Wyvern Entity
A Wyvern entity (represented by the WyvernEntity in code) is the basic unit of data structure that represents a critical object in your pipline. It’s like the database table concept. For example in the case of product ranking, “product”, “user”, “query” or even the “request” are all important entities. Wyvern Entity is built with pydantic model so it’s really easy to work with. Let’s look at the ranking product entity:pipelines/product_ranking/schemas.py
product_id as it’s main identifier.
Wyvern Index
Wyvern Index (current only integrates with Redis) is where you store Near Realtime or Streaming Features of entity data. The concept is quite similar to the elasticsearch index. Wyvern Application has the following indexation APIs:- Upload Entities (POST /api/v1/entities/upload): API to create or update entities. No partial update of an entity.In another word, if an entity is already indexed (already exists) in Wyvern Index, it will be completely replaced by this new entity data with the same entity id.
- Get Entities (POST /api/v1/entities/get): API to read the entities from Wyvern Index.
- Delete Entities (POST /api/entity/v1/entities/delete): API to delete entities in Wyvern Index.
Data Hydration
At the beginning of a wyvern inference request, Wyvern grabs all the entities contained in the request schema, fetches all entity data from Wyvern Index if the entity exists and “hydrate” all the data into the entity object. Therefore, even if your API request is only sending the primary key of the entity, the pipeline code will still have full access to all the indexed data attributes of the entity. This is the what the “Entity Hydration” phase. Let’s look at an example with the ranking request:pipelines/product_ranking/schemas.py
product_id, Wyvern will fetch all the product entities that are found in Wyvern Index and hydrate data into the Product entity. As a result, request.candidates[0].brand will be available within your pipeline code.
Batch Feature Retrieval
Wyvern automates the retrieval of all the features required by your pipeline from feature store and store the features in memory per request. Inside every wyvern component, you can doself.get_feature(identifier, "feature_name") to get the feature value.
Next, let’s see how batch features work in Wyvern.