Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(local relationship): support local relationship tables in old schema #391

Merged
merged 8 commits into from
Aug 15, 2024

Conversation

jsdonn
Copy link
Contributor

@jsdonn jsdonn commented Jul 24, 2024

Summary

As part of the graph query service, we want to support local relationship tables in all GMSes, regardless of which database schema mode they are on (i.e. NEW_SCHEMA_ONLY vs OLD_SCHEMA_ONLY). This requires adding support for local relationship ingestion and querying in the old schema.

Ingestion

Before: local relationship ingestion was bundled with new schema entity table ingestion in EbeanLocalAccess. While bundled in the same add method, there is no technical dependency between the two.

After: local relationship ingestion has been moved to EbeanLocalDAO::saveLatest. The logic is the same:

  1. check if the incoming update is a soft deletion.
  2. if soft deletion, check if the aspect to be soft-deleted has any relationships tied to it.
  3. if yes, throw an error. No aspect soft deletion allowed if it is associated with local relationships.
  4. ingest aspect as usual (into entity tables in new schema or metadata_aspect in old schema)
  5. ingest local relationships using the same logic as before.

Query

Before: querying for local relationships using the findRelationships API would join multiple tables together (relationship + entity tables) to satisfy any filter conditions. This logic was built specifically for the new schema.

After: EbeanLocalRelationshipQueryDAO now has a schemaConfig similar to EbeanLocalDAO, which can take on the values of NEW_SCHEMA_ONLY, DUAL_SCHEMA, and OLD_SCHEMA_ONLY. It can be set using setSchemaConfig(). findRelationships will use different SQL queries depending on the value of schemaConfig. In new schema mode, everything is the same as before.

In old schema mode, there are certain restrictions. Only 1 filter is allowed per source or destination entity. That filter, if it exists, MUST be on the urn field (and not the aspect field for example). Meaning - no joins are necessary; all information in the allowed query can be found just by looking at the local relationship table. This restriction is in place because filtering on metadata_aspect is not feasible since the metadata column is not indexed. Thus, the resulting SQL will look something like:

SELECT rt.* FROM metadata_relationship_<relationship_type> rt
WHERE deleted_ts IS NULL
{ (optional) AND rt.source = urn:li:<sourceType>:1 }
{ (optional) AND rt.destination = urn:li:<destinationType>:2 }

Other Details

Introduced RelationshipSource enum which can have 2 values: RELATIONSHIP_BUILDERS and ASPECT_METADATA. This enum will be used to tell the DAO where to ingest relationship metadata from. Right now, all relationship data comes from looking up an Aspect class in a local relationship builder registry to retrieve the corresponding relationship builder. Then, the relationship is built using that relationship builder and ingested.

In the future, relationships will be stored directly in the aspect metadata so maintaining relationship builders and a registry will not be necessary. However, I will leave the implementation of supporting the ASPECT_METADATA enum for a later PR.

Testing Done

Updated/refactored some unit tests.

Checklist

Copy link
Contributor

@yangyangv2 yangyangv2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

minor comments, thanks for the hard / great work!

The PR description is very helpful!

* Set a local relationship builder registry.
* Provide a local relationship builder registry. Local relationships will be built based on the builders during data ingestion.
* @param localRelationshipBuilderRegistry All local relationship builders should be registered in this registry.
* Can be set to null to turn off local relationship ingestion.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

in which scenario we want to disable the local relationship? routing GMS?

this is little implicit operation control --- should we keep the registry nonNull but have an more explicit control to enable/disable local relationship ingestion?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We want to disable the local relationships when a relationship is not ready to be onboarded yet (like when the table has not been created yet).

In the future, we might not need this; we can just assume that all relationships will be ingested. But for now, while we still have ES graph and MySQL local relationship tables, we only want to ingest the relationships that are actually onboarded to MySQL local relationships.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

makes sense, can you include a brief explanation in the comment?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same above

@@ -80,6 +87,9 @@ public EbeanLocalRelationshipQueryDAO(EbeanServer server) {
@Nonnull
public <SNAPSHOT extends RecordTemplate> List<SNAPSHOT> findEntities(@Nonnull Class<SNAPSHOT> snapshotClass,
@Nonnull LocalRelationshipFilter filter, int offset, int count) {
if (_schemaConfig == EbeanLocalDAO.SchemaConfig.OLD_SCHEMA_ONLY) {
throw new UnsupportedOperationException("findEntities is not supported in OLD_SCHEMA_MODE");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this be a Checked Exception to make sure the OLD_SCHEMA can properly handle it? or can we be sure OLD_SCHEMA will not run into this logic

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you clarify what you mean by checked exception?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the caller needs to explicitly catch and handle the exception.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added a throws UnsupportedOperationException to the method signature

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

UnsupportedOperationException is a "RuntimeException" and it does not make caller to explicitly catch it. You can either switch to some exception that extends Exception instead of RuntimeException.

My intention was a question more making the client to be cautious when using this findEntity methods as they may be running on old schema, otherwise they'll see this issue at runtime.. But if you think this risk is low, then you can stay with the Runtime Exception, but make the documentation clear.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Understood. Changed to OperationNotSupportedException from javax which extends Exception. Also added to the javadocs.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general, I don't expect many non-MG users to use the localrelationship dao. As far as I know only the lineage accessor in TMS uses it and we have control over that.

@@ -104,6 +114,9 @@ public <SRC_SNAPSHOT extends RecordTemplate, DEST_SNAPSHOT extends RecordTemplat
@Nonnull Class<DEST_SNAPSHOT> destinationEntityClass, @Nonnull LocalRelationshipFilter destinationEntityFilter,
@Nonnull Class<RELATIONSHIP> relationshipType, @Nonnull LocalRelationshipFilter relationshipFilter, int minHops,
int maxHops, int offset, int count) {
if (_schemaConfig == EbeanLocalDAO.SchemaConfig.OLD_SCHEMA_ONLY) {
throw new UnsupportedOperationException("findEntities is not supported in OLD_SCHEMA_MODE");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as line 91

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added a throws UnsupportedOperationException to the method signature

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed to OperationNotSupportedException from javax which extends Exception. Also added to the javadocs.

@jsdonn jsdonn force-pushed the old_schema_local_relationship branch from 95b02b4 to 124dfad Compare August 13, 2024 23:01
Copy link
Contributor

@yangyangv2 yangyangv2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for addressing the comments.

@ybz1013 ybz1013 merged commit be56ee0 into linkedin:master Aug 15, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants