Below is DataHub's roadmap for the short, medium and long term. We welcome suggestions from the community.
ETAs are revisted on a regular basis and are subject to change. If you would like to see something prioritized, please reach out to us on Slack or attend the town hall to discuss!
- Models + UI
- Link datasets to jobs & flows
- Models + UI
- Add query-after-write capability to local DAO
- Support majority of gremlin-compatible graph DBs
- Add docker-based integration tests
- Migration from docker-compose to Kubernetes for Docker container orchestration
- Split up unified events to improve scalability & modularity
- Models + impact analysis
- Models + UI
- Models + UI
- Make schemas searchable
- Support GraphQL schemas
- UI to highlight high value information about Entities within Search and Entity Pages
- Simple tag-based data privacy metadata
- Users will be able to like and follow entities
- Dataset & field-level commenting
- Config-driven UI
- Generate TypeScript types from Pegasus
- Use GraphQL exclusively for frontend queries
- Use Redux exclusively for UI state management
- Support a wide range of document stores
- Donate code to a foundation, e.g. Apache, Linux Foundation.
- Run DataHub in Azure and provide how-to guides
- Indexing in OLAP store (Pinot) with TTL
- Initially focus on rest.li services & GraphQL integration
- TypeScript-only frontend development
- Modeling in protobuf + serving in gRPC