Inspiration
Problem
- Data scientist lack on getting different web3 datasets based on there required problem statements.
- Uploading and retrieving machine learning or deep learning models they build was really hard and model management quiet difficult.
- Need to go different data source to grab the required datasets and still hard to search needed data.
- Data and model personalised storage are hard.
What it does
Solutions
- WEB3 DATA HUB is a common solution, using that people can easily pull required datasets (easily access via PIP packages)
- Easy one line cmd to store and pull the models are data as required.
- Easy one line search cmd to get any data as per required
- Personalised encrypted data and model storage.
How we built it
IPFS
- NFT.storage and Web3.storage are used to store the data
- Multiple cron jobs that fetch data daily from various platforms
- Data cleaning and data validation jobs that clean each datasets before push.
- Support encryption if needs.
- personalised support to direct upload to nft storage or web3 storage via function calls.
Challenges we ran into
Getting datasets and models are very hard. keep improving pipelines and model updates.
What's next for IpfsML ( Decentralized WEB3 DataHub )
- SDK (js and go)
- Pipeline improvements for tons of data handling
Built With
- ipfs
- machine-learning
- pip
- python
Log in or sign up for Devpost to join the conversation.