github.com
togethercomputer/RedPajama-Data: The RedPajama-Data repository contains code for preparing large datasets for training large language models.
· The RedPajama-Data repository contains code for preparing large datasets for training large language models. - GitHub - togethercomputer/RedPajama-Data: The RedPajama-Data repository contains code ... · Shared by 9, including Marktechpost AI Research News ⚡, Matt Shaffer
Stanford University
Ecosystem Graphs for Foundation Models
· Shared by 20, including Sebastian Raschka, Alexander Seifert, William El Kaim, Matt Shaffer
armbench.s3-website-us-east-1.amazonaws.com
ARMBENCH DATASET
1 min · · ARMBench is a large-scale benchmark dataset for perception and manipulation challenges in a robotic pick-and-place setting. The dataset is collected in an Amazon warehouse and captures a wide variety… · Shared by 9, including Marktechpost AI Research News ⚡, Matt Shaffer
github.com
beneschwab/awesome-openx: A list of free applications, libraries and datasets concerning the development of automated driving functions with focus on ASAM…
· A list of free applications, libraries and datasets concerning the development of automated driving functions with focus on ASAM OpenX standards - GitHub - beneschwab/awesome-openx: A list of free ... · Shared by 4, including Matt Shaffer
huggingface.co
tatsu-lab/alpaca · Datasets at Hugging Face
· Dataset Preview Go to dataset viewer · Shared by 8, including Matt Shaffer, Andy Matuschak
github.com
GitHub - GAP-LAB-CUHK-SZ/TO-Scene
· (ECCV 2022 Oral) TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenes - GitHub - GAP-LAB-CUHK-SZ/TO-Scene: (ECCV 2022 Oral) TO-Scene: A Large-scale Dataset for Understanding 3D Tabl... · Shared by 5, including Matt Shaffer
github.com
GAP-LAB
· generation and analysis of pixels, points and polygons - GAP-LAB · Shared by 5, including Matt Shaffer
jonbarron.info
mip-NeRF
1 min · · Project page for Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields. · Shared by 7, including Matt Shaffer
lakefs.io
Atomic Versioned Data Lake
· lakeFS is an open-source tool that transforms your object storage to Git-like repositories. Start managing data the way you manage your code. · Shared by 13, including Matt Shaffer, Mike Ivars
github.com
InfuseAI/ArtiVC: A version control system to manage large files.
· A version control system to manage large files. Contribute to InfuseAI/ArtiVC development by creating an account on GitHub. · Shared by 5, including Matt Shaffer
github.com
Charmve/Surface-Defect-Detection: 📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are…
· 📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance. - GitHub - Charmve/Surface-Defect-Detect... · Shared by 7, including Matt Shaffer
github.com
shankarpandala / lazypredict
· Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning - GitHub - shankarpandala/lazypredict: Lazy Predict help ... · Shared by 10, including Matt Shaffer, Nico Müller 🇺🇦
github.com
unixpickle/car-data: Scraping and predicting car info
· Scraping and predicting car info. Contribute to unixpickle/car-data development by creating an account on GitHub. · Shared by 4, including Matt Shaffer
Marktechpost AI Research News ⚡
Meet ‘Stack,’ A 3TB of Permissively Licensed Source Code for LLMs (Large Language Models)
2 min · · About a year ago, generating code from a Large Language model (LLM) was like an unachievable task. With the advancement in Artificial Intelligence, LLMs are now successfully being used to generate… · Shared by 4, including Matt Shaffer
blog.roboflow.com
Top 6 Manufacturing Datasets for Computer Vision
4 min · · This post will show you 6 of the best open source datasets for computer vision tasks in the manufacturing industry. · Shared by 5, including Matt Shaffer
github.com
GitHub - LAION-AI/interesting-text-datasets
· Contribute to LAION-AI/interesting-text-datasets development by creating an account on GitHub. · Shared by 4, including Matt Shaffer