DATASETS DE ACESSO LIVRE PARA PROJETOS DE I.A.!
A melhor forma de aprender o poder da inteligência artificial (IA) é criando projetos e aplicações com ela.
O sucesso de um projeto de IA começa pela análise, seleção, tratamento e filtragem do conjunto de dados a serem utilizados no treinamento da IA. O tempo demandado nesta etapa permitirá economizar tempo nas etapas futuras do projeto, minimizar o uso de recursos computacionais e elevar o nível de acurácia da IA.
A seguir são relacionadas alguns dos repositórios de conjuntos de dados (datasets) de acesso livre para projetos de IA mais difundidos.
Datasets de Universidades
- UCI Machine Learning Repository – https://archive.ics.uci.edu/datasets
- Harvard Dataverse (Harvard University) – https://data.harvard.edu/dataverse
- Labelme (CSAIL – MIT) – http://labelme.csail.mit.edu/Release3.0/browserTools/php/dataset.php
- Center of AI in Medicine & Imaging (Stanford University) – https://aimi.stanford.edu/shared-datasets
Datasets de Governos
- Data .gov US – https://data.gov/
- Data .gov UK – https://www.data.gov.uk/
- European Data – https://data.europa.eu/data/datasets?locale=en
- Latin American Data Bank – https://ropercenter.cornell.edu/latin-american-data-bank
- Dados Abertos Brasil – https://dados.gov.br/signin
Datasets de Astronomia e Espaço
- Earth Data (NASA) – https://www.earthdata.nasa.gov/
- CERN Open Data Portal – https://opendata.cern.ch/
Datasets de Saúde
- Health Data (USA) – https://healthdata.gov/
- Centers For Disease Control And Prevention (USA) – https://www.cdc.gov/datastatistics/index.html
- Dataset for Health Care and Public Health (USA) – https://researchguides.dartmouth.edu/c.php?g=517073&p=6289098
- Global Health Observatory Data Repository – World Health Organization (WHO) – https://apps.who.int/gho/data/node.home
- National Library of Medicine (NIH – USA) – https://medpix.nlm.nih.gov/home
Datasets De Tópicos Variados
- ImageNet – https://image-net.org/
- Kaggle Datasets – https://www.kaggle.com/datasets
- Sigma Open Dadasets – https://sigma.ai/open-datasets/
- OpenML – https://www.openml.org/
- Datahub .io – https://datahub.io/collections
- FiveThirtyEight – https://data.fivethirtyeight.com/
- IMDb Non-Commercial Datasets – https://developer.imdb.com/non-commercial-datasets/
- Google Dataset Search – https://datasetsearch.research.google.com/
- IBM Data Asset eXchange – https://developer.ibm.com/exchanges/data/
- AWS Open Data – https://aws.amazon.com/marketplace/search/results?trk=868d8747-614e-4d4d-9fb6-fd5ac02947a8&sc_channel=el&FULFILLMENT_OPTION_TYPE=DATA_EXCHANGE&CONTRACT_TYPE=OPEN_DATA_LICENSES&filters=FULFILLMENT_OPTION_TYPE%2CCONTRACT_TYPE
- BD – https://basedosdados.org/
Datasets Temáticos
- Furnas Dataset (Electrical Power Transmission Lines) – https://github.com/freds0/PTL-AI_Furnas_Dataset?tab=readme-ov-file
- Nasdaq Data Link – https://data.nasdaq.com/institutional-investors
- Antarctic Datasets – https://www.antarcticglaciers.org/antarctica-2/antarctic-datasets/
- BFI film industry statistics (UK) – https://www.bfi.org.uk/industry-data-insights
- NYC Taxi Trip Data – https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
- FBI (USA) Crime Data Explorer – https://cde.ucr.cjis.gov/LATEST/webapp/#/pages/home
Elaborado Por: Dr. Arnaldo de Carvalho Junior
Publicado em: Jun 17, 2024