Best public datasets for machine learning, data science, sentiment analysis, computer vision, natural language processing (NLP), clinical data, and others.