{"id":3992,"date":"2024-04-30T17:14:48","date_gmt":"2024-04-30T11:44:48","guid":{"rendered":"https:\/\/newsdata.io\/blog\/?p=3992"},"modified":"2024-08-30T17:22:09","modified_gmt":"2024-08-30T11:52:09","slug":"news-dataset-compilation","status":"publish","type":"post","link":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/","title":{"rendered":"Dataset Compilation"},"content":{"rendered":"[vc_row type=&#8221;in_container&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/4&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221; column_padding_type=&#8221;default&#8221; gradient_type=&#8221;default&#8221; offset=&#8221;vc_hidden-sm vc_hidden-xs&#8221;][\/vc_column][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_direction_desktop=&#8221;default&#8221; column_element_spacing=&#8221;default&#8221; desktop_text_alignment=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_backdrop_filter=&#8221;none&#8221; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; el_class=&#8221;text_block_wrapper&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;3\/4&#8243; tablet_width_inherit=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221; column_padding_type=&#8221;default&#8221; gradient_type=&#8221;default&#8221; offset=&#8221;vc_col-lg-9 vc_col-md-12&#8243;][image_with_animation image_url=&#8221;3994&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;None&#8221; animation_movement_type=&#8221;transform_y&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;15px&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;100%&#8221; max_width_mobile=&#8221;default&#8221;][vc_column_text]\n<h1><b>Introduction<\/b><\/h1>\n[\/vc_column_text][vc_column_text]With the increasing dependency of the world on technology, the amount of data being scraped per day saw an increase. With the increasing scraped data, the complex and unorganized data has also increased in the market, making it hard to weave your way through the data. This is where the dataset comes in to help organize and structure the data into understandable formats.<\/p>\n<p>In this blog, we will be going through the basics of a dataset and the possible sources that provide datasets that are reliable and readily available.[\/vc_column_text][vc_column_text]\n<h2><b>What do you mean by Dataset?<\/b><\/h2>\n[\/vc_column_text][vc_column_text]A dataset refers to a collection of data presented in a specific way. The dataset contains separate elements and can be manipulated as per individual needs. An individual dataset tends to hold information that can be numbers, texts, images, or even sounds, arranged in a spreadsheet in rows and columns.<\/p>\n<p>These can be used to study any number of things, ranging from customer behavior to scientific discoveries. Moreover, a dataset helps organize the complex data in a format that is easily understandable and readily available for use. This further ensures improved decision-making.<\/p>\n<p>You can also check out websites like <strong>Newsdata.io<\/strong> and <strong>Datarade<\/strong> to get your own customized dataset on the topic of your choice.[\/vc_column_text][vc_column_text]\n<h2><b>What are the possible sources of Dataset?<\/b><\/h2>\n[\/vc_column_text][vc_column_text]The dataset fetched from authentic and reliable sources helps solidify the research process. But with hundreds of sources providing datasets on various topics, it gets difficult to differentiate the reliable ones.<\/p>\n<p>So, to save you from having to go through each source, below is a list of reliable sources to fetch the dataset from.[\/vc_column_text][vc_column_text]\n<h3><b>1. Government Websites<\/b><\/h3>\n<p>Governments across the globe publish open datasets on their websites on various topics related to economics, health, demographics, and the environment.<\/p>\n<p>Some examples of government websites are mentioned below:<\/p>\n<ul>\n<li><a href=\"https:\/\/www.nic.in\/products\/open-government-data-ogd-platform-india\/\" rel=\"nofollow\">National Information Center<\/a><\/li>\n<li><a href=\"http:\/\/data.gov\">Data.gov<\/a><\/li>\n<li><a href=\"http:\/\/gov.uk\">Gov.UK<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3><b>2. International Organizations<\/b><\/h3>\n<p>There are several globally known international organizations that publish open datasets on their websites on broader topics related to global issues.<\/p>\n<p>Some examples of international organizations are given below:<\/p>\n<ul>\n<li><a href=\"https:\/\/data.worldbank.org\/\">World Bank Open Dataset<\/a><\/li>\n<li><a href=\"https:\/\/data.un.org\/\">UN Data\u00a0<\/a><\/li>\n<li><a href=\"https:\/\/www.earthdata.nasa.gov\/\">Earthdata<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3><b>3. Research Institutes<\/b><\/h3>\n<p>Several research institutes have been publishing their datasets on various topics ranging from health and lifestyle to the economy and finances.<\/p>\n<p>Some examples of research institutes are given below:<\/p>\n<ul>\n<li><a href=\"https:\/\/archive.ics.uci.edu\/\">UCL Machine Learning repository<\/a><\/li>\n<li><a href=\"https:\/\/dataverse.harvard.edu\/\">Harvard Dataverse<\/a><\/li>\n<li><a href=\"https:\/\/opendata.cern.ch\/\">CERN Open Data Portal<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3><b>4. Non-Profit Organizations<\/b><\/h3>\n<p>Non-profit organizations collect and publish data mainly on issues like poverty, public health, or environmental sustainability. These datasets can then be used by researchers and analysts working on similar issues.<\/p>\n<p>Some examples of non-profit organizations publishing datasets are given below:<\/p>\n<ul>\n<li><a href=\"https:\/\/www.kaggle.com\/datasets\">Kaggle<\/a><\/li>\n<li><a href=\"https:\/\/openknowledge.worldbank.org\/home\">Open Knowledge Repository<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3><b>5. Search Engine\u00a0<\/b><\/h3>\n<p>Search engines like Google have been pushing their way to become top contributors of free datasets on various topics. They have been increasing their repository to include all datasets available on the internet.<\/p>\n<ul>\n<li><a href=\"https:\/\/datasetsearch.research.google.com\/\" rel=\"nofollow\">Google Dataset Search<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3><b>6. Communities<\/b><\/h3>\n<p>Recent years saw an uprising in the number of online communities collecting and publishing data. These communities can be used as a platform to fetch and share datasets on various topics.<\/p>\n<p>Some examples of such online communities are listed below:<\/p>\n<ul>\n<li><a href=\"http:\/\/newsdata.io\">Newsdata.io<\/a><\/li>\n<li><a href=\"https:\/\/data.world\/\" rel=\"nofollow\">Data.world<\/a><\/li>\n<li><a href=\"https:\/\/github.com\/\" rel=\"nofollow\">GitHub<\/a><\/li>\n<li><a href=\"https:\/\/www.statista.com\/\" rel=\"nofollow\">Statista<\/a><\/li>\n<\/ul>\n[\/vc_column_text][vc_column_text]These were some of the sources that we, <a href=\"http:\/\/newsdata.io\">Newsdata.io<\/a>, used for its <a href=\"https:\/\/newsdata.io\/blog\/category\/dataset\/\">\u2018Dataset of the Week\u2019<\/a> blog. These resources are not only feasible to operate but also ensure accurate and reliable datasets for various purposes.[\/vc_column_text][vc_column_text]\n<h2><b>\u2018Dataset of the Week\u2019 Blog<\/b><\/h2>\n[\/vc_column_text][vc_column_text]There are different reasons why one would need datasets, ranging from global collaboration to enhanced predictions. One such application is for the research purpose, as used by Newsdata.io.<\/p>\n<p>To be precise, Newsdata.io fetches and provides datasets on various topics to prepare a list of free and available datasets. The topics related to which datasets have been published so far in the \u2018Dataset of the Week\u2019 blog are mentioned below:[\/vc_column_text][vc_column_text]\n<ol>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/russia-ukraine-war-news-datasets\/\">Russia-Ukraine War News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/israel-hamas-war-datasets\/\">Israel-Hamas War News Datasets\u00a0<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/global-inflation-datasets\/\">Global Inflation Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/us-elections-news-datasets\/\">US Presidential Elections 2024 News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/india-election-2024-news-datasets\/\">India Election2024 News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/fifa-worldcup-2023-datasets-a-mega-compilation\/\">FIFA WorldCup 2023 News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/infrastructure-development-news-dataset\/\">Infrastructure Development News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/carbon-footprint-news-datasets\/\">Carbon Footprint News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/indian-stock-market-news-dataset\/\">Indian Stock Market News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/space-race-news-datasets\/\">Space Race News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/climate-crisis-news-datasets-a-mega-compilation\/\">Climate Crisis News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/oil-trading-news-datasets\/\">Oil Trading News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/ai-progress-news-datasets\/\">AI Progress News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/heat-wave-news-datasets\/\">Heat Waves News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/t20-world-cup-news-datasets\/\">T20 World Cup News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/french-elections-news-datasets\/\">French Elections News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/ev-news-datasets\/\">EV News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/cybercrime-news-datasets\/\">Cyber Crime News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/cybercrime-news-datasets\/\">EUFA Euro Standings News Datasets<\/a><\/h3>\n<\/li>\n<li>\n<h3><a href=\"https:\/\/newsdata.io\/blog\/2024-olympics-news-datasets\/\">2024 Olympics News Datasets<\/a><\/h3>\n<\/li>\n<\/ol>\n<p>Keep following for more such datasets on trending topics.[\/vc_column_text][vc_column_text]\n<h2><b>Conclusion<\/b><\/h2>\n[\/vc_column_text][vc_column_text]With this, we come to the end of this blog. I hope by now you will have grasped the basic idea of a dataset and the possible sources from which you can find reliable and accurate datasets on topics of your choice. For more such blogs related to datasets, you can follow the <a href=\"https:\/\/newsdata.io\/blog\/category\/dataset\/\">\u2018Dataset of the Week\u2019<\/a> blog series or visit <a href=\"http:\/\/newsdata.io\">Newsdata.io<\/a>.[\/vc_column_text][image_with_animation image_url=&#8221;3027&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;None&#8221; animation_movement_type=&#8221;transform_y&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;15px&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;100%&#8221; max_width_mobile=&#8221;default&#8221; img_link=&#8221;https:\/\/bit.ly\/41MjLOC&#8221;][\/vc_column][\/vc_row]\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>This article reviews the basics of news datasets. This article also includes a list of sources to fetch datasets from. <!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":14,"featured_media":3994,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[165,7],"tags":[169],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Dataset Compilation - Newsdata.io - Stay Updated with the Latest News API Trends<\/title>\n<meta name=\"description\" content=\"This article reviews the basics of news datasets. This article also includes a list of sources to fetch datasets from.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Dataset Compilation - Newsdata.io - Stay Updated with the Latest News API Trends\" \/>\n<meta property=\"og:description\" content=\"This article reviews the basics of news datasets. This article also includes a list of sources to fetch datasets from.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/\" \/>\n<meta property=\"og:site_name\" content=\"Newsdata.io - Stay Updated with the Latest News API Trends\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-30T11:44:48+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-08-30T11:52:09+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"675\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Prabhleen Kaur\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Prabhleen Kaur\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/\",\"url\":\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/\",\"name\":\"Dataset Compilation - Newsdata.io - Stay Updated with the Latest News API Trends\",\"isPartOf\":{\"@id\":\"https:\/\/newsdata.io\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1\",\"datePublished\":\"2024-04-30T11:44:48+00:00\",\"dateModified\":\"2024-08-30T11:52:09+00:00\",\"author\":{\"@id\":\"https:\/\/newsdata.io\/blog\/#\/schema\/person\/24759f2a710e24a49089f8cf2b8e70b3\"},\"description\":\"This article reviews the basics of news datasets. This article also includes a list of sources to fetch datasets from.\",\"breadcrumb\":{\"@id\":\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1\",\"width\":1200,\"height\":675,\"caption\":\"Mega Compilation: News Datasets\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog\",\"item\":\"https:\/\/newsdata.io\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Dataset Compilation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/newsdata.io\/blog\/#website\",\"url\":\"https:\/\/newsdata.io\/blog\/\",\"name\":\"Newsdata.io - Stay Updated with the Latest News API Trends\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/newsdata.io\/blog\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/newsdata.io\/blog\/#\/schema\/person\/24759f2a710e24a49089f8cf2b8e70b3\",\"name\":\"Prabhleen Kaur\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/newsdata.io\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/17fb948d4989270bb762d1379a5e874f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/17fb948d4989270bb762d1379a5e874f?s=96&d=mm&r=g\",\"caption\":\"Prabhleen Kaur\"},\"description\":\"Hello, Curious Minds! Welcome to my corner of the digital world, a space brimming with words and woven with ideas. Fresh out of the rigorous trenches of an Economics honors degree at the esteemed University of Delhi, I know a thing or two about crunching numbers and dissecting trends. But beyond the world of graphs and equations, lies my love for reading and writing. Admittedly, I'm a newbie in the content writing scene, still tasting the ink of fresh beginnings. I believe every corner of life holds a story waiting to be told, and I'm eager to be your storyteller. So, strap yourselves in, dear readers, and let's dive into the captivating world of words together! P.S. Feel free to drop a comment or reach out \u2013 I'm always up for a good conversation!\",\"sameAs\":[\"www.linkedin.com\/in\/prabhleen-kaur-8b8823202\"],\"url\":\"https:\/\/newsdata.io\/blog\/author\/prabhleen\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Dataset Compilation - Newsdata.io - Stay Updated with the Latest News API Trends","description":"This article reviews the basics of news datasets. This article also includes a list of sources to fetch datasets from.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/","og_locale":"en_US","og_type":"article","og_title":"Dataset Compilation - Newsdata.io - Stay Updated with the Latest News API Trends","og_description":"This article reviews the basics of news datasets. This article also includes a list of sources to fetch datasets from.","og_url":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/","og_site_name":"Newsdata.io - Stay Updated with the Latest News API Trends","article_published_time":"2024-04-30T11:44:48+00:00","article_modified_time":"2024-08-30T11:52:09+00:00","og_image":[{"width":1200,"height":675,"url":"https:\/\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png","type":"image\/png"}],"author":"Prabhleen Kaur","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Prabhleen Kaur","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/","url":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/","name":"Dataset Compilation - Newsdata.io - Stay Updated with the Latest News API Trends","isPartOf":{"@id":"https:\/\/newsdata.io\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#primaryimage"},"image":{"@id":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1","datePublished":"2024-04-30T11:44:48+00:00","dateModified":"2024-08-30T11:52:09+00:00","author":{"@id":"https:\/\/newsdata.io\/blog\/#\/schema\/person\/24759f2a710e24a49089f8cf2b8e70b3"},"description":"This article reviews the basics of news datasets. This article also includes a list of sources to fetch datasets from.","breadcrumb":{"@id":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/newsdata.io\/blog\/news-dataset-compilation\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#primaryimage","url":"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1","contentUrl":"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1","width":1200,"height":675,"caption":"Mega Compilation: News Datasets"},{"@type":"BreadcrumbList","@id":"https:\/\/newsdata.io\/blog\/news-dataset-compilation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog","item":"https:\/\/newsdata.io\/blog\/"},{"@type":"ListItem","position":2,"name":"Dataset Compilation"}]},{"@type":"WebSite","@id":"https:\/\/newsdata.io\/blog\/#website","url":"https:\/\/newsdata.io\/blog\/","name":"Newsdata.io - Stay Updated with the Latest News API Trends","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/newsdata.io\/blog\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/newsdata.io\/blog\/#\/schema\/person\/24759f2a710e24a49089f8cf2b8e70b3","name":"Prabhleen Kaur","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/newsdata.io\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/17fb948d4989270bb762d1379a5e874f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/17fb948d4989270bb762d1379a5e874f?s=96&d=mm&r=g","caption":"Prabhleen Kaur"},"description":"Hello, Curious Minds! Welcome to my corner of the digital world, a space brimming with words and woven with ideas. Fresh out of the rigorous trenches of an Economics honors degree at the esteemed University of Delhi, I know a thing or two about crunching numbers and dissecting trends. But beyond the world of graphs and equations, lies my love for reading and writing. Admittedly, I'm a newbie in the content writing scene, still tasting the ink of fresh beginnings. I believe every corner of life holds a story waiting to be told, and I'm eager to be your storyteller. So, strap yourselves in, dear readers, and let's dive into the captivating world of words together! P.S. Feel free to drop a comment or reach out \u2013 I'm always up for a good conversation!","sameAs":["www.linkedin.com\/in\/prabhleen-kaur-8b8823202"],"url":"https:\/\/newsdata.io\/blog\/author\/prabhleen\/"}]}},"jetpack_sharing_enabled":true,"jetpack_featured_media_url":"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1","category":["Dataset","General"],"featured_image_url":"https:\/\/i0.wp.com\/newsdata.io\/blog\/wp-content\/uploads\/2024\/04\/Mega-Compilation-News-Dataset.png?fit=1200%2C675&ssl=1","_links":{"self":[{"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/posts\/3992"}],"collection":[{"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/users\/14"}],"replies":[{"embeddable":true,"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/comments?post=3992"}],"version-history":[{"count":3,"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/posts\/3992\/revisions"}],"predecessor-version":[{"id":4773,"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/posts\/3992\/revisions\/4773"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/media\/3994"}],"wp:attachment":[{"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/media?parent=3992"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/categories?post=3992"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsdata.io\/blog\/wp-json\/wp\/v2\/tags?post=3992"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}