Is a structured collection of data stored?

In computer science, a data structure is a particular way of organising and storing data in a computer such that it can be accessed and modified efficiently. More precisely, a data structure is a collection of data values, the relationships among them, and the functions or operations that can be applied to the data.

Three different data structures

For the analysis of data, it is important to understand that there are three common types of data structures:

Is a structured collection of data stored?

Structured Data

Structured data is data that adheres to a pre-defined data model and is therefore straightforward to analyse. Structured data conforms to a tabular format with relationship between the different rows and columns. Common examples of structured data are Excel files or SQL databases. Each of these have structured rows and columns that can be sorted.

Structured data depends on the existence of a data model – a model of how data can be stored, processed and accessed. Because of a data model, each field is discrete and can be accesses separately or jointly along with data from other fields. This makes structured data extremely powerful: it is possible to quickly aggregate data from various locations in the database.

Structured data is is considered the most ‘traditional’ form of data storage, since the earliest versions of database management systems (DBMS) were able to store, process and access structured data.

Unstructured Data

Unstructured data is information that either does not have a predefined data model or is not organised in a pre-defined manner. Unstructured information is typically text-heavy, but may contain data such as dates, numbers, and facts as well. This results in irregularities and ambiguities that make it difficult to understand using traditional programs as compared to data stored in structured databases. Common examples of unstructured data include audio, video files or No-SQL databases.

The ability to store and process unstructured data has greatly grown in recent years, with many new technologies and tools coming to the market that are able to store specialised types of unstructured data. MongoDB, for example, is optimised to store documents. Apache Giraph, as an opposite example, is optimised for storing relationships between nodes.

The ability to analyse unstructured data is especially relevant in the context of Big Data, since a large part of data in organisations is unstructured. Think about pictures, videos or PDF documents. The ability to extract value from unstructured data is one of main drivers behind the quick growth of Big Data.

Semi-structured Data

Semi-structured data is a form of structured data that does not conform with the formal structure of data models associated with relational databases or other forms of data tables, but nonetheless contain tags or other markers to separate semantic elements and enforce hierarchies of records and fields within the data. Therefore, it is also known as self-describing structure. Examples of semi-structured data include JSON and XML are forms of semi-structured data.

The reason that this third category exists (between structured and unstructured data) is because semi-structured data is considerably easier to analyse than unstructured data. Many Big Data solutions and tools have the ability to ‘read’ and process either JSON or XML. This reduces the complexity to analyse structured data, compared to unstructured data.

Metadata – Data about Data

A last category of data type is metadata. From a technical point of view, this is not a separate data structure, but it is one of the most important elements for Big Data analysis and big data solutions. Metadata is data about data. It provides additional information about a specific set of data.

In a set of photographs, for example, metadata could describe when and where the photos were taken. The metadata then provides fields for dates and locations which, by themselves, can be considered structured data. Because of this reason, metadata is frequently used by Big Data solutions for initial analysis.

A Database Is A Structured Collection of Data

Uploaded by

Amar Sarkar

0 ratings0% found this document useful (0 votes)

113 views21 pages

Document Information

click to expand document information

Description:

Hdhd

Original Title

A Database is a Structured Collection of Data

Copyright

© © All Rights Reserved

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

  • Share on Facebook, opens a new window

    Facebook

  • Share on Twitter, opens a new window

    Twitter

  • Share on LinkedIn, opens a new window

    LinkedIn

  • Share with Email, opens mail client

    Email

  • Copy Link

    Copy Link

Did you find this document useful?

0%0% found this document useful, Mark this document as useful

0%0% found this document not useful, Mark this document as not useful

Is this content inappropriate?

Report this Document

Download now

SaveSave A Database is a Structured Collection of Data For Later

0 ratings0% found this document useful (0 votes)

113 views21 pages

A Database Is A Structured Collection of Data

Original Title:

A Database is a Structured Collection of Data

Uploaded by

Amar Sarkar

Description:

Hdhd

Full description

SaveSave A Database is a Structured Collection of Data For Later

0%0% found this document useful, Mark this document as useful

0%0% found this document not useful, Mark this document as not useful

Embed

Share

Print

Download now

Jump to Page

You are on page 1of 21

Search inside document

You're Reading a Free Preview
Pages 6 to 15 are not shown in this preview.

Buy the Full Version

You're Reading a Free Preview
Page 19 is not shown in this preview.

Buy the Full Version

Reward Your Curiosity

Everything you want to read.

Anytime. Anywhere. Any device.

No Commitment. Cancel anytime.

Is a structured collection of data stored?

Share this document

Share or Embed Document

Sharing Options

  • Share on Facebook, opens a new window
  • Share on Twitter, opens a new window
  • Share on LinkedIn, opens a new window
  • Share with Email, opens mail client
  • Copy Link

Quick navigation

  • Home

  • Books

  • Audiobooks

  • Documents

    , active

What is a structured data store?

The term structured data refers to data that resides in a fixed field within a file or record. Structured data is typically stored in a relational database (RDBMS). It can consist of numbers and text, and sourcing can happen automatically or manually, as long as it's within an RDBMS structure.

Where does structured data stored?

Structured data is commonly stored in data warehouses and unstructured data is stored in data lakes. Both have cloud-use potential, but structured data allows for less storage space and unstructured data requires more.

Is structured data stored in a database?

Structured data is when data is in a standardized format, has a well-defined structure, complies to a data model, follows a persistent order, and is easily accessed by humans and programs. This data type is generally stored in a database.

Is a structured collection of data stored which can be stored in digital form?

In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage.