Deprecated: Function jetpack_form_register_pattern is deprecated since version jetpack-13.4! Use Automattic\Jetpack\Forms\ContactForm\Util::register_pattern instead. in /var/www/html/wp-includes/functions.php on line 6031 Warning: Cannot modify header information - headers already sent by (output started at /var/www/html/wp-includes/functions.php:6031) in /var/www/html/wp-includes/rest-api/class-wp-rest-server.php on line 1794 {"id":396,"date":"2021-06-25T17:30:45","date_gmt":"2021-06-25T23:30:45","guid":{"rendered":"https:\/\/news.registro.gt\/?p=396"},"modified":"2022-02-21T15:00:26","modified_gmt":"2022-02-21T21:00:26","slug":"the-new-data-science-lab-of-the-uvg","status":"publish","type":"post","link":"https:\/\/news.registro.gt\/en\/2021\/06\/25\/the-new-data-science-lab-of-the-uvg\/","title":{"rendered":"The new Data Science Lab of the UVG"},"content":{"rendered":"\n

In March 2021, the Board of Directors of Universidad del Valle de Guatemala (UVG<\/a>), authorized the creation of the Data Science Lab, as a unit within the Center for Studies in Applied Informatics (CEIA <\/a>in Spanish). One of the goals of this new unit is to collect, store and preserve as much data as possible, generated in Guatemala. <\/strong>With this information, an open data repository will be created; which means that the data will be available without restrictions, with the condition of citing the source and sharing.<\/p>\n\n\n

<\/p>\n\n\n

The new Data Science Lab is an innovative proposal because, in addition to working with small databases, it will be one of the pioneers in working with Big Data in the country<\/strong> and making tools available to Guatemalans to work with.<\/p>\n\n\n

What is Big Data?<\/h3>\n\n\n

The term Big Data refers to large and complex data sets, which are so voluminous that greater computational resources are required to work with them. Despite its popularity, there is still no consensus on its definition, so we can simplify it by saying that: if the resources available are not enough to process the data, it is Big Data. <\/strong><\/p>\n\n\n\n

Big Data is characteristic of the 21st century: it is estimated that currently 1.7 Megabytes of data are generated per second, per person in the world.  Despite their complexity, these massive volumes of data can be used to identify and find solutions to problems that were once unsolvable. In other words, Big Data provides a benchmark.<\/strong><\/p>\n\n\n

The \u00abthree Vs\u00bb of Big Data<\/h3>\n\n\n

The idea of \u200b\u200bthe \u201cthree Vs\u201d responds to the characteristics necessary for Big Data to be relevant. These are:<\/p>\n\n\n\n

Volume:<\/strong> The amount of data we have matters because it dictates the resources needed  to model and process it.<\/p>\n\n\n\n

Velocity: <\/strong>This characteristic refers to the rate at which data is received and some action is applied to it. Sometimes this data is acquired in real-time, which requires evaluation and action at the same speed.<\/p>\n\n\n\n

Variety:<\/strong> This refers to the different types of data available. In the past, conventional data was structured and could be clearly organized in a relational database<\/a>. With the increase of data, it is more difficult to structure elements such as text, audio, or video.<\/p>\n\n\n\n

Frequently  two other Vs are mentioned: value and veracity.<\/strong> These respond to the fact that the data has an intrinsic value. However, it is of no use until that value is discovered. To be of any value, the data must be usable, and this depends on its preservation. <\/p>\n\n\n\n

It is equally important to ensure that the data comes from reputable and reliable sources. For this reason, CEIA intends to fill a void in the country, by creating the Data Science Lab as a point of reference for researchers and companies from various sectors to obtain relevant data<\/strong>.<\/p>\n\n\n

The Data Science Lab projects<\/h3>\n\n\n

The Data Science Lab will begin its operations with 2 projects, which will be available to the public:<\/p>\n\n\n\n