Obiekt

Tytuł: Parallelizing user-defined functions in the ETL workflow using orchestration style sheets

Contributor:

Gamper, Johann - ed. ; Wrembel, Robert - ed.

Subtitle:

.

Group publication title:

AMCS, volume 29 (2019)

Abstract:

Today`s ETL tools provide capabilities to develop custom code as user-defined functions (UDFs) to extend the expressiveness of the standard ETL operators. However, while this allows us to easily add new functionalities, it also comes with the risk that the custom code is not intended to be optimized, e.g., by parallelism, and for this reason, it performs poorly for data-intensive ETL workflows. In this paper we present a novel framework, which allows the ETL developer to choose a design pattern in order to write parallelizable code and generates a configuration for the UDFs to be executed in a distributed environment. ; This enables ETL developers with minimum expertise in distributed and parallel computing to develop UDFs without taking care of parallelization configurations and complexities. We perform experiments on large-scale datasets based on TPC-DS and BigBench. The results show that our approach significantly reduces the effort of ETL developers and at the same time generates efficient parallel configurations to support complex and data-intensive ETL tasks.

Publisher:

Zielona Góra: Uniwersytet Zielonogórski

Resource Identifier:

oai:zbc.uz.zgora.pl:85921

DOI:

10.2478/amcs-2019-0005

Pages:

69-79

Source:

AMCS, volume 29, number 1 (2019) ; kliknij tutaj, żeby przejść

Language:

eng

License CC BY 4.0:

kliknij tutaj, żeby przejść

Rights:

Biblioteka Uniwersytetu Zielonogórskiego

Kolekcje, do których przypisany jest obiekt:

Data ostatniej modyfikacji:

14 lip 2025

Data dodania obiektu:

10 lip 2025

Liczba wyświetleń treści obiektu:

18

Wszystkie dostępne wersje tego obiektu:

https://zbc.uz.zgora.pl/repozytorium/publication/100935

Wyświetl opis w formacie RDF:

RDF

Wyświetl opis w formacie OAI-PMH:

OAI-PMH

Obiekty Podobne

×

Cytowanie

Styl cytowania:

Ta strona wykorzystuje pliki 'cookies'. Więcej informacji