Skip to main content

Research Repository

Advanced Search

All Outputs (18)

Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints' (2016)
Presentation / Conference Contribution
Bonner, S., Brennan, J., Theodoropoulos, G., Kureshi, I., & McGough, A. (2016, August). Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints'. Presented at Twelfth Workshop on Mining and Learning with Graphs (MLG) at KDD'16., San Francisco, USA

The problem of how to compare empirical graphs is an area of great interest within the field of network science. The ability to accurately but efficiently compare graphs has a significant impact in such areas as temporal graph evolution, anomaly dete... Read More about Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints'.

Towards large-scale what-if traffic simulation with exact-differential simulation (2015)
Presentation / Conference Contribution
Hanai, M., Suzumura, T., Theodoropoulos, G., Perumalla, K., & Yilmaz, L. (2015, December). Towards large-scale what-if traffic simulation with exact-differential simulation. Presented at 2015 Winter Simulation Conference, WSC '15, Huntington Beach, California

To analyze and predict a behavior of large-scale traffics with what-if simulation, it needs to repeat many times with various patterns of what-if scenarios. In this paper, we propose new techniques to efficiently repeat what-if simulation tasks with... Read More about Towards large-scale what-if traffic simulation with exact-differential simulation.

Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain (2015)
Presentation / Conference Contribution
Bonner, S., McGough, S., Kureshi, I., Brennan, J., Theodoropoulos, G., Moss, L., Corsar, D., & Antoniou, G. (2023, October). Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain. Presented at IEEE International Conference on Big Data, Santa Clara

Recent technological advances in modern healthcare have lead to the ability to collect a vast wealth of patient monitoring data. This data can be utilised for patient diagnosis but it also holds the potential for use within medical research. However,... Read More about Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain.

Fast Compression of Large Semantic Web Data using X10 (2015)
Journal Article
Cheng, L., Avinash, M., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2016). Fast Compression of Large Semantic Web Data using X10. IEEE Transactions on Parallel and Distributed Systems, 27(9), 2603-2617. https://doi.org/10.1109/tpds.2015.2496579

The Semantic Web comprises enormous volumes of semi-structured data elements. For interoperability, these elements are represented by long strings. Such representations are not efficient for the purposes of applications that perform computations over... Read More about Fast Compression of Large Semantic Web Data using X10.

Towards an Info-Symbiotic Decision Support System for Disaster Risk Management (2015)
Presentation / Conference Contribution
Kureshi, I., Theodoropoulos, G., Mangina, E., O'Hare, G., & Roche, J. (2015, October). Towards an Info-Symbiotic Decision Support System for Disaster Risk Management. Presented at 2015 IEEE/ACM 19th International Symposium on Distributed Simulation and Real Time Applications (DS-RT), Chengdu, China

This paper outlines a framework for an info-symbiotic modelling system using cyber-physical sensors to assist in decision-making. Using a dynamic data-driven simulation approach, this system can help with the identification of target areas and resour... Read More about Towards an Info-Symbiotic Decision Support System for Disaster Risk Management.

Exact-Differential Large-Scale Traffic Simulation (2015)
Presentation / Conference Contribution
Hanai, M., Suzumura, T., Theodoropoulos, G., & Perumalla, K. (2015, June). Exact-Differential Large-Scale Traffic Simulation. Presented at 3rd ACM SIGSIM Conference on Principles of Advanced Discrete Simulation - SIGSIM-PADS '15, London, United Kingdom

Analyzing large-scale traffics by simulation needs repeating execution many times with various patterns of scenarios or parameters. Such repeating execution brings about big redundancy because the change from a prior scenario to a later scenario is v... Read More about Exact-Differential Large-Scale Traffic Simulation.

High throughput indexing for large-scale semantic web data (2015)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2015, April). High throughput indexing for large-scale semantic web data. Presented at 30th Annual ACM Symposium on Applied Computing - SAC '15, Salamanca, Spain

Distributed RDF data management systems become increasingly important with the growth of the Semantic Web. Currently, several such systems have been proposed, however, their indexing methods meet performance bottlenecks either on data loading or quer... Read More about High throughput indexing for large-scale semantic web data.

Design and evaluation of parallel hashing over large-scale data (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, December). Design and evaluation of parallel hashing over large-scale data. Presented at 2014 21st International Conference on High Performance Computing (HiPC), Velha Goa, India

High-performance analytical data processing systems often run on servers with large amounts of memory. A common data structure used in such environment is the hash tables. This paper focuses on investigating efficient parallel hash algorithms for pro... Read More about Design and evaluation of parallel hashing over large-scale data.

Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, November). Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems. Presented at 23rd ACM International Conference on Conference on Information and Knowledge Management - CIKM '14, Shanghai, China

The performance of joins in parallel database management systems is critical for data intensive operations such as querying. Since data skew is common in many applications, poorly engineered join operations result in load imbalance and performance bo... Read More about Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems.

A two-tier index architecture for fast processing large RDF data over distributed memory (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, September). A two-tier index architecture for fast processing large RDF data over distributed memory. Presented at 25th ACM conference on Hypertext and social media - HT '14, Santiago, Chile

We propose an efficient method for fast processing large RDF data over distributed memory. Our approach adopts a two-tier index architecture on each computation node: (1) a light-weight primary index, to keep loading times low, and (2) a dynamic, mul... Read More about A two-tier index architecture for fast processing large RDF data over distributed memory.

Robust and Efficient Large-Large Table Outer Joins on Distributed Infrastructures (2014)
Book Chapter
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014). Robust and Efficient Large-Large Table Outer Joins on Distributed Infrastructures. In F. Silva, I. Dutra, & V. S. Costa (Eds.), Euro-Par 2014 Parallel Processing : 20th International Conference, Porto, Portugal, August 25-29, 2014 ; proceedings (258-269). Springer Verlag. https://doi.org/10.1007/978-3-319-09873-9_22

Outer joins are ubiquitous in many workloads but are sensitive to load-balancing problems. Current approaches mitigate such problems caused by data skew by using (partial) replication. However, contemporary replication-based approaches (1) introduce... Read More about Robust and Efficient Large-Large Table Outer Joins on Distributed Infrastructures.

Efficiently Handling Skew in Outer Joins on Distributed Systems (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, T. (2014, May). Efficiently Handling Skew in Outer Joins on Distributed Systems. Presented at 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, Chicago, IL, USA

Outer joins are ubiquitous in databases and big data systems. The question of how best to execute outer joins in large parallel systems is particularly challenging as real world datasets are characterized by data skew leading to performance issues. A... Read More about Efficiently Handling Skew in Outer Joins on Distributed Systems.

Automated Dynamic Resource Provisioning and Monitoring in Virtualized Large-Scale Datacenter (2014)
Presentation / Conference Contribution
Abar, S., Lemarinier, P., Theodoropoulos, G., & OHare, G. (2014, May). Automated Dynamic Resource Provisioning and Monitoring in Virtualized Large-Scale Datacenter. Presented at 2014 IEEE 28th International Conference on Advanced Information Networking and Applications, Victoria, Victoria, Canada

Infrastructure as a Service (IaaS) is a pay-as-you go based cloud provision model which on demand outsources the physical servers, guest virtual machine (VM) instances, storage resources, and networking connections. This article reports the design an... Read More about Automated Dynamic Resource Provisioning and Monitoring in Virtualized Large-Scale Datacenter.

Space-Time Matching Algorithms for Interest Management in Distributed Virtual Environments (2014)
Journal Article
Liu, E., & Theodoropoulos, G. (2014). Space-Time Matching Algorithms for Interest Management in Distributed Virtual Environments. ACM Transactions on Modeling and Computer Simulation, 24(3), Article 15. https://doi.org/10.1145/2567922

Interest management in Distributed Virtual Environments (DVEs) is a data-filtering technique designed to reduce bandwidth consumption and therefore enhances the scalability of the system. This technique usually involves a process called interest matc... Read More about Space-Time Matching Algorithms for Interest Management in Distributed Virtual Environments.

Synchronised Range Queries in Distributed Simulations of Multi-Agent Systems (2013)
Journal Article
Suryanarayanan, V., & Theodoropoulos, G. (2013). Synchronised Range Queries in Distributed Simulations of Multi-Agent Systems. ACM Transactions on Modeling and Computer Simulation, 23(4), Article 25. https://doi.org/10.1145/2517449

Range queries are an increasingly important associative form of data access encountered in different computational environments including peer-to-peer systems, wireless communications, database systems, distributed virtual environments, and, more rec... Read More about Synchronised Range Queries in Distributed Simulations of Multi-Agent Systems.