Skip to main content

Research Repository

Advanced Search

All Outputs (12)

Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints' (2016)
Presentation / Conference Contribution
Bonner, S., Brennan, J., Theodoropoulos, G., Kureshi, I., & McGough, A. (2016, August). Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints'. Presented at Twelfth Workshop on Mining and Learning with Graphs (MLG) at KDD'16., San Francisco, USA

The problem of how to compare empirical graphs is an area of great interest within the field of network science. The ability to accurately but efficiently compare graphs has a significant impact in such areas as temporal graph evolution, anomaly dete... Read More about Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints'.

Towards large-scale what-if traffic simulation with exact-differential simulation (2015)
Presentation / Conference Contribution
Hanai, M., Suzumura, T., Theodoropoulos, G., Perumalla, K., & Yilmaz, L. (2015, December). Towards large-scale what-if traffic simulation with exact-differential simulation. Presented at 2015 Winter Simulation Conference, WSC '15, Huntington Beach, California

To analyze and predict a behavior of large-scale traffics with what-if simulation, it needs to repeat many times with various patterns of what-if scenarios. In this paper, we propose new techniques to efficiently repeat what-if simulation tasks with... Read More about Towards large-scale what-if traffic simulation with exact-differential simulation.

Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain (2015)
Presentation / Conference Contribution
Bonner, S., McGough, S., Kureshi, I., Brennan, J., Theodoropoulos, G., Moss, L., Corsar, D., & Antoniou, G. (2023, October). Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain. Presented at IEEE International Conference on Big Data, Santa Clara

Recent technological advances in modern healthcare have lead to the ability to collect a vast wealth of patient monitoring data. This data can be utilised for patient diagnosis but it also holds the potential for use within medical research. However,... Read More about Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain.

Towards an Info-Symbiotic Decision Support System for Disaster Risk Management (2015)
Presentation / Conference Contribution
Kureshi, I., Theodoropoulos, G., Mangina, E., O'Hare, G., & Roche, J. (2015, October). Towards an Info-Symbiotic Decision Support System for Disaster Risk Management. Presented at 2015 IEEE/ACM 19th International Symposium on Distributed Simulation and Real Time Applications (DS-RT), Chengdu, China

This paper outlines a framework for an info-symbiotic modelling system using cyber-physical sensors to assist in decision-making. Using a dynamic data-driven simulation approach, this system can help with the identification of target areas and resour... Read More about Towards an Info-Symbiotic Decision Support System for Disaster Risk Management.

Exact-Differential Large-Scale Traffic Simulation (2015)
Presentation / Conference Contribution
Hanai, M., Suzumura, T., Theodoropoulos, G., & Perumalla, K. (2015, June). Exact-Differential Large-Scale Traffic Simulation. Presented at 3rd ACM SIGSIM Conference on Principles of Advanced Discrete Simulation - SIGSIM-PADS '15, London, United Kingdom

Analyzing large-scale traffics by simulation needs repeating execution many times with various patterns of scenarios or parameters. Such repeating execution brings about big redundancy because the change from a prior scenario to a later scenario is v... Read More about Exact-Differential Large-Scale Traffic Simulation.

High throughput indexing for large-scale semantic web data (2015)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2015, April). High throughput indexing for large-scale semantic web data. Presented at 30th Annual ACM Symposium on Applied Computing - SAC '15, Salamanca, Spain

Distributed RDF data management systems become increasingly important with the growth of the Semantic Web. Currently, several such systems have been proposed, however, their indexing methods meet performance bottlenecks either on data loading or quer... Read More about High throughput indexing for large-scale semantic web data.

Design and evaluation of parallel hashing over large-scale data (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, December). Design and evaluation of parallel hashing over large-scale data. Presented at 2014 21st International Conference on High Performance Computing (HiPC), Velha Goa, India

High-performance analytical data processing systems often run on servers with large amounts of memory. A common data structure used in such environment is the hash tables. This paper focuses on investigating efficient parallel hash algorithms for pro... Read More about Design and evaluation of parallel hashing over large-scale data.

Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, November). Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems. Presented at 23rd ACM International Conference on Conference on Information and Knowledge Management - CIKM '14, Shanghai, China

The performance of joins in parallel database management systems is critical for data intensive operations such as querying. Since data skew is common in many applications, poorly engineered join operations result in load imbalance and performance bo... Read More about Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems.

A two-tier index architecture for fast processing large RDF data over distributed memory (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, September). A two-tier index architecture for fast processing large RDF data over distributed memory. Presented at 25th ACM conference on Hypertext and social media - HT '14, Santiago, Chile

We propose an efficient method for fast processing large RDF data over distributed memory. Our approach adopts a two-tier index architecture on each computation node: (1) a light-weight primary index, to keep loading times low, and (2) a dynamic, mul... Read More about A two-tier index architecture for fast processing large RDF data over distributed memory.

Efficiently Handling Skew in Outer Joins on Distributed Systems (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, T. (2014, May). Efficiently Handling Skew in Outer Joins on Distributed Systems. Presented at 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, Chicago, IL, USA

Outer joins are ubiquitous in databases and big data systems. The question of how best to execute outer joins in large parallel systems is particularly challenging as real world datasets are characterized by data skew leading to performance issues. A... Read More about Efficiently Handling Skew in Outer Joins on Distributed Systems.

Automated Dynamic Resource Provisioning and Monitoring in Virtualized Large-Scale Datacenter (2014)
Presentation / Conference Contribution
Abar, S., Lemarinier, P., Theodoropoulos, G., & OHare, G. (2014, May). Automated Dynamic Resource Provisioning and Monitoring in Virtualized Large-Scale Datacenter. Presented at 2014 IEEE 28th International Conference on Advanced Information Networking and Applications, Victoria, Victoria, Canada

Infrastructure as a Service (IaaS) is a pay-as-you go based cloud provision model which on demand outsources the physical servers, guest virtual machine (VM) instances, storage resources, and networking connections. This article reports the design an... Read More about Automated Dynamic Resource Provisioning and Monitoring in Virtualized Large-Scale Datacenter.