Skip to main content

Research Repository

Advanced Search

Outputs (18)

Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints' (2016)
Presentation / Conference Contribution
Bonner, S., Brennan, J., Theodoropoulos, G., Kureshi, I., & McGough, A. (2016, August). Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints'. Presented at Twelfth Workshop on Mining and Learning with Graphs (MLG) at KDD'16., San Francisco, USA

The problem of how to compare empirical graphs is an area of great interest within the field of network science. The ability to accurately but efficiently compare graphs has a significant impact in such areas as temporal graph evolution, anomaly dete... Read More about Efficient Comparison of Massive Graphs Through The Use Of 'Graph Fingerprints'.

Towards large-scale what-if traffic simulation with exact-differential simulation (2015)
Presentation / Conference Contribution
Hanai, M., Suzumura, T., Theodoropoulos, G., Perumalla, K., & Yilmaz, L. (2015, December). Towards large-scale what-if traffic simulation with exact-differential simulation. Presented at 2015 Winter Simulation Conference, WSC '15, Huntington Beach, California

To analyze and predict a behavior of large-scale traffics with what-if simulation, it needs to repeat many times with various patterns of what-if scenarios. In this paper, we propose new techniques to efficiently repeat what-if simulation tasks with... Read More about Towards large-scale what-if traffic simulation with exact-differential simulation.

Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain (2015)
Presentation / Conference Contribution
Bonner, S., McGough, S., Kureshi, I., Brennan, J., Theodoropoulos, G., Moss, L., Corsar, D., & Antoniou, G. (2023, October). Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain. Presented at IEEE International Conference on Big Data, Santa Clara

Recent technological advances in modern healthcare have lead to the ability to collect a vast wealth of patient monitoring data. This data can be utilised for patient diagnosis but it also holds the potential for use within medical research. However,... Read More about Data Quality Assessment and Anomaly Detection Via Map / Reduce and Linked Data: A Case Study in the Medical Domain.

Fast Compression of Large Semantic Web Data using X10 (2015)
Journal Article
Cheng, L., Avinash, M., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2016). Fast Compression of Large Semantic Web Data using X10. IEEE Transactions on Parallel and Distributed Systems, 27(9), 2603-2617. https://doi.org/10.1109/tpds.2015.2496579

The Semantic Web comprises enormous volumes of semi-structured data elements. For interoperability, these elements are represented by long strings. Such representations are not efficient for the purposes of applications that perform computations over... Read More about Fast Compression of Large Semantic Web Data using X10.

Towards an Info-Symbiotic Decision Support System for Disaster Risk Management (2015)
Presentation / Conference Contribution
Kureshi, I., Theodoropoulos, G., Mangina, E., O'Hare, G., & Roche, J. (2015, October). Towards an Info-Symbiotic Decision Support System for Disaster Risk Management. Presented at 2015 IEEE/ACM 19th International Symposium on Distributed Simulation and Real Time Applications (DS-RT), Chengdu, China

This paper outlines a framework for an info-symbiotic modelling system using cyber-physical sensors to assist in decision-making. Using a dynamic data-driven simulation approach, this system can help with the identification of target areas and resour... Read More about Towards an Info-Symbiotic Decision Support System for Disaster Risk Management.

Exact-Differential Large-Scale Traffic Simulation (2015)
Presentation / Conference Contribution
Hanai, M., Suzumura, T., Theodoropoulos, G., & Perumalla, K. (2015, June). Exact-Differential Large-Scale Traffic Simulation. Presented at 3rd ACM SIGSIM Conference on Principles of Advanced Discrete Simulation - SIGSIM-PADS '15, London, United Kingdom

Analyzing large-scale traffics by simulation needs repeating execution many times with various patterns of scenarios or parameters. Such repeating execution brings about big redundancy because the change from a prior scenario to a later scenario is v... Read More about Exact-Differential Large-Scale Traffic Simulation.

High throughput indexing for large-scale semantic web data (2015)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2015, April). High throughput indexing for large-scale semantic web data. Presented at 30th Annual ACM Symposium on Applied Computing - SAC '15, Salamanca, Spain

Distributed RDF data management systems become increasingly important with the growth of the Semantic Web. Currently, several such systems have been proposed, however, their indexing methods meet performance bottlenecks either on data loading or quer... Read More about High throughput indexing for large-scale semantic web data.

Design and evaluation of parallel hashing over large-scale data (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, December). Design and evaluation of parallel hashing over large-scale data. Presented at 2014 21st International Conference on High Performance Computing (HiPC), Velha Goa, India

High-performance analytical data processing systems often run on servers with large amounts of memory. A common data structure used in such environment is the hash tables. This paper focuses on investigating efficient parallel hash algorithms for pro... Read More about Design and evaluation of parallel hashing over large-scale data.

Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, November). Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems. Presented at 23rd ACM International Conference on Conference on Information and Knowledge Management - CIKM '14, Shanghai, China

The performance of joins in parallel database management systems is critical for data intensive operations such as querying. Since data skew is common in many applications, poorly engineered join operations result in load imbalance and performance bo... Read More about Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems.

A two-tier index architecture for fast processing large RDF data over distributed memory (2014)
Presentation / Conference Contribution
Cheng, L., Kotoulas, S., Ward, T., & Theodoropoulos, G. (2014, September). A two-tier index architecture for fast processing large RDF data over distributed memory. Presented at 25th ACM conference on Hypertext and social media - HT '14, Santiago, Chile

We propose an efficient method for fast processing large RDF data over distributed memory. Our approach adopts a two-tier index architecture on each computation node: (1) a light-weight primary index, to keep loading times low, and (2) a dynamic, mul... Read More about A two-tier index architecture for fast processing large RDF data over distributed memory.