Learning through Research
Research Experiences for our Undergraduate Students
IS470: Guided Research in Information Systems (Technology Area or Management Area)

The IS470: Guided Research in Information Systems module aims to introduce students to academic research in Information Systems. It allows students to experience first hand the challenges and exhilaration of research, discovery and innovation, and enriches their academic experience by working at/near the frontiers of research in IS technology or management.

Below are examples of research projects done by our students who completed IS470.

To find out more information, please visit IS470 Depth Elective-Technology Area or IS470 Depth Elective-Management Area.

Year .2011/2012
Object Persistence for Big Data
Speaker
THIA Kai Xin
BSc (IS Management), 2008 intake
2nd Major in Advanced Business Technology
(Business Intelligence & Analytics Track)
SIS Faculty Reviewers C Jason WOODARD (supervisor), Assistant Professor of Information Systems (Research)
DING Xuhua, Associate Professor of Information Systems (Research)
Kevin STEPPE, Lecturer of Information Systems
Abstract Object-relational mapping technologies make it easy to store and retrieve complex objects from databases but tend to be poorly adapted for data-intensive analytical applications. As a result, many researchers and small firms still prefer to analyse "big data" using flat text files -- a comparatively ancient and crude approach. Using Kuala, which is a simulation framework designed by Professor Jason, I generated scenarios of varying complexity and data size in order to stress test the “big data” technologies against traditional solutions. I found out that while it is possible to replace traditional databases like Derby with Greenplum Database Community Edition (GPDB) for day-to-day operation, it is neither really feasible nor recommended. Derby simply outperforms GPDB in object-relational mapping operations by huge margins. We also needed to make changes to the core Kuala code to gain reasonable performance from GPDB, thus undermining GPDB's generality in tool implementation. If the user is concerned about performance of modest sized object-relational mapping operations, traditional databases like Derby are still a better choice than GPDB. The purpose of “big data” technologies like GPDB is to support the traditional databases like Derby for large-scale analytics, not replace Derby for small to modest scale object-relational mapping operations.
Presentation File
Download here.
Final Report Download here.

Exploration of Disk I/O and ARM architecture
Speaker WANG Yaoyao
International Exchange Student, 2011 intake
SIS Faculty Reviewers DING Xuhua (supervisor), Associate Professor of Information Systems (Research)
C Jason WOODARD, Assistant Professor of Information Systems (Research)
Kevin STEPPE, Lecturer of Information Systems (Practice)
Abstract My work includes two parts: exploration of Linux disk I/O mechanism and study of ARM architecture. In the first part, a virtual file system is the interface between the user programme and the kernel, which allows same commands to behave equally in different file systems. In the generic block layer, it handles the requests for all block devices. An I/O scheduler merges or re-orders the requests from the generic block layer to increase the throughput. Finally, a block device driver converts these requests to commands that are understood by the disk controller. In the second part, arm supports multiple ISA, where each kind of instruction can be executed in the same program. Arm memory management unit plays an important part in the memory. MMU translates virtual addresses into physical addresses, controls memory access permission, determines the individual behavior of the cache and writes buffer for each page in memory. Exception handlers are responsible for handling errors, interrupts, and other events generated by the external system.
Presentation File
Download here.
Final Report Download here.

H.264/SVC Security on Android Smart Phone
Speaker ZHAO Yifan
International Exchange Student, 2011 intake
SIS Faculty Reviewers Robert H DENG (supervisor), Professor of Information Systems (Research)
C Jason WOODARD, Assistant Professor of Information Systems (Research)
ZHU Feida, Assistant Professor of Information Systems (Research)
Abstract

In this course, I studied techniques for encrypting and authenticating H.264/Scalable Video Coding (SVC) streams. I customised the H.264/SVC Codec Opensvc, JSVM and RTSP (Real Time Streaming Protocol) software Live555 and implemented them in the Android platform. Then I tested the performance of the EStream fast stream encryption algorithms (Salsa20/12, HC-128/256, Rabbit, SOSEMANUK) and compared them with RC4 and AES in order to determine the best encryption algorithms for SVC encryption in mobile devices. I also implemented two SVC stream authentication schemes and studied their performance through computer simulation, where I used the Gilbert Channel Model to simulate packet loss statistics in wireless networks.

Presentation File
Download here.
Final Report Download here.

Identifying Linux Bug Fixing Patches
Speaker TIAN Yuan
International Exchange Student, 2011 intake
SIS Faculty Reviewers David LO (supervisor), Assistant Professor of Information Systems (Research)
Robert H DENG, Professor of Information Systems (Research)
ZHU Feida, Assistant Professor of Information Systems (Research)
Abstract

There is a continuing tension between the need to develop and test new features, and the need to provide a stable and secure execution environment. A compromise, adopted by the developers of the Linux kernel, is to release new versions, including bug fixes and new features, frequently, while maintaining some older “longterm” version. This strategy raises the problem of how to identify bug fixing patches that are submitted to the current version but should be applied to the long-term versions as well. The current approach is to rely on the individual subsystem maintainers to forward patches that seem relevant to the maintainers of the longterm kernel. The reactivity and diligence of the maintainers, however, varies, and thus many important patches could be missed by this approach. In this project, we propose an approach that automatically identifies bug fixing patches based on the changes and commit messages recorded in code repositories. We compare our approach with the keyword-based approach for identifying bug fixing patches used in the literature. We have applied our approach to patches from Linux. The results show that our approach can achieve a much higher recall (45.11% relative improvement in recall) as compared to keyword-based approaches, with similar precision.

Presentation File
Download here.
Final Report Download here.

 

Data mining on social network
Speaker LI Cheng
International Exchange Student, 2011 intake
SIS Faculty Reviewers ZHU Feida (supervisor), Assistant Professor of Information Systems (Research)
Robert H DENG, Professor of Information Systems (Research)
David LO, Assistant Professor of Information Systems (Research)
Abstract

My work in this semester is conducted around the theme of Twitter. Firstly, I surveyed the Twitter studies mainly from the event and the individual perspective. Considering the large volume of literature, while I have collected more than one hundred papers and sorted them into different categories according to the problems they addressed and methods used, there are still more works need to be included. I'm also responsible for the building of the frontend for the demo system of the Twitter users' real-life friend identification algorithm, the website of which is http://twitterbud2011.appspot.com/. Then I helped in strengthening the identification algorithm by doing experiments as well as finding social science work supporting the principles the algorithm based upon. After submission of the paper illustrating this algorithm to www2012 conference, I'm currently engaged in the design of the demo paper which is also expected to be submitted to www2012 conference. This demo paper is based on the aforementioned website, on which we'll do some improvements to make it more interesting for users to play with and, at the meantime, a platform to exemplify the concepts proposed in the related paper.

Presentation File
Download here.
Final Report Download here.

 

Mining Top K longest sequential patterns in large dataset
Speaker WU Ziyang
International Exchange Student, 2011 intake
SIS Faculty Reviewers ZHU Feida (supervisor), Assistant Professor of Information Systems (Research)
David LO, Assistant Professor of Information Systems (Research)
DING Xuhua, Associate Professor of Information Systems (Research)
Abstract

Sequential pattern mining is an important data mining problem with broad application. Long patterns reflect important nature of the large sequence database and they are more informative in characterising large sequential data. However, with regard to long patterns in large datasets like genome-scale datasets, some elegant algorithms like PrefixSpan would still take a great amount of time. In our project, we try to develop a new algorithm that can efficiently mine the top k longest sequential patterns. First, by using suffix trees, we can derive all the frequent common substrings. Then several probing strategies, for example, random probing and evenly probing, are proposed to generate the initial candidate set. Pairwise comparison is iteratively performed to merge the segments in the candidate set. In each iteration, the segments that can be merged would be merged to a new pattern and put in the candidate set for the next iteration, while those that can’t be merged would be abandoned unless they meet up some threshold, like the length of the pattern or the number of the segments in the pattern. Moreover, several heuristic methods are used to speed up the mining process.

Presentation File
Download here.

 

Year .2010/2011
Visualising Geographic and Temporal Patterns in Twitter
Speaker
Subeesh S/O BASHEER
BSc (IS Management), 2008 intake
2nd Major in Advanced Business Technology
SIS Faculty Reviewers KAM Tin Seong (supervisor), Associate Professor of Information Systems (Practice)
LIM Ee-Peng, Professor of Information Systems (Research)
KOH Lian Chee, Senior Instructor

Project Description

This project investigates collection and analysis of large volumes of social network data. It uses Twitter as an example, and explores various methods to discover temporal and geographic patterns of messages posted to Twitter.
Abstract The growth of social networking applications such as Twitter and Foursquare, combined with the extensive adoption of smartphones, provide a way for users to post their thoughts and messages, acting as a channel for communication and the expression of ideas. Messages posted on these platforms are typically tagged with geographic information, allowing associations between geographic regions and the topics posted from each region. Using Twitter as a source of data, this research explores various methods by which the topics and ideas posted by users in tweets can be analysed and visualised, to determine temporal and geographic patterns. Algorithms were developed to perform the extraction and compilation of comprehensive spatial, chronological and topic-based information from a dataset of tweets. Geospatial information visualisation approaches were then adapted to create a visualisation that conveys this information in a manner that is easy to manipulate and interpret. These were combined to develop a set of analysis and visualisation techniques that can be applied to social networks, allowing analysts to discover and understand the underlying patterns present in such social network data. Additionally, this research will also examine the suitability, benefits, and shortcomings of using platform-as-a-service solutions, such as Google App Engine, to perform data collection and analysis of large volumes of social network data.
Presentation File
Download here.

Does Being Healthy Really Mean Falling Sick Less Often? - Using Visual Analytics to Explore Corporate Health Levels and Expenditure
Speaker
Jonathon KHOO Kay Chin
BSc (IS Management), 2008 intake
2nd Major in Advanced Business Technology
(Business Intelligence & Analytics Track)
SIS Faculty Reviewers KAM Tin Seong (supervisor), Associate Professor of Information Systems (Practice)
ZHU Feida, Assistant Professor of Information Systems (Research)
KOH Lian Chee, Senior Instructor

Project Description

More and more employers have been putting lots of efforts into improving the health of their employees, hoping that this will make staff happier and better performing. This project studies the relationship between the frequency of employee's falling sick and the size of medical bills burned by the employer, via analysing real medical screening data using both visual and statistical analysis tools.
Abstract Does how healthy you are determine how often you fall sick and how much you will cost your company in medical claims and lost work hours? More and more employers have been putting in concerted efforts into improving the health of their employees, hoping that this will translate into happier and better performing staff. This is none more important than for front line employees in the healthcare sector, many of whom are exposed to many germs and viruses every day. Using medical screening data from a major Singaporean hospital and a combination of visual and statistical analysis tools in JMP, we look into whether there is any real relationship between how healthy someone is, how often they fall sick, and how much they cost the company in medical bills. We look at both a person's overall health, determined by the algorithm, and many individual health indicators to show that there is more to this relationship than meets the eye.
Presentation File
Download here.

Scope-Shapes: Place and Text in Twitter
Speaker
Josuah NAIR
BSc (IS Management), 2007 intake
Double Degree in Bachelor of Business Management
2nd Major in Finance & Sociology
SIS Faculty Reviewers KAM Tin Seong (supervisor), Associate Professor of Information Systems (Practice)
LIM Ee-Peng, Professor of Information Systems (Research)
KOH Lian Chee, Senior Instructor

Project Description

This project designs and developments a cloud-based information visualisation application which allows users to uncover new patterns between text and location within users' statements on Twitter.
Abstract

Twitter and other social network sites have formed a rich source of data over the last few years. These sites have recently added location based data in a more comprehensive manner, providing another opportunity to gain insights from place perspective. In addition, research on social media to date tends to focus on the social aspects of social network sites, neglecting the rich semantic data which can be analysed through ‘status updates'. This research focuses on the aspects of location and text within users' statements on Twitter, to provide a means to uncover new patterns for social science researchers to investigate, by allowing exploration of previously unexplored aspects of Twitter using new data visualisation techniques.

This is done through the design and development of a cloud-based information visualisation application. Two new information visualisation approaches are employed in the design of the application. First, the geographic substrate, where text and time based data in the form of a graph can be visualised off a map, and vice-versa. Second, the re-applying of C.S Pierce's idea of a scope to manage the different types of relationships presented in the data through enclosing graphs and sub-graphs of certain relationship types into an oval. These two approaches form a flexible information visualisation approach, based on a ProtoVis web toolkit interactively accessing data hosted on Google App Engine.

Social Coordination for Dynamic Flow Distribution in a Massive Multi-Agent Environment:
Maximising Visitor Self-Interest by Utilising Combinatorial Auctions to Allocate a Socially Optimised Queue Priority in a Theme Park
Speaker
Danny TAN Wei Yang
BSc (IS Management), 2007 intake
2nd Major in Operations Management
SIS Faculty Reviewers LAU Hoong Chuin, Associate Professor of Information Systems (Research)

Project Description

How it Began

  • Began as a LARC initiative
  • Project involving Resorts World Singapore
  • Cooperate with Universal Studios as a platform
  • Site visits & observations
  • Interviews with Stakeholders

Which led me in thinking of how we can…

  • Formalise the problems identified that are inherent to Universal Studios.
  • Develop a project that can expand the use computational social science.
  • Find a novel solution to use behavioral information observed and collected in order to solve the problem.
Abstract In this paper, an innovative valuation mechanism is proposed to award theme park visitors with priority pass to an attraction. Using bidding theories and auction mechanisms, we engage system agents to determine optimal next-plan-of-action to incentivise and maximise self-interest for visitors. An archetypal scenario often relies on either letting users pay for priority, or providing one-dimensional information such as wait times of different queues to empower visitors with state-of-the-world knowledge, and letting them adapt their next plan of action accordingly. However, such practices are not socially optimal and often do not suffice the demands given the varying subsets of preferences that visitors might have. As such, visitors often fail to maximise their time at the park. The author explains the concept of using visitor preference, experience and social environment as valuation to a combinatorial auction bid and award priority passes to the winning visitors (visitors that desires a ride most), thus effectively dynamically planning visitor’s activities efficiently, rationally, and eventually maximising visitor’s time at the park. To assess effectiveness of the concept, the problem is first defined, solution explained, evaluated and the results examined.
Final Report Download here.

Dynamic Patient Dispatch Policies for Improving Patient Flows in a Hospital A&E Department
Speaker
WANG Chao
BSc (IS Management), 2007 intake
2nd Major in Advanced Business Technology
SIS Faculty Reviewers LAU Hoong Chuin (supervisor), Associate Professor of Information Systems (Research)
MA Nang Laik, Assistant Professor of Information Systems (Practice)
RAMASUBBU Narayan, Assistant Professor of Information Systems (Research)

Project Description

In this project, we consider the queuing process in a hospital's A&E department and propose dynamic dispatch policies that seek to reduce the average waiting time for all patients. We show, by discrete-event simulation, the performance gain of these policies against the standard first-come-first-served (FCFS) policy.
Abstract In this project, we consider the queuing process in a hospital's A&E department, which is modeled as an M/M/s queue with reentrant customers. Patients arrive stochastically to be consulted by one of the doctors (servers), who will diagnose the patient and (if necessary) book lab tests for the patient. After taking the lab test(s), that patient will rejoin the queue for the same doctor to review the lab results. We study the current process of a local hospital, which serves patients on FCFS policy. The objective is to model the behaviour of this queuing system and find dispatch policies that seek to reduce the average waiting time for all patients.
Presentation File
Download here.

Year .2009/2010
Distributed Data Analysis Using Map-Reduce
Speaker
Shahfik Bin AMASHA
BSc (IS Management), 2006 intake
2nd Major in Advanced Business Technology (Service Systems & Solutions Track)
SIS Faculty Reviewers C Jason WOODARD (supervisor), Assistant Professor of Information Systems (Research)
Abstract Hadoop clusters are relatively easy to set up and manage, making them an invaluable tool for researchers to quickly get up and running on crunching large datasets. The Hadoop benchmarks show an a linear trend for processing different size of data for the same number of nodes and a leveling off for throughput performance. This is useful to know the optimum number of nodes to provision in a cluster. Finally, there is a huge potential for further the project into interactive use cases and real-time data analytics
Final Report Download here.

C.H.I.M.E.S Approach to Designing Emergence in Games
Speaker
Don BAEY Chwee Leng
BSc (IS Management), 2006 intake
2nd Major in Advanced Business Technology (Service Systems & Solutions Track)
SIS Faculty Reviewers Ori SASSON (supervisor), Former Assistant Professor of Information Systems
Abstract “A good game is a series of interesting choices." – Sid Meier, Game Designer

Many games today offer highly immersive gameplay as well as strong re-playability. Often enough, these games are designed based on sets of simple game rules which the players interact with or even cross-interact with other rules. Having such form of dynamic interaction can result in emergent gameplay. By analysing the emergence characteristics of various game examples, this paper aims to appreciate the effects and challenges of designing emergence in games propose an approach towards designing emergence.

Final Report Download here.

Recommending People in Developers' Social Network
Speaker LIU Nian
International Exchange Student, 2009 intake
SIS Faculty Reviewers David LO (supervisor), Assistant Professor of Information Systems
Abstract Many open-source projects histories stored in software repositories provide the opportunities to extract important information to help software developers in performing their tasks. If given a platform to let developers know each other and cooperate, it can lead to more and more software tools being developed. Also, it is likely that more interesting or large scale tools are developed as more and more collaboration between developers are made.

The purpose of this project is to do one or more of the following:

  1. Collecting information on these data sources
  2. Visualising this information
  3. Finding nice patterns on the social phenomenon behind this “developers” social network
  4. Developing an automated data mining approach to recommending people on developer's social network
In this IS470 project, we finish task 1 and task 4, and will work on task 2 in future work.

Final Report Download here.

Clustering Software Components for Effective Reuse
Speaker WANG Shaowei
International Exchange Student, 2009 intake
SIS Faculty Reviewers David LO (supervisor), Assistant Professor of Information Systems (Research)
Abstract

There is a wealth of software data available from years of software development efforts. These data are publicly available via systems like Sourceforge, Bugzilla, etc. The data revolves around the various activities that developers typically do in a software development effort, such as adding new code, modifying existing code, participating in multiple projects, forming connections with peers in the same project, creating new projects, fixing bugs, handling feature requests, handling bug reports, etc.

Plug-ins are gaining popularity because of the convenience they offer, and the size of plug-ins increases over time as changes are made by developers. If we can find out the specifications that a plug-in follows or group similar plug-ins together automatically, this will help developers save a great amount of time and energy when he/she takes on a legacy system, especially systems developed without any formal documentation.

We propose an effective approach of clustering plug-ins into several groups according to their similarity and doing some specification mining to get some common functions about each of the clustered plug-ins. In this way, developers can pick out plug-ins more easily, because one just needs to read the document about every cluster to get the main point of these plug-ins.

In this project, we take eclipse plug-ins as a case study, because eclipse plug-ins are freely available, are frequently updated, have been developed over a long period of time (over 10 years), and come with various functionalities.
Final Report Download here.

Exploring and Analysing Space-Time-Attribute Datasets:
A Rich Internet Application (RIA) based Geo-Visual Analytics Approach
Speaker
Eunice YEO Zhi Shu
BSc (IS Management), 2006 intake
2nd Major in 2nd Major in Advanced Business Technology (Business Intelligence & Analytics Track)
SIS Faculty Reviewers KAM Tin Seong (supervisor), Associate Professor of Information Systems (Practice)
Abstract

Current GIS tools are no longer adequate to support an efficient analysis of space-time-attribute datasets. Hence we have developed an interactive analytical application that has adopted various geo-visualisation techniques, such as heat maps, coordinated linked view and brushing. At the end of the paper, we will use this application to investigate the time-series patterns of taxi trips at different locations.

Final Report Download here.

A Survey of Contemporary Virtualisation on x86 Platform and Security Status
Speaker
ZHAO Siqi
BSc (IS Management), 2006 intake
2nd Major in Advanced Business Technology
Abstract

This report is aimed at understanding the big picture of virtualisation on the x86 platform. Classification of virtualisation is presented first, and then the architecture, then more details of the implementations are summarised. For a long time, virtualisation had remained a privilege for mainframes. Only recently, when x86 based microcomputers gradually took over the world of business computing, did people realise the need for virtualisation on x86 based machines. However, the x86 micro architecture had never been designed with support for virtualisation in mind. Virtualising an x86 box had proven to be intrinsically difficult, but not impossible. In the effort to bring virtualisation to x86 platforms, various techniques have been discovered and developed. Each approach exhibits varying aspects of advantages and disadvantages.

Final Report Download here.
Year .2008/2009
Optimising selling schedule in the TAC-SCM game
Speaker
NGUYEN Thi Duong
BSc (IS Management), 2005 intake
SIS Faculty Reviewers CHENG Shih-Fen (supervisor), Assistant Professor of Information Systems (Research)
LAU Hoong Chuin, Associate Professor of Information Systems (Research)
C Jason WOODARD, Assistant Professor of Information Systems (Research)
Jing JIANG, Assistant Professor of Information Systems (Research)
Abstract Because computer prices in the Trading Agent Competition-Supply Chain Management (TAC-SCM) game is usually volatile, it's important to make a good decision on the amount of computers to sell over a period of time. In this paper, we introduce an optimisation model to determine a selling schedule which would maximise final revenue. This model can be used both offline as another benchmark of agent's sales performance and in game as part of the decision making process. Using the result of our model as a benchmark, we found a positive correlation between optimising agent's selling schedule and its overall performance in the game, suggesting the potential value of this approach.
Final Report Download here.
Year .2007/2008
Authentiplus: A new method of identification
Speaker
CHIA Jiawei Alvin
BSc (IS Management), 2004 intake
Double Degree in Bachelor of Business Management
SIS Faculty Reviewers Debin GAO, Assistant Professor of Information Systems (Research)

Abstract

There are three generally accepted techniques for user authentication: (1) Knowledge-based systems,(2) token-based systems, and (3) systems based on biometries (i.e. fingerprints, irises). However, in this study I have chosen to apply the assumption that the authenticating:
  • user is "naked" (Without access cards, identification cards, phones, keys, etc).
  • system's hardware is basic (No eye-tracking, thumbprint recognition or other similar optical mechanisms).
The reason for this is simple. Secure and functional authentication systems have already been in development (Li & Shum, 2002) and often, it is difficult to balance security levels with costs (installation of scanning or biometric devices) and convenience (the availabiiity of these devices at any location). Users do not desire to be burdened with tokens like access cards and companies may not necessarily have the financial means to implement those token-based systems, let alone biometric ones. As such, I will only consider investigating new authentication methods of knowledge-based systems in this report in the hopes of creating a realistic innovation.

The objective of this research is to analyse some of the existing textual and non-textual* knowledge-based authentication schemes, to identify their strengths and weaknesses and propose a new design which has the following competing requirements:

  1. user only needs to remember one password,
  2. each time the user tries to login, he will be given a different challenge and a different answer will be sent to the server.

The level of complexity and usability of the system will also be taken into account.

*Non-Textual passwords may consist of mediums like biometrics, OTP electronic cards, physical keys, etc
Final Report Download here.

Social Network and Data Analysis of the CCMixter community
Speaker
Ankit GUGLANI
BSc (IS Management), 2005 intake
SIS Faculty Reviewers CHELIOTIS Giorgos (supervisor), Visiting Associate Professor of Information Systems
LAU Hoong Chuin (coordinator), Associate Professor of Information Systems (Research)
Abstract Creative Commons, or CC as it is called, provides free licenses to copyright holders to enable them to express exactly which rights they wish to waive and which restriction they wish to impose on their creative digital works. A myriad of licenses are offered by CC which provide for a different combination of freedoms and restrictions. Each license can be viewed as a combination of certain licensing attributes (each linked to a specific freedom or restriction).

We were interested to look at CC-Mixter as a community since it provided a unique opportunity to study human interactions through remixing of work and through the forums. We are also interested to know how communities based on collaboration evolve.

Since CC-Mixter had both, the element of being a community based and collaboration and reuse and it fit well with the rest of the research in that it used creative commons licenses.

Initially another cause of interest in this rich dataset was a chance of observing license choices of users for the first time; and their impact on the further chains of reuse. This was later not found feasible as it appears that the community assigns the license based on the sources / samples rather than the user making an active choice.

Presentation File
Download here.
Final Report Download here.

Complex Network Metrics and Software Evolvability
Speaker
LAI Siu Chun
BSc (IS Management), 2006 intake
SIS Faculty Reviewers C Jason WOODARD (supervisor), Assistant Professor of Information Systems (Research)
Abstract

More than other forms of engineering, software engineering requires evolvability on top of functionality. Being a non-functional and rather subjective requirement, efforts have been made to define and systematically measure it.

The research project aims to establish complex network metrics as a measure of software evolvability. It extends Myers' study on software systems and complex networks and MacCormack, Rusnak and Baldwin's empirical study on management decision and software design structures. We will study a modularisation phase of the Mozilla Application Suite and track the changes in several network properties like average path length, betweenness, and in-out distributions.

Final Report Download here.

Innovation in the ProgrammableWeb: Characterising the Mashup Ecosystem
Speaker YU Shuli
BSc (IS Management), 2004 intake
Double Degree in Bachelor of Social Science (Psychology)
SIS Faculty Reviewers C Jason WOODARD (supervisor), Assistant Professor of Information Systems (Research)
Abstract

This paper investigates the current structure and characteristics within the mashup and API ecosystem by analysing empirical data from the website ProgrammableWeb (www.programmableweb.com). Social network analysis and regression models were used to explore the statistical distributions, network topology and attribute relationships within the ecosystem. It was found that the ecosystem has grown slowly but steadily from 2005 to 2007 and that the API-affiliation network is likely small world and scale-free. The market share of APIs has become less concentrated, where highly popular APIs are growing at slower rate than less popular APIs. There also appears to be time and category influences on API popularity.

Final Report Download here.
Last updated on 10 May, 2012 by School of Information Systems.