Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks

0
1200
Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks

Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks

Abstract

Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks management report in data mining.The last few years have witnessed the emergence and evolution of a vibrant research stream on a large variety of online Social Media Network (SMN) platforms. Recognizing anonymous, yet identical users among multiple SMNs is still an intractable problem. Clearly, cross-platform exploration may help solve many problems in social computing in both theory and applications. Since public profiles can be duplicated and easily impersonated by users with different purposes, most current user identification resolutions, which mainly focus on text mining of users’ public profiles, are fragile. Some studies have attempted to match users based on the location and timing of user content as well as writing style.

However, the locations are sparse in the majority of SMNs, and writing style is difficult to discern from the short sentences of leading SMNs such as Sina Microblog and Twitter. Moreover, since online SMNs are quite symmetric, existing user identification schemes based on network structure are not effective. The real-world friend cycle is highly individual and virtually no two users share a congruent friend cycle. Therefore, it is more accurate to use a friendship structure to analyze cross-platform SMNs. Since identical users tend to set up partial similar friendship structures in different SMNs, we proposed the Friend Relationship-Based User Identification (FRUI) algorithm. FRUI calculates a match degree for all candidate User Matched Pairs (UMPs), and only UMPs with top ranks are considered as identical users. We also developed two propositions to improve the efficiency of the algorithm. Results of extensive experiments demonstrate that FRUI performs much better than current network structure-based algorithms.

INTRODUCTION

In the most recent decade, many sorts of social networking sites have emerged and contributed immensely to huge volumes of true data on social practices. Twitter 1, the most critical smaller scale blog benefit, has more than 600 million users and delivers upwards of 340 million tweets for every day. Sina Microblog2, the primary Twitter-style Chinese miniaturized scale blog site, has a ton of than500 million records and creates run out a hundred million tweets for every day. Because of this decent variety of on-line online networking systems (SMNs), people tend to utilize entirely unexpected SMNs for different capacities. for instance, Ren 3, a Facebook-style however antonymous SMN, is utilized in China for sites, though Sina Micro blog is utilized to share statuses.

In various words, each existent SMN fulfills some user needs. Regarding SMN administration, coordinating mysterious users crosswise over various SMN stages will give incorporated subtle elements on each users and educate relating laws, such as focusing on administrations arrangements. In principle, the cross-stage investigations empower a bird’s-eye perused of SMN user practices. In any case, about all current SMN-construct ponders center with respect to one SMN stage, yielding fragmented data. Subsequently, this examination researches the procedure of intersection various SMN stages to shading a complete picture of those practices. Regardless, cross-stage investigation faces different difficulties. As appeared, with the extension of SMN stages on the net, the cross-stage approach has coordinated various SMN stages to frame wealthier data and a great deal of finish SMNs for social registering assignments. SMN users sort the common scaffolds for these SMN stages. the main theme for cross-stage SMN investigation is users recognizable proof for different SMNs. Investigation of this subject establishes a framework for extra cross-stage SMN examination. Proposing a novel Friend Relationship-based User Identification (FRUI)algorithm.

In our analysis of cross-platform SMNs, we deeply mined friend relationships and network structures. In the real world, people tend to have mostly the same friends in different SMNs, or the friend cycle is highly individual. The more matches in two un-mapped users’ known friends, the higher the probability that they belong to the same individual in the real world. Based on this fact, we proposed the FRUI algorithm. Since FRUI employs a unified friend relationship, it is apt to identify users from a heterogeneous network structure. Unlike existing algorithms, FRUI chooses candidate matching pairs from currently known identical users rather than unmapped ones. This operation reduces computational complexity, since only a very small portion of unmapped users are involved in each iteration.

System Configuration:

H/W System Configuration:-

Processor          : Pentium IV

Speed               : 1 Ghz

RAM                  : 512 MB (min)

Hard Disk          : 20GB

Keyboard           : Standard Keyboard

Mouse               : Two or Three Button Mouse

Monitor             : LCD/LED Monitor

S/W System Configuration:-

Operating System               : Windows XP/7

Programming Language       : Java/J2EE

Software Version                 : JDK 1.7 or above

Database                            : MYSQL

 

Cross-Plat Form:

cross-platform software (multi-platform, or platform independent software) is computer software that is implemented on multiple computing plat formsCross-platform software may be divided into two types; one requires individual building or compilation for each platform that it supports, and the other one can be directly run on any platform without special preparation, e.g., software written in an interpreted language or pre-compiled portable byte ode for which the interpreters or run-time packages are common or standard components of all platforms.

EXISTING SYSTEMS:-

Existing algorithms FRUI chooses candidate matching pairs from currently known identical users rather than unmapped ones. This operation reduces computational complexity, since only a very small portion of unmapped users are involved in each iteration. Moreover, since only mapped users are exploited, our solution is scalable and can be easily extended to online user identification applications. In contrast with current algorithms FRUI requires no control parameters. The main question in the above scenario is the overlap of the users’ friends. To address this issue, we discuss the overlap of SMNs, including node and edge overlap, below. Node overlap. Many studies have verified that numerous users are overlapped in different SMNs. Nearly all cross-platform user identification studies mention node overlap, because it is the fundamental assumption to solve this issue. Early in 2007, 64% of Facebook users had MySpace accounts.

PROPOSED SYSTEMS:-

Proposing a novel Friend Relationship-based User Identification (FRUI) algorithm. In our analysis of cross platform SMNs, we deeply mined friend relationships and network structures. In the real world, people tend to have mostly the same friends in different SMNs, or the friend cycle is highly individual. The more matches in two unmapped users’ known friends, the higher the probability that they belong to the same individual in the real world. Based on this fact, we proposed the FRUI algorithm. A preprocessor is designed to acquire as many Priori UMPs as possible. Currently, there is no common approach available to obtain UMPs between two SMNs. Specified methods must be formulated according to given SMNs. Although no unified process is suitable for the Preprocessor, some algorithms can be adopted according to the application, e.g., email address, screen name, URL, etc. Edge overlap. Until very recently, no statistical studies quantified relationship overlap in two SMNs. However, some studies noted that these relationships overlap to a certain extent. NS which identifies users purely through networks in ground-truth datasets, proved that users have similar relationships in Twitter and Flickr. Paridhi also found that users tend to connect with a segment of the same people across SMNs, and introduced network structure to improve the accuracy of user identification between Twitter and Facebook.

ADVANTAGES:-

Advances in SMN services, more SMNs allow users to bind their accounts with other major SMNs. In this case, priori knowledge can be obtained with bound information. For example, PaPa and ChangBa, two major mobile applications (apps) in China, encourage users to link their Sina Microblog accounts for commercial interests, bridging their websites with the largest microblog service in China. Twitter provides an attribute, called a URL, for user self-identification. Preprocessors can directly use URLs to match a Twitter account to Facebook or other SMN accounts. When no extra information except the network structure can be employed, the seed identification approach in NS and the de-anonymization attacks in are alternatives for the Preprocessor.

IMPLEMENTATION

Implementation is the stage of the project when the theoretical design is turned out into a working system. Thus it can be considered to be the most critical stage in achieving a successful new system and in giving the user, confidence that the new system will work and be effective. The implementation stage involves careful planning, investigation of the existing system and it’s constraints on implementation, designing of methods to achieve changeover and evaluation of changeover methods.

Modules:

In this project we have following four modules.

  • Cross-PlatForm In SMN’s
  • Anonymous Identical User
  • Friends And Relation

Cross-PlatForm In SMN’s:-

SMN connections fall into two categories: single-following connections and mutual-following connections. Single following connections are also called following relationships or following links. If user A follows user B, then user A and user B have a following relationship (single-way fans in which one knows the other, but not vice versa). Following relationships are common in micro blogging SMNs, such as Twitter and Sina Micro blog. Likewise, mutual-following connections are called friend relationships. In microblogging SMNs, a friend relationship refers to the mutual following relationships between two users. In our analysis of crossplatform SMNs, we deeply mined friend relationships and network structures. In the real world, people tend to have mostly the same friends in different SMNs, or the friend cycle is highly individual. The more matches in two unmapped users’ known friends, the higher the probability that they belong to the same individual in the real world. Based on this fact, we proposed the FRUI algorithm.

Anonymous Identical User:-

Anonymous is a loosely associated international network of activist and hacktivist entities. A website nominally associated with the group describes it as “an Internet gathering” with “a very loose and decentralized command structure that operates on ideas rather than directives”. The group became known for a series of wellpublicized publicity stunts and distributed denial-of-service attacks on government, religious, and corporate websites. Although no unified process is suitable for the Preprocessor, some algorithms can be adopted according to the application, e.g., email address, screen name, URL, etc. An email address appears to be a unique feature for each account, and can be used to collect Priori UMPs. Node overlap. Many studies have verified that numerous users are overlapped in different SMNs. Nearly all cross-platform user identification studies mention node overlap, because it is the fundamental assumption to solve this issue. The identifier finds UMPs using connections among users and Priori UMPs. As noted above, a match degree for each candidate UMP should be calculated in advance. NS formulates the match degree using in- and out-degrees in directed networks.

Friends and Relation:-

The friend relationship requires confirmation by the two users, and is much more reliable and consistent in SMNs. Thus, it can reduce the noise introduced by a discretionary single-following relationship. Making use of the friend relationship in undirected networks, JLA defines the match degree as, For any two SMNs, SMNA and SMNB can be considered as mirrors of the real world. Suppose that people set up random friendships in the real world; then the probability of a friendship between any two persons is p (0 < p < 1), and for any friendship, sa (0 < sa < 1) and sb (0 < sb < 1) are probabilities that it exists in SMNA and SMNB, respectively. Therefore, the probabilities that a friendship exists in SMNA and SMNB are psa and psb, respectively. we use ground truth datasets to evaluate the user identification resolution. In order to verify FRUI in different types of SMNs, we collected data from two hetero generous SMNs: Sina Micro blog and RenRen. The Sina Micro blog dataset was captured from the Sina Micro blog search page, while the RenRen dataset was directly obtained from its Open API. As showing the Sina Micro blog dataset consisted of 1.17 million users and 1.9 million friend relationships, and each user hadan average of 3.2 friends. The RenRen dataset was comprised of 5.5million node sand 14.6 million edges, and each user had anaverage of 5.3friends. Therefore, the RenRen dataset was much denser than Sina Micro blog’s.

Conclusion 

Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks management report in data mining.study addressed the problem of user identification across SMN platforms and offered an innovative solution. As a key aspect of SMN, network structure is of paramount importance and helps resolve de-anonymization user identification tasks. Therefore, we proposed a uniform network structure-based user identification solution. We also developed a novel friend relationship-based algorithm called FRUI. To improve the efficiency of FRUI, we described two propositions and addressed the complexity. Finally, we verified our algorithm in both synthetic networks and ground-truth networks. Results of our empirical experiments reveal that network structure can accomplish important user identification work.

Our FRUI algorithm is simple, yet efficient, and performed much better than NS, the existing state-of-art network structure-based user identification solution. In scenarios when raw text data is sparse, incomplete, or hard to obtain due to privacy settings, FRUI is extremely suitable for cross-platform tasks. Moreover, our resolution can be easily applied to any SMNs with friendship networks, including Twitter, Facebook and Foursquare. It can also be extended to other studies in social computing with cross-platform problems such as targeted marketing, information retrieval, collaborative filtering, sentiment analysis  and more.

In addition, since only the Adjacent Users are involved in each iteration process, our method is scalable and can be easily applied to large datasets and online user identification applications. Identifying anonymous users across multiple SMNs is challenging work. Therefore, only a portion of identical users with different nicknames can be recognized with this method. This study built the foundation for further studies on this issue. Ultimately, it is our hope that a final approach can be developed to identify all identical users with different nicknames. Other user identification methods can be applied simultaneously to examine multiple SMN platforms. These methods are complementary and not mutually exclusive, since the final decision may rely on human user’s involvement. Therefore, we suggest using these methods synergistically and considering strengths and weaknesses for the best results.