WYA? (Where You At)
The acclaimed Netflix documentary The Social Dilemma unveiled hidden machinations behind social media and its control over billions of users. For our generation, the millennials and gen Z, social media forms a very important aspect of our lives. Almost all our experiences make it to Instagram as pictures, or on Twitter as texts, or Facebook Posts or even on LinkedIn. Tristan Harris, the primary subject of the documentary, has spent his career studying how today's major technology platforms have increasingly become the social fabric by which we live and think, giving these platforms dangerous power over our ability to make sense of the world.
Following in his footsteps of uncovering hidden security issues, our team (Ayushman Panda, Kancharla Aditya Hari, P. Sahithi Reddy, Pratishtha Abrol, Snehal Ranjan, and V.J.S. Pranavasri) came up with the idea of this project. Where You At attempts to gauge how a person's location information can be inferred from his posts online. Location forms a vital bridge connecting the virtual world to the real world, and has multitude ways to be misused, which makes it an important aspect to study, keeping in mind the privacy and safety of the users of social media.
For the first and foremost step of the project, it was important to know which, among the wide variety of social media platforms would be the most suitable for the scope of this project.
For this, we had to conduct a survey, the targeted age group being 15 to 30, according to which unsurprisingly, Instagram easily made it to the most influential and most used platform. In fact, on further studying about user engagement on Instagram, we found that close to 100 MILLION posts are made per day on Instagram. This made Instagram the perfect choice for our project.
In another survey conducted by us, on the same participants, we asked them of their preferences about revealing the location of their posts, 43% responded with aversion, close to half of that percentage stating that they do it for privacy concerns. Although, the results of our survey did show that the participants were aware of some of the privacy issues, they were unaware that even a friend in their network, or even a random post with a similar background could be used to identify their location. This reinforced the idea of this project.
Implementation
Deep Learning Models formed were used to analyze images and extract features from user's social media posts. These features would later be used to create a comparison with the posts made by the people in the victim's social network. We limited our search to only the people in the network of the target user to reduce computational load on our devices. This limit is easy to expand, by creating a larger image dataset, or even using location tagged images from Google, but would require infrastructure and time, and thus could not be included in the scope of this project. We encourage people to build upon this and carry out further research in the field.
Our model followed the following approach:
Scraper: The scraper forms the first step in the implementation of this project. It creates a dataset according to each image url input, scraping the target user's network for their friends and their posts. It outputs a file of images, metadata and a list of followees.
Filter: The scraped images from the user's network are passed through a filter, to filter out images with no location tags.
DELF model: DELF (DEep Local Features) is a TensorFlow model, useful for large-scale instance-level image recognition. IT detects and describes semantic local features which can be geometrically verified between images showing similar object instances. DELF forms the core of the whole model, algorithmically matching images with the same background data, to output the best matches and the probable location information.
Result
The picture above showcases an output example. The red dot represents the victim, green, the victim's friend network, blue represents the posts with location information, and finally, the gold ones for the posts with a high similarity score. It can easily be drawn from the result, an image of another friend with the same location as the victim.
Click here to see our project explained in a video.
Comments
Post a Comment