We are a team PhD students at Faculty of Technical Science, Novi Sad, Serbia where we work as teaching assistants. We have a strong background in relational databases, data mining and web development. We are cleaning, integrating and transforming data from various sources (websites, social media, databases, various text formats such as xlsx, csv, xml...) and building reliable web applications which are easy to use and contain various complex analysis based on advanced statistics and data mining which are easy to understand and are used to help you in making right choices in your business.
Cambridge English: Advanced (CAE) offers proof of the English skills that education institutions and employers seek for high-achieving study and work situations.
Architecture and Algorithms for Filtering Tweets Based on Chosen Countries and Cities
In this paper we present an algorithm for filtering Twitter data based on the tweet geographic location. Desired geographic locations are provided as a set of parameters, and different properties of a tweet are considered to determine the location. A user may also choose the number of threads and amount of memory used in filtering process. In this way, the user may fine-tune the algorithm performance. Filtered data are stored in the Hadoop distributed file system which runs on a 16 nodes cluster.