Affable

Java consumer which consumes influencer's information for storing, ranking, aggregation and time-series analytics.

Installation

To install and package -

$ mvn package

To run the consumer for Cassandra writing

$ java -cp affable-1.0-SNAPSHOT.jar com.affable.consumer.DBConsumer

To run the consumer for time-series analytics, ranking etc.

$ java -cp affable-1.0-SNAPSHOT.jar com.affable.consumer.AnalyticsConsumer

Prerequisites/Setup

Create a kafka topic 'influencers-analytics', in which the influencer's update after cassandra writing would be pushed.

$ bin/kafka-topics --create --zookeeper localhost:2181 --replication-factor 1 --partitions 10 --topic influencers-analytics

Create cassandra keyspace affable and table users

CREATE KEYSPACE affable WITH REPLICATION = { 'class' : 'NetworkTopologyStrategy', 'datacenter1' : 3 } AND DURABLE_WRITES = false;

CREATE TABLE users(userid int PRIMARY KEY, username varchar, followerCount varint, followingCount varint, isSuspicious boolean, time varint);

Install/configure InfluxDB and create database, retention policy

$ influx

> CREATE DATABASE affable_influencers

> CREATE RETENTION POLICY defaultPolicy ON affable_influencers DURATION 30d REPLICATION 1

Architecture

Please read the WIKI for this part of details

Benchmarks

Benchmarking on -

Macbook Air
8 GB 1600 MHz DDR3
1.6 GHz Intel Core i5

With single instance of each consumer -

Cassandra Writing: 800 writes/sec

Ranking and Influx: 700 writes/sec
With two instances of DBConsumer -

Cassandra Writing: 1500 writes/sec
Making cassandra writes as asynchronous, increased writing performance by 33 percent.

Scaling

Increasing consumer instances, increases computing capability of the consumers.

Ideally the number of topic partitions should be equal to number of consumer instances and both can scale pretty easily.

Redis

To scale out redis horizontally, we can have keys sharding by using hash of hash.

To further increase the speed of ranking/averaging through Redis, we can follow batching approach. With this way, we would only be doing ranking/averaging only after 'n' updates.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.idea		.idea
src		src
target		target
.DS_Store		.DS_Store
README.md		README.md
affable.iml		affable.iml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Affable

Installation

Prerequisites/Setup

Architecture

Benchmarks

Scaling

Redis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Affable

Installation

Prerequisites/Setup

Architecture

Benchmarks

Scaling

Redis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages