big data analysis

Topic- "Graph-based analysis of big data"

Big data is a term that describes the large volume of data both, structured and unstruc-

tured that inundates a business on a day to day basis. But it’s not the amount of data

that’s important. It’s what organizations do with the data that matters. Big data can

be analyzed for insights that lead to better decisions and strategic business moves with

the help of graph. Graphs are ubiquitous and the volume and diversity of graph data

are strongly growing.

Continuous growth of real time new applications results a huge amount of data that has been modeled into [login to view URL] graph database has been represented as an interconnections between objects and the study of these interconnections left with recent research gaps that are given as follows:

-Overhead reduction of cypher queries.

-Manual attribute based graph summarization.

-Complex link analysis to discover fraud patterns in a big data analysis.

-Detect and prevent fraud as it happens in real time.

Interactive graph analytics supported by suitable visualizations is highly desirable to put

the human in the loop for exploring and analyzing graph data. The currently existing

separation between interactive query processing with graph databases and batch-oriented

graph analytics should thus be overcome by providing all kinds of analysis in a unified,

distributed platform with support for interactive and visual analysis. Some of the graph

e.g., Blazegraph, System G and Titan, try to go into this direction, but there are still many

open issues in finding suitable visualizations and interaction forms for the difierent kinds of

analysis and at the same time it poses a number of challenges for suitable implementations

which are observed as follows:

Problem 1: Cypher queries mostly request complete nodes and relationships, which

causes a considerable overhead due to inefficiencies in the data structure used by Neo4j.

Problem 2: Graph visualization and summarization: Graph visualization methods are

primarily designed to better layout a graph in a big data world, so that it is easier for

users to understand the graph by visual. However, as graphs become large, displaying an

entire graph on the limited computer screen is challenging both, from the usability and

the visual performance perspectives. To overcome the problems raised by the large graph

sizes, navigation, interaction and summarization techniques are often incorporated into

graph visualization tools.

Graph summarization techniques are crucial in such domains as they can assist in

uncovering useful insights about the patterns hidden in the underlying data. However,

earlier graph summarization is to produce small and informative summaries based on user-

selected node attributes and relationships which allow users to interactively drill-down or

roll-up to navigate through summaries with different resolutions. Earlier we have used K-

Snap method which only deals with categorical node attributes(A categorical or discrete

variable is one that has two or more categories) but in the real world, many node attributes

are numerical, such as the age of a social network user or the number of publications of

an author in a coauthorship network. Simply running the graph summarization method

on the numerical attributes will result in summaries with large sizes (at least as large as

the number of distinct numerical values).

Problem 3: Complex link analysis to discover fraud patterns in a big data analysis:

Uncovering fraud rings requires you to traverse data relationships with high computational

complexity. This problem exacerbated as a fraud ring grows with the size of incremental


Problem 4: Detect and prevent fraud as it happens in execution time: To prevent a

fraud ring, you need real-time link analysis on an interconnected dataset, from the time a

false account is created to when a fraudulent transaction occurs.

Compétences : Java, Python, Recherche, Programmation de bases de données, Neo4j

en voir plus : big data analysis thesis assistant india freelance, big data analysis, mapreduce algorithms for big data analysis, big data analysis course in hyderabad, big data analysis course, skills needed for big data analysis, how does data format affect big data analysis, challenges in big data analysis, big data analysis report, learning spark: lightning-fast big data analysis, python for big data analysis & visualization, big data analysis and data mining, big data analysis and deep learning applications, sql for big data analysis, big data analysis techniques pdf, big data analysis techniques, how is big data analysis helpful in increasing business revenue, big data analysis with scala and spark, freelance work for big data analysis, big-data analysis with python github

Concernant l'employeur :
( 0 commentaires ) Lucknow, India

Nº du projet : #29420342

3 freelances font une offre moyenne de 51481 ₹ pour ce travail


Hi. I am a data scientist. I am very familiar to Deep learning apis such as Tensorflow, TensorflowLite, TfLearn, keras, pytorch, and fastai, mxnet. I have a good hands on working with Advanced R and Python and BI tools Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% INR en 20 jours
(4 Commentaires)

I am a professional Business Plan researcher with many years of experience. I am committed to giving my clients the ultimate service that they are looking for. With vast writing experience, I believe in high-quality wo Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% INR en 7 jours
(1 Évaluation)

Hello sir. Thank you for giving me a chance to bid on your project I have gone through your requirements. I have much experience in this field. I am sure that I can finish your project as you want.  I have Skills as f Plus

%bids___i_sum_sub_35% %project_currencyDetails_sign_sub_36% INR en 10 jours
(0 Commentaires)