0% found this document useful (0 votes)

61 views17 pages

BDAV Practical

The document provides a comprehensive guide on implementing MapReduce programs for WordCount and Matrix Multiplication in Java, including necessary code snippets and execution steps. It also covers MongoDB operations for creating and querying a sample database, as well as Hive commands for database and table creation, data loading, and querying. Additionally, it includes practical examples for using Pig Latin to analyze student data.

Uploaded by

socialdixon789

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views17 pages

BDAV Practical

Uploaded by

socialdixon789

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Practical 1: Write a program in Map Reduce for WordCount operation.

public class WordCount {

public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable>{
private Text word = new Text();
public void map(Object key, Text value, Context context )
throws IOException, InterruptedException {
StringTokenizer itr = new StringTokenizer([Link]());
while ([Link]()) {
[Link]([Link]());
[Link](word, new IntWritable(1));
(1,1)
}
}
}
public static class IntSumReducer extends Reducer<Text,IntWritable,Text,IntWritable> {
public void reduce(Text key, Iterable<IntWritable> values,Context context) throws
IOException, InterruptedException {
int sum = 0;
for (IntWritable x : values) { sum += [Link]();
}
[Link](key, new IntWritable(sum));
}
}
public static void main(String[] args) throws Exception {
Configuration conf = new Configuration();
Job job = [Link](conf, "word count");
[Link]([Link]); // mentioning Main class
[Link]([Link]);
[Link]([Link]);
[Link]([Link]);
[Link]([Link]);
[Link]([Link]);
[Link](job, new Path(args[0]));
[Link](job, new Path(args[1]));
[Link]([Link](true) ? 0 : 1);
}
}

[Link](input data)
how are you where are you

Steps to run program :-

start hadoop
[Link]
[Link]
hadoop [Link] [Link]
ls -l
hdfs dfs -ls /
hdfs dfs -rm -r /wordcount
jar cf [Link] WordCount*.class

[Link] localhost la check kara file ahe ka tikde.

hdfs dfs -mkdir -p /wordcount/input
hdfs dfs -copyFromLocal [Link] /wordcount/input
hadoop jar [Link] WordCount /wordcount/input /wordcount/output
hdfs dfs -cat /wordcount/output/part-r-00000

Practical 2: Write a program in Map Reduce for Matrix Multiplication.

[Link]

import [Link].*; import [Link]; import

[Link].*;
import [Link].*;
import [Link];
import [Link];
import [Link];
import [Link];

public class MatrixMultiply {

public static void main(String[] args) throws Exception {
if ([Link] != 2) { [Link]("Usage: MatrixMultiply <in_dir> <out_dir>");
[Link](2);
}
Configuration conf = new Configuration();
[Link]("n", "100");
[Link]("p", "1000");
@SuppressWarnings("deprecation")
Job job = new Job(conf, "MatrixMultiply");
[Link]([Link]);
[Link]([Link]);
[Link]([Link]);
[Link]([Link]); [Link]([Link]);
[Link]([Link]);
[Link]([Link]);
[Link](job, new Path(args[0]));
[Link](job, new Path(args[1]));
[Link](true);
}
}

[Link]

import [Link].*;
import [Link];
import [Link];
import [Link];
import [Link];

public class Map

extends [Link]<LongWritable, Text, Text, Text> {
@Override
public void map(LongWritable key, Text value, Context context) throws IOException,
InterruptedException {
Configuration conf = [Link]();
int m = [Link]([Link]("m"));
int p = [Link]([Link]("p"));
String line = [Link]();
String[] indicesAndValue = [Link](",");
Text outputKey = new Text();
Text outputValue = new Text();
if (indicesAndValue[0].equals("M")) {
for (int k = 0; k < p; k++) {
[Link](indicesAndValue[1] + "," + k);
[Link](indicesAndValue[0] + "," + indicesAndValue[2] + "," +
indicesAndValue[3]);
[Link](outputKey, outputValue);
}
} else {
for (int i = 0; i < m; i++) {
[Link](i + "," + indicesAndValue[2]);
[Link]("N," + indicesAndValue[1] + "," + indicesAndValue[3]);
[Link](outputKey, outputValue);
}
}
}
}

[Link]

import [Link];
import [Link];
import [Link];
import [Link];

public class Reduce

extends [Link]<Text, Text, Text, Text> {
@Override
public void reduce(Text key, Iterable<Text> values, Context context) throws IOException,
InterruptedException {
String[] value;
HashMap<Integer, Float> hashA = new HashMap<Integer, Float>();
HashMap<Integer, Float> hashB = new HashMap<Integer, Float>();
for (Text val : values) {
value = [Link]().split(",");
if (value[0].equals("M")) {
[Link]([Link](value[1]), [Link](value[2]));
} else {
[Link]([Link](value[1]), [Link](value[2]));
}
}
int n = [Link]([Link]().get("n"));
float result = 0.0f;
float m_ij;
float n_jk;
for (int j = 0; j < n; j++) {
m_ij = [Link](j) ? [Link](j) : 0.0f;
n_jk = [Link](j) ? [Link](j) : 0.0f;
result += m_ij * n_jk;
}
if (result != 0.0f) {
[Link](null,
new Text([Link]() + "," + [Link](result)));
}
}
}

Ek file banva [Link] nava chi tyat he taka

M,0,0,12
M,0,1,13
M,1,0,14
M,1,1,15
Ek file banva [Link] nava chi tyat he taka
N,0,0,11
N,0,1,13
N,1,0,14
N,1,1,19

code:-
[Link]
[Link]
hadoop [Link] [Link] [Link] [Link]
jar cf [Link] *.class
ls –l
hdfs dfs -mkdir /MatrixMultiply
hdfs dfs -mkdir /MatrixMultiply/input
hdfs dfs -ls /
hdfs dfs -copyFromLocal [Link] [Link] /MatrixMultiply/input
hadoop jar [Link] MatrixMultiply /MatrixMultiply/input /MatrixMultiply/output
hdfs dfs -cat /MatrixMultiply/output/part-r-00000

MONGODB

CmdPrompt1-type : mongod
2nd cmd PmptType: mongosh

Practical 2: Sample Database Creation

Start cmd -> mongod

Start a new cmd -> mongosh
show dbs
use tanvi ( can use any name )

Practical 3: Query the Sample Database using MongoDB querying commands

[Link]("student")
[Link]({name: "Tanvi Tawade", rollno:61, div:"A"})
[Link]([{name: "Namrata Gaikwad", rollno:12, div: "B"},
{name: "Omkar Daifale", rollno:10, div:"A"},
{name: "Chinmay Warang", rollno:69, div:"A"},
{name: "Shreya Nikam", rollno:33, div:"B"},
{name: "Pratiksha Majrekar", rollno:31, div:"A"}, (ekach snippet code ahe )
{name: "Heth Shah", rollno:52, div:"B"},
{name: "Ketan Bhoir", rollno:6, div:"B"},
{name: "Uday Gavada", rollno:16, div:"A"},
{name: "Prathmesh Patil", rollno:38, div:"B"},
{name: "Swaraj Wadkar", rollno:67, div:"A"}])
[Link]({})
[Link]().pretty()
[Link]({name:"Tanvi Tawade"})
[Link]({name: {$in:["Tanvi Tawade", "Swaraj Wadkar"]}})
[Link]({$and:[{name:"Tanvi Tawade"},{rollno:61}]})
[Link]({$or:[{name:"Tanvi Tawade"},{rollno:31}]})
[Link]({rollno:{$lt:62}, $or:[{name:"Tanvi Tawade"},{div:"A"}] })
[Link]({rollno:{$lt:62}, $or:[{name:"Tanvi Tawade"},{div:"B"}] })
[Link]({$or:[{name:/^C/},{name:/^T/}]})
[Link]({$nor:[{name:"Swaraj Wadkar"},{div:"B"}]})
[Link]({name:"Heth Shah"})
[Link]({name:"Heth Shah"},{$set: {div:"A"}})
[Link]([{name: "Namrata Gaikwad", rollno:12, div: "B"},
{name: "Omkar Daifale", rollno:10, div:"A"},
{name: "Shreya Nikam", rollno:33, div:"B"}])
[Link]({div:"B"},{$set:{div:"A"}})
[Link]({name:"Namrata Gaikwad"},{$set:{div:"B",rollno:13} })
[Link]({rollno:38})
[Link]({$or: [{rollno:{$lt:30}},{div:"B"}]})
[Link]({name:1, rollno:1},{name: "idx_name_rollno"})

HIVE

Practical 3: Create Database & Table in Hive

To start hive Go to /home/hadoop/apache-hive-3.1.2-bin

[Link]
[Link]
hive
create database tanvi;
show databases;
use tanvi;
create table student(rno int, name string,section string, marks int);
show tables;
insert into table student values(61,'Tanvi', 'A', 83);
select * from student;
insert into table student values(12, 'Namrata', 'B', 54), (10,'Omkar','A',53),
(31,'Pratiksha','A',89),(33,'Shreya','B',23),(6,'Ketan','B',47),(69,'Chinmay','B',59),
(16,'Uday','A',78),(52,'Heth','B',68),(38,'Prathmesh','B',48), (67,'Swaraj','A',56);

Practical 4: Hive Partitioning

set [Link]=true;
set [Link];
set [Link]=nonstrict;

Create a file [Link]

61,Tanvi,A,83
12,Namrata,B,54
10,Omkar,A,53
31,Pratiksha,A,89
33,Shreya,B,23
6,Ketan,B,47
69,Chinmay,B,59
16,Uday,A,78
52,Heth,B,68
38,Prathmesh,B,48
67,Swaraj,A,56

create table student_part(rno int, name string,marks int)

partitioned by(section string)
row format delimited fields terminated by ',' ;

LOAD DATA LOCAL INPATH '/home/hadoop/hive/[Link]' INTO TABLE

student_part;
DESCRIBE FORMATTED student_part;
SELECT COUNT(*) FROM student_part WHERE section = 'A';

Practical 7: Hive Views and Indexes

CREATE VIEW emp_view AS SELECT * FROM employee WHERE salary>60000;
select * from emp_view;
drop view emp_view;

Practical 8: HiveQL : Select Where, Select OrderBy, Select GroupBy, Select Joins

Create a text file [Link]

61,Tanvi,Manager,83000
12,Namrata,Developer,54000
10,Omkar,Tester,53000
31,Pratiksha,Manager,89000
33,Shreya,Developer,23000
6,Ketan,Tester,47000
69,Chinmay,B,59000
16,Uday,Tester,78000
52,Heth,Developer,68000
38,Prathmesh,Developer,48000
67,Swaraj,Tester,56000

CREATE TABLE [Link] ( empcode INT,ename STRING, job STRING, salary

INT)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ',';
LOAD DATA LOCAL INPATH '/home/hadoop/hive/[Link]' INTO TABLE employee;
select * from employee;
select count(*) from employee;
select avg(salary) from employee;
ALTER TABLE employee RENAME TO emp;

Create a [Link]

61,Tanvi,1,83000
12,Namrata,3,54000
10,Omkar,2,53000
31,Pratiksha,1,89000
33,Shreya,2,23000
6,Ketan,2,47000
69,Chinmay,3,59000
16,Uday,2,78000
52,Heth,3,68000
38,Prathmesh,3,48000
67,Swaraj,2,56000
37,Rupali,2,66000
CREATE TABLE [Link] ( empcode INT,ename STRING, dno INT,salary INT)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ',';
LOAD DATA LOCAL INPATH '/home/hadoop/hive/[Link]' INTO TABLE employee;
select * from employee;

Create a dept..txt
1,Manager,Mumbai
2,Tester,Pune
3,Developer,Delhi

CREATE TABLE [Link] ( dno INT, dname STRING, loacation STRING)

ROW FORMAT DELIMITED
FIELDS TERMINATED BY ',';
LOAD DATA LOCAL INPATH '/home/hadoop/hive/[Link]' INTO TABLE department;
select * from department;
select * from employee e, department d where [Link]=[Link];
select count(*) from employee group by dno;
select count(*) from employee e ,department d where [Link]=[Link] and [Link]='Manager';
ALTER TABLE employee ADD COLUMNS (dept STRING COMMENT 'Department
name');
SELECT dno, COUNT(*) FROM [Link] GROUP BY dno;
SELECT [Link],[Link],[Link],[Link],[Link] FROM employee e JOIN
department d ON ([Link]=[Link]);
SELECT [Link],[Link],[Link],[Link],[Link] FROM employee e LEFT OUTER
JOIN department d ON ([Link]=[Link]);
SELECT [Link],[Link],[Link],[Link],[Link] FROM employee e RIGHT OUTER
JOIN department d ON ([Link]=[Link]);
SELECT [Link],[Link],[Link],[Link] FROM employee e FULL OUTER JOIN
department d ON ([Link]=[Link]);

PIG

Practical 2: Pig Latin Basic

1. Display total number of students

Create a [Link]
61, Tanvi Tawade, maths, 85
61, Tanvi Tawade, aiml, 90
61, Tanvi Tawade, dscc, 78
12, Namrata Gaikwad, maths, 75
12, Namrata Gaikwad, aiml, 82
12, Namrata Gaikwad, dscc, 90
10, Omkar Daifale, maths, 92
10, Omkar Daifale, aiml, 88
10, Omkar Daifale, dscc, 76
69, Chinmay Warang, maths, 80
69, Chinmay Warang, aiml, 85
69, Chinmay Warang, dscc, 92
33, Shreya Nikam, maths, 88
33, Shreya Nikam, aiml, 78
33, Shreya Nikam, dscc, 85
31, Pratiksha Majrekar, maths, 76
31, Pratiksha Majrekar, aiml, 90
31, Pratiksha Majrekar, dscc, 82
52, Heth Shah, maths, 90
52, Heth Shah, aiml, 85
52, Heth Shah, dscc, 88
6, Ketan Bhoir, maths, 82
6, Ketan Bhoir, aiml, 76
6, Ketan Bhoir, dscc, 90
16, Uday Gavada, maths, 85
16, Uday Gavada, aiml, 92
16, Uday Gavada, dscc, 78
38, Prathmesh Patil, maths, 78
38, Prathmesh Patil, aiml, 85
38, Prathmesh Patil, dscc, 90
67, Swaraj Wadkar, maths, 92
67, Swaraj Wadkar, aiml, 80
67, Swaraj Wadkar, dscc, 86

stud = LOAD '[Link]' using PigStorage(',') AS (rno: int , name : chararray , sub :
chararray, mark : int);
dump stud;
Describe stud;
{rno: int , name : chararray , sub : chararray, mark : int}
A = group stud all;
dump A;
B = foreach A generate COUNT(stud);
dump B;

2. Display subject wise student count

A = group stud by sub;
dump A;
B = foreach A generate COUNT(stud);
dump B;
B = foreach A generate AVG([Link]);
dump B;
B = foreach A generate [Link], AVG([Link]);
dump B;
B = foreach A generate [Link], AVG([Link]);
dump B;
B = foreach A generate [Link], SUM([Link]);
dump B;
B = foreach A generate [Link], SUM([Link]);
dump B;
B = foreach A generate [Link], SUM([Link]);
dump B;
B = foreach A generate [Link], MAX([Link]);
dump B;
B = foreach A generate MAX([Link]);
dump B;
B = foreach A generate [Link], MIN([Link]);
dump B;

Practical 4: Download the data

pig -x local

Create a [Link]
61, Tanvi, Tawade, 22, 9766543210, Mumbai
12, Namrata, Gaikwad, 23, 9876543210, Mumbai
10, Omkar, Daifale, 22, 8765432109, Bangalore
69, Chinmay, Warang, 24, 7654321098, Delhi
33, Shreya, Nikam, 21, 6543210987, Mumbai
31, Pratiksha, Majrekar, 25, 5432109876, Hyderabad
52, Heth, Shah, 23, 4321098765, Bangalore
6, Ketan, Bhoir, 24, 3210987654, Mumbai
16, Uday, Gavada, 22, 2109876543, Delhi
38, Prathmesh, Patil, 21, 1098765432, Hyderabad
67, Swaraj, Wadkar, 25, 9876543210, Chennai

student1 = LOAD '[Link]' using PigStorage(',') AS (rno:chararray, fname:chararray,

lname:chararray, age:int, phone:int, city:chararray);
dump student1;
STORE student1 into 'student_output.txt' using PigStorage('|');
Practical 5: Create your Script
1. Write the following pig latin commands in a file called student_data.pig.

emp = load '[Link]' using PigStorage(',') AS (eid:chararray, name:chararray,

designation:chararray, deptid:chararray, salary:int);
STORE emp into 'emp_output.txt' using PigStorage(',');
ss = FOREACH emp GENERATE eid, name, deptid;
dump ss;

Practical 6: Save and Execute the Script

2. Execute the Apache Pig script using the following command.

pig -x local emp_data.pig

exec [Link]
run emp_data.pig

Practical 7: Pig Operations : Diagnostic Operators, Grouping and Joining, Combining

& Splitting, Filtering, Sorting

Crreate a [Link]
61, Tanvi, Tawade, 22, 9766543210, Mumbai
12, Namrata, Gaikwad, 23, 9876543210, Mumbai
10, Omkar, Daifale, 22, 8765432109, Bangalore
69, Chinmay, Warang, 24, 7654321098, Delhi
33, Shreya, Nikam, 21, 6543210987, Mumbai
31, Pratiksha, Majrekar, 25, 5432109876, Hyderabad
52, Heth, Shah, 23, 4321098765, Bangalore
6, Ketan, Bhoir, 24, 3210987654, Mumbai
16, Uday, Gavada, 22, 2109876543, Delhi
38, Prathmesh, Patil, 21, 1098765432, Hyderabad
67, Swaraj, Wadkar, 25, 9876543210, Chennai

student1 = LOAD '[Link]' using PigStorage(',') AS (rno:chararray, fname:chararray,

lname:chararray, age:int, phone:int, city:chararray);

a. Diagnostic Operators
dump student1;
describe student1;
explain student1;
stud_11 = FILTER student1 BY age < 23;
dump stud_11;
C = FOREACH student1 GENERATE rno, fname, city;
dump C;
illustrate C;

b. Grouping and Joining

stud_1 = GROUP student1 BY city;
dump stud_1;
describe stud_1;
stud_2 = GROUP student1 BY (city,age);
dump stud_2;
describe stud_2;

1. self join
A = LOAD '[Link]' using PigStorage(',') AS (rno:chararray, fname:chararray,
lname:chararray, age:int, phone:int, city:chararray);
B = LOAD '[Link]' using PigStorage(',') AS (rno:chararray, fname:chararray,
lname:chararray, age:int, phone:int, city:chararray);
C = JOIN A BY age, B BY age;
dump C;

2. Inner Join (equijoin)- An inner join returns rows when there is a match in both
tables.
Create a [Link]
61, Tanvi, Manager, 1, 83000
12, Namrata, Quality Assurance, 3, 54000
10, Omkar, Engineering, 2, 53000
31, Pratiksha, Manager, 1, 89000
33, Shreya, Testing, 2, 23000
6, Ketan, Testing, 2, 47000
69, Chinmay, Quality Assurance, 3, 59000
16, Uday, Testing, 2, 78000
52, Heth, Quality Assurance, 3, 68000
38, Prathmesh, Quality Assurance, 3, 48000
67, Swaraj, Testing, 2, 56000
37, Rupali, Testing, 2, 66000

emp= LOAD '[Link]' using PigStorage(',') AS (eid:chararray, name:chararray,

designation:chararray, deptid:chararray, salary:int);
dump emp;

Create a [Link]
1, Finance
2, Testing
3, Quality Assurance

dept= LOAD '[Link]' using PigStorage(',') AS (deptid:chararray, dname:chararray);

dump dept;
emp_dept_innerjoin = JOIN emp BY deptid, dept BY deptid;
dump emp_dept_innerjoin;
3. LEFT join
emp_dept_left = JOIN emp BY deptid LEFT, dept BY deptid;
dump emp_dept_left ;

4. RIGHT JOIN
emp_dept_right = JOIN emp BY deptid RIGHT, dept BY deptid;
dump emp_dept_right;

5. FULL outer join

emp_dept_full= JOIN emp BY deptid FULL OUTER, dept BY deptid;
dump emp_dept_full ;

Cross Product
cross_prod = CROSS emp, dept;
dump cross_prod;

c. Combining & Splitting

SPLIT emp into sal1 if salary<54000, sal2 if salary>=54000;
dump sal1;
dump sal2;

d. Filtering, Sorting
filter_designation = FILTER emp BY designation == 'manager';
dump filter_designation;

Order by
S = order emp by name desc;
dump S;
S = order emp by name asc;
dump S;

SPARK

Practical 2: Downloading Data Set and Processing it Spark

spark-shell
val mydfT = [Link]("/home/hadoop/SparkT/[Link]")
[Link]()
[Link]
[Link]("BVIMIT")
val mydf2 = [Link]("SELECT * FROM BVIMIT")
[Link]()
val mydf2 = [Link]("describe BVIMIT")
[Link]
val mydf2 = [Link]("SELECT * FROM BVIMIT where _c1 > 50")
[Link]

Step 1 : Create dataframe from json file

val df=[Link]("/home/hadoop/SparkT/[Link]")
if error is showwing then use this
val df1=[Link]("multiline","true").json("/home/hadoop/SparkT/[Link]")
[Link]()
[Link]()
[Link]("name").show()
[Link](("name"),("div")).show()
OR
[Link]([Link]("name"), [Link]("div")).show()
[Link]([Link]("rollno") >50).show()
[Link]("div").count().show()
[Link]("people2")
val sqlDF1 = [Link]("SELECT * FROM people2")
[Link]
[Link]("output")

Practical 3: Word Count in Apache Spark.

Create a [Link]
As we all know, a paragraph is a group of sentences that are connected and make absolute
sense. While writing a long essay or letter, we break them into paragraphs for better
understanding and to make a well-structured writing piece.

val data3=[Link]("/home/hadoop/SparkT/[Link]")
[Link]
val splitdata=[Link](line=>[Link](" "));
[Link];
val mapdata=[Link](word=>(word,1));
[Link]
val reducedata=[Link](_+_);
[Link]

Ex 3 and 4
No ratings yet
Ex 3 and 4
15 pages
All
No ratings yet
All
11 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
BDA Exp Removed Removed
No ratings yet
BDA Exp Removed Removed
33 pages
22MCC20017 Suraj Kumar Thakur BIG Data 2.1
No ratings yet
22MCC20017 Suraj Kumar Thakur BIG Data 2.1
7 pages
ADA Lab Manual
No ratings yet
ADA Lab Manual
34 pages
Big Data Lab
No ratings yet
Big Data Lab
52 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Exp 9 - Merged
No ratings yet
Exp 9 - Merged
13 pages
Bda-Wordcount 250805 135324
No ratings yet
Bda-Wordcount 250805 135324
5 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
Big Data Analytics with Hadoop Guide
No ratings yet
Big Data Analytics with Hadoop Guide
10 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Bigdata Program
No ratings yet
Bigdata Program
4 pages
Hadoop MapReduce Java Examples
No ratings yet
Hadoop MapReduce Java Examples
15 pages
1 To 8
No ratings yet
1 To 8
16 pages
Big Data Lab
No ratings yet
Big Data Lab
8 pages
Hadoop Wordcount Program
No ratings yet
Hadoop Wordcount Program
20 pages
Sets Bda
No ratings yet
Sets Bda
19 pages
Java Hadoop Word Count Tutorial
No ratings yet
Java Hadoop Word Count Tutorial
4 pages
Exp 5 Bdafinal
No ratings yet
Exp 5 Bdafinal
7 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Matrix Multiplication with Hadoop MapReduce
No ratings yet
Matrix Multiplication with Hadoop MapReduce
7 pages
Hadoop Mini Project
No ratings yet
Hadoop Mini Project
8 pages
MapReduce Matrix Multiplication Guide
No ratings yet
MapReduce Matrix Multiplication Guide
7 pages
Hadoop Installation & MapReduce Guide
No ratings yet
Hadoop Installation & MapReduce Guide
7 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Using Map Reduce Concept, Implement A Java Pro...
No ratings yet
Using Map Reduce Concept, Implement A Java Pro...
2 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
BDA Exp5
No ratings yet
BDA Exp5
12 pages
Bda-Lab 2&6
No ratings yet
Bda-Lab 2&6
6 pages
Overview of MapReduce Framework
No ratings yet
Overview of MapReduce Framework
23 pages
Mapreduce Program
No ratings yet
Mapreduce Program
3 pages
BDA
No ratings yet
BDA
19 pages
Hadoop Word Count with MapReduce
No ratings yet
Hadoop Word Count with MapReduce
6 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
4 Matrix
No ratings yet
4 Matrix
2 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
WordCountApp
No ratings yet
WordCountApp
2 pages
BDA Record
No ratings yet
BDA Record
58 pages
Hadoop Log Level MapReduce Tutorial
No ratings yet
Hadoop Log Level MapReduce Tutorial
3 pages
BDA Output
No ratings yet
BDA Output
32 pages
Big Data Lab
No ratings yet
Big Data Lab
12 pages
MapReduce Word Count Program Guide
No ratings yet
MapReduce Word Count Program Guide
14 pages
Elipse
No ratings yet
Elipse
9 pages
Java MapReduce Word Count Example
No ratings yet
Java MapReduce Word Count Example
2 pages
MapReduce Word Count Example in Java
No ratings yet
MapReduce Word Count Example in Java
6 pages
Classcreation
No ratings yet
Classcreation
2 pages
Lab3 BigData-MapReduce
No ratings yet
Lab3 BigData-MapReduce
8 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
MapReduce - Notes
No ratings yet
MapReduce - Notes
17 pages
MapReduce Workflow in Hadoop
No ratings yet
MapReduce Workflow in Hadoop
28 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
58 pages
Experiment 1 Copy 1
No ratings yet
Experiment 1 Copy 1
8 pages
Hadoop
No ratings yet
Hadoop
19 pages
Abstract ResearchPaper 1
No ratings yet
Abstract ResearchPaper 1
6 pages
Mobile Computing Lab Certificate 2023-24
No ratings yet
Mobile Computing Lab Certificate 2023-24
176 pages
Artificial Intelligence and Machine Learning (Theory Exam)
No ratings yet
Artificial Intelligence and Machine Learning (Theory Exam)
65 pages
B2 34 Ethical Hacking
No ratings yet
B2 34 Ethical Hacking
87 pages
Metaverse Impact on Shopping Behavior
No ratings yet
Metaverse Impact on Shopping Behavior
10 pages
Trellix Application Control For Windows Essentials
No ratings yet
Trellix Application Control For Windows Essentials
10 pages
BDMS
No ratings yet
BDMS
33 pages
Feedback Hunter
No ratings yet
Feedback Hunter
6 pages
Automotive LIN Bootloader Guide
No ratings yet
Automotive LIN Bootloader Guide
6 pages
Nexans Euromold Interface C Symmetrical Accessories
No ratings yet
Nexans Euromold Interface C Symmetrical Accessories
2 pages
Vamsi Resume
No ratings yet
Vamsi Resume
7 pages
02 Server Basics
No ratings yet
02 Server Basics
44 pages
Net2 - Quiz2 - Dimaano, Zareena A.
No ratings yet
Net2 - Quiz2 - Dimaano, Zareena A.
3 pages
EEE-354: Telecommunication Systems Engineering: Problem Session
No ratings yet
EEE-354: Telecommunication Systems Engineering: Problem Session
20 pages
19 Parts
No ratings yet
19 Parts
41 pages
Design of Pulse Oximetry System: Journal of Basic and Applied Research International April 2015
No ratings yet
Design of Pulse Oximetry System: Journal of Basic and Applied Research International April 2015
12 pages
Non-European University Fees Guide
No ratings yet
Non-European University Fees Guide
4 pages
Fundamentals of RF Circuit Design - ch7
No ratings yet
Fundamentals of RF Circuit Design - ch7
14 pages
Driver ST Motor RTA
0% (1)
Driver ST Motor RTA
2 pages
Chapter 5 Data Flow Diagram (DFD)
No ratings yet
Chapter 5 Data Flow Diagram (DFD)
36 pages
Ex01 - Introduction To PS501 - RevA
No ratings yet
Ex01 - Introduction To PS501 - RevA
22 pages
Basic Notes
No ratings yet
Basic Notes
25 pages
DPC Process Manual
No ratings yet
DPC Process Manual
86 pages
Family Communication Device Distribution
89% (9)
Family Communication Device Distribution
2 pages
08-D65 Logging and Tracing
No ratings yet
08-D65 Logging and Tracing
44 pages
AKTU PPS Papers (B Code, New Syll) PDF
No ratings yet
AKTU PPS Papers (B Code, New Syll) PDF
13 pages
Distributed Databases Model 1
No ratings yet
Distributed Databases Model 1
39 pages
CIP Drives Axis - Overtravel Limits and Recovery On Servo Drives
No ratings yet
CIP Drives Axis - Overtravel Limits and Recovery On Servo Drives
11 pages
Consent Form For Parents
No ratings yet
Consent Form For Parents
1 page
CCNP 350-501 Exam Prep Guide
No ratings yet
CCNP 350-501 Exam Prep Guide
3 pages
Proyecto Interdisciplinar Basica Superior Ingles Trimestre 2
No ratings yet
Proyecto Interdisciplinar Basica Superior Ingles Trimestre 2
5 pages
Signal and Power Integrity Simplified 2nd Edition Bogatin - ebook and textbook resources
100% (2)
Signal and Power Integrity Simplified 2nd Edition Bogatin - ebook and textbook resources
292 pages
Whatsapp Pay System Project
100% (1)
Whatsapp Pay System Project
3 pages
AT&T Mobility Outage Report 2024
No ratings yet
AT&T Mobility Outage Report 2024
10 pages

Uploaded by

Uploaded by

Practical 1: Write a program in Map Reduce for WordCount operation.

[Link] ( Create a [Link] )

public class WordCount {

Steps to run program :-

[Link] localhost la check kara file ahe ka tikde.

Practical 2: Write a program in Map Reduce for Matrix Multiplication.

import [Link].*; import [Link]; import

public class MatrixMultiply {

public class Map

public class Reduce

Ek file banva [Link] nava chi tyat he taka

Practical 2: Sample Database Creation

Start cmd -> mongod

Practical 3: Query the Sample Database using MongoDB querying commands

Practical 3: Create Database & Table in Hive

To start hive Go to /home/hadoop/apache-hive-3.1.2-bin

Practical 4: Hive Partitioning

Create a file [Link]

create table student_part(rno int, name string,marks int)

LOAD DATA LOCAL INPATH '/home/hadoop/hive/[Link]' INTO TABLE

Practical 7: Hive Views and Indexes

Create a text file [Link]

CREATE TABLE [Link] ( empcode INT,ename STRING, job STRING, salary

CREATE TABLE [Link] ( dno INT, dname STRING, loacation STRING)

Practical 2: Pig Latin Basic

1. Display total number of students

2. Display subject wise student count

Practical 4: Download the data

student1 = LOAD '[Link]' using PigStorage(',') AS (rno:chararray, fname:chararray,

emp = load '[Link]' using PigStorage(',') AS (eid:chararray, name:chararray,

Practical 6: Save and Execute the Script

pig -x local emp_data.pig

Practical 7: Pig Operations : Diagnostic Operators, Grouping and Joining, Combining

student1 = LOAD '[Link]' using PigStorage(',') AS (rno:chararray, fname:chararray,

b. Grouping and Joining

emp= LOAD '[Link]' using PigStorage(',') AS (eid:chararray, name:chararray,

dept= LOAD '[Link]' using PigStorage(',') AS (deptid:chararray, dname:chararray);

5. FULL outer join

c. Combining & Splitting

Practical 2: Downloading Data Set and Processing it Spark

Step 1 : Create dataframe from json file

Practical 3: Word Count in Apache Spark.

You might also like