[데이터마이닝] 5.링크분석(Link analysis) 5.2절 연습문제(Exercise 5.2.1 )

Exercise 5.2.1

Suppose we wish to store an n × n boolean matrix (0 and 1 elements only). We could represent it by the bits themselves, or we could represent the matrix by listing the positions of the 1’s as pairs of integers, each
integer requiring ⌈log2 n⌉ bits. The former is suitable for dense matrices; the latter is suitable for sparse matrices. How sparse must the matrix be (i.e., what fraction of the elements should be 1’s) for the sparse representation to save space?

(x,x)로 1들의 위치를 표현한다. (여기서 x는 ⌈log2 n⌉ bits 로 표현 가능한 정수)

따라서 하나의 1을 표현할때 필요한 메모리 용량은 2* ⌈log2 n⌉ bits 이며

전체 n × n boolean matrix에서 총 k개의 1이 있다고 할때 필요한 메모리 용량은 2*k*⌈log2 n⌉ bits 이다.

따라서 2*k*⌈log2 n⌉ < n^2을 만족하는 만큼의 1의 개수인 k 를 구하면 된다.

k< n^2 / 2*⌈log2 n⌉

Exercise 5.2.2

Using the method of Section 5.2.1, represent the transition
matrices of the following graphs:
(a) Figure 5.4.

(b) Figure 5.7.

5.2.1절의 방식이란 다음과 같다.

We can thus represent a column by one integer for the out-degree, and one integer
per nonzero entry in that column, giving the row number where that entry
is located.

각 노드의 출력 차수와 그 후속자(successors) 리스트로 전이행렬M 을 표현할 수 있다.

Exercise 5.2.3

Using the method of Section 5.2.4, represent the transition matrices of the graph of Fig. 5.3, assuming blocks have side 2.

이것을 5.2.4절의 방식으로 표현해보자

5.2.4절의 방식이란 다음과 같은 방식

참고

r_new 전체를 읽으면 cost가 너무 많이 드니까. r_new를 fit in mermoy한 k blocks 나눈다

한 block내에 노드가 0과 1뿐이므로 노드 0의 전체 outgoing degree는 4이지만 block 내 에서는 destination이 0과 1 뿐이다. 따라서 전이행렬 M에서 destination 0,1에 해당하는 값만 읽어오면 된다.

Exercise 5.2.4

Consider a Web graph that is a chain, like Fig. 5.9, with n nodes. As a function of k, which you may assume divides n, describe the representation of the transition matrix for this graph, using the method of Section 5.2.4

'데이터마이닝' 카테고리의 다른 글

[데이터마이닝] 5.링크분석(Link analysis) 5.4절 연습문제(Exercise 5.4 ) (0)	2020.05.16
[데이터마이닝] 5.링크분석(Link analysis) 5.3절 연습문제(Exercise 5.3.5 ) (0)	2020.05.12
[데이터마이닝] 3. 유사항목 찾기(Finding Similar Items) - 3.3연습문제 (0)	2020.05.05
[데이터마이닝] 3. 유사항목 찾기(Finding Similar Items) -3 민해싱 실제 구현은? Minhashing (0)	2020.05.05
[데이터마이닝] 3. 유사항목 찾기(Finding Similar Items) -3 민해싱개념 Minhashing (0)	2020.05.05

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

앙창

[데이터마이닝] 5.링크분석(Link analysis) 5.2절 연습문제(Exercise 5.2.1 )

Exercise 5.2.1

Exercise 5.2.2

5.2.4절의 방식이란 다음과 같은 방식

참고

Exercise 5.2.4

'데이터마이닝' 카테고리의 다른 글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역

[데이터마이닝] 5.링크분석(Link analysis) 5.2절 연습문제(Exercise 5.2.1 )

Exercise 5.2.1

Exercise 5.2.2

5.2.4절의 방식이란 다음과 같은 방식

참고

Exercise 5.2.4

'데이터마이닝' 카테고리의 다른 글

'데이터마이닝' Related Articles

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역