Some ideas from Maximilian.

Recap Pseudo Code

Look at the Pseudo code again:

Input: KG \(\mathcal K:\langle\mathcal E,\mathcal R,\mathcal F\rangle\), triple \(x_{hrt}=(h,r,t)\in\mathcal F\), embedding score function \(s:\mathcal E\times\mathcal R\times\mathcal E\to\mathbb R\), sample size \(k\in\mathbb N\)

Output: \(ReliK_{LB}(x_{hrt})\) or \(ReliK_{Apx}(x_{hrt})\)

\(S_H\gets\) sample \(k\) triples from \(\mathcal N^-(h)\); \(S_T\gets\) sample \(k\) triples from \(N^-(t)\).
\(rank_H\gets1\); \(rank_T\gets1\)
for \(x_{h'r't'}\in S_H\cup S_T\) do
If \(s(x_{hrt})\le s(x_{h'r't'})\) then
- if \(h'=h\) then
\(rank_H\gets rank_H+1\)
- if \(t'=t\) then
\(rank_T\gets rank_T+1\)
return \(\frac{1}{2}\left(\frac{1}{rank_H+|\mathcal N^-(h)|-|S_H|}+\frac{1}{rank_T+|\mathcal N^-(t)|-|S_T|}\right)\) for \(ReliK_{LB}\)

or \(\frac{1}{2}\left(\frac{1}{rank_H\frac{\mathcal N^-(h)}{|S_H|}}+\frac{1}{rank_T\frac{\mathcal N^-(t)}{|S_T|}}\right)\) for \(ReliK_{Apx}\)

Ideas for Parallelization

Idea 1 Parallel Sampling

Sampling the negative sets \(S_H\) and \(S_T\) can be done in parallel for both head and tail entities. GPU-based libraries like PyTorch can handle large-scale sampling efficiently on GPUs.

I will think about it.

Idea 2 Rank Update

For each sampled triple \(x_{h'r't'}\), it can be parallelized across all samples in \(S_H\cup S_T\). For-parallel

GPUs are very efficient at handling this kind of parallel computation, especially with batched matrix operations.

I will be working on this.

Info

Will there be some write-after-write or read-after-write issues? Should I add some write-lock?