diff --git a/index.html b/index.html index 385fd1a..1e13077 100644 --- a/index.html +++ b/index.html @@ -25,7 +25,7 @@ Academic Project Page - + @@ -47,80 +47,77 @@ -
-
-
-
-
-

Academic Project Page

-
- - - First Author*, - - Second Author*, +
+
+
+
+
+

TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space

+
+ - Third Author - -
- -
- Institution Name
Conferance name and year
-
*Indicates Equal Contribution
-
- -
- + +
+ 1Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS)
2University of Chinese Academy of Sciences
ACL 2024 Main Conference
+
+ +
-
-
-
+ + + @@ -129,8 +126,8 @@

Academic Project Page

Aliquam vitae elit ullamcorper tellus egestas pellentesque. Ut lacus tellus, maximus vel lectus at, placerat pretium mi. Maecenas dignissim tincidunt vestibulum. Sed consequat hendrerit nisl ut maximus. @@ -142,143 +139,39 @@

-
-
-
-

Abstract

-
-

- Lorem ipsum dolor sit amet, consectetur adipiscing elit. Proin ullamcorper tellus sed ante aliquam tempus. Etiam porttitor urna feugiat nibh elementum, et tempor dolor mattis. Donec accumsan enim augue, a vulputate nisi sodales sit amet. Proin bibendum ex eget mauris cursus euismod nec et nibh. Maecenas ac gravida ante, nec cursus dui. Vivamus purus nibh, placerat ac purus eget, sagittis vestibulum metus. Sed vestibulum bibendum lectus gravida commodo. Pellentesque auctor leo vitae sagittis suscipit. -

-
-
-
-
-
- - - - -
-
-
- -
-
-
- - - - - - -
-
-
- -

Video Presentation

+
- -
- - +

Abstract

+
+

+ Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks. However, they sometimes suffer from producing hallucinations, particularly in cases where they may generate untruthful responses despite possessing the correct knowledge. In this paper, we propose TruthX, an inferencetime method to elicit the truthfulness of LLMs by editing their internal representations in truthful space. TruthX employs an auto-encoder to map LLM’s representations into semantic and truthful latent spaces respectively, and applies contrastive learning to identify a truthful editing direction within the truthful space. During inference, by editing LLM’s internal representations in truthful space, TruthX effectively enhances the truthfulness of LLMs. Experiments show that TruthX effectively improves the truthfulness of 13 advanced LLMs by an average of 20% on TruthfulQA benchmark. Further analyses suggest that the truthful space acquired by TruthX plays a pivotal role in controlling LLM to produce truthful or hallucinatory responses. +

-
-
- - - - -
-
-
-

Another Carousel

- -
-
-
- - - - - - - - -
-
-
-

Poster

- - - -
-
- + + -
+

BibTeX

-
BibTex Code Here
+

+ If you have any questions, please contact Shaolei Zhang (zhangshaolei20z@ict.ac.cn). +

+
@inproceedings{truthx,
+        title={TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space}, 
+        author={Shaolei Zhang and Tian Yu and Yang Feng},
+        year={2024},
+        url={https://arxiv.org/abs/2402.17811}
+        booktitle = {Proceedings of the 62th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
+        year = {2024},
+        publisher = {Association for Computational Linguistics},
+}
@@ -289,12 +182,12 @@

BibTeX

- +
+

- This page was built using the Academic Project Page Template which was adopted from the Nerfies project page. - You are free to borrow the of this website, we just ask that you link back to this page in the footer.
This website is licensed under a Creative - Commons Attribution-ShareAlike 4.0 International License. + TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space (ACL 2024)

+
@@ -302,6 +195,7 @@

BibTeX

+ diff --git a/meta/.DS_Store b/meta/.DS_Store new file mode 100644 index 0000000..9a23aee Binary files /dev/null and b/meta/.DS_Store differ diff --git a/meta/images/favicon.ico b/meta/images/favicon.ico new file mode 100644 index 0000000..dc70283 Binary files /dev/null and b/meta/images/favicon.ico differ diff --git a/meta/images/ill.png b/meta/images/ill.png new file mode 100644 index 0000000..6e1a869 Binary files /dev/null and b/meta/images/ill.png differ diff --git a/meta/video/.DS_Store b/meta/video/.DS_Store new file mode 100644 index 0000000..719a74e Binary files /dev/null and b/meta/video/.DS_Store differ diff --git a/meta/video/demo.mov b/meta/video/demo.mov new file mode 100644 index 0000000..09c5242 Binary files /dev/null and b/meta/video/demo.mov differ diff --git a/static/.DS_Store b/static/.DS_Store new file mode 100644 index 0000000..4768cf3 Binary files /dev/null and b/static/.DS_Store differ