At the Shoji Laboratory, we conduct research on a wide variety of topics centered on information access technologies.

This page provides an overview of our main research themes and selected examples of our work.

Our publications are summarized on Shoji’s personal website.

Research Themes

Information Retrieval Closer to Real-World Search Click

Today’s web search is often unnatural.

Think about how you search for something in the real world. For example, when you go to an electronics store to buy a new TV and ask a staff member for recommendations, you typically describe your situation and purpose, such as: “Which TV is best for watching period dramas?” or “Which TV is suitable for FPS games?” The same applies when looking for a book in a library—you rarely search by specifying exact entities or keywords in advance. Search is fundamentally an act of learning about something unknown, so users cannot always describe the target using the “right” terms.

However, many web search engines require users to perform exactly that kind of unnatural search. Users are expected to input “keywords that are likely to appear on the page they ultimately want to find.” In other words, even though search is supposed to help you discover what you do not know, you still need to guess the words that might appear in the relevant pages.

To free users from such unnatural interactions, the Shoji Laboratory studies retrieval algorithms that allow more flexible and expressive queries.

Examples:

  • Searching for places or items using a user’s purpose as input
  • Web page search using “people’s impressions” as input
…and more.

Information Access That Sticks in Memory Click

So much daily web browsing—yet it hardly becomes knowledge.

Some reports suggest that people spend close to four hours per day browsing the web. Compared with TV, magazines, or books, the web may be the medium people engage with the most.

At the same time, it has been pointed out that information obtained on the web is difficult to retain. It is said that a large fraction of web queries are “revisitation queries.” These include not only returning to familiar sites, but also re-searching information that was previously read but later forgotten.

Even without relying on data, many people have an intuitive sense that web browsing leaves little lasting learning despite the time invested. If you practiced an instrument for four hours every day, you would become fairly skilled within a year. If you watched two films every day, you would become a movie enthusiast. If you trained for four hours daily, you would become physically strong. …But what about daily web browsing? Does it really leave you with lasting skills or knowledge?

The Shoji Laboratory studies information access technologies that help transform everyday web experiences into more meaningful learning, by supporting memory and knowledge consolidation from what users read online.

Examples:

  • Organizing daily browsing history into “cards,” or turning it into quizzes to support recall
  • Notifying users when they approach real-world places related to their past web searches
…and more.

Information Retrieval from Aggregated Opinions Click

Reviews are often the only evidence—but reading them all is impossible.

In recent years, people increasingly rely on online reviews to make decisions. For example, when choosing a movie to watch, official sites often provide limited information, so viewers consult audience reviews. Similarly, for many products, a spec sheet alone cannot convey real-world usability, so purchase decisions often depend on reviews.

Although reviews are widely used in everyday decision-making, technologies for searching, summarizing, and making large volumes of reviews usable are still underdeveloped. For example, if you want to find “a movie with an amazing plot twist,” how can you search for it through reviews? In many cases, reviewers do not use the literal phrase “plot twist”; instead they describe it in many ways, such as “an unexpected turn near the end” or “I immediately started rewatching after finishing.” Moreover, consider two movies: one where 100 people say “the ending was somewhat surprising,” and another where 5 people say “it was the most shocking ending of my life.” Which one is more “plot-twist-heavy”?

The Shoji Laboratory studies methods to aggregate and leverage information that may be unhelpful in isolation but becomes meaningful at scale, such as reviews, user-submitted recipes, and social media posts.

Examples:

  • Ranking “movies that are X” by aggregating review content
  • Mining cases like “people who bought X tend to do Y afterward” from social media
…and more.

Information Presentation That Makes You Want to Read Further Click

Links in social feeds are often ignored if only the title is shown.

Today, users are frequently presented with links: “How about reading this page?” or “What about this product?” Sharing news links with titles on social media is common, and recommendation algorithms suggest what news to read or what products to buy.

However, if a link only shows a headline or a product name, people often do not bother clicking. Even if we build sophisticated retrieval and recommendation algorithms, they become ineffective if no one actually visits the suggested pages.

One approach is to rewrite link text to match the user, or to replace a bare product name with a compelling message that highlights its appeal. For example, suppose there is a news story about a baseball game. A Giants fan might be more likely to click on “Giants dominate!” while a Tigers fan might respond better to “Tigers fought hard but narrowly lost.” Likewise, “ISO 25600 camera” may be less clickable than “a camera that captures clear faces even at night.”

The Shoji Laboratory studies information presentation and design that make search and recommendation results easier to understand and more attractive to read, even for non-experts. Using large language models and diverse data sources, we develop algorithms that summarize content, transform descriptions, and deliver information in a way that reaches the people who need it.

Examples:

  • Generating catchphrases from product specifications
  • Personalized rewriting of news headlines
…and more.

Information Access in the Physical World (Museum Information Access) Click

Simply visiting a museum casually does not always lead to much learning.

Information access is not limited to computers or the web. The physical world is filled with information, and people continuously acquire it in everyday life—from schools and libraries to bulletin boards in town.

The Shoji Laboratory focuses on museums as a particularly important domain where information access support can make a difference. We use information technologies to help visitors engage more deeply with exhibits, consolidate what they learned into lasting knowledge, and make the viewing experience more meaningful.

Museums attract a wide range of visitors. Many visitors come for non-self-motivated reasons—for example, school trips or free tickets. We study systems that help such visitors view exhibits more actively, become interested, and remember what they experienced.

Examples:

  • Analyzing interaction logs from a guide device and converting the viewing experience into a single “postcard”
  • Automatically generating a “treasure-hunt game” that encourages visitors to find exhibits they have not yet seen but may like
…and more.

Other Projects Click

If it is interesting, we are open to it.

Beyond the themes above, we have explored many other projects related (and occasionally not directly related) to information access.

Examples:
  • Automatically generating training charts by analyzing interaction logs from rhythm games
  • Web information retrieval within VR environments
…and more.

Selected Research Examples

ある観点に言及したレビューに含まれそうな「文の断片」を見つけよう!

iiWAS2025「Expanding Aspect Queries into Review Sentence Fragments for Product Comparison via LLM-Generated Synthetic Reviews」

生成AIにレシピを読ませて、「ほかのレシピとの違い」を言語化させよう!

iiWAS2025「Generating Distinctive Recipe Names via Relative Feature Comparison in Recipe Set」

エルフやドワーフと質問を出し合いながらミュージアムを観賞しよう

ICADL2025「Question-Based Viewing with LLM-Powered Personified Characters: A Role-Playing Dialogue System for Perspective-Taking in Museums」

手元の画像と似たスタイルに画像を変換できる画像生成AIを探したい

MMM2025「Image-Generation AI Model Retrieval by Contrastive Learning-based Style Distance Calculation」

レビュー文を、キャッチフレーズ文体に変換しよう!

2025年 電子情報通信学会論文誌 情報マネジメント特集 「生成的言語モデルによるレビュー文のキャッチフレーズ文体への変換」

「ある商品の購入目的を達成できる、ぜんぜん別の商品」を探そう

2024年 データベース学会論文誌 「商品レビューグラフを用いた利用目的を達成可能な代替商品の検索」

分かりづらい製品スペック情報を、体験的なエピソードに変換しよう!

BigComp2024「Generating Experiential Descriptions and Estimating Evidence Using Generative Language Model and User Products Reviews」

「みんなが○○と評している映画」を、キーワードで検索できるようにしよう

BigComp2024「BERT-Based Movie Keyword Search Leveraging User-Generated Movie Rankings and Reviews」

日常的なWeb閲覧の記録を、カードにまとめて集めよう

iiWAS2023「Digital Index Card Creation and Management for Memorizing What You See on the Web」

博物館鑑賞を「個人の興味に合わせた宝探しゲーム」にしてしまおう!

ICADL 2023 「Personalized Treasure Hunt Game for Proactive Museum Appreciation by Analyzing Guide App Operation Log」

レビュー中の記述が、何について書かれているかを整理しよう

iiWAS2023「Generating Fine-Grained Aspect Names from Movie Review Sentences Using Generative Language Model」

「ギターを弾ける場所」はどこ?目的を達成できる地物の検索

Rinton Press 論文誌 JDI「Geographic Entity Retrieval for Finding Places Suitable for Certain Purposes by Using Relevance Graphs on Places and Reviews」

任意のキーワードに対して、レビュー中に現れる多様な言い換え表現を発見しよう

Rinton Press 論文誌 JDI 「Doc2Vec-based Approach for Extracting Diverse Evaluation Expressions from Online Review Data」