TSG-Bench

Multi Action Scene Graph Generation(MA-SGG)

Generate scene graphs by decomposing actions when descriptions involve multiple action

MA-SGG aims to generate scene graphs by decomposing actions when given complex descriptions that involve multiple actions. The task formulation is identical to that of SA-SGG, except that an additional clue indicating the number of actions is provided, and the complexity of \( d_i \) is greater than 1. This makes MA-SGG more challenging than SA-SGG because the amount of information to process, especially to generate, is larger, and target actions may be implicit in the description. Also, although the number of actions is given, the task still requires the ability to accurately decompose, identify, and order valid actions from the description.

Scene Graph Generation

Single Action Scene Graph Generation(SA-SGG)

Example 1

Example 2

Multi Action Scene Graph Generation(MA-SGG)

Example 1

Example 2