As a user, I would like to denote to a human interpreter that an audio file is a mix of two languages, for example, Mostly Spanish, but also contains English.