Evaluating AI assistants on everyday business tasks