Text this: EgoSep: Egocentric On-Screen Sound Source Separation for Real-Time Edge Computing