Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
It might be kind of insane, but if you have the programming ability in theory you might be able to use mediapipe to calculate a per frame face mesh. Then store the mesh oriented bounding box and for each frame output a transformed image such that all the faces overlap. Then feed the new images into Stable Diffusion and feed that image into an inverse transform and use that final image to generate the video. Essentially this would remove as much of the changes over time as possible from the face. Should make it more temporally consistent as the transforms will remove the back and forth movement issues.