I have a deployed a Golang app to Google Cloud Platform's App Engine Standard environment.
My app.yaml included the following lines:
automatic_scaling:
min_idle_instances: 0
max_idle_instances: 0
min_pending_latency: automatic
max_pending_latency: automatic
min_instances: 0
max_instances: 2
My intention was that the app should scale down to 0 instances when idle and spawn up to 2 instances to handle incoming requests.
After calling the service I noticed that it never scales down to 0 instances.
I opened the service configuration in the web console and found the following:
runtime: go
api_version: go1
env: standard
threadsafe: true
instance_class: F1
handlers:
- url: /penguin
script: _go_app
automatic_scaling:
min_idle_instances: automatic
max_idle_instances: automatic
min_pending_latency: automatic
max_pending_latency: automatic
max_instances: 2
Is it possible to scale App Engine Standard services down to 0 instances? What should that look like in the yaml?