CS Academy Notes

2017-11-12 00:00:00 +0000

Round 51: Tree Coloring

Root level: (k chooses 1 + # neighbor) Other non-leaves level: (k - 2 chooses # neighbors) leaf level : 1

The answer is the product of all. Note need to calcualte N choose k in linear time

Round 49: Max Substring

count # of appearance for each char. For each char in a-z, we increase the length of guesses 1 by 1, until we hit prev same char or reachs index -1, or the position at that char is different

Round 49: Amusement Park

DP on the expected value, Ex(t, a). Note the boundary coniditon => Ex(0, x) = 0

Ex(t, a) = 1 + Pr(t, a) * (Ex(t, a + 1)) + (1 -Pr(t, a)) * Ex(t -1, a +1)

Round 47: Gcd Rebuild

Init base U, V factors to 1 For each V, just calculate the LCM of the i-row. For each U, just calculate the LCM of the i-column.

Note that we act greedily here, if other numbers are possibles, they must be a multiple of LCM anyway. So we just try LCM to reduce the likelyhood of OOB

And then run through the UV matrix again to verify

Round 47: Expected Merge

DP on Ex(l), use a vector to store the length, since each will be visited only once. The overall cost will be same as merge sort

Consider calc(l)

for i in  0 to l
ans[i] += l

calc(l/2)

if(l is even){

for (j, 0, l/2)
	ans[j] += Ex(j, l/2)

for(j, 0, l/2)
	ans[l/2 + j] += Ex(j, l/2)
}else if(l is odd){

calc(l/2 + 1)

for (j, 0, l/2)
	ans[j] += 1/2(Ex(j, l/2) + Ex(j, l/2 + 1)

//middle one is special
ans[l/2]  = 1/2(Ex(l/2 - 1, l/2) + Ex(0, l/2 + 1))

for(j, 0, l/2){
	ans[l/2  + 1 + j] += 1/2(Ex(j, l/2) + Ex(j + 1, l/2 + 1)) 
}

Round 54: Pair Swap

Maintain a heap of next k entries, we will swap with current head and break if

heap top < current head
heap top is maximal in the heap, but within the range

The moment we find it we can stop

Note: official solution uses a dequeue!!!

maintain a dequeue s.t. elements forms an increasing sequence, i.e., i < j and a[i] < a[j], with the last entry being a[i + k]
every time we seen a new entry, we add a[i+k] to the tail of the queue, after we pop the tail to remove bad ones
becaues we moved i to the left, and i < j and a[i] < a[j], the head of the queue may be i itself (valid in the last iteration but not this one anymore) => so we need to pop it

Round 54: Spanning Trees

We can construct tree in two parts: 1. share nothing, 2. share everything => i.e., if we have solution for (N,K), we also have a solution for (N+1, K+1) => we just add an external node

share nothing works when N >= 4, when N = 1 , 2, or 3, we can just brute force

Round 52: Virus on a Tree

calcuate subtree size, sort by size, for each cut one, traverse to mark as cut already

Note in implementaion, we can pass a canSave flag, and save only the root of possible tree to cut, and add the root/size to a list

Round 52: Game on a Circle

check 1, 12, 23,…, 89 until we find the first black. if no black so far, then we know b = 9, 9 and 19 would be black

upon finding a black c[i],

when i is not the first entry we tried
we check c[i - 1]. 
	if c[i - 1] is white, we check c[i+ 1] 
		if  c[i + 1] is black, we know value of a, we also know that a[i + 10] can not be black, so does a[i + 20....]
		if c[i + 1] is white, then we know the value of b, and you know c[i - 9]....c[i - 1] can not be black, because a >= i - 9 or  < i - 10 - 10  
	if c[i-1] is black, if know c[i - 11] is white, we can try all other 9 * c[i - 11 + n * 10].
		
when i is the the first entry we tried
	keep going until we reach the second black si, then we can try si + 10....skip mod = i...skip mod = si

Official solution

imagine the game as a 10 * 10 grid!!!
ask 0, 11, 22,….99, until we have two blacks
if we have 2 blacks, we know those two columns might be back, so we just try the other 9 columns
If we have only 1 black, we know that column might be black, so we just try all remaining columns

Round 51: Manhattan Distances

Manhattan distance is similar to the idea of linear two, a + b >= c, and then we can fix (0, 0) and (0, c), and make the third point (x, y) s.t.

x + y = a
b - x + y = c

Note that if a soution exists, the a+ b+ c is even, because consider x1 < x2 < x3, the total value is 2 * (x3 - x1) + 2 * (y3 - y1) (Note, xi and yi may not belong to the same point!)

On Spring Cloud Config

2017-11-09 00:00:00 +0000

Its use case is a remote application.properties file with HA. Therefore, auto refresh is possible but not natively supported, and config client will not retry if the config server returned from eureka is dead => just like how local java app reacts when the application.properties is changed on disk, or the location of the .properties file is not valid
It has no client/server cache => every request does a brute force git full => yet another reason when auto refresh is not a good idea => very easy to overwhelm git backend
Therefore, every time we deployed config change, we will have to do a rolling restart of our service
It is not designed to handle secretes, e.g., API key, db password, because the decrption of such information happens on the SERVER side. This means the http (not S) response client sees will have secret in plain text. Use Vault or AWS parameter store to manage secrets
Cloud Config can push to client via RabbitMQ or Kafka, although adding such coupling just for notification is debatable

For AWS parameter store, each account can have max 10k keys. This may not be enough for the usage of general config server.

Conclusion: spring cloud config is a lightweight tool to manage feature flags and non-sensitve configs, but not for secrets an real-time updates

CS Academy Notes

2017-11-07 00:00:00 +0000

Round 42: Prefix Matches

for each A[i], we know that B[i + A[i]] = max(A[j] , j + A[j] = i + A[i]), so we can through and upgrade it, then from n to 1, we maintain current length l, and B[i] = l, l–, or update l to the new B[i] if B[i] > l

Insight: when we update B[i], the first time it updates will have the highest value, because the staring index is at the left most position => prefix length is longest at this time

Round 42: Sorting Steps

Runtime Analysis is similar to a potential method. We defind cost(i) = number of entiers > a[i] to the left i. For each operation, cost[i] – for all cost[i] > 0, otherwise it means some bigger number is blocked. We continue this process until all cost(i) = 0. Therefore, overall # of steps = max(cost[i])

How to calculate max(cost[i]) without a complex data structure?!!!

Observations:

If an entry has max(cost[i]), i must not have j > i and a[j] < a[i]. Otherwise, cost[j] > cost[i]
Since all numbers < a[i] are to the left of i, cost[i] = i - real position

Round 56: Find Edge List

DAG Topological sort with cycle detection. I solved it slower than I expected.

For cycle detection, maintain visited, visiting, unvisited state. Note for undirected graph, we just need 2 stages, and will find it the moment we reach a visited stage
The inverse of the topological sort does not mean print before the depth. The counter exampe is A -> B <- C, the inverse of the topological sort should be A -> C -> B, but print it before going depth, we will hit, A-> B -> C
Either add a fake node with edges to all nodes, or find a node with no incoming edges. Need to handle the case of every node has incoming edge -> a cycle already

Round 56: Find Path Union

Given the max #, we can know the max value of the tree, and the max level of the node, then we iterate from the max to min value => if current node is not in the set but its parent is, then we add max level - depth(node) to the sum. Final answer the total # of full tree - sum

Official solution

Remove child -> parent edge 1 by 1, starting from highest num. Answer++ if current node is in the list

So maintain 2 sorted lists, one is nodes to visit, one is ancestor’s node to visit, at each step, take the max of two lists, ans++, and put the new ancester to the parent list

Note:

we can just use a q to implement sorted list, because node number is decreasing at this step, so its parent (nv/2) must be non-increasing as we progress!!!
at every step, we just need to pop both q and vector on same values
because each node has two children, the same q value may appear multiple times. Again, by 1), we know they will both be head of the list!!!

Round 46: Set Subtraction

Take the min number, and try all 1k possible X sort the numbers, scan from left to right, if num(a[i] ) > num(a[i] + X), bad, otherwise, reduce both a[i] and a[i] + X Note: we can use two pointer techinque here instead of sets, because the first pointer should never catch up with second one because x is positive !!!

Round 45: Remove Update

Use sweep line to calcuate the final results after all updates. and then calculate maxStart[i]= max value in 1…i, maxEnd[i] = max value in i….n

For each segment, the answer is max(maxSeg - segV, maxStart[i], maxEnd[i]), and we take the minimum.

Note that we can use the partial sum trick to simulate operations!!!

calculate s[i] = a[1] +…..a[i]
p[l] = v, p[r] = -v

Also, note that to minize max value, we need to consider intervals that contains all maximal values after applying all entries, which is also the max value we are looking for!!! Becaues if the interval does not cover all max As, then the remining max A entry will remain the new max value after removal. Even better, in implementation,we can just assume max value is such for ANY interval anyway, because those do not satify this will not affect our final results - maxStart and maxEnd will cover that!

Round 44: Check DFS

Idea 1

we start with the node, at every step. if p(i) and current stack top is an edge, we add it the stackkeep going. Otherwise, we pop the stack until we have it!, and then add to the stack

terminating condition is we go through all perms

Idea 2

If node v is visited before node u, then we know v can not appear after u in adjacency list. Therefore, we sort adjanent list by the order in permutation!!! In the end do a native DFS and compare orders

Round 44: Count Squares

consider each square parallel to axis, if the length is n, then we can form n squares within the square, each side with manhanttan distance n. Therefore, the answer is

1 * (x-1) (y - 1) + 2 * (x-2) (y-2) + .....+ 1 (x-n-1) (y- n - 1)

i.e., the bouding box of a square is also a square

Round 55: Build the Fence

Note that we can repeat the cut, i.e., 7 can turn into 3, 3, 1 => Therefore, higher k, less likely to achieve => bsearch on K

Round 55: Matrix Palindromes

Suppose K = M, then we can calculate the total cost to make the whole matrix palindrone.

If all 4 letters are the same => cost 0
If distribution is 3-1 => cost 1
If distribution is 2-2 or 2-1-1 => cost 2
Otherwise, cost is 3;

For each column, compare it the the case where, only rows are palindromes and columns are not. Then we pick the best M - K cost saved

Round 53: Histogram Partition

Lets look for a O(n^2) solution first!!! For each integer, we need to add 1 to cost, if there does not exist int = current int before running into anything shorter. note that we are just interested in the number <= current one, so all numbers > current one can be ignored So we can maintain a stack, so that each item is checked only once, and end result is an increasing number on stack

Round 53: Parallel Rectangles

We can brute force in two ways

check out all (LL, UR) pairs, this works well if not many distinct xs
for each pair of (x, y1), (x, y2), store the frequence of (y1, y2). This works well if not too few distinct xs

The key insight is to combine them togehter!!!

So we check each x-axis. If # of ys > S, use first approach, otherwise, use second approach. To choose S, consider that the second appraoch, cost is around N * S/2 * log(N), so S should be < around 50, to limit total steps around 10^6. The first appraoch will run at most N/S times, each takes N, i.e., this means S should be larger than root(N)

Round 48: 8 Divisible

To be able to divisible by 8, we can break the number = a number < 200 but division by 8 + a number divisible by 200 => this just suggests that the last 3 digits divisible by 8 <=> whole number divisible by 8!!!

Then we can generate string, and keep the lexically smallest, excluding ones with leading 0s => 10^3 choices * 10^3 length each

Round 48: Dominant Free Sets

O(n^2) gives obvious DP relationship. Use Fenwick tree to speed up!!!

On SSH

2017-11-01 00:00:00 +0000

Client’s private key (identity key, e.g., id_rsa) stays with client and should not be distributed. Client’s public key,e.g., id_rsa.pub, goes to the server, which is stored in server’s autherized_key file
passphrase is used to encrypt/decrpt client’s private key
In a cluster mode, shared host key between hosts may be alright
Host key is the SERVER’s public key. It is stored in ssh_host_<rsa/dsa/ecdsa/ed25519>_key file on the server, known_host on the client, if the client connects to the server ever.
Known_host is of format - [host],IP ALGO key
login is accepted if client proves that it knows the secret key, and the public key is in autherized_key file

CODE FESTIVAL 2017 Qualification Round C

2017-10-22 00:00:00 +0000

B:Similar Arrays

I used the brute force approach, which is slower to implement.

Note that it is even as long as there is 1 even number in the array. It is easier to compute the inverse of the function => i.e., all numbers are odd!

C: Inserting ‘x’

I calcuate the palindrome length and then calcuate how many x should be inserted - WAed because I didn’t consider the case of AXBXXCCXBXXA!!!

Instead, we can just modifly to standard palindrome check slightly, i.e., when we see a mistach at current head and tail, according to the rule, we know exactly the optimal soltuion will be. This will greatly simplify the code.

Expanding on this, what if we can insert any letters, can it be done on O(n)?

Idea 1

Construct t = reverse(s) + $ + s, and then run z-function on t, if i + z[i] == length(t), then we know the substring at i….z[i] is a palindrome suffix of s!

Idea 2

Construct a rolling hash of each suffix/preifx, and compare each (i, n-i-1) pair, this should be close to O(N), as long as our hash function does not have too much collision!

Spring Cloud Eureka on ECS

2017-10-15 00:00:00 +0000

How is EIP bound via DNS

Spring-Cloud-Eureka-Server has its now ServerBootStrap. This means the bootstrap logic from vanilla Eureka-Server will be ignored
Spring-Cloud by default sets the binding strategy to EIP, i.e., we will be using EIPManager from vanilla Eurika
To use DNS resolivng, we need to set shouldUseDnsForFetchingServiceUrls to true in client config.
The DNS name we need to config on the client side is

clientConfig.getEurekaServerDNSName()

EIPManager look for the record “txt.YOUR_REGION.SERVICE_DNS_NAME” of TXT type, and get a list of CNAMEs from the value of TXT field. Note it expect the first part of CNAME is yoru zone name
For each CNAME record, EIPManager will look up “txt.”CNAME TXT field and another list of CNAMEs, and construct service URL in the form of

 String serviceUrl = "http://" + ec2Url + ":"
                            + clientConfig.getEurekaServerPort()
                            + "/" + clientConfig.getEurekaServerURLContext()
                            + "/";

Note EurekaServerURLContext - we need to config it in client config

To trasnlate from service url in DNS to EIP, note it is brute force string parsing - get substring between “ec2-“ and “.yourregion.compute”, and then replace all - in the substring with .
To bind EIP, we need to configure

String aWSAccessId = serverConfig.getAWSAccessId();
String aWSSecretKey = serverConfig.getAWSSecretKey();

in service config

Note that EIP is a public IPv4 address, but we really want to shield eureka from internet traffic. Instead, we can put eureka instances inside the private subnet, while still using the public host name. The communication from within the VPC should still work, because AWS resolves public DNS host name to the private IPv4 address

On ECS

A working config for Eureka cluster deployed in AWS

spring:
  profiles:
    active: NotDefault #to trigger the @Bean injection on aws aware context
server:
  port: 8761

eureka:
  environment: aws #shown in the Eureka UI
  client:
#I choose true for cluster case, so that cluster will keep receiving heartbeats and some false alarms will not be raised
    registerWithEureka: true 
    fetchRegistry: true
    region: us-east-1
    eurekaServerURLContext: eureka
    eurekaServerPort: 8761
    useDnsForFetchingServiceUrls: true
    eurekaServerDNSName: YOUR.DOMAIN.IN.ROUTE.53
    availabilityZones:
      us-east-2: us-east-1a, us-east-1b

  server:
    waitTimeInMsWhenSyncEmpty: 0
    #Note we need to set server.aWSAccessId and server.aWSSecretKey, so that the instance can bind to an EIP
    #Take them out for confidentiality case
    #Alternatively, you can just set environment variable eureka.server.aWSAccessId and 
    #eureka.server.aWSSecretKey
  region: us-east-1
  datacenter: cloud
  us-east-1:
    availabilityZones: us-east-1a, us-east-1b

Problems I run into when I pack Eureka inside docker

Use java -jar eureka.jar does not trigger injection at @Bean. Still don’t understand why, mostly likely because of missing jar in classpath
Then I tried running gradle wrapper directly inside container, but the minimal java docker image does not have bash, which gradle wrapper requires
So I changed to base image to centos 7, but JAVA_HOME is not set, so I have to add CMD to install java and set JAVA_HOME explicitly
Need to set ip and host name explicitly, during server bootstrap. Otherwise, the host will use the internal ip host name instead of public ip host name. Such discrepancy will cause Eureka to report replicas as unavailable, because the registered host name is different from the EIP host name stored in route 53

When Eureka instance is on ECS, it will be using the docker ID as host name to register. Therefore, the service consumer will not be able to resolve it

We will have to inject an EurekaInstance ourselves, and manually set host name and port to be registered on Eureka. Note that EurekaInstance exists in client

    @Bean
    @Profile("!default")
    public EurekaInstanceConfigBean eurekaInstanceConfigBean(InetUtils utils) {
        final int  managementPort = YOUR_SERVICE_PORT;

        log.info("Setting AmazonInfo on EurekaInstanceConfigBean");
        final EurekaInstanceConfigBean instance = new EurekaInstanceConfigBean(utils) {

	   //needed only when Eureka server instance binds to EIP
            @Scheduled(initialDelay = 10000L, fixedRate = 30000L)
            public void refreshInfo() {
                log.debug("Checking datacenter info changes");
                AmazonInfo newInfo = AmazonInfo.Builder.newBuilder().autoBuild("eureka");
                if (!this.getDataCenterInfo().equals(newInfo)) {
                    log.info("Updating datacenterInfo");
                    ((AmazonInfo) this.getDataCenterInfo()).setMetadata(newInfo.getMetadata());
                }
            }

            private AmazonInfo getAmazonInfo() {
                return (AmazonInfo) getDataCenterInfo();
            }

            @Override
            public String getHostname() {
                AmazonInfo info = getAmazonInfo();
                final String publicHostname = info.get(AmazonInfo.MetaDataKey.publicHostname);
                return this.isPreferIpAddress() ?
                        info.get(AmazonInfo.MetaDataKey.localIpv4) :
                        publicHostname == null ?
                                info.get(AmazonInfo.MetaDataKey.localHostname) : publicHostname;
            }

            @Override
            public String getHostName(final boolean refresh) {
                return getHostname();
            }

            @Override
            public int getNonSecurePort() {
                return managementPort;
            }

            @Override
            public String getHomePageUrl() {
                return super.getHomePageUrl();
            }

            @Override
            public String getStatusPageUrl() {
                String scheme = getSecurePortEnabled() ? "https" : "http";
                return scheme + "://" + getHostname() + ":"
                        + managementPort + getStatusPageUrlPath();
            }

            @Override
            public String getHealthCheckUrl() {
                String scheme = getSecurePortEnabled() ? "https" : "http";
                return scheme + "://" + getHostname() + ":"
                        + managementPort + getHealthCheckUrlPath();
            }
        };

        AmazonInfo info = AmazonInfo.Builder.newBuilder().autoBuild("cloudconfig");
        instance.setDataCenterInfo(info);

        return instance;
    }

Config Related Code in Spring Cloud Netflix - Eureka

2017-10-13 00:00:00 +0000

Server Dependency

Your Eureka project -> spring-cloud-starter-eureka-server -> spring-cloud-starter-netflix-eureka-server, which depends on

spring-cloud-starter
spring-cloud-netflix-eureka-server
spring-cloud-starter-netflix-archaius
spring-cloud-starter-netflix-ribbon
com.netflix.ribbon.ribbon-eureka

Note that there is no concrete code until spring-cloud-netflix-eureka-server

During Server Bootstrap

Look for spring.profiles.active
Look for spring.profiles.default
Look for eureka.datacenter
Look for eureka.environment
Check if applicationInfoManager’s dataCenterInfo is AWS/Amazon, if so, start the AWS binder. Note that it defaults to EIP binding strategy
For other settings, check ServerConfigBean, which is populated via reflection/injection. AVOID VANILLA EUREKA CONFIG PATH FROM NETFLIX DOC!

Spring Cloud Eureka Client Config

Client will look for az in a region, if az is not defined, it will use “defaultZone”
Client will look for serviceUrl.”myZone”, if it is undefined, it will look for serviceUrl.defaultZone
serviceUrl defaults to “defaultZone” -> “http://localhost:8761/eureka”. That is why local run can do without setting default Url
For other settings, check ClientConfigBean, which is populated via relfection/injection. AVOID VANILLA EUREKA CONFIG PATH FROM NETFLIX DOC!

Note

In my Eureka project, application.yml and application.properties can co-exist in src/main/resources and both will be read. I use application.properties to config logger, which is logback
Following the guide, the aws specific init is placed in the main of my eureka server. Note the example has a @Profile(“!default”) flag, i.e., you need to set spring.profiles.active to something other than “default”, either via -D arguments or application.properties

AWS specific code in Spring Cloud Netflix - Eureka

2017-10-11 00:00:00 +0000

Note that Spring Cloud Eureka adds a layer of abstraction on top of vanilla netflix eureka, even though they have similar names such as client/server/registry e.t.c. Therefore, when navigating through the code, I often run into definition that embedded inthe vanilla jar

Eureka Client - Client config

Default region is us-east-1
serviceUrl is az -> fully qualified URL map, default mapping is “defaultZone” -> “http://localhost:8761/eureka”
availabilityZones is a region -> az map. Default is empty

Eureka Client - Instance config

aSGName - autoscalling group associated with this instance
dataCenterInfo - data center where is this instance is deployed - DataCenterInfo.Name.MyOwn

Eureka Server - Bootstrap

awsBinder - during serverContext init, we check if applicationInfoManager.getInfo() is from aws

	protected boolean isAws(InstanceInfo selfInstanceInfo) {
		boolean result = DataCenterInfo.Name.Amazon == selfInstanceInfo
				.getDataCenterInfo().getName();
		log.info("isAws returned " + result);
		return result;
	}

Eureka Server - Server Config

aWSAccessId
aWSSecretKey
aWSBindingStrategy, default EIP

Eureka Server - Controller

In header

 if (info.getName() == DataCenterInfo.Name.Amazon) {
         AmazonInfo amazonInfo = (AmazonInfo) info;
         model.put("amazonInfo", amazonInfo);
         model.put("amiId", amazonInfo.get(AmazonInfo.MetaDataKey.amiId));
         model.put("availabilityZone",
                 amazonInfo.get(AmazonInfo.MetaDataKey.availabilityZone));
         model.put("instanceId", amazonInfo.get(AmazonInfo.MetaDataKey.instanceId));
     }

When populating apps on controller, it does similar check as above

Outstanding Questions

How is an instance of ApplicationInfoManager is populated?
In Eureka client, how is the region info populated?
Why my spring cloud eureka is not logging while the original is?

Round 439

2017-10-07 00:00:00 +0000

869A - The Artful Expedient

Claim: Karen always wins Proof:

On Netflix Eureka

2017-10-06 00:00:00 +0000

Deploying a new mid-tier service on AWS without Eureka

This means our service is not facing client on the internet,i.e, edge service. Instead, their customers are other services.

Add DNS entry ,i.e., in route 53, so that traffic within the same AZ can be routed within AZ,e.g., inter-service calls from within the same AZ. The name is the single endpoint exposed to other services, the value is the internal load balancer name.
Add an internal load balancer, whose endpoint is the value of the DNS name in the previous step. This serves the standard LB purposes within the AZ
Add an auto scaling group (ASG) to host all service instances, so that # of instances can be scaled based on traffic

Note that such steps must be performed on EVERY SINGLE service, i.e., each service will have 3 fix points (DNS, LB, ASG) the service’s team has to manage.

Deplying a new mid-tier service on AWS with Eureka

After Eureka is up, the service just use Eureka’s API to register the instance. Moreover, given the spring-cloud-netflix suite, such registration is automatically handled by client => the service team don’t need to worry about any fix point, except its own stable service name!
The only fix points Eureka ops need to maintain is the directly eureka related, and has nothing to do with registered service,i.e., Eureka is like a service database, and ops don’t worry about the data inside ops except database params

Notes

Eureka client uses client-side load balance, whereas ELB is a traditional proxy-based load balancing,i.e., all traffic goes through ELB
Netflix prefers stateless servers, i.e., centralize all state (“fix point”) into DBs
Registrations may happen in an orphaned server and some clients may reflect new registrations while the others may not.
Seems that Eureka doesn’t maintain persistant state itself, instead, it is constructed purely from heatbeats of services, both between eureka servers, and applicaion servers.
Zookeeper is a CP/BASE system. As a comparsion, Eureka is AP system, since service discovery is a fundmental service.

HA Eureka in AWS

One ASG for a Eureka cluster within a region
One Elastic IP (EIP) for every server in the cluster. Eureka server will find unused EIP and binds it to itself during the start up, after you config Eureka with the last of EIP address
Add a DNS TXT record name for the whole region, the record should be of format

txt.region1.eurika-server-domain-name="az1.eurika-server-domain-name" "az2.eurika-server-domain-name" "az3.eurika-server-domain-name" 
txt.az1.eurika-server-domain-name="domain-with-EIP1-in-zone1" "domain-with-EIP2-in-zone2"
txt.az2.eurika-server-domain-name="domain-with-EIP3-in-zone2"
txt.az3.eurika-server-domain-name ="domain-with-EIP4-in-zone3"

By default, Eureka client searches for the property file eureka-client.properties in the classpath.In all eureka-client.properties

eureka.shouldUseDns=true
eureka.eurekaServer.domainName=mydomaintest.netflix.net
eureka.eurekaServer.port=7001
eureka.eurekaServer.context=eureka/v2

Note that space as separate in TXT record

You can also hardcode Eureka instances in Eureka config files, but this means the file content will be in AMI and distributed to ALL instances!

MemSQL Start[c]UP 3.0 Round 2

2017-10-04 00:00:00 +0000

865A: Save the problem!

Official solution: suppose we have only denomination 1 and 2, then given amount A, we have only A/2 + 1 ways to form the number, because the only replace we can do is replace 2 1s with a 2

This is way quicker than my reverse dp approach during the contest!!!

865B: Ordering Pizza

I tried a double pointer approach, and it gave me TLE!!! very likely because I put # of pizzas in an infinite loop.

Official solution:

To simplify implementation, we add another dummy people that reads all leftover pieces without gaining any happiness Also, there is at most one pizza shared by people preferring different types. So there are potential 2 cases, 1 for each type of pizza. We just try both and pick the better one

865D: Buy Low Sell High

Everyday, we gain one option to buy, and if today’s price is higher than some options price, we know we can sell some options at that price, but we don’t know if it is the best time to sell, so we can just sell and give us options to buy back at the price we sell. This transformation works because on a day we do nothing <=> selling and buying on the same day.

On TCP

2017-10-03 00:00:00 +0000

Even though we call tcp “packets”, the correct term is segment for TCP, datagram for IP, and frame for link layer
The length of the data section is not in the TCP segmet header, and thus needs to be inferred
During the 3-way handshake of establishing connection, the initial sequence number on either side is randomly chosen
During the 4-way handshake of connection terminiation, each side terminates independently. Also, the side that sends the first FIN will wait for a timeout period after sending out the final ACK, before closing the connection, so that the local port is unavailable for new connections. Some optimization turns the handshake into 3 ways, by letting the receiving end send FIN & ACK together, since they are neighboring steps.
Most implementaions maps a session to an OS process, with address and port (0 to 65535) as the identifier. Note that port is only on the TCP layer, IP is only on the IP layer.
The window field specifies # of bytes the other side is willing to receive

DR Drills with AWS Aurora

2017-10-01 00:00:00 +0000

Objectives

Verify that our services on Aurora can still perform within SLA with degraded aurora service
Build tools and procedures for such drills, so that we can repeat drills against other services on different platform.

Note

All drills should happen in the context of our performance testing load, i.e., new traffic coming.
Upon injected failure, aurora will just restart on the spot instead of failover, this means the service will wait until the master recovers
Between AZs, the replica lag is around 20ms. However, this just means the lags in cache, because aurora uses a share-disk design, the data is always consistent.
Failover times are typically 60-120 seconds.This means most connections will timeout during failover
Aurora also has the capabity to perform disk failure and disk congestions, but drilling on such things brings debatable additonal value, until we gain more experience on that

Crash dispatcher on writer

master: create db instance
read replica: Read replica has been disconnected from master. Restarting Mysql => create db instance

Crash instance on writer

master: DB instance restarted
read replica: DB instance restarted

Crash node on writer

master: DB instance restarted
read replica: Read replica has fallen behind the master too much. Restarting Mysql => DB instance restarted

Failover

old master:

a. Started cross AZ failover to DB instance
b. A new writer was promoted. Restarting database as a reader.
c. DB instance restarted
d. Completed failover to DB instance

new master:

a. Started cross AZ failover to DB instance
b. DB instance shutdown
c. DB instance restarted	
d. Completed failover to DB instance

Schedule

Such drill should happen once a month, before major version release.
The drill should start during low-traffic times, e.g., 2am local time

Drill 1: Failover

Ensure traffic is going through our service. Either through traffic replication or load testing tool
Failover the current writer to a reader in a different AZ
During the failover, the service health check should be remain OK all the time
During the failover, write failure is expected, but read failure should not happen

Drill 2: Deleting a read replica

Ensure traffic is going through our service. Either through traffic replication or load testing tool
Ensure we have at least 2 healthy aurora instances running
Pick a read replica and delete it
During the failover, the service health check should be remain OK all the time
During the failover, write/read failure should not happen
Create a new read replica off the current writer

Drill 3: Deleting the current writer

Ensure traffic is going through our service. Either through traffic replication or load testing tool
Ensure we have at least 2 healthy aurora instances running
Pick the current writer and delete it
During the failover, the service health check should be remain OK all the time
During the failover, write failure is expected, but read failure should not happen
Create a new read replica off the current writer

CS Academy Round 50: Min Races

2017-09-29 00:00:00 +0000

I solved it during the contest, but the official explanation is cleaner than mine.

Official solution

sort all drivers by their final rank, i.e., we will have a list of classes

Claim: answer to the problem = length of longest decreasing subsequence.

Proof: This is actually Dilworth Theorem. In any finite partially ordered set, the maximum number of elements in any antichain equals the minimum number of chains in any partition of the set into chains.

Our greedy construction actually serves as proof

Codeforces Notes

2017-09-26 00:00:00 +0000

855B: Marvolo Gaunt’s Ring

Offical solution uses a multiple stage DP approach

v[0][0] = p * a[0]
v[0][1] = (p + q) * a[0]
v[0][2] = (p + q + r) * a[0]

LP(i , 1, n) {
	v[i][0] = max(v[i-1][0], p *a[i]) 
	v[i][1] = max(v[i-1][1], q * a[i] + v[i][1])
	v[i][2] = max(v[i-1][2], r * a[i] + v[i][2])
}

return v[n-1][2];

855C: Helga Hufflepuff’s Cup

I got the DP states during the contest, but I got stuck calcualting ways to distribute max security nodes among children!!!

consider the node with n nodes, # of ways to distribute k top security nodes amoung them is

ways(childI, type, secN) = sum over ways(childI - 1, type, secN - sofar) * f(childI, type, j)

Note runtime/memory is OK because secN <= 10

864E: Fire

Obviously if we save 2 items in the final solutions, we can greedily save the one with less t(i) first. So we sort by t(i) first, we know the final solution must be a subsequence in this sorted array

Now we have 2 states we need to worry about, current time and value. I got stuck how to handle it!!!

Note that value range is small, we can force the parameter as part of DP state,i.e., knapsack idea

minT(i, v) = minimum time to save value v among the first i items

Alternatively, we can use maxV(i, t) too, but the value range for v is 400 , much less than the t range

On HA of AWS Mysql RDS and Aurora

2017-09-25 00:00:00 +0000

MySQL RDS

In multi-AZ deployments for MySQL, data in primary DB is synchrously replicated to a standby instance in a different AZ. Failover is automatic, and endpoint remains same. Backup is taken from the standby, so that I/O is not suspendend on primary

Note the read replica, a different concept from the multi-az replica, uses MySQL’s async replication

Aurora RDS

Storage is handled by the storage layer,i.e., shared-disk approach, and is replicated across 3 AZs, each with 2 copies.

Aurora uses a SSD-backed virutalized storage layer.

Each write has to be acknolwedged by 4 out of 6 copies (sync), while read covers 3(4 + 3 > 6). This is to ensure

If 1 AZ + 1 machine is down, can still read
If lost 2 replicas, even if they are in the same AZ, can still write

For each source DB cluster, you can only have one cross-region Read Replica DB cluster. Data transfer between Regions incurs extra charge.

It is possible to promote the cross-region to standalone cluster, but this also means we need to add a cross-region replica to this newly promoted master - the old master will be on a split brain, and can not be re-incorporated!

Also note that for EACH region, aurora gives 2 end points, one read point, and one write end point. It is not smart enough to do auto-load balance!

CODE FESTIVAL 2017 qual A

2017-09-24 00:00:00 +0000

C: Palindromic Matrix

I used classification to solve it. The official solution uses brute force

Given H and W, we can tell how many positions has 4 mirrors, 2 mirrors, and 1 mirror

So for each letter, we try to fit it into 4 mirrors, 2 mirrors, and 1 mirror, in that order one by one

The answer is no if when we run out of possible 1 mirror positions. Otherwise, yes.

D: Four Coloring

Covert Mahattan distance to Chebyshev Distance
Divide the converted grid into d * d squares
assign each 4 sqaures 4 colors. It is correct because each pair with distance d is in neighboring converted squares

Distributed Systems Testing - from TiDB talk

2017-09-22 00:00:00 +0000

Original in Chinese

Profile everything, even on production - to catch once in a life time bug
Tests may make your code less beautiful - May need to add members to structs just for testsings, but we still need to design for tests
Fault injeciton: to test network failure case, try automate it without human intervention - Otherwise it is inefficient
Disk fails > 8% after 3 years
Importatnt to monitor NTP, detect jumping back => normally bad!!
Reading data from disk without checksum => no protection against potential data corruption
Fault injection: disk error, netowrk card, cpu, clock, file system, network & protocol => need to simulate everything so that you can inject error
Common tools: libfiu, openstack fault injection factory, Jepsen (mostly famous),
FoundationDB limitation: fake multi-process does not work well with languages where single thread is in effect multi-threaded,e.g., channel)
TiKV uses namazu. Planning to introduce OpenTracing (in Go) to fill the similar role as Google Dapper
Dont test failure case by triggering failure automatically, use your simulation layer

My TiDB benchmark runs

2017-08-21 00:00:00 +0000

Document results here since there is not many English sources on this

Setup

r3.2xlarge (8vCPU, 61 G ram). 3 act as PD/TiDB, 3 act as TiKV. All TiDB/TiKVs run on mounted instance storage SSD

c4.2xlarge (8vCPU, 15G ram) as test driver servers.

All 8 servers are within the same availability zone. Ping between each server is around 0.15ms

Sysbench: Oltp_read_only

A. Prepare data: 16 tables, each with 1M rows

./sysbench ./lua/oltp_read_only.lua         --mysql-host=$TIDB1 --mysql-port=4000 --mysql-user=root --mysql-password=""         --mysql-db=test --tables=16 --table-size=1000000         --report-interval=10         --threads=10 --time=0  prepare

B. on the first test driver server run

./sysbench ./lua/oltp_read_only.lua         --mysql-host=$TIDB1 --mysql-port=4000 --mysql-user=root --mysql-password=""         --mysql-db=test --tables=16 --table-size=1000000         --report-interval=10         --threads=10 --time=0         run

C.on the second test driver server run

./sysbench ./lua/oltp_read_only.lua         --mysql-host=$TIDB2 --mysql-port=4000 --mysql-user=root --mysql-password=""         --mysql-db=test --tables=16 --table-size=1000000         --report-interval=10         --threads=10 --time=0         run

Note that the two test servers use different –mysql-host in an attempt to balance network traffic on TiDB/PD servers

Result: Combined QPS from the test servers around 12.5 k. 95% latency between 25-39 ms

Sysbench: Oltp_insert

A. Prepare data: same command as oltp_read_only: 16 tables each with 1M rows

B. on the first test driver server run

./sysbench ./lua/oltp_insert.lua         --mysql-host=$TIDB1 --mysql-port=4000 --mysql-user=root --mysql-password=""         --mysql-db=test --tables=16 --table-size=1000000         --report-interval=10         --threads=20 --time=0         run

C. on the second test driver server run

./sysbench ./lua/oltp_insert.lua         --mysql-host=$TIDB2 --mysql-port=4000 --mysql-user=root --mysql-password=""         --mysql-db=test --tables=16 --table-size=1000000         --report-interval=10         --threads=20 --time=0         run

Again, the two test servers use different –mysql-host in an attempt to balance network traffic on TiDB/PD servers

Result: Combined TPS from the two test servers around 4.5 k. 95% latency around 14 ms

Findings from sysbench runs

When I run the same commands with –range_selects=false, the combined qbs is around 17k, with 95% latency between 14-22 ms
Among the 4 range queries in oltap_read_only.lua, only execute_distinct_ranges is slower. Such behaviour is expected, given the distinct_ranges query.
The latency/QPS improves almost linearly as I take out range queries one by one, which is expected

TPCC: latency test

I populate data with the simplistic load

./tpcc_load -h 127.0.0.1 -P 4000 -d tpcc1000 -u root -p "" -w 1

and run a simple test

./tpcc_start -h 127.0.0.1 -P 4000 -d tpcc1000 -u root -w 1 -c 1 -r 10 -l 40

Results

 10, trx: 80, 95%: 59.947, 99%: 62.177, max_rt: 62.718, 80|54.310, 8|61.505, 8|144.893, 9|304.212
  20, trx: 86, 95%: 59.696, 99%: 60.778, max_rt: 61.960, 83|55.630, 8|59.896, 8|142.030, 8|290.288
  30, trx: 81, 95%: 59.911, 99%: 62.270, max_rt: 63.184, 85|55.237, 9|61.069, 8|144.407, 8|300.596
  40, trx: 74, 95%: 61.437, 99%: 62.344, max_rt: 64.457, 75|54.663, 7|58.993, 8|145.278, 9|308.553

Findings from TPCC runs

when I deploy 3 instances across AZs but within the same DC, 95% percentile jumps to about 110ms
Because of MVCC, if number of warehouses is too low (< 10 * number of threads) we will see unable to update error from time to time
Even though TPCC is estabilished bench mark, both AliSQL and TiDB primarily use sysbench

Expected Values in Codeforces

2017-08-21 00:00:00 +0000

Easy problems in the last two contests, but I didn’t solve them quickly because of formulas with expected values

839C: Journey

Idea 1

ans = sum of (Pr(leaf) * Depth(leaf)) . Pr(leaf) can be calculated in DP steps, while we can calucate Depth(leaf) in separate DFS, and when we reach the leaf

Idea 2

For each tree, ans(root) = sum of (ans(leave)/k) + 1. This formula works because all leaves increase depth by 1, therefore, we can abstract all 1s out

841C: Leha and function

Intuition is obvious, but how to prove it? I used experimentation to prove the greedy solution - a slow approach.

For any K-size subset in N, we can use a bijection to represent it as a list of deltas d(1), d(2), …..d(k), d(k+ 1), where d(1) = a(1), sum of all d(i) = N + 1, i.e., d(k+1) = N + 1 - a(K)

Since this applies to any K-size subset in N, we know Ex(d(1)) + Ex(d(2)) + … = N + 1, because of linearity of expected value

The hard part is to prove E(d(1)) = (N + 1) / (K+ 1)

Idea 1

Use hockey-stick identity!

sum of [r, n] choose(i , r) = choose(n+ 1, r+1)

Idea 2

Note that if we swap any d(i) and d(j), and keep others same, the corresponding a(i) is still valid, i.e., d(i) and d(j) follow the same distribution => Ex(d(i)) = Ex(d(j))

Previous Page: 16 of 23 Next