通过递归调用时，例程未运行

I'm doing the Web Crawler problem from the tour of go. Here's my solution so far:

func GatherUrls(url string, fetcher Fetcher) []string {
    body, urls, err := fetcher.Fetch(url)
    if err != nil {
        fmt.Println("error:", err)
    } else {
        fmt.Printf("found: %s %q
", url, body)
    }
    return urls
}

// Crawl uses fetcher to recursively crawl
// pages starting with url, to a maximum of depth.
func Crawl(url string, depth int, fetcher Fetcher) {
    // get all urls for depth
    // check if url has been crawled
    //  Y: noop
    //  N: crawl url
    // when depth is 0, stop
    fmt.Printf("crawling %q...
", url)
    if depth <= 0 {
        return
    }
    urls := GatherUrls(url, fetcher)
    fmt.Println("urls:", urls)
    for _, u := range urls {
        fmt.Println("currentUrl:", u)
        if _, exists := cache[u]; !exists {
            fmt.Printf("about to crawl %q
", u)
            go Crawl(u, depth - 1, fetcher)
        } else {
            cache[u] = true
        }
    }
}

func main() {
    cache = make(map[string]bool)
    Crawl("https://golang.org/", 4, fetcher)
}

When I run this code, Crawl() is never called when the function recurses (i know this because fmt.Printf("crawling %q... ", url) is only ever called once)

Here are the logs:

crawling "https://golang.org/"...
found: https://golang.org/ "The Go Programming Language"
urls: [https://golang.org/pkg/ https://golang.org/cmd/]
currentUrl: https://golang.org/pkg/
about to crawl "https://golang.org/pkg/"
currentUrl: https://golang.org/cmd/
about to crawl "https://golang.org/cmd/"

What am I doing wrong? I suspect that spawning a thread to do recursion is the wrong way to do this? Please advise.

Please note that I want to do this with as few libraries as possible. I've seen some answers with the WaitGroup package. I dont want to use this.

NOTE: The full code including the lesson boilerplate is below: package main

import (
    "fmt"
)

var cache map[string]bool

type Fetcher interface {
    // Fetch returns the body of URL and
    // a slice of URLs found on that page.
    Fetch(url string) (body string, urls []string, err error)
}

func GatherUrls(url string, fetcher Fetcher) []string {
    body, urls, err := fetcher.Fetch(url)
    if err != nil {
        fmt.Println("error:", err)
    } else {
        fmt.Printf("found: %s %q
", url, body)
    }
    return urls
}

// Crawl uses fetcher to recursively crawl
// pages starting with url, to a maximum of depth.
func Crawl(url string, depth int, fetcher Fetcher) {
    // get all urls for depth
    // check if url has been crawled
    //  Y: noop
    //  N: crawl url
    // when depth is 0, stop
    fmt.Printf("crawling %q...
", url)
    if depth <= 0 {
        return
    }
    urls := GatherUrls(url, fetcher)
    fmt.Println("urls:", urls)
    for _, u := range urls {
        fmt.Println("currentUrl:", u)
        if _, exists := cache[u]; !exists {
            fmt.Printf("about to crawl %q
", u)
            go Crawl(u, depth - 1, fetcher)
        } else {
            cache[u] = true
        }
    }
}

func main() {
    cache = make(map[string]bool)
    Crawl("https://golang.org/", 4, fetcher)
}

// fakeFetcher is Fetcher that returns canned results.
type fakeFetcher map[string]*fakeResult

type fakeResult struct {
    body string
    urls []string
}

func (f fakeFetcher) Fetch(url string) (string, []string, error) {
    if res, ok := f[url]; ok {
        return res.body, res.urls, nil
    }
    return "", nil, fmt.Errorf("not found: %s", url)
}

// fetcher is a populated fakeFetcher.
var fetcher = fakeFetcher{
    "https://golang.org/": &fakeResult{
        "The Go Programming Language",
        []string{
            "https://golang.org/pkg/",
            "https://golang.org/cmd/",
        },
    },
    "https://golang.org/pkg/": &fakeResult{
        "Packages",
        []string{
            "https://golang.org/",
            "https://golang.org/cmd/",
            "https://golang.org/pkg/fmt/",
            "https://golang.org/pkg/os/",
        },
    },
    "https://golang.org/pkg/fmt/": &fakeResult{
        "Package fmt",
        []string{
            "https://golang.org/",
            "https://golang.org/pkg/",
        },
    },
    "https://golang.org/pkg/os/": &fakeResult{
        "Package os",
        []string{
            "https://golang.org/",
            "https://golang.org/pkg/",
        },
    },
}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doushang2571 2018-06-11 08:55
关注
As you see in this sample: https://tour.golang.org/concurrency/10, we should do following tasks:

Fetch URLs in parallel.

Don't fetch the same URL twice.

Cache URLs already fetched on a map, but maps alone are not safe for concurrent use!

So, we can do following steps to resolve above tasks:

Create struct to store the fetch result:

type Result struct { body string urls []string err error }

Create a struct to store URL has already fetched on the map, we need use sync.Mutex, this is not introduced in 'A Tour of Go':

type Cache struct { store map[string]bool mux sync.Mutex }

Fetch URL and body in parallel: Add URL to the cache when fetching it, but the first we need lock read/write in parallel by a mutex. So, we can modify Crawl function like this:

func Crawl(url string, depth int, fetcher Fetcher) { if depth <= 0 { return } ch := make(chan Result) go func(url string, res chan Result) { body, urls, err := fetcher.Fetch(url) if err != nil { ch <- Result{body, urls, err} return } var furls []string cache.mux.Lock() for _, u := range urls { if _, exists := cache.store[u]; !exists { furls = append(furls, u) } cache.store[u] = true } cache.mux.Unlock() ch <- Result{body: body, urls: furls, err: err} }(url, ch) res := <-ch if res.err != nil { fmt.Println(res.err) return } fmt.Printf("found: %s %q ", url, res.body) for _, u := range res.urls { Crawl(u, depth-1, fetcher) } }

You can view the full code and run this in the playground: https://play.golang.org/p/iY9uBXchx3w

Hope this help.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

通过递归调用时，例程未运行
2018-06-11 03:30

回答 3 已采纳 As you see in this sample: https://tour.golang.org/concurrency/10, we should do following tasks:
JAVA函数递归调用问题 java
2015-01-20 16:08

回答 2 已采纳 if(i>1) {print();//先到这里，但这里是递归，又调用当前函数，当前函数没有执行完 System.out.println("***** "+sum);//当B时点执行完后，这里
c语言–函数的递归调用 c语言有问必答
2021-11-23 17:46

回答 2 已采纳 /* 已知某数列为： F(0)=F(1)=1; F(2)=0; F(n)=F(n-1)-3F(n-2)+2F(n-3) (n>2) 要求：使用递归法编写这个函数，然后输出F(0)~F(10)，求
Python3例程
2017-08-07 12:34

用Python3编写的，采用递归调用技术实现查找指定磁盘路径范围内包含给定字符串的文本文件功能的例程
if-else和递归调用 c语言
2022-06-06 19:54

回答 1 已采纳 if(n==1) 都是不细心
递归函数调用简单但是出错 c++
2023-03-19 06:23

回答 5 已采纳参考GPT和自己的思路：这段代码有一个简单的递归函数，目的是计算给定数字 n 的阶乘。但是，有两个错误导致这段代码无法正常编译和运行：在 main 函数中不能定义函数 fact，必须将其单独定义在
请解决阶乘递归调用计算问题 c语言有问必答
2022-10-19 14:32

回答 4 已采纳 13的阶乘超过了int的取值范围，改为long long 吧。如果这个值会超过25，那要改用double修改如下： #include<stdio.h> long long factoria
P9 打印整数的递归例程
2022-10-03 21:38

31岁的算法君的博客 P9 打印整数的递归例程
请问下面这个递归调用是怎么执行的？ java
2021-03-30 16:21

回答 1 已采纳 1.你代码有问题，一个大括号的问题 2.递归是符合条件就进去，不符合才出来，继续执行下面的代码
Vue3 组件递归中,递归下调用父父组件的方法,请教 vue.js 前端有问必答
2022-06-04 09:01

回答 4 已采纳这种涉及到多层级的组件通信，建议使用vue的provide和inject从上层组件提供函数和数据给所有下层组件，下层接收使用详细使用方式请查阅文档~https://v3.cn.vuejs.org/
函数递归调用问题放苹果 c语言有问必答
2022-11-01 09:54

回答 4 已采纳具体的题目是什么，放的规则？
递归例程的四条基本法则
2016-02-13 03:35

XML火柴的博客 1.基准情形。...假设所有的递归调用都能运行。 4.合成效益法则（compound interest rule）。在求解一个问题的同一示例时，切勿在不同的递归调用中做重复性的工作。摘自–《数据结构与算法分析—C语言描述》
递归调用:十进制化二制 c语言
2021-11-01 14:55

回答 1 已采纳 public static String getBinaryString(int n) { String result = ""; while (n > 0) {
数据结构与算法分析（一） —— 深入理解递归算法的调用过程
2016-05-11 19:59

TangowL的博客本文旨在对递归算法的方法调用过程进行深入的研究，理清调用过程。
C#递归例程
2019-09-27 10:10

dengjieming5263的博客 using System; using System.Collections.Generic; using System.Linq; ...namespace ConsoleApp1 //函数的递归调用 { //F(n)= F(n-1)+F(n-2)... F(1)=3; F(0)=2; 求F（40） class Prog...
没有解决我的问题, 去提问

悬赏问题

¥15 有了解d3和topogram.js库的吗？有偿请教
¥100 任意维数的K均值聚类
¥15 stamps做sbas-insar，时序沉降图怎么画
¥15 unity第一人称射击小游戏，有demo，在原脚本的基础上进行修改以达到要求
¥15 买了个传感器，根据商家发的代码和步骤使用但是代码报错了不会改，有没有人可以看看
¥15 关于#Java#的问题，如何解决？
¥15 加热介质是液体，换热器壳侧导热系数和总的导热系数怎么算
¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
¥15 cmd cl 0x000007b
¥20 BAPI_PR_CHANGE how to add account assignment information for service line

通过递归调用时，例程未运行

3条回答 默认 最新

悬赏问题

3条回答默认最新