PHP：索引大型RSS源数组

Currently, I am retrieving individual RSS feeds and storing the data that I need from them in a JSON format like this for every source (like 100):

{
"status": "ok",
"source": "source-string",
"sortBy": "top",
"unixTimeStampLastUpdated": 1513555729,
"articles": [{
    "author": null,
    "title": "Article Title",
    "description": "Short Description",
    "urlToImage": null,
    "publishedAt": 1513536447,
    "id": "2017-12_5a370775559fa"
},
 ...and so on

I store a monthly JSON file for each source (about 100 sources) in that format.

From that, I generate pages based on the sources monthly JSON file. For each of the articles listed it has a unique ID that needs to point to something on my server; to do this, I have an ENORMOUS monthly array of just the article IDs and a few of their attributes, like this:

{
"2017-12_5a3701fb89c99": {
    "title": "Sample Article Title",
    "url": "https:\/\/www.example.com\/",
    "feed": "the-source",
    "origin": "2017-12"
},
"2017-12_5a3701fba9c9a": {
    "title": "Sample Article Title",
    "url": "https:\/\/www.example.com\/",
    "feed": "the-source-2",
    "origin": "2017-12"
},

My Question:

What is the best way to retrieve articles, index them, display them, and act on the callbacks of them (ID); lighting fast and organized?

I am not sure if a SQL Database will solve my problems, as I have not had to set one up yet and I think this could be simpler...

Is there a way that I could do this with each article listed in only 1 JSON file instead of it being reference in a few places? Or would it lack speed?

Any input would be greatly appreciated!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doudao7113 2017-12-19 02:39
关注
Sounds like your data isn't terribly relational and you want:

A key-value/document store. [fast retrieval, eg: id -> json doc]

Something to build/search indexes overtop of data with loose schemas. [fast search, eg: author -> doc id]

Welcome to NoSQL land.

There are plenty of simple services that each accomplish one task or the other, [eg: Lucene or Solr for search] and plenty of consolidated services that accomplish both. If you're running this app in a public cloud somewhere [eg: AWS DynamoDB, GCP Datastore] then chances are they already have a service that does what you want, otherwise you're probably going to want to look into something like Couchbase, Cassandra, or Elasticsearch.

I've tried to be as broad as possible, so as not to ignite a holy war, but your question itself really rides the line for "Too Broad" and "Primarily Opinion-based" to begin with.

Lastly, if all this is too daunting you can always cobble together loose approximations of NoSQL systems inside of an RDBMS. In fact, Postgres has some fairly nice tools for interacting with schemaless data.
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

tp5出现以下报错未定义数组索引 title php
2021-12-23 13:31

回答 2 已采纳 $data变量没有 title 这个key1、先打印$data，看下里面的数据是否正确2、使用的时候，加一层判断 htmlentities(array_key_exists('title', $da
在php中将索引数组转换为多维关联/索引数组 php
2018-11-09 14:35

回答 3 已采纳 This is a blanket statement on how to get your desired array : $desired_array = array(array("0"=&
php 如何把索引数组转换成关联数组？ php
2021-09-08 18:12

回答 2 已采纳 <?php $a = [1,2,3]; $b = [4,5,6]; $c = array_combine($a,$b); print_r($c); ?>
前端面试八股文（详细版）—上
2022-11-13 17:06

旺旺大力包的博客前端面试八股文，知识点广而全，内容会及时更新
PHP：数组索引值排序[重复] php
2014-10-15 07:02

回答 2 已采纳 Quick and dirty method $arrayData = array_combine( range(1, count($arrayData)), $arrayDat
C#其他信息: 索引超出了数组界限。 c#
2015-07-24 00:40

回答 7 已采纳你最后应该是 richTextBox2.Text = richTextBox2.Text + aaa + "\r\n";
matlab索引超出数组元素的数目 matlab
2022-05-07 13:37

回答 1 已采纳 s太大了，y1长度为201，把s改成小于201的数
web前端加php题,也许你需要点实用的-Web前端笔试题
2021-04-24 02:41

方轩固的博客 Web前端笔试题Html+css1.对WEB标准以及w3c的理解与认识。标签闭合，标签小写，不乱嵌套：提高搜索机器人的搜索几率；使用外联的css和js，结构行为表现的分离：文件下载与页面加载速度更快，内容能被更广泛的设备所...
错误：位置 2 处的索引无效。数组索引必须为正整数或逻辑值。求结果 matlab
2022-05-19 22:03

回答 1 已采纳 I有时候不是正整数，改成：I=floor(5*i)-4
matlab 索引超出数组范围 matlab
2018-09-10 14:11

回答 2 已采纳 vol0是32001*1的cell数组，列数为1，即col = 1，那么，j 从3开始肯定就会超出列长，程序运行到 for j = 3:col 就报错了另外，为什么用 vol0{i}{j} 索引？这个
关于php的问题，关于数组索引的 php
2022-04-25 10:59

回答 3 已采纳 $a[3] 就是你看到的 3=>'b', 然后没有指定的键是按照0开始，然后按照目前最大的键+1来处理，如果有重复的就会覆盖掉，所以 $a[4] = 'd'
14万字面试题汇总整理，祝你顺利斩获大厂前端offer
2021-07-17 09:08

孙叫兽的博客导读：最近很多小伙伴私信我说，一般大厂的前端面试题都有哪些，应该如何准备，要不要刷题等等，这里孙叫兽简单给大家总结一下前端的高频面试题，如果对你有帮助，记得点赞评论+收藏。现在很多大厂都比较内卷，你不...
php获取数组中特定索引的值 php
2015-04-21 04:21

回答 2 已采纳 Just get the substring with substr & strpos. Try with - $str = 'test1:val1,test2:val2,test3:val3
前端300道常见面试题，前端找工作必备
2021-12-15 21:06

编程ID的博客前端面试题汇总一、HTML 和 CSS 1、你做的页面在哪些流览器测试过？这些浏览器的内核分别是什么? IE: trident 内核Firefox：gecko 内核Safari:webkit 内核 Opera:以前是 presto 内核，Opera 现已改用 Google Chrome...
PHP 学习路线
2022-07-11 07:37

「已注销」的博客 PHP 官网文档(中文)：https://www.php.net/manual/zh/langref.php ThinkPhp (官方手册、入门教程)：https://sites.thinkphp.cn/1556331 W3School PHP 教程：...
没有解决我的问题, 去提问

悬赏问题

¥15 关于smbclient 库的使用
¥15 微信小程序协议怎么写
¥15 c语言怎么用printf（“\b \b”）与getch（）实现黑框里写入与删除？
¥20 怎么用dlib库的算法识别小麦病虫害
¥15 华为ensp模拟器中S5700交换机在配置过程中老是反复重启
¥15 java写代码遇到问题，求帮助
¥15 uniapp uview http 如何实现统一的请求异常信息提示？
¥15 有了解d3和topogram.js库的吗？有偿请教
¥100 任意维数的K均值聚类
¥15 stamps做sbas-insar，时序沉降图怎么画

PHP：索引大型RSS源数组

1条回答 默认 最新

悬赏问题

1条回答默认最新