时间:2021-07-01 10:21:17 帮助过:2人阅读
如果stream下面没有reader,那么这个targets返回NULL。
我们跟到GetLocalStreamReaders里面看看
Bitmapset * GetLocalStreamReaders(Oid relid) { Bitmapset *readers = GetAllStreamReaders(relid); if (stream_targets && readers) {
.....
} return readers; }
中间的if不看,这个readers是通过调用GetAllStreamReaders来获取的,我们继续跟进去看看。
Bitmapset * GetAllStreamReaders(Oid relid) { HeapTuple tup = SearchSysCache1(PIPELINESTREAMRELID, ObjectIdGetDatum(relid)); bool isnull; ..... raw = SysCacheGetAttr(PIPELINESTREAMRELID, tup, Anum_pipeline_stream_queries, &isnull); if (isnull) return NULL; ...... ReleaseSysCache(tup); return result; }
这段代码就很有意思了。
如果isnull直接return,而后面的ReleaseSysCaceh没有执行。
这样上面的tup就一直存在,没有释放掉。
这样会导致后面的一个断言错误。
来看看下面堆栈信息。
TRAP: FailedAssertion("!(ct->refcount == 0)", File: "catcache.c", Line: 588, PID: 3829, Query: (null))
assertion failure at:
pipeline: bgworker: worker [postgres] (ExceptionalCondition+0xaf)[0x906b0f]
pipeline: bgworker: worker [postgres] (AtEOXact_CatCache+0x1e6)[0x8eb735]
pipeline: bgworker: worker [postgres] [0x4fe75a]
pipeline: bgworker: worker [postgres] (CommitTransactionCommand+0x72)[0x4ff19c]
pipeline: bgworker: worker [postgres] (ContinuousQueryWorkerMain+0x6cd)[0x7366a1]
pipeline: bgworker: worker [postgres] [0x7343f9]
pipeline: bgworker: worker [postgres] (StartBackgroundWorker+0x2bd)[0x7427ea]
pipeline: bgworker: worker [postgres] [0x75532a]
pipeline: bgworker: worker [postgres] [0x755646]
pipeline: bgworker: worker [postgres] [0x750473]
pipeline: bgworker: worker [postgres] (PostmasterMain+0x110c)[0x74f92a]
pipeline: bgworker: worker [postgres] [0x694f85]
/lib64/libc.so.6(__libc_start_main+0xf5)[0x7fb8eb84caf5]
pipeline: bgworker: worker [postgres] [0x462e09]
我们看看catcache.c:588
582 dlist_foreach(iter, bucket) 583 { 584 CatCTup *ct; 585 586 ct = dlist_container(CatCTup, cache_elem, iter.cur); 587 Assert(ct->ct_magic == CT_MAGIC); 588 Assert(ct->refcount == 0); 589 Assert(!ct->dead); 590 }
我们看看ct->refcount的解释:
int refcount; /* number of active references */
这其实跟我修改的代码有关系,我们从上面堆栈信息分析。
ContinuousQueryWorkerMain-->CommitTransactionCommand
我在ContinuousQueryWorkerMain里面自己调用了
Bitmapset *targets = GetLocalStreamReaders(relid);
而我判断targets的时候,
if (!targets) { donothing... }
我特意看了一下官方的用法。
src/backend/pipeline/stream.c:200
if (targets == NULL) { char *name = get_rel_name(pstmt->relid); ereport(ERROR, (errcode(ERRCODE_INVALID_PARAMETER_VALUE), errmsg("no continuous views are currently reading from stream %s", name), errhint("Use CREATE CONTINUOUS VIEW to create a continuous view that includes %s in its FROM clause.", name))); }
很清楚的看到,这个里面直接丢了个ERROR,
直接abort,这样就不会像我上面堆栈信息那样,后面commit就会断言异常。
话说,一个stream下面没有readers是很正常的,但是这么明显是代码有错误,该释放的没有释放。
修改如下:
src/backend/catalog/pipeline_stream.c
GetAllStreamReaders函数
if (isnull) return NULL;
修改成
if (isnull){ ReleaseSysCache(tup); return NULL; }
这样在返回的时候就直接释放了tup。
这个里面还有别的问题。后面再写。
关于pipelineDB调用GetLocalStreamReaders的BUG
标签: