; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy3G058080 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy3G058080
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionDNA-directed RNA polymerase I subunit RPA1-like
Genome locationchrH03:11466602..11467127
RNA-Seq ExpressionChy3G058080
SyntenyChy3G058080
Gene Ontology termsGO:0032774 - RNA biosynthetic process (biological process)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXC30509.1 hypothetical protein L484_010758 [Morus notabilis]6.83e-1531.58Show/hide
Query:  ENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIY
        E++ K   FR    + R  +   F+A     D    KL  LY LE+ LIPK+  N+I+  HL ++D+ E F +YPWGR  Y +   + K++  + +   Y
Subjt:  ENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIY

Query:  -FQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTL
           GF  A+I W +ETIP L   ++   + +G   P I+  E  +   +R +
Subjt:  -FQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTL

KAA0038287.1 DNA-directed RNA polymerase I subunit RPA1-like [Cucumis melo var. makuwa]6.65e-3969.15Show/hide
Query:  HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSD
        HNHINL HLDIL++EE FK YPWGR  YIL  QF KKASYNDKD +Y QGF LAL+YW  E IP+  NLDVGFGR L I+GP I+T ECL++SD
Subjt:  HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSD

XP_022132727.1 uncharacterized protein LOC111005524 [Momordica charantia]1.06e-1334.51Show/hide
Query:  ITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKA-SYNDKDAIYFQGFRLALIYWIF
        I R  +   F  +    D    K+  LYILE FL+ KQ    IN ++  ++D +E F+ YPWGR  Y +   F KKA   ND  AI   GF  AL+ W +
Subjt:  ITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKA-SYNDKDAIYFQGFRLALIYWIF

Query:  ETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTLNMNI
        ETIP L      F   +    P +         +WR L+  I
Subjt:  ETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTLNMNI

XP_024031030.1 uncharacterized protein LOC21394043 [Morus notabilis]6.81e-1531.58Show/hide
Query:  ENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIY
        E++ K   FR    + R  +   F+A     D    KL  LY LE+ LIPK+  N+I+  HL ++D+ E F +YPWGR  Y +   + K++  + +   Y
Subjt:  ENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIY

Query:  -FQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTL
           GF  A+I W +ETIP L   ++   + +G   P I+  E  +   +R +
Subjt:  -FQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTL

XP_038891747.1 pescadillo homolog [Benincasa hispida]7.20e-2145.65Show/hide
Query:  ILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTLNMNI
        +LD EE F+ YPWGR  + L  +FFKK   N   +I+ QGF L L+YW FE IP+L N  +GF R +   GP +   E  E +DW+ +N NI
Subjt:  ILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRTLNMNI

TrEMBL top hitse value%identityAlignment
A0A1S3B065 uncharacterized protein LOC103484737 isoform X41.5e-1031.58Show/hide
Query:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI
        +TGL   +   +++ + ++       F  +  I R  +   F  +    +    K+  LYILE F++ KQ    IN ++  ++D +E F  YPWGR  Y 
Subjt:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI

Query:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG
        +   F KKA   ND  AI   GF  AL  W +ETIP L      F + + FG
Subjt:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X51.5e-1031.58Show/hide
Query:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI
        +TGL   +   +++ + ++       F  +  I R  +   F  +    +    K+  LYILE F++ KQ    IN ++  ++D +E F  YPWGR  Y 
Subjt:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI

Query:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG
        +   F KKA   ND  AI   GF  AL  W +ETIP L      F + + FG
Subjt:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG

A0A1S3B181 uncharacterized protein LOC103484737 isoform X71.5e-1031.58Show/hide
Query:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI
        +TGL   +   +++ + ++       F  +  I R  +   F  +    +    K+  LYILE F++ KQ    IN ++  ++D +E F  YPWGR  Y 
Subjt:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI

Query:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG
        +   F KKA   ND  AI   GF  AL  W +ETIP L      F + + FG
Subjt:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG

A0A5A7T9I0 DNA-directed RNA polymerase I subunit RPA1-like1.5e-3169.15Show/hide
Query:  HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSD
        HNHINL HLDIL++EE FK YPWGR  YIL  QF KKASYNDKD +Y QGF LAL+YW  E IP+  NLDVGFGR L I+GP I+T ECL++SD
Subjt:  HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSD

A0A5D3CNI7 TF-B3 domain-containing protein1.5e-1031.58Show/hide
Query:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI
        +TGL   +   +++ + ++       F  +  I R  +   F  +    +    K+  LYILE F++ KQ    IN ++  ++D +E F  YPWGR  Y 
Subjt:  MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYI

Query:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG
        +   F KKA   ND  AI   GF  AL  W +ETIP L      F + + FG
Subjt:  LKCQFFKKA-SYNDKDAIYFQGFRLALIYWIFETIPKL------FNLDVGFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G32960.1 Domain of unknown function (DUF1985)2.7e-0429.73Show/hide
Query:  EEDSLKEK--LVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKK--ASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVG
        +EDS +E+  L  L ++E+  +         +++L+     E   +YPWG   + +     KK  AS   K      GF LAL  WI E+IP    L   
Subjt:  EEDSLKEK--LVNLYILETFLIPKQ*HNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKK--ASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVG

Query:  FGRTLGIQGPT
        + R   I+ PT
Subjt:  FGRTLGIQGPT

AT5G28810.1 Domain of unknown function (DUF1985)3.5e-0432.95Show/hide
Query:  FKDYPWG--------RSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRT
        F+ YPWG        RS  I+KC         D D+    G   AL+ WI+E++P       G G   G  G T +T  C+   DWR+
Subjt:  FKDYPWG--------RSPYILKCQFFKKASYNDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMT*ECLESSDWRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGGATTAAAATGGCACAAATGCCTAGAGTTAAATCTAAGAGAAAGAAAAGAGAATAGTATGAAAAATGCCCTTTTCCGTCATGACGACTTTATAACAAGAAGAGA
TATAGAAAGAAATTTCAAAGCATTATGCAATGAGGAAGACTCATTGAAAGAAAAATTAGTTAACCTCTATATCTTAGAAACCTTTTTAATTCCTAAGCAATAACACAACC
ACATAAATCTCCAACATCTAGACATATTGGATCATGAAGAAACATTCAAAGACTATCCATGGGGGAGATCACCATACATCCTAAAATGTCAATTCTTCAAGAAAGCATCA
TACAATGACAAGGATGCTATATACTTTCAAGGATTTCGCCTAGCTTTAATTTATTGGATTTTTGAGACAATACCAAAACTTTTCAATTTAGATGTTGGGTTTGGAAGAAC
GCTTGGAATTCAAGGGCCAACAATAATGACATGAGAATGTTTGGAGTCAAGTGACTGGAGGACTCTAAACATGAACATTTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGGATTAAAATGGCACAAATGCCTAGAGTTAAATCTAAGAGAAAGAAAAGAGAATAGTATGAAAAATGCCCTTTTCCGTCATGACGACTTTATAACAAGAAGAGA
TATAGAAAGAAATTTCAAAGCATTATGCAATGAGGAAGACTCATTGAAAGAAAAATTAGTTAACCTCTATATCTTAGAAACCTTTTTAATTCCTAAGCAATAACACAACC
ACATAAATCTCCAACATCTAGACATATTGGATCATGAAGAAACATTCAAAGACTATCCATGGGGGAGATCACCATACATCCTAAAATGTCAATTCTTCAAGAAAGCATCA
TACAATGACAAGGATGCTATATACTTTCAAGGATTTCGCCTAGCTTTAATTTATTGGATTTTTGAGACAATACCAAAACTTTTCAATTTAGATGTTGGGTTTGGAAGAAC
GCTTGGAATTCAAGGGCCAACAATAATGACATGAGAATGTTTGGAGTCAAGTGACTGGAGGACTCTAAACATGAACATTTTATTAA
Protein sequenceShow/hide protein sequence
MTGLKWHKCLELNLRERKENSMKNALFRHDDFITRRDIERNFKALCNEEDSLKEKLVNLYILETFLIPKQHNHINLQHLDILDHEETFKDYPWGRSPYILKCQFFKKASY
NDKDAIYFQGFRLALIYWIFETIPKLFNLDVGFGRTLGIQGPTIMTECLESSDWRTLNMNILLX