; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011156 (gene) of Snake gourd v1 genome

Gene IDTan0011156
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-directed RNA polymerase II subunit RPB1-like
Genome locationLG01:28287448..28288762
RNA-Seq ExpressionTan0011156
SyntenyTan0011156
Gene Ontology termsNA
InterPro domainsIPR011057 - Mss4-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146165.1 uncharacterized protein At4g08330, chloroplastic [Cucumis sativus]8.8e-5379.34Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIYSCTECG NLNLNS+HLFPPDFYFEAGNKGTLSFS IDSTKFR EKEDKLRPFFET+NYWGIQRKRTK+ CN+CG LVGYVYDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQWTVEREGIK
        FGPSQ+  R  ++  + + ++
Subjt:  FGPSQLKGRVAQWTVEREGIK

XP_022965337.1 uncharacterized protein LOC111465232 isoform X2 [Cucurbita maxima]1.8e-5388.5Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIYSC ECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFR EKEDKLRPFFETVNYWGIQRKRTKI CN+C  LVGYVYDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQW
        FGPSQ+  R  ++
Subjt:  FGPSQLKGRVAQW

XP_023551931.1 uncharacterized protein LOC111809759 isoform X2 [Cucurbita pepo subsp. pepo]2.0e-5286.73Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIY C ECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFR EKEDKLRPFFETVNYWGIQRKRTKI CN+C  LVGYVYDDGPPLT+SPGQYH
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQW
        FGPSQ+  R  ++
Subjt:  FGPSQLKGRVAQW

XP_028774540.1 uncharacterized protein LOC114731516 [Prosopis alba]1.4e-5376.86Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MAS+YSCTECG+NLNLNS+HLFPPDFYFEAGNKGT SF+++D+TKFRFEKEDK+RPFFETVNYWGIQRKRTKI CN+CGCLVGYVYDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQWTVEREGIK
         GPSQ+  R  ++  + + ++
Subjt:  FGPSQLKGRVAQWTVEREGIK

XP_038903300.1 uncharacterized protein LOC120089928 [Benincasa hispida]6.8e-5380.17Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIYSC ECG NLNLNSSHLFPPDFYFEAGNKGTLSFS IDSTKFR EKEDKLRPFFETVNYWGIQRKRTK+ CN+CG LVGY+YDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQWTVEREGIK
        FGPSQ+  R  ++  + + ++
Subjt:  FGPSQLKGRVAQWTVEREGIK

TrEMBL top hitse value%identityAlignment
A0A0A0L4W0 Uncharacterized protein4.3e-5379.34Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIYSCTECG NLNLNS+HLFPPDFYFEAGNKGTLSFS IDSTKFR EKEDKLRPFFET+NYWGIQRKRTK+ CN+CG LVGYVYDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQWTVEREGIK
        FGPSQ+  R  ++  + + ++
Subjt:  FGPSQLKGRVAQWTVEREGIK

A0A1S3BJW2 uncharacterized protein LOC1034906673.6e-5277.69Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIYSC+ECG NLNLNSSHLFPPDFYFEAGNKGTLSFS IDSTKFR EKEDKLRPFFET+NYWGIQRKRTK+ C +CG LVGY+YDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQWTVEREGIK
        FGPSQ+  R  ++  + + ++
Subjt:  FGPSQLKGRVAQWTVEREGIK

A0A6J1E6R2 uncharacterized protein LOC1114312091.2e-5286.73Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIY C ECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFR EKEDKLRPFFETVNYWGIQR RTKI CN+C  LVGYVYDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQW
        FGPSQ+  R  ++
Subjt:  FGPSQLKGRVAQW

A0A6J1HK24 uncharacterized protein LOC111465232 isoform X28.6e-5488.5Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIYSC ECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFR EKEDKLRPFFETVNYWGIQRKRTKI CN+C  LVGYVYDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQW
        FGPSQ+  R  ++
Subjt:  FGPSQLKGRVAQW

A0A7J7I8I0 Uncharacterized protein1.4e-5169.47Show/hide
Query:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH
        MASIYSCTECGAN NL+++HLFPPDFYFEAGNKGT+SF+LID+TKF+FEKEDK+RPFFET+NYWGIQRKRTKI C++C  LVGY+YDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYH

Query:  FGPSQLKGRVAQWTVEREGIKRSVTCLFPHF
         GPSQ+  R  ++  + + ++    CL   F
Subjt:  FGPSQLKGRVAQWTVEREGIKRSVTCLFPHF

SwissProt top hitse value%identityAlignment
Q9STN5 Uncharacterized protein At4g08330, chloroplastic2.4e-0532.22Show/hide
Query:  YSCTECGANLNLNSSHLFPPDF---YFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPP
        YSC  CG  LNL+S++         Y ++   G +SF  ID  +F    E +  P F   + WG+ R RTK+ C  C   +G    +  P
Subjt:  YSCTECGANLNLNSSHLFPPDF---YFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPP

Arabidopsis top hitse value%identityAlignment
AT2G17705.1 unknown protein1.7e-4663.93Show/hide
Query:  ASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYHF
        ++IY+C ECG++LNLN + LFPPDFYFEAGNKGTLSF+ +D+ KFRFEKEDK+ PFFET+NYWGIQRKRTKI C +C  L+GY+YDDGPPLT   GQY F
Subjt:  ASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYHF

Query:  GPSQLKGRVAQWTVEREGIKRS
        GPSQ+  R  ++  + + ++ S
Subjt:  GPSQLKGRVAQWTVEREGIKRS

AT4G08330.1 unknown protein1.7e-0632.22Show/hide
Query:  YSCTECGANLNLNSSHLFPPDF---YFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPP
        YSC  CG  LNL+S++         Y ++   G +SF  ID  +F    E +  P F   + WG+ R RTK+ C  C   +G    +  P
Subjt:  YSCTECGANLNLNSSHLFPPDF---YFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATATACTCCTGTACAGAATGTGGAGCAAATCTGAATCTCAATTCCAGTCATCTCTTCCCGCCGGATTTCTACTTCGAGGCCGGAAACAAGGGCACCCTTTC
GTTTTCGTTGATCGACTCCACCAAGTTCCGGTTCGAGAAGGAGGACAAGCTCCGACCATTCTTCGAGACCGTTAACTATTGGGGAATCCAGCGGAAACGCACCAAGATCA
ACTGCAATGCCTGTGGATGTCTCGTCGGCTATGTTTACGACGATGGGCCTCCTCTTACCGATAGTCCAGGTCAGTACCACTTCGGACCTAGCCAGTTAAAGGGGAGGGTT
GCTCAATGGACAGTGGAAAGGGAAGGCATTAAAAGGTCTGTCACTTGTCTTTTTCCTCATTTTATCGGCCCTTTCAATCCTTTAACATACACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCATATACTCCTGTACAGAATGTGGAGCAAATCTGAATCTCAATTCCAGTCATCTCTTCCCGCCGGATTTCTACTTCGAGGCCGGAAACAAGGGCACCCTTTC
GTTTTCGTTGATCGACTCCACCAAGTTCCGGTTCGAGAAGGAGGACAAGCTCCGACCATTCTTCGAGACCGTTAACTATTGGGGAATCCAGCGGAAACGCACCAAGATCA
ACTGCAATGCCTGTGGATGTCTCGTCGGCTATGTTTACGACGATGGGCCTCCTCTTACCGATAGTCCAGGTCAGTACCACTTCGGACCTAGCCAGTTAAAGGGGAGGGTT
GCTCAATGGACAGTGGAAAGGGAAGGCATTAAAAGGTCTGTCACTTGTCTTTTTCCTCATTTTATCGGCCCTTTCAATCCTTTAACATACACATAA
Protein sequenceShow/hide protein sequence
MASIYSCTECGANLNLNSSHLFPPDFYFEAGNKGTLSFSLIDSTKFRFEKEDKLRPFFETVNYWGIQRKRTKINCNACGCLVGYVYDDGPPLTDSPGQYHFGPSQLKGRV
AQWTVEREGIKRSVTCLFPHFIGPFNPLTYT