; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007076 (gene) of Snake gourd v1 genome

Gene IDTan0007076
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG02:22581292..22583263
RNA-Seq ExpressionTan0007076
SyntenyTan0007076
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5545180.1 hypothetical protein DKX38_013292 [Salix brachista]1.2e-1832.72Show/hide
Query:  LLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK-------VNVSD---PETTEEIICSIPQT
        LL+++ L+W+Q++R  WL+ G+RNT +FH+    RR+KN I+GL N +GDW+++   M  + ++YF  LF         N++    P+  E  +  +   
Subjt:  LLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK-------VNVSD---PETTEEIICSIPQT

Query:  VSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFT
        V+ E +    F  G         N S  WR +L+G      G RW+IGSG  V    D W +
Subjt:  VSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFT

KAG7572583.1 Ribonuclease H-like superfamily [Arabidopsis suecica]1.5e-1828.1Show/hide
Query:  EAEIKILESDLTQEKEE--TWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVN
        ++E+ + + D  + +EE      ++ E   ++ LYW Q++R   +Q G+ N+K+FH     RR +N I GL + +G W   + D+  +A+SYF  LF   
Subjt:  EAEIKILESDLTQEKEE--TWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVN

Query:  VSDPETTEEIICSIPQTVSE-------------EILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW----FTSEGR
         + PE  EE +  +   +++             E+ + RYF+    L      +PS+ WRSI     L  KG   ++GSG  +++ +DPW    F    +
Subjt:  VSDPETTEEIICSIPQTVSE-------------EILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW----FTSEGR

Query:  EHPTVVTPNL
         + +V  P+L
Subjt:  EHPTVVTPNL

PWA48861.1 reverse transcriptase [Artemisia annua]9.8e-1831.25Show/hide
Query:  TQEKEETWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVNVSDPETTEEIICS
        T+ K+    K++ ELL  + L W+Q +R  WL+ G++NT++FH  AS R+++N I  L   DG W+    ++  L  SYF+ LF  + S P+  E ++  
Subjt:  TQEKEETWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVNVSDPETTEEIICS

Query:  IPQ--------------TVSEEI--------------LRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW
        I +              T SE++              L+ RY     F +  +G  PS+ W S L   D+  KG +W IG G  VN+  D W
Subjt:  IPQ--------------TVSEEI--------------LRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW

PWA52122.1 hypothetical protein CTI12_AA457560 [Artemisia annua]2.3e-1927.52Show/hide
Query:  KEAEIKILESDL---TQEKEETWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK
        K+  + IL+S     T  +++   + + ELL  + L W+QR+R  WL  G++NT++FH RAS RRK+N I  L + DG WI NE D+  L   YF++LF 
Subjt:  KEAEIKILESDL---TQEKEETWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK

Query:  VNVSDPETTEEIICSI----------------------------------------------------------------------PQTVSEEILRDRYF
         ++  P+  + ++  I                                                                      P T++ ++L+ RYF
Subjt:  VNVSDPETTEEIICSI----------------------------------------------------------------------PQTVSEEILRDRYF

Query:  QTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSEGREHP
            F +  +G  PS+ WRS +   DLF KG +W IG G  VN+ +D W     R  P
Subjt:  QTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSEGREHP

VFQ91828.1 unnamed protein product [Cuscuta campestris]2.1e-2034.3Show/hide
Query:  KKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLF-----------------KVNVSDPE
        K++  L+++++ YWRQRA++ WL  G+RNT++FH  AS RR+KN+I+ L +++G+W     ++ +L   ++++LF                   NV+   
Subjt:  KKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLF-----------------KVNVSDPE

Query:  TTEEIICSIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW
             +   P ++   + + RY+  G FLN  LGS+PSF WRSIL G +L + G   +IG G    I D PW
Subjt:  TTEEIICSIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW

TrEMBL top hitse value%identityAlignment
A0A2U1LSZ3 RNase H domain-containing protein1.1e-1927.52Show/hide
Query:  KEAEIKILESDL---TQEKEETWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK
        K+  + IL+S     T  +++   + + ELL  + L W+QR+R  WL  G++NT++FH RAS RRK+N I  L + DG WI NE D+  L   YF++LF 
Subjt:  KEAEIKILESDL---TQEKEETWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK

Query:  VNVSDPETTEEIICSI----------------------------------------------------------------------PQTVSEEILRDRYF
         ++  P+  + ++  I                                                                      P T++ ++L+ RYF
Subjt:  VNVSDPETTEEIICSI----------------------------------------------------------------------PQTVSEEILRDRYF

Query:  QTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSEGREHP
            F +  +G  PS+ WRS +   DLF KG +W IG G  VN+ +D W     R  P
Subjt:  QTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSEGREHP

A0A484MV31 CCHC-type domain-containing protein1.0e-2034.3Show/hide
Query:  KKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLF-----------------KVNVSDPE
        K++  L+++++ YWRQRA++ WL  G+RNT++FH  AS RR+KN+I+ L +++G+W     ++ +L   ++++LF                   NV+   
Subjt:  KKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLF-----------------KVNVSDPE

Query:  TTEEIICSIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW
             +   P ++   + + RY+  G FLN  LGS+PSF WRSIL G +L + G   +IG G    I D PW
Subjt:  TTEEIICSIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPW

A0A5N5LR83 RNase H domain-containing protein5.6e-1932.72Show/hide
Query:  LLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK-------VNVSD---PETTEEIICSIPQT
        LL+++ L+W+Q++R  WL+ G+RNT +FH+    RR+KN I+GL N +GDW+++   M  + ++YF  LF         N++    P+  E  +  +   
Subjt:  LLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFK-------VNVSD---PETTEEIICSIPQT

Query:  VSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFT
        V+ E +    F  G         N S  WR +L+G      G RW+IGSG  V    D W +
Subjt:  VSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFT

A0A803LSC1 Uncharacterized protein2.1e-1829.65Show/hide
Query:  NKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVNVSDPETTEEIICSIPQ-----
        ++++ EL   +  YW  RAR + L+ G++NT +FH +AS RR +N I GL + +GDW      +  +AI YF  LF      P   + ++  + Q     
Subjt:  NKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVNVSDPETTEEIICSIPQ-----

Query:  -------TVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE
               +++ ++++  YF++   +    G +PSF WRSI     +F  G  WK+G G  + + +D W   E
Subjt:  -------TVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE

A0A803QGC3 Uncharacterized protein6.6e-2032.52Show/hide
Query:  KEETWNKKMVELLEED--------NLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVNVSDPETTE
        K E W  K    LE+D         +YW+QR++  WL+ G++NTK+FH +AS RRKKN I+GL + +  W T   D+  +A SYF  LF  +    E  +
Subjt:  KEETWNKKMVELLEED--------NLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVNVSDPETTE

Query:  EI-------------------------------------ICSIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVN
         +                                     I + P  +  +IL+  YF    FL    G   S  W SILWG DL  +G RW +G G Q+ 
Subjt:  EI-------------------------------------ICSIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVN

Query:  IMDDPW
        I +DPW
Subjt:  IMDDPW

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003101.4e-0634.33Show/hide
Query:  PQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE
        P T+   +LR RYF     +  ++G+ PS+ WRSI+ G +L  +G    IG G    +  D W   E
Subjt:  PQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein6.4e-0733.33Show/hide
Query:  SIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE
        S P+++  ++ + RYF     LN  LGS PSF W+SI    ++  +G R  +G+G  + I    W  S+
Subjt:  SIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.8e-0834.33Show/hide
Query:  PQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE
        P T+   +LR RYF     +  ++G+ PS+ WRSI+ G +L  +G    IG G    +  D W   E
Subjt:  PQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAGGCCGAAATTAAGATTCTAGAGTCTGACTTAACACAAGAGAAAGAGGAGACCTGGAATAAGAAAATGGTTGAACTACTAGAAGAGGATAATCTTTATTGGCG
TCAGAGAGCAAGAGAGGACTGGTTGCAGTGGGGGGAGCGTAATACCAAGTGGTTTCATATTAGAGCATCGACAAGACGAAAGAAAAACCACATTCAGGGCCTTTCTAACA
ACGACGGTGATTGGATTACAAATGAGGTTGACATGGGAAGGCTGGCTATTTCATACTTTGCTAATTTATTCAAAGTCAACGTATCTGATCCTGAAACAACCGAGGAAATT
ATCTGCAGTATTCCCCAAACCGTCTCAGAGGAGATCCTCCGCGACCGTTATTTTCAGACAGGGAAGTTTCTAAACGGAACTCTTGGCTCTAACCCATCATTTACCTGGCG
GAGTATTTTGTGGGGTTGGGATCTATTCATGAAAGGGTACAGATGGAAAATTGGAAGTGGATACCAGGTAAATATCATGGATGACCCCTGGTTTACTAGTGAAGGTCGAG
AGCACCCGACGGTGGTTACTCCCAATCTTGCTCATGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAGGCCGAAATTAAGATTCTAGAGTCTGACTTAACACAAGAGAAAGAGGAGACCTGGAATAAGAAAATGGTTGAACTACTAGAAGAGGATAATCTTTATTGGCG
TCAGAGAGCAAGAGAGGACTGGTTGCAGTGGGGGGAGCGTAATACCAAGTGGTTTCATATTAGAGCATCGACAAGACGAAAGAAAAACCACATTCAGGGCCTTTCTAACA
ACGACGGTGATTGGATTACAAATGAGGTTGACATGGGAAGGCTGGCTATTTCATACTTTGCTAATTTATTCAAAGTCAACGTATCTGATCCTGAAACAACCGAGGAAATT
ATCTGCAGTATTCCCCAAACCGTCTCAGAGGAGATCCTCCGCGACCGTTATTTTCAGACAGGGAAGTTTCTAAACGGAACTCTTGGCTCTAACCCATCATTTACCTGGCG
GAGTATTTTGTGGGGTTGGGATCTATTCATGAAAGGGTACAGATGGAAAATTGGAAGTGGATACCAGGTAAATATCATGGATGACCCCTGGTTTACTAGTGAAGGTCGAG
AGCACCCGACGGTGGTTACTCCCAATCTTGCTCATGTTTAG
Protein sequenceShow/hide protein sequence
MKEAEIKILESDLTQEKEETWNKKMVELLEEDNLYWRQRAREDWLQWGERNTKWFHIRASTRRKKNHIQGLSNNDGDWITNEVDMGRLAISYFANLFKVNVSDPETTEEI
ICSIPQTVSEEILRDRYFQTGKFLNGTLGSNPSFTWRSILWGWDLFMKGYRWKIGSGYQVNIMDDPWFTSEGREHPTVVTPNLAHV