; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022781 (gene) of Snake gourd v1 genome

Gene IDTan0022781
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG07:70494751..70495449
RNA-Seq ExpressionTan0022781
SyntenyTan0022781
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.4e-1530.9Show/hide
Query:  RDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEAL
        RDI  A + Y+  S G  +  K +S              +  +W PP  + WKLN++A+W      GG GW +RD  G ++ A  + ++ +  I  LE +
Subjt:  RDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEAL

Query:  AILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
        AI EGL A      EHC  + +ESDSLE I  L+   +D +E+   ++ I  ++     ++     R  N +AH  AR
Subjt:  AILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]2.5e-1235.46Show/hide
Query:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL
        HS+  RDI  A + Y+  S G  +  K +S + H            W PP  + WKLN+DA+W       G GW +RD  G ++  G + ++ +  I  L
Subjt:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL

Query:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE
        E +AI EGL A      EHC  + +ESDSLE I  L+  V+
Subjt:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]2.5e-1235.46Show/hide
Query:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL
        HS+  RDI  A + Y+  S G  +  K +S + H            W PP  + WKLN+DA+W       G GW +RD  G ++  G + ++ +  I  L
Subjt:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL

Query:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE
        E +AI EGL A      EHC  + +ESDSLE I  L+  V+
Subjt:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.4e-1540.58Show/hide
Query:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLE--ASISCMSEHCISLTVESDSLEVIKGLNGGVEDM
        +W PP +  W LN+DASW+DS   GG GW IR  +G +V AG + V+    +K+LEA AILEGL    ++  +      L +E+DS EV   LN   ED+
Subjt:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLE--ASISCMSEHCISLTVESDSLEVIKGLNGGVEDM

Query:  SELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
        ++    V+ IL+L +S   + F K  R TN  AH+ A+
Subjt:  SELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.6e-1430.32Show/hide
Query:  HSKVIRDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIK
        HS+  RDI    + Y+  S G  +  K +S              +   W PP  + WKLN+DA+W      GG GW +RD  G ++ A  + ++ +  I 
Subjt:  HSKVIRDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIK

Query:  ILEALAILEGLEA-----SISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
         LE +AI EGL A           EHC  + +ESDSLE I  L+   +D +E+   ++ I  ++     ++     R  N +AH+ AR
Subjt:  ILEALAILEGLEA-----SISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134126.9e-1630.9Show/hide
Query:  RDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEAL
        RDI  A + Y+  S G  +  K +S              +  +W PP  + WKLN++A+W      GG GW +RD  G ++ A  + ++ +  I  LE +
Subjt:  RDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEAL

Query:  AILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
        AI EGL A      EHC  + +ESDSLE I  L+   +D +E+   ++ I  ++     ++     R  N +AH  AR
Subjt:  AILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.2e-1235.46Show/hide
Query:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL
        HS+  RDI  A + Y+  S G  +  K +S + H            W PP  + WKLN+DA+W       G GW +RD  G ++  G + ++ +  I  L
Subjt:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL

Query:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE
        E +AI EGL A      EHC  + +ESDSLE I  L+  V+
Subjt:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE

A0A6J1DNV9 uncharacterized protein LOC1110224036.9e-1640.58Show/hide
Query:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLE--ASISCMSEHCISLTVESDSLEVIKGLNGGVEDM
        +W PP +  W LN+DASW+DS   GG GW IR  +G +V AG + V+    +K+LEA AILEGL    ++  +      L +E+DS EV   LN   ED+
Subjt:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLE--ASISCMSEHCISLTVESDSLEVIKGLNGGVEDM

Query:  SELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
        ++    V+ IL+L +S   + F K  R TN  AH+ A+
Subjt:  SELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X21.2e-1235.46Show/hide
Query:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL
        HS+  RDI  A + Y+  S G  +  K +S + H            W PP  + WKLN+DA+W       G GW +RD  G ++  G + ++ +  I  L
Subjt:  HSKVIRDISTANEEYLATSRGHGSPTKLESCESH----------GEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKIL

Query:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE
        E +AI EGL A      EHC  + +ESDSLE I  L+  V+
Subjt:  EALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVE

A0A6J1DSV1 uncharacterized protein LOC1110236087.6e-1530.32Show/hide
Query:  HSKVIRDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIK
        HS+  RDI    + Y+  S G  +  K +S              +   W PP  + WKLN+DA+W      GG GW +RD  G ++ A  + ++ +  I 
Subjt:  HSKVIRDISTANEEYLATSRGHGSPTKLESC------------ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIK

Query:  ILEALAILEGLEA-----SISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
         LE +AI EGL A           EHC  + +ESDSLE I  L+   +D +E+   ++ I  ++     ++     R  N +AH+ AR
Subjt:  ILEALAILEGLEA-----SISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.8e-0428.28Show/hide
Query:  ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCIS-----LTVESDSLEVIKGL
        ++H  W  PE    K N D S+ +       GW +RDSNGS + AG  Q  G+     LE+      ++A I  M +HC S     +  E D+  +   +
Subjt:  ESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCIS-----LTVESDSLEVIKGL

Query:  NGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
        NG  +    + N+++ I   +  F +I+F    R  N+ A   A+
Subjt:  NGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR

AT1G52990.1 thioredoxin family protein3.2e-0526.98Show/hide
Query:  KLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILD
        K N DAS ++   V G GW IR+S G+++  G  + +G+   +  E  A++  ++A+ +      I    E D+  V + +N    D   L +Y+  I  
Subjt:  KLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILD

Query:  LVNSFSAINFVKCHRSTNSLAHNFAR
         + SF++  F+  HR  N  A    +
Subjt:  LVNSFSAINFVKCHRSTNSLAHNFAR

AT2G02650.1 Ribonuclease H-like superfamily protein7.8e-0426.09Show/hide
Query:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSE
        +W PP     K N D+ +        +GW IR+ NG +V  G  +++        EAL  L  L+      +     +  ESDS  ++  +N G ED S 
Subjt:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSE

Query:  LSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFARGV
        L   +  I   +      +    +R  NS A   A  V
Subjt:  LSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFARGV

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.6e-0726.47Show/hide
Query:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSE
        +W+PP  D  K N DAS ++   V G GW +R+S G+++  G  + +G+   +  E   ++  ++AS     +  I    E D+  + + +N    +   
Subjt:  EWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILEGLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSE

Query:  LSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR
        L +++  I   + SF +I F   HR  N  A   A+
Subjt:  LSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAATGCTAATAGTTTCTCTTCTTTGCCGATAGATCATTCCAAAGTCATTAGGGACATCTCCACTGCGAATGAAGAGTATCTAGCGACTTCCAGAGGCCATGGGTC
ACCGACAAAACTAGAGAGTTGTGAGAGTCATGGAGAATGGACCCCTCCTGAGCTCGACTGTTGGAAATTAAACAGTGACGCATCTTGGAACGATAGCATTAAGGTAGGTG
GAACTGGATGGGCTATCCGTGACTCTAATGGATCCCTGGTTTGTGCAGGTGGTAAGCAAGTAAAAGGGAAGTGGCCTATCAAAATCTTGGAAGCCTTAGCCATCTTGGAA
GGCTTAGAAGCTTCTATTAGCTGTATGAGTGAGCATTGCATATCATTGACCGTTGAGTCTGATTCTCTTGAAGTTATAAAGGGTCTGAATGGAGGCGTAGAGGACATGTC
GGAGTTGAGCAACTATGTTAAAGCCATCCTAGATCTTGTTAACTCCTTTTCTGCTATAAATTTTGTAAAATGTCACAGATCCACAAATTCTCTAGCCCATAATTTCGCTA
GAGGTGTGTGCCTTCATGGTAGCTTTGATGGGAGGCTTTCTCCCTCTTTTTTAGTTAGTAGTGACAGGTTTAGTTTTGGGGAGTATCCGTCTTGGGTCTCCGAGATTCTT
TCTCTTTTCTGTAGGCTCGTTCGACCTCTTCTGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAATGCTAATAGTTTCTCTTCTTTGCCGATAGATCATTCCAAAGTCATTAGGGACATCTCCACTGCGAATGAAGAGTATCTAGCGACTTCCAGAGGCCATGGGTC
ACCGACAAAACTAGAGAGTTGTGAGAGTCATGGAGAATGGACCCCTCCTGAGCTCGACTGTTGGAAATTAAACAGTGACGCATCTTGGAACGATAGCATTAAGGTAGGTG
GAACTGGATGGGCTATCCGTGACTCTAATGGATCCCTGGTTTGTGCAGGTGGTAAGCAAGTAAAAGGGAAGTGGCCTATCAAAATCTTGGAAGCCTTAGCCATCTTGGAA
GGCTTAGAAGCTTCTATTAGCTGTATGAGTGAGCATTGCATATCATTGACCGTTGAGTCTGATTCTCTTGAAGTTATAAAGGGTCTGAATGGAGGCGTAGAGGACATGTC
GGAGTTGAGCAACTATGTTAAAGCCATCCTAGATCTTGTTAACTCCTTTTCTGCTATAAATTTTGTAAAATGTCACAGATCCACAAATTCTCTAGCCCATAATTTCGCTA
GAGGTGTGTGCCTTCATGGTAGCTTTGATGGGAGGCTTTCTCCCTCTTTTTTAGTTAGTAGTGACAGGTTTAGTTTTGGGGAGTATCCGTCTTGGGTCTCCGAGATTCTT
TCTCTTTTCTGTAGGCTCGTTCGACCTCTTCTGCTGTAA
Protein sequenceShow/hide protein sequence
MLNANSFSSLPIDHSKVIRDISTANEEYLATSRGHGSPTKLESCESHGEWTPPELDCWKLNSDASWNDSIKVGGTGWAIRDSNGSLVCAGGKQVKGKWPIKILEALAILE
GLEASISCMSEHCISLTVESDSLEVIKGLNGGVEDMSELSNYVKAILDLVNSFSAINFVKCHRSTNSLAHNFARGVCLHGSFDGRLSPSFLVSSDRFSFGEYPSWVSEIL
SLFCRLVRPLLL