; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002681 (gene) of Snake gourd v1 genome

Gene IDTan0002681
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRNase H domain-containing protein
Genome locationLG09:44067864..44068341
RNA-Seq ExpressionTan0002681
SyntenyTan0002681
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_013694754.1 uncharacterized protein LOC106398790 [Brassica napus]7.1e-0728.46Show/hide
Query:  LSKELSSIELSKAMT--IMWGLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEAT
        L +E    ++ + +T  IMW LW   N+    S    A    ++  +   + K+ EE+     +P       + + +  W PP +G LK N DA+W + T
Subjt:  LSKELSSIELSKAMT--IMWGLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEAT

Query:  RKGGLGWLVRDSRGSLICAGIQK
          GG+GW++RD  G ++ AG ++
Subjt:  RKGGLGWLVRDSRGSLICAGIQK

XP_021816392.1 uncharacterized protein LOC110758782 [Prunus avium]1.4e-0735.9Show/hide
Query:  LSSIELSKAMTIMW---GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKG
        LSS E   A  I W   GLW  W K +  +    A + P     LL+ + +   +     +P P     S + Q+ W  PL G+LK+N DA+W     +G
Subjt:  LSSIELSKAMTIMW---GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKG

Query:  GLGWLVRDSRGSLICAG
        G+GW++RDS G L+CAG
Subjt:  GLGWLVRDSRGSLICAG

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.5e-0931.39Show/hide
Query:  WTPKDIWDWLSKELSSIELSKAMTIMWGLWTFWNKC---------KTFSNSLVAFSLPD-RRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPP
        WT KD W+WL   LS  E++ +M I W +W   N+          +    S+V F   +  +G  +   +  ++ DG     LP  R    +    W  P
Subjt:  WTPKDIWDWLSKELSSIELSKAMTIMWGLWTFWNKC---------KTFSNSLVAFSLPD-RRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPP

Query:  LEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG
             KLN DASW E    GG+GW++ D RG ++ AG
Subjt:  LEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]5.8e-0928.15Show/hide
Query:  DIWDWLSKELSSIELSKAMTIMWGLWTFWN-------------KCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPL
        D +DW+ ++    +    + ++W +WT+ N             K + F+ S +     +  G L + +KNL                       GW PP 
Subjt:  DIWDWLSKELSSIELSKAMTIMWGLWTFWN-------------KCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPL

Query:  EGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICA
        + + KLNVDA+W ++   GGLGW+VRDS G  I A
Subjt:  EGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICA

XP_028948114.1 uncharacterized protein LOC114820933 [Malus domestica]9.2e-0732.59Show/hide
Query:  DIWDWLSKELSSIELSKAM--TIMWGLWTFWNKCKTFSNSLVAFSLPDRRGELLII-EKNLEE---------LDGGASRPLPSNRSKSFVNQNGWLPPLE
        D W+     +   E ++A+     +GLW  W       N +V      +  E+L +  KN+ E         LD GA RPL S+     +    W  P  
Subjt:  DIWDWLSKELSSIELSKAM--TIMWGLWTFWNKCKTFSNSLVAFSLPDRRGELLII-EKNLEE---------LDGGASRPLPSNRSKSFVNQNGWLPPLE

Query:  GVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG
        G  KLN DASWC A+ + G GW++RD  G L  AG
Subjt:  GVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG

TrEMBL top hitse value%identityAlignment
A0A1J3C7A8 Putative ribonuclease H protein (Fragment)4.5e-0730.19Show/hide
Query:  GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQ---NGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLI
        G W  W   K+ ++ ++          +L   +++EE           ++  S  N    N W PP +G LK NVD +W E   + G+GW++RDSRG +I
Subjt:  GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQ---NGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLI

Query:  CAGIQK
          G +K
Subjt:  CAGIQK

A0A1J3D6G7 Uncharacterized protein5.8e-0732.08Show/hide
Query:  GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEE---LDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLI
        G W  W   K  +  ++     D    +   E+++EE      G S+ L ++ +K    Q  W PP EG LK NVD +W +   + G+GW++RD+RG +I
Subjt:  GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEE---LDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLI

Query:  CAGIQK
          G +K
Subjt:  CAGIQK

A0A6J1CQG0 uncharacterized protein LOC1110132167.4e-1031.39Show/hide
Query:  WTPKDIWDWLSKELSSIELSKAMTIMWGLWTFWNKC---------KTFSNSLVAFSLPD-RRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPP
        WT KD W+WL   LS  E++ +M I W +W   N+          +    S+V F   +  +G  +   +  ++ DG     LP  R    +    W  P
Subjt:  WTPKDIWDWLSKELSSIELSKAMTIMWGLWTFWNKC---------KTFSNSLVAFSLPD-RRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPP

Query:  LEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG
             KLN DASW E    GG+GW++ D RG ++ AG
Subjt:  LEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG

A0A6J1D4B6 uncharacterized protein LOC1110171812.8e-0928.15Show/hide
Query:  DIWDWLSKELSSIELSKAMTIMWGLWTFWN-------------KCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPL
        D +DW+ ++    +    + ++W +WT+ N             K + F+ S +     +  G L + +KNL                       GW PP 
Subjt:  DIWDWLSKELSSIELSKAMTIMWGLWTFWN-------------KCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPL

Query:  EGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICA
        + + KLNVDA+W ++   GGLGW+VRDS G  I A
Subjt:  EGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICA

A0A6P5SQ64 uncharacterized protein LOC1107587826.9e-0835.9Show/hide
Query:  LSSIELSKAMTIMW---GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKG
        LSS E   A  I W   GLW  W K +  +    A + P     LL+ + +   +     +P P     S + Q+ W  PL G+LK+N DA+W     +G
Subjt:  LSSIELSKAMTIMW---GLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKG

Query:  GLGWLVRDSRGSLICAG
        G+GW++RDS G L+CAG
Subjt:  GLGWLVRDSRGSLICAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0442.22Show/hide
Query:  NQNGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG
        N + W  P  G +K N D S+     K   GW+VRDS GS + AG
Subjt:  NQNGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSRGSLICAG

AT4G29090.1 Ribonuclease H-like superfamily protein6.0e-0429.13Show/hide
Query:  IMWGLWTFWNKCKTFSNSLVAFSLPDRRGELL-IIEKNLEELDGGASRPLPSNRSKSFVNQNG---WLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSR
        + W LW  W       N LV         E+L   E +LEE          S  +K  VN++    W PP    +K N DA+W     + G+GW++R+ +
Subjt:  IMWGLWTFWNKCKTFSNSLVAFSLPDRRGELL-IIEKNLEELDGGASRPLPSNRSKSFVNQNG---WLPPLEGVLKLNVDASWCEATRKGGLGWLVRDSR

Query:  GSL
        G +
Subjt:  GSL

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0631.25Show/hide
Query:  IMWGLW-----TFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDS
        +MW +W       +N  +T   + V  +L D +  L     N  E   G     PS  +K       W PP    LK N DAS  E     GLGW++R+S
Subjt:  IMWGLW-----TFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCEATRKGGLGWLVRDS

Query:  RGSLICAGIQKW
        +G++I  G+ K+
Subjt:  RGSLICAGIQKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTGGAATTTTGGACTCCTAAAGATATCTGGGACTGGTTGTCAAAGGAGTTGAGCTCTATAGAACTTTCTAAAGCCATGACTATTATGTGGGGCTTGTGGACCTT
CTGGAATAAATGTAAAACCTTCTCTAATTCGCTCGTGGCTTTTTCTTTGCCAGATAGAAGGGGTGAGTTGTTGATCATTGAAAAGAACCTGGAGGAGCTTGATGGGGGTG
CGAGTCGCCCTCTTCCCTCGAACAGATCTAAGAGCTTTGTGAATCAAAATGGGTGGCTTCCACCTCTAGAAGGTGTGTTAAAACTTAATGTCGATGCTTCTTGGTGCGAA
GCTACGAGAAAAGGGGGCCTAGGTTGGTTGGTTCGTGATTCCCGTGGTTCTCTCATTTGTGCAGGGATTCAGAAGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTTGGAATTTTGGACTCCTAAAGATATCTGGGACTGGTTGTCAAAGGAGTTGAGCTCTATAGAACTTTCTAAAGCCATGACTATTATGTGGGGCTTGTGGACCTT
CTGGAATAAATGTAAAACCTTCTCTAATTCGCTCGTGGCTTTTTCTTTGCCAGATAGAAGGGGTGAGTTGTTGATCATTGAAAAGAACCTGGAGGAGCTTGATGGGGGTG
CGAGTCGCCCTCTTCCCTCGAACAGATCTAAGAGCTTTGTGAATCAAAATGGGTGGCTTCCACCTCTAGAAGGTGTGTTAAAACTTAATGTCGATGCTTCTTGGTGCGAA
GCTACGAGAAAAGGGGGCCTAGGTTGGTTGGTTCGTGATTCCCGTGGTTCTCTCATTTGTGCAGGGATTCAGAAGTGGTAG
Protein sequenceShow/hide protein sequence
MDLEFWTPKDIWDWLSKELSSIELSKAMTIMWGLWTFWNKCKTFSNSLVAFSLPDRRGELLIIEKNLEELDGGASRPLPSNRSKSFVNQNGWLPPLEGVLKLNVDASWCE
ATRKGGLGWLVRDSRGSLICAGIQKW