; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006419 (gene) of Snake gourd v1 genome

Gene IDTan0006419
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG08:485380..486483
RNA-Seq ExpressionTan0006419
SyntenyTan0006419
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAS55787.1 hypothetical protein [Oryza sativa Japonica Group]1.1e-1527.24Show/hide
Query:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR
        ++DLI  +  W  + I   FL  D + I  I +      + I W  D+ G+FSV+SAY+L  +L+   +CS S SS +   WE +WK +V  +++I  WR
Subjt:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR

Query:  LLNDIVPCVQG----SLVAFGVLGL-------------------MMELYLDKDSLM--------------SDRSSQTIELSPKGW-KLNVDATWFDKALK
        + ++ +  ++     +L  F V G+                    + + L+K   +                  ++  E    GW KLNVD ++   + K
Subjt:  LLNDIVPCVQG----SLVAFGVLGL-------------------MMELYLDKDSLM--------------SDRSSQTIELSPKGW-KLNVDATWFDKALK

Query:  GGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNSMCVDLSLREN
        GG G I+R+ L   +      ++ C      E  A +EGL   L  W   + P+ VE+D   VI LLN    D S+  N
Subjt:  GGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNSMCVDLSLREN

EEC68026.1 hypothetical protein OsI_35837 [Oryza sativa Indica Group]1.8e-1829.66Show/hide
Query:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR
        ++DLI+ +  W    I   FLP D + ILNI L   Q  + + W  DR G FSV+SAY L   L+   S S S    ++  W+ LWK  V  ++KI  W+
Subjt:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR

Query:  LLNDIVPCVQG----SLVAFGVLGLMMELYLDKDSLMSDRSSQTIELSPKGW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEA
          ++ +P ++     +L A  +  +   +   K  L              GW KLN+D ++     +GG G I+R+     +     F+ERC      E 
Subjt:  LLNDIVPCVQG----SLVAFGVLGLMMELYLDKDSLMSDRSSQTIELSPKGW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEA

Query:  KAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS
         A  EG+   L  W   + P+ +E+D +E + L  S
Subjt:  KAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS

EEC73134.1 hypothetical protein OsI_07152 [Oryza sativa Indica Group]3.5e-2231.2Show/hide
Query:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR
        ++D+++ +  W E+++   FLP DVE+IL I +   Q  + + W  DR G+FSV+SAY+L  +++R   CS S    +K  W  +W  +V  ++KI  WR
Subjt:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR

Query:  LLNDIVPCVQGSLVA-FGVLGLMMELYLDKDSLMSDRSSQTIELSPK-GW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKA
           + +P ++      FG++G  +         M+ +    +   PK GW KLN+D ++  +  +GG G ++R+     +     F+ RC      E  A
Subjt:  LLNDIVPCVQGSLVA-FGVLGLMMELYLDKDSLMSDRSSQTIELSPK-GW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKA

Query:  MIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS
          +GL+  L  W   + P+ VESD +E+I LLNS
Subjt:  MIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS

XP_022149515.1 uncharacterized protein LOC111017927 [Momordica charantia]1.8e-1544.44Show/hide
Query:  WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWRLLNDIVPCV
        W ES+IR+SFL  + + ILNIPL      +E+IW  D+K KFSVKS YRLG  L+ +     S+S E    W+ LW+  V  ++KI  WR+ NDI+  +
Subjt:  WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWRLLNDIVPCV

XP_024200343.1 uncharacterized protein LOC112203645 [Rosa chinensis]7.5e-1727.24Show/hide
Query:  WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSR----SLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWRLLNDIV
        W ES+IR +F PH+V+ IL+IP+   +  + I+W   + G+++VKS   L   L R    S+ CS S + E   +W+ LWK+ +  ++K+ +WR     +
Subjt:  WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSR----SLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWRLLNDIV

Query:  PCV----------------------QGSLVAFGVLGLMMELYLDK--------------------DSLMSDRSSQTIEL-----SPKGWKLNVDATWFDK
        PC                       Q S+V   V GL    +  +                     ++  D + + +++     +    KLN DA    K
Subjt:  PCV----------------------QGSLVAFGVLGLMMELYLDK--------------------DSLMSDRSSQTIEL-----SPKGWKLNVDATWFDK

Query:  ALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNSMCVDLSL
          K G G +VRD   +    G   +     I  +EA A+  G   LL   +     L+VESDS  VI  LN   +DLS+
Subjt:  ALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNSMCVDLSL

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y4 uncharacterized protein LOC1110179278.9e-1644.44Show/hide
Query:  WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWRLLNDIVPCV
        W ES+IR+SFL  + + ILNIPL      +E+IW  D+K KFSVKS YRLG  L+ +     S+S E    W+ LW+  V  ++KI  WR+ NDI+  +
Subjt:  WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWRLLNDIVPCV

B8AHI8 Uncharacterized protein1.7e-2231.2Show/hide
Query:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR
        ++D+++ +  W E+++   FLP DVE+IL I +   Q  + + W  DR G+FSV+SAY+L  +++R   CS S    +K  W  +W  +V  ++KI  WR
Subjt:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR

Query:  LLNDIVPCVQGSLVA-FGVLGLMMELYLDKDSLMSDRSSQTIELSPK-GW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKA
           + +P ++      FG++G  +         M+ +    +   PK GW KLN+D ++  +  +GG G ++R+     +     F+ RC      E  A
Subjt:  LLNDIVPCVQGSLVA-FGVLGLMMELYLDKDSLMSDRSSQTIELSPK-GW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKA

Query:  MIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS
          +GL+  L  W   + P+ VESD +E+I LLNS
Subjt:  MIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS

B8BK40 Uncharacterized protein8.6e-1929.66Show/hide
Query:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR
        ++DLI+ +  W    I   FLP D + ILNI L   Q  + + W  DR G FSV+SAY L   L+   S S S    ++  W+ LWK  V  ++KI  W+
Subjt:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR

Query:  LLNDIVPCVQG----SLVAFGVLGLMMELYLDKDSLMSDRSSQTIELSPKGW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEA
          ++ +P ++     +L A  +  +   +   K  L              GW KLN+D ++     +GG G I+R+     +     F+ERC      E 
Subjt:  LLNDIVPCVQG----SLVAFGVLGLMMELYLDKDSLMSDRSSQTIELSPKGW-KLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEA

Query:  KAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS
         A  EG+   L  W   + P+ +E+D +E + L  S
Subjt:  KAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNS

Q75M12 Reverse transcriptase domain-containing protein5.2e-1627.24Show/hide
Query:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR
        ++DLI  +  W  + I   FL  D + I  I +      + I W  D+ G+FSV+SAY+L  +L+   +CS S SS +   WE +WK +V  +++I  WR
Subjt:  IADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR

Query:  LLNDIVPCVQG----SLVAFGVLGL-------------------MMELYLDKDSLM--------------SDRSSQTIELSPKGW-KLNVDATWFDKALK
        + ++ +  ++     +L  F V G+                    + + L+K   +                  ++  E    GW KLNVD ++   + K
Subjt:  LLNDIVPCVQG----SLVAFGVLGL-------------------MMELYLDKDSLM--------------SDRSSQTIELSPKGW-KLNVDATWFDKALK

Query:  GGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNSMCVDLSLREN
        GG G I+R+ L   +      ++ C      E  A +EGL   L  W   + P+ VE+D   VI LLN    D S+  N
Subjt:  GGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKAMIEGLSQLLSLWKEPIPPLIVESDSVEVIGLLNSMCVDLSLREN

S8D7I6 Uncharacterized protein (Fragment)3.2e-1341.38Show/hide
Query:  DRRIADLIQVERG-WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSC-SPSDSSEVKA----LWEDLWKIDVM
        D R++DLI   RG W +S +R+ F P D E IL+IPL  T+  +++IW+    G +SVKS    GY L +SL+  +PS +S   A    LW+ LWK+ + 
Subjt:  DRRIADLIQVERG-WKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSC-SPSDSSEVKA----LWEDLWKIDVM

Query:  PRIKIGVWRLLNDIVP
        P+I +  WRL  +I+P
Subjt:  PRIKIGVWRLLNDIVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCGTGTCCTCTGCGAGATAGGCGTATTGCGGATCTCATCCAGGTAGAGAGAGGATGGAAGGAGAGTGTGATTCGTGACTCTTTCCTTCCTCATGATGTCGAGGA
TATCCTAAATATTCCTTTAGGAGGGACTCAGGTGATCGAGGAGATTATATGGGACATGGATAGGAAAGGAAAGTTCTCGGTTAAGAGTGCCTATAGATTAGGTTACCGGC
TTTCCCGATCCCTTAGTTGTTCTCCTTCTGACTCGAGTGAGGTGAAAGCCCTATGGGAGGATTTGTGGAAGATTGACGTTATGCCTAGGATTAAGATCGGGGTGTGGAGG
CTTCTCAATGACATTGTCCCCTGTGTGCAGGGCTCATTGGTTGCCTTTGGAGTATTGGGACTAATGATGGAACTTTATCTGGATAAGGATAGCCTCATGTCTGATAGGAG
TTCTCAAACGATTGAACTGAGCCCGAAGGGTTGGAAGCTGAACGTAGATGCGACTTGGTTTGATAAGGCCCTTAAAGGGGGCTTTGGATGGATTGTTCGTGATTGGCTTA
AGGAGCCTGTTTTGGGTGGGATGACCTTTGTGGAGCGTTGTTGGCCTATTAAGGTCCTTGAAGCAAAAGCCATGATCGAAGGTTTGTCCCAACTTTTGTCGTTGTGGAAG
GAGCCTATACCCCCTCTCATCGTGGAATCTGACTCCGTGGAGGTTATTGGTTTGCTTAATAGCATGTGTGTTGATCTCTCTCTGAGAGAGAATAACAAAACCGTTCATAA
TCTTACAGTTTTGACATCCTCTTCGAGCAATTCTTTGATCGGAAAAGAGTGTTTTGGGACGGATGTTTATCCTCCCCTTGTTGATAGGAGCCGATTGGGTGTGGTTTTCC
TGGGTTCTTGCTGTTTCGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCGTGTCCTCTGCGAGATAGGCGTATTGCGGATCTCATCCAGGTAGAGAGAGGATGGAAGGAGAGTGTGATTCGTGACTCTTTCCTTCCTCATGATGTCGAGGA
TATCCTAAATATTCCTTTAGGAGGGACTCAGGTGATCGAGGAGATTATATGGGACATGGATAGGAAAGGAAAGTTCTCGGTTAAGAGTGCCTATAGATTAGGTTACCGGC
TTTCCCGATCCCTTAGTTGTTCTCCTTCTGACTCGAGTGAGGTGAAAGCCCTATGGGAGGATTTGTGGAAGATTGACGTTATGCCTAGGATTAAGATCGGGGTGTGGAGG
CTTCTCAATGACATTGTCCCCTGTGTGCAGGGCTCATTGGTTGCCTTTGGAGTATTGGGACTAATGATGGAACTTTATCTGGATAAGGATAGCCTCATGTCTGATAGGAG
TTCTCAAACGATTGAACTGAGCCCGAAGGGTTGGAAGCTGAACGTAGATGCGACTTGGTTTGATAAGGCCCTTAAAGGGGGCTTTGGATGGATTGTTCGTGATTGGCTTA
AGGAGCCTGTTTTGGGTGGGATGACCTTTGTGGAGCGTTGTTGGCCTATTAAGGTCCTTGAAGCAAAAGCCATGATCGAAGGTTTGTCCCAACTTTTGTCGTTGTGGAAG
GAGCCTATACCCCCTCTCATCGTGGAATCTGACTCCGTGGAGGTTATTGGTTTGCTTAATAGCATGTGTGTTGATCTCTCTCTGAGAGAGAATAACAAAACCGTTCATAA
TCTTACAGTTTTGACATCCTCTTCGAGCAATTCTTTGATCGGAAAAGAGTGTTTTGGGACGGATGTTTATCCTCCCCTTGTTGATAGGAGCCGATTGGGTGTGGTTTTCC
TGGGTTCTTGCTGTTTCGTTTAA
Protein sequenceShow/hide protein sequence
MISCPLRDRRIADLIQVERGWKESVIRDSFLPHDVEDILNIPLGGTQVIEEIIWDMDRKGKFSVKSAYRLGYRLSRSLSCSPSDSSEVKALWEDLWKIDVMPRIKIGVWR
LLNDIVPCVQGSLVAFGVLGLMMELYLDKDSLMSDRSSQTIELSPKGWKLNVDATWFDKALKGGFGWIVRDWLKEPVLGGMTFVERCWPIKVLEAKAMIEGLSQLLSLWK
EPIPPLIVESDSVEVIGLLNSMCVDLSLRENNKTVHNLTVLTSSSSNSLIGKECFGTDVYPPLVDRSRLGVVFLGSCCFV