; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001800 (gene) of Snake gourd v1 genome

Gene IDTan0001800
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG09:69191878..69192508
RNA-Seq ExpressionTan0001800
SyntenyTan0001800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_018829827.1 uncharacterized protein LOC108997896 [Juglans regia]1.1e-1030.16Show/hide
Query:  CSVENQDRDLEILNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLFLWVRG-RWVPLDYW-----DYIR------KSTGSKDLRSLVILMWR
        CS E+Q   +E L DI+P+   L KK ++ +  C  C+  EE+  H+LW     + V G R  PL+ W     D+I              L  + I++++
Subjt:  CSVENQDRDLEILNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLFLWVRG-RWVPLDYW-----DYIR------KSTGSKDLRSLVILMWR

Query:  IWLRRNQVRFGNENVSLEVLIFSVFQMLKVPNKEESPLLSN-SDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE
        IW RR    F N   S  VL+ S    ++V ++    L  + +    ++++  W+PP   ++KLN D A+  +    GIG +VRN +GE
Subjt:  IWLRRNQVRFGNENVSLEVLIFSVFQMLKVPNKEESPLLSN-SDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]3.0e-1125.93Show/hide
Query:  SAVNLSKKGVISNVSCLFCRSFEESTCHLLWE--LVLFLWV-------------RGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNENV
        S V L +     N + L  R  EE+T H+LWE  ++  +W+             R  W   +YW+++    G ++ R  +I+  +IW  RN+  F   + 
Subjt:  SAVNLSKKGVISNVSCLFCRSFEESTCHLLWE--LVLFLWV-------------RGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNENV

Query:  SLEVLIFSVFQMLKVPNKEESPLLSNSDQDF-------ELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGEPVLGGMTFV
            +  ++ + + + +  +   L    +DF       + ++  WKPP    WKLN D AW   +   GIGWI+R+ +GE +  G   +
Subjt:  SLEVLIFSVFQMLKVPNKEESPLLSNSDQDF-------ELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGEPVLGGMTFV

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.4e-1137.61Show/hide
Query:  KDLRSLVILMWRIWLRRNQVRFGNENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE
        +DL  L+I  W IW  RN V F  E+ S   +I  + + +   + +    LS   +      K W+PP +  W LN D +WSD +  GGIGWI+R+W G+
Subjt:  KDLRSLVILMWRIWLRRNQVRFGNENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE

Query:  PVLGGMTFV
         VL G  FV
Subjt:  PVLGGMTFV

XP_023929714.1 uncharacterized protein LOC112041034 [Quercus suber]1.5e-1028Show/hide
Query:  VPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLFLWVRGRWVPLDYWDYIRKST--------------GSKDLRS-LVILMWRIWLRRNQVRFGNE
        +P+ +NL K+ V+SN  C  CR+  E   H LW   +    R  W     +++IR                 GS DL +   +++W IW RR ++R G  
Subjt:  VPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLFLWVRGRWVPLDYWDYIRKST--------------GSKDLRS-LVILMWRIWLRRNQVRFGNE

Query:  NVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE
        +  +  ++    ++L      ++P+   + +  +L    WKPP  G +KLN DGA     +  GIG ++R+W+G+
Subjt:  NVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE

XP_042974792.1 uncharacterized protein LOC122306430 [Carya illinoinensis]1.4e-1128.74Show/hide
Query:  VPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWEL------------VLFLWVRGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNENVS
        +P+ +NL K+ V+   +C  C    E+ CH LW               +  W  G     + W         + L  + ++M RIWLRRN + F N+ V 
Subjt:  VPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWEL------------VLFLWVRGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNENVS

Query:  LEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKG--WKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE
           +     Q L      +    S  D    +S K   WKPPM    K+N D AW  K    GIG ++R+  GE
Subjt:  LEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKG--WKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE

TrEMBL top hitse value%identityAlignment
A0A2N9F155 Uncharacterized protein1.1e-1127.43Show/hide
Query:  NDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLF--LWVRGRWVP----------LDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNE
        ++ +P+ +NL  + VI + SC  C    E+  H LW+  +   +W    W            ++ + +        +L+      W IW RRN  R   +
Subjt:  NDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLF--LWVRGRWVP----------LDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNE

Query:  NVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE
        +  L  L+    Q+L      +    S S Q    ++  WKPP    +K+N DGA+ + S E GIG I+RN  GE
Subjt:  NVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.5e-1125.93Show/hide
Query:  SAVNLSKKGVISNVSCLFCRSFEESTCHLLWE--LVLFLWV-------------RGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNENV
        S V L +     N + L  R  EE+T H+LWE  ++  +W+             R  W   +YW+++    G ++ R  +I+  +IW  RN+  F   + 
Subjt:  SAVNLSKKGVISNVSCLFCRSFEESTCHLLWE--LVLFLWV-------------RGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNENV

Query:  SLEVLIFSVFQMLKVPNKEESPLLSNSDQDF-------ELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGEPVLGGMTFV
            +  ++ + + + +  +   L    +DF       + ++  WKPP    WKLN D AW   +   GIGWI+R+ +GE +  G   +
Subjt:  SLEVLIFSVFQMLKVPNKEESPLLSNSDQDF-------ELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGEPVLGGMTFV

A0A6J1DNV9 uncharacterized protein LOC1110224036.6e-1237.61Show/hide
Query:  KDLRSLVILMWRIWLRRNQVRFGNENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE
        +DL  L+I  W IW  RN V F  E+ S   +I  + + +   + +    LS   +      K W+PP +  W LN D +WSD +  GGIGWI+R+W G+
Subjt:  KDLRSLVILMWRIWLRRNQVRFGNENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE

Query:  PVLGGMTFV
         VL G  FV
Subjt:  PVLGGMTFV

M5XSK0 Reverse transcriptase domain-containing protein3.9e-1229.38Show/hide
Query:  ILNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLFLWVRGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGN-----ENVSLE
        +++ I+P+  NL++K V  +  C+ C    +S  H+L +     W  G   P D+     +   S+D  + +++ W IW  RN + + N     E VSL 
Subjt:  ILNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLFLWVRGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGN-----ENVSLE

Query:  VLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGEPVLGGMT
          +  +   L+V N      L +  +  ++ Q  W+PP     K+NVDGAW   + EGG+G +VR+  G+ V G  T
Subjt:  VLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGEPVLGGMT

M5Y023 zf-RVT domain-containing protein (Fragment)7.8e-1329.44Show/hide
Query:  ILNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLL---------WE---LVLFLWVRGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFG
        +++ I+P+  NL++K V  +  C+ C    ES+ H+L         W    L    W  G   P D+     +   S+D  + +++ W IW  RN + + 
Subjt:  ILNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLL---------WE---LVLFLWVRGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFG

Query:  N-----ENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQ
        N     E VSL   +  +   L+V N      L +  +  ++ QK W+PP     K+NVDGAW   + EGG+G +VR W+
Subjt:  N-----ENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein5.1e-0925.82Show/hide
Query:  LNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVL--FLW--------VRGRWVPLD----YWDY-IRKSTGSKDLRSLVI--LMWRIWLRRNQ
        L++ +P A  L+ + +    +C+ C S +E+  HLL++       W        + G W        YW + +       +  S ++  L+WR+W  RN+
Subjt:  LNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVL--FLW--------VRGRWVPLD----YWDY-IRKSTGSKDLRSLVI--LMWRIWLRRNQ

Query:  VRF-GNENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE
        + F G E  + EVL  +   + +   + E+       Q    S   W+PP   + K N D  W+  +   GIGW++RN +GE
Subjt:  VRF-GNENVSLEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTGAAATACCGATGTAGTGTCGAGAATCAAGATCGGGATTTGGAGATCCTCAATGACATCGTTCCTTCGGCGGTTAACTTGTCTAAAAAAGGGGTGATTTCTAA
TGTTAGTTGCCTTTTTTGCAGGTCTTTTGAAGAGTCAACTTGTCATCTGTTGTGGGAGTTGGTTCTTTTTCTATGGGTAAGGGGGAGATGGGTGCCTTTGGACTACTGGG
ATTACATCAGGAAATCTACTGGAAGCAAGGACTTGAGGAGCCTTGTGATCTTGATGTGGAGAATTTGGCTTAGGCGAAACCAAGTTAGGTTTGGAAATGAGAATGTGTCC
TTGGAGGTTTTGATCTTCTCAGTCTTTCAAATGCTGAAAGTGCCCAACAAGGAGGAAAGCCCTTTGTTATCAAACAGTGATCAAGATTTCGAGTTGAGTCAGAAAGGTTG
GAAGCCCCCTATGGTAGGTTATTGGAAGTTGAATGTGGATGGAGCTTGGTCTGACAAATCTCTCGAAGGCGGCATTGGGTGGATCGTTAGGAATTGGCAAGGTGAGCCTG
TTTTGGGCGGAATGACGTTTGTTTATCGGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTGAAATACCGATGTAGTGTCGAGAATCAAGATCGGGATTTGGAGATCCTCAATGACATCGTTCCTTCGGCGGTTAACTTGTCTAAAAAAGGGGTGATTTCTAA
TGTTAGTTGCCTTTTTTGCAGGTCTTTTGAAGAGTCAACTTGTCATCTGTTGTGGGAGTTGGTTCTTTTTCTATGGGTAAGGGGGAGATGGGTGCCTTTGGACTACTGGG
ATTACATCAGGAAATCTACTGGAAGCAAGGACTTGAGGAGCCTTGTGATCTTGATGTGGAGAATTTGGCTTAGGCGAAACCAAGTTAGGTTTGGAAATGAGAATGTGTCC
TTGGAGGTTTTGATCTTCTCAGTCTTTCAAATGCTGAAAGTGCCCAACAAGGAGGAAAGCCCTTTGTTATCAAACAGTGATCAAGATTTCGAGTTGAGTCAGAAAGGTTG
GAAGCCCCCTATGGTAGGTTATTGGAAGTTGAATGTGGATGGAGCTTGGTCTGACAAATCTCTCGAAGGCGGCATTGGGTGGATCGTTAGGAATTGGCAAGGTGAGCCTG
TTTTGGGCGGAATGACGTTTGTTTATCGGAGTTGA
Protein sequenceShow/hide protein sequence
MFVKYRCSVENQDRDLEILNDIVPSAVNLSKKGVISNVSCLFCRSFEESTCHLLWELVLFLWVRGRWVPLDYWDYIRKSTGSKDLRSLVILMWRIWLRRNQVRFGNENVS
LEVLIFSVFQMLKVPNKEESPLLSNSDQDFELSQKGWKPPMVGYWKLNVDGAWSDKSLEGGIGWIVRNWQGEPVLGGMTFVYRS