; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004524 (gene) of Snake gourd v1 genome

Gene IDTan0004524
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG04:13181817..13186272
RNA-Seq ExpressionTan0004524
SyntenyTan0004524
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAQ19327.1 bZIP-like protein [Oryza sativa Japonica Group]8.2e-1334.4Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        KG  W++GNG  + I  DSW   D    P           V  ++ EDGSWDV KI + FH +D   IL I  +     + + W  ++ G+F+VRSAYRL
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSHGCPQETSSSNNEKVKEFWKKL
               +E+SSS    + + W+ +
Subjt:  GLSHGCPQETSSSNNEKVKEFWKKL

ABA97040.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]1.3e-1337.7Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        KG  W+IGNG  V I  D W   D    P           V  +L +DGSWDV+K+ R F  +D   IL+I  +  L  + L W  +R G F+VRSAY+L
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSHGCPQETSSSNNEKVKEFW
         +S     E+SSS+ +  ++ W
Subjt:  GLSHGCPQETSSSNNEKVKEFW

EEC68887.1 hypothetical protein OsI_37529 [Oryza sativa Indica Group]8.2e-1334.4Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        KG  W++GNG  + I  DSW   D    P           V  ++ EDGSWDV KI + FH +D   IL I  +     + + W  ++ G+F+VRSAYRL
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSHGCPQETSSSNNEKVKEFWKKL
               +E+SSS    + + W+ +
Subjt:  GLSHGCPQETSSSNNEKVKEFWKKL

EEC81552.1 hypothetical protein OsI_24974 [Oryza sativa Indica Group]4.8e-1336.89Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        KG  W+IGNG  V I  D W   D    P           V  +L +DGSWDV+ + R F  +D   IL+I  +  L  + L W  +R G F+VRSAY+L
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSHGCPQETSSSNNEKVKEFW
         +S     E+SSS+ +  ++ W
Subjt:  GLSHGCPQETSSSNNEKVKEFW

XP_025877668.1 uncharacterized protein LOC4351546 [Oryza sativa Japonica Group]8.2e-1334.4Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        KG  W++GNG  + I  DSW   D    P           V  ++ EDGSWDV KI + FH +D   IL I  +     + + W  ++ G+F+VRSAYRL
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSHGCPQETSSSNNEKVKEFWKKL
               +E+SSS    + + W+ +
Subjt:  GLSHGCPQETSSSNNEKVKEFWKKL

TrEMBL top hitse value%identityAlignment
A0A803NHG3 Uncharacterized protein4.0e-1331.01Show/hide
Query:  EEVIKGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRS
        E ++KGYRW++GNG QV ++ D W          +  P   Q  V  +    G WD   I+ NF+  D   IL++P     L ++++W ++R G +TVRS
Subjt:  EEVIKGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRS

Query:  AYRLGLSHGCPQETSSSNNEKVKEFWKKL
         YR+       +  ++   + +K++W+KL
Subjt:  AYRLGLSHGCPQETSSSNNEKVKEFWKKL

A0A803PYN6 Uncharacterized protein1.0e-1331.75Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        +G  WK+GNG  +  +ND W + +  +   ++       S+ + +   G+WD+ K++ +F +  + HIL IP  G + ++ L+W  +  GIF V+S Y L
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSH-GCPQETSSSNNEKVKEFWKKL
         +SH   P  +S   N   K FW KL
Subjt:  GLSH-GCPQETSSSNNEKVKEFWKKL

A0A803QJV0 Uncharacterized protein1.8e-1331.78Show/hide
Query:  EEVIKGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRS
        E ++KGYRW++GNG QV ++ D W          +  P   Q  V  +    G WD   I+ NF+  D   IL++P     L ++++W ++R G +TVRS
Subjt:  EEVIKGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRS

Query:  AYRLGLSHGCPQETSSSNNEKVKEFWKKL
         YR+       +  ++S  + +K++W KL
Subjt:  AYRLGLSHGCPQETSSSNNEKVKEFWKKL

B8B7F1 zf-RVT domain-containing protein2.3e-1336.89Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        KG  W+IGNG  V I  D W   D    P           V  +L +DGSWDV+ + R F  +D   IL+I  +  L  + L W  +R G F+VRSAY+L
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSHGCPQETSSSNNEKVKEFW
         +S     E+SSS+ +  ++ W
Subjt:  GLSHGCPQETSSSNNEKVKEFW

Q2QUC2 Retrotransposon protein, putative, unclassified6.1e-1437.7Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL
        KG  W+IGNG  V I  D W   D    P           V  +L +DGSWDV+K+ R F  +D   IL+I  +  L  + L W  +R G F+VRSAY+L
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSAYRL

Query:  GLSHGCPQETSSSNNEKVKEFW
         +S     E+SSS+ +  ++ W
Subjt:  GLSHGCPQETSSSNNEKVKEFW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein8.8e-0528.97Show/hide
Query:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGS---WDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSA
        KG R  IG+G  + I  D+  +      P     T+ + ++  +    GS   WD  KI +     D   I +I    S   ++++W +N  G +TVRS 
Subjt:  KGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGS---WDVEKIKRNFHLVDVRHILQIPQTGSLLNNELVWKHNRKGIFTVRSA

Query:  YRLGLSH
        Y L L+H
Subjt:  YRLGLSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACACACGAAGAACGTTAGAAAACGAACTGTTGTATTACGACAGAGAAATCCTTCTCTTAGACGAGCCCAAGGGATCACGACATTTCGCAGACCATGAATTTCG
GTACGCTTCTTTTTGGATTCATATCCACAATCTACCCTCCGCAGATGCGGAAGAATCGTTAATACATAACGTGAATGCCCTTTCAAAGTCGATATTGAGCCAGAGGGGAA
ACAATTCGAACCTAATCTCTGACATCCCATTTCACACGAGGCATCTCTGGCCAGTTTTTGTTCGAGGAGCATGGACTCCAGAGGACGAGGGAGATCCAACCGAGGGTGAA
ACGAGGGCGAGCGGAGGTGGCTCATCGGCGACACAACAAAAGAAGCAGGAAGCTACCGACGACAATGGCTCAAGAAAGATTTTCGATGAATTTAAAATTATCCTAAATGC
TATTAACTTTTCAGCTACAACGAATCCAAAGGAGGAACTTGCGGCGGGATCCATGGAACATTCAAAAGCGCCCCGAAAGGTAAAGAAAGTTGTCGTTACGGAAGTCACCA
AGGCGTTAGGGTTTGAGGAATCAGTTTCGGATGGAAAAAGTGAAGATGAAGGGAAACAACGATTTTCGAAGGAGACGGTTGTGGGACCCACAATAATTAATCGACCACGA
GCAAAGATGGATAAGTTGGCAAGAAAATTGAATCCAAGCGGTGAAGGAAATGGGTCGAATGACTTATTGGGCCTAATTTTTAATGTGGGTTGTGACTCGTCTAAGATTAA
GTTGGACCACAATGTCGGGCCTCGTGGTTCGGCGAGCTGTAATCCGAAGAAGAAACTAGAGATGGGTGTCGTGGGCCTCAAAAAGTCCAACAAGGTTTCAGGCCAGAAGA
GAATCCAAGGAAAACTATCAGAAGAAGGACCATCTAGCACAGTGCTCAGTAGAGTTGAAAAGGTGGAACCACCGTCGCATGGGTGGCTCGATTCGGGCTCTAACCCGCTA
GACCCCAAGGCAATAGAGGAAGTTATTCGAAGTATCCCTCAAACAATTTCGGAGGATACTAATAATTGGCCTTTAAAAGACTTCACAAGAATGGAGATTGAGGAGGTGAT
AAAGGGTTACAGATGGAAGATTGGAAATGGTTGCCAGGTAACTATCATGAATGACTCCTGGTTTATAAGTGATGATCGGGTGCATCCCAAGGAGGTTGCTCCTACCTTTG
CCCAAGGCTCCGTCCAGTTTATTCTAAGGGAAGACGGTTCTTGGGATGTAGAGAAGATTAAGAGAAACTTCCACTTGGTGGATGTGCGCCATATTTTACAGATTCCCCAG
ACAGGGTCTCTTTTAAACAACGAGCTAGTTTGGAAGCACAATAGAAAAGGGATCTTCACAGTCAGGAGTGCGTACAGGTTAGGCCTCAGTCATGGTTGTCCTCAGGAAAC
TTCAAGTTCTAATAACGAGAAAGTGAAGGAATTCTGGAAGAAATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGACACACGAAGAACGTTAGAAAACGAACTGTTGTATTACGACAGAGAAATCCTTCTCTTAGACGAGCCCAAGGGATCACGACATTTCGCAGACCATGAATTTCG
GTACGCTTCTTTTTGGATTCATATCCACAATCTACCCTCCGCAGATGCGGAAGAATCGTTAATACATAACGTGAATGCCCTTTCAAAGTCGATATTGAGCCAGAGGGGAA
ACAATTCGAACCTAATCTCTGACATCCCATTTCACACGAGGCATCTCTGGCCAGTTTTTGTTCGAGGAGCATGGACTCCAGAGGACGAGGGAGATCCAACCGAGGGTGAA
ACGAGGGCGAGCGGAGGTGGCTCATCGGCGACACAACAAAAGAAGCAGGAAGCTACCGACGACAATGGCTCAAGAAAGATTTTCGATGAATTTAAAATTATCCTAAATGC
TATTAACTTTTCAGCTACAACGAATCCAAAGGAGGAACTTGCGGCGGGATCCATGGAACATTCAAAAGCGCCCCGAAAGGTAAAGAAAGTTGTCGTTACGGAAGTCACCA
AGGCGTTAGGGTTTGAGGAATCAGTTTCGGATGGAAAAAGTGAAGATGAAGGGAAACAACGATTTTCGAAGGAGACGGTTGTGGGACCCACAATAATTAATCGACCACGA
GCAAAGATGGATAAGTTGGCAAGAAAATTGAATCCAAGCGGTGAAGGAAATGGGTCGAATGACTTATTGGGCCTAATTTTTAATGTGGGTTGTGACTCGTCTAAGATTAA
GTTGGACCACAATGTCGGGCCTCGTGGTTCGGCGAGCTGTAATCCGAAGAAGAAACTAGAGATGGGTGTCGTGGGCCTCAAAAAGTCCAACAAGGTTTCAGGCCAGAAGA
GAATCCAAGGAAAACTATCAGAAGAAGGACCATCTAGCACAGTGCTCAGTAGAGTTGAAAAGGTGGAACCACCGTCGCATGGGTGGCTCGATTCGGGCTCTAACCCGCTA
GACCCCAAGGCAATAGAGGAAGTTATTCGAAGTATCCCTCAAACAATTTCGGAGGATACTAATAATTGGCCTTTAAAAGACTTCACAAGAATGGAGATTGAGGAGGTGAT
AAAGGGTTACAGATGGAAGATTGGAAATGGTTGCCAGGTAACTATCATGAATGACTCCTGGTTTATAAGTGATGATCGGGTGCATCCCAAGGAGGTTGCTCCTACCTTTG
CCCAAGGCTCCGTCCAGTTTATTCTAAGGGAAGACGGTTCTTGGGATGTAGAGAAGATTAAGAGAAACTTCCACTTGGTGGATGTGCGCCATATTTTACAGATTCCCCAG
ACAGGGTCTCTTTTAAACAACGAGCTAGTTTGGAAGCACAATAGAAAAGGGATCTTCACAGTCAGGAGTGCGTACAGGTTAGGCCTCAGTCATGGTTGTCCTCAGGAAAC
TTCAAGTTCTAATAACGAGAAAGTGAAGGAATTCTGGAAGAAATTATAG
Protein sequenceShow/hide protein sequence
MADTRRTLENELLYYDREILLLDEPKGSRHFADHEFRYASFWIHIHNLPSADAEESLIHNVNALSKSILSQRGNNSNLISDIPFHTRHLWPVFVRGAWTPEDEGDPTEGE
TRASGGGSSATQQKKQEATDDNGSRKIFDEFKIILNAINFSATTNPKEELAAGSMEHSKAPRKVKKVVVTEVTKALGFEESVSDGKSEDEGKQRFSKETVVGPTIINRPR
AKMDKLARKLNPSGEGNGSNDLLGLIFNVGCDSSKIKLDHNVGPRGSASCNPKKKLEMGVVGLKKSNKVSGQKRIQGKLSEEGPSSTVLSRVEKVEPPSHGWLDSGSNPL
DPKAIEEVIRSIPQTISEDTNNWPLKDFTRMEIEEVIKGYRWKIGNGCQVTIMNDSWFISDDRVHPKEVAPTFAQGSVQFILREDGSWDVEKIKRNFHLVDVRHILQIPQ
TGSLLNNELVWKHNRKGIFTVRSAYRLGLSHGCPQETSSSNNEKVKEFWKKL