; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013054 (gene) of Snake gourd v1 genome

Gene IDTan0013054
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon gag protein
Genome locationLG01:94283223..94285420
RNA-Seq ExpressionTan0013054
SyntenyTan0013054
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]4.9e-5945.75Show/hide
Query:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN
        F++E+LSEAWERFK+ LRKCPHHGLPHCIQ+E FYNGLNT+TKQVVDASANG +LS+TYNEA+EILERI+SNNCQW D RS+ GKK R +LE+DAL++IN
Subjt:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN

Query:  AQMASMADMLKNLTVG-----------------------------------------------NATQASPNV----------------------------
        AQ+ASM ++L+NL  G                                                A+Q +P                              
Subjt:  AQMASMADMLKNLTVG-----------------------------------------------NATQASPNV----------------------------

Query:  ---------------------QNAVVHQRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELP
                             Q A       +  QHI GT LESL+KEYMA+NDAVIQ        QQ S+RN EVQ+GQLANE+R+RP GK+P DTE+P
Subjt:  ---------------------QNAVVHQRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELP

Query:  KREGKE
        KREG E
Subjt:  KREGKE

XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]3.7e-5944.95Show/hide
Query:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA
        +DE+ S+AWERFK++LRKCPHHG+PHCIQ+E FYNGLN +++ V+DASANG +LS++YNEAFEILERI+SNN QW+ +R+   +KV G+LE+DALTA+ A
Subjt:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA

Query:  QMASMADMLKNLTVGNATQASPNVQNAVV------------------------------------------------HQRVVAAPQHIPGTSLESLLKEY
        QMASM ++LKN+ +G + Q +  +Q A +                                                  +    PQ    +SLESL+++Y
Subjt:  QMASMADMLKNLTVGNATQASPNVQNAVV------------------------------------------------HQRVVAAPQHIPGTSLESLLKEY

Query:  MAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGKPLAVSKRTTRKTIREGISK
        MAKNDAVIQ        Q AS++N E+QLGQLAN++++RPQG +P+DTE P+R+GKE  +A+ LRSGK +  +   T       I K
Subjt:  MAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGKPLAVSKRTTRKTIREGISK

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]2.7e-5746.43Show/hide
Query:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA
        +DE+ S+AWERFK+LLRKCPHHG+PHCIQ+E FYNGLN +++ V+DASANG +LS++YNEAFEILERI+SNN QW+ +R+   +KV G+LE+DALTA+ A
Subjt:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA

Query:  QMASMADMLKNLTVGNATQASP------------------NVQNAVVHQRVVAAPQHIPG----------------------------------------
        QMASM ++LKN+ +G + Q +                   N  N         A +H P                                         
Subjt:  QMASMADMLKNLTVGNATQASP------------------NVQNAVVHQRVVAAPQHIPG----------------------------------------

Query:  --TSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGK
          +SLESL+++YMAKNDAVIQ        Q AS+RN EVQLGQLAN++++RPQG +P+DTE P+R+GKE  +A+ LRSGK
Subjt:  --TSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGK

XP_030505222.1 uncharacterized protein LOC115720205 [Cannabis sativa]1.2e-5745.88Show/hide
Query:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA
        +DE+ S+AWERFK+LLRKCPHHG+PHCIQ+E FYNGLN +++ V+DASANG +LS++YNEAFEILERI+SNN QW+ +R+   +KV G+LE+DALTA+ A
Subjt:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA

Query:  QMASMADMLKNLTVGNATQASPNVQNAVVH------------------------------QRVVAAPQHIPG------------------TSLESLLKEY
        QMASM ++LKN+ +G + Q +  +Q A +                               Q+        PG                  +SLESL+++Y
Subjt:  QMASMADMLKNLTVGNATQASPNVQNAVVH------------------------------QRVVAAPQHIPG------------------TSLESLLKEY

Query:  MAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGKPLAVSKRTTRK
        MAKNDAVIQ Q  ++        N EVQLGQLAN++++RPQG +P+DTE P+R+ KE  +A+ LRSGK +  +   T K
Subjt:  MAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGKPLAVSKRTTRK

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]1.5e-6054.89Show/hide
Query:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA
        +DES S+AWERFK+LLRKCPHHG+PHCIQME FYNGLN +++ V+DASANG +LS++YNEAFEILE I+SNN QW+++R+   +KV G+LE+DA+TA+ A
Subjt:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA

Query:  QMASMADMLKNLTVG--NATQASPNVQNAVVH-------QRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSR
        QMASM +   NL+ G   A+ ++   Q    +        R     Q+   +SLESL+++YMAKNDAVIQ        Q AS+RN E+QLG LANE+++R
Subjt:  QMASMADMLKNLTVG--NATQASPNVQNAVVH-------QRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSR

Query:  PQGKVPADTELPKREGKEQVQAIELRSGKPLAVSK
        PQG +P+DTE P+R+GKEQ ++I LRSGK L  S+
Subjt:  PQGKVPADTELPKREGKEQVQAIELRSGKPLAVSK

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like3.2e-4841.08Show/hide
Query:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA
        DDESL EAWERFK+LL+KCPHHG+PHCIQ+E FYNGL   T+ VVDASANG LLS++YNEA+EI+ERI+SNN QW  SR++ G++V G+ E+DA+T++ +
Subjt:  DDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINA

Query:  QMASMADMLKNLTV------------------------GNATQASPNVQNAVVH----------------------------------------------
        Q++S++ M KNLT                         G+  +  P+   +V +                                              
Subjt:  QMASMADMLKNLTV------------------------GNATQASPNVQNAVVH----------------------------------------------

Query:  -------QRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGK
               Q+V    Q     SLESLLK YMAKNDA+IQ        Q A+++N E Q+GQLA E+R+R QG +P+DTE P+  GKE  +A+ LRS K
Subjt:  -------QRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGK

A0A6J1EEI2 uncharacterized protein LOC1114333947.9e-4743.68Show/hide
Query:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN
        F+D++LSEAWERFK++LRKCPHHGLPHCIQME FYNGLN +TKQVVDASANG +LS+TYNEA+EILERI+SNNCQWAD RS+ G+K RG+LE+DAL++IN
Subjt:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN

Query:  AQMASMADMLKNLTVG-----------------------------------------------NATQASP------------------------------
        AQ+AS+ ++L+NL +G                                                A+Q +P                              
Subjt:  AQMASMADMLKNLTVG-----------------------------------------------NATQASP------------------------------

Query:  -----------NVQNAVVH--QRVVAAPQHIP------GTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQ
                    +QN + +  Q+V    + IP      GTS+ESL+KEYMAKND VI       Q QQAS+RN EVQ
Subjt:  -----------NVQNAVVH--QRVVAAPQHIP------GTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQ

A0A6J1EQ90 uncharacterized protein LOC1114364113.9e-4641.37Show/hide
Query:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN
        F+DE+LSEA ERFK++LRKCPHHGLPHCIQME FYNGLN  TKQVVDASANG +LS+TYNEA+EILERI+SNNCQWAD RS+ G+K RG+LE+DAL++IN
Subjt:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN

Query:  AQMASMADMLKNLTVGN--------------------------------------------ATQAS--------------------PN------------
        AQ+AS+ ++L+NL +G                                               QAS                    PN            
Subjt:  AQMASMADMLKNLTVGN--------------------------------------------ATQAS--------------------PN------------

Query:  ------------VQNAVVHQ--------RVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELP
                    +QN + +         +     Q+   TS+ESL+KEYMAKNDAVIQ        QQAS+RN EVQ+G   N  +     +  ADT+  
Subjt:  ------------VQNAVVHQ--------RVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELP

Query:  KREGKEQ
          E   Q
Subjt:  KREGKEQ

A0A6J1G7Q6 uncharacterized protein LOC1114515982.4e-5945.75Show/hide
Query:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN
        F++E+LSEAWERFK+ LRKCPHHGLPHCIQ+E FYNGLNT+TKQVVDASANG +LS+TYNEA+EILERI+SNNCQW D RS+ GKK R +LE+DAL++IN
Subjt:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN

Query:  AQMASMADMLKNLTVG-----------------------------------------------NATQASPNV----------------------------
        AQ+ASM ++L+NL  G                                                A+Q +P                              
Subjt:  AQMASMADMLKNLTVG-----------------------------------------------NATQASPNV----------------------------

Query:  ---------------------QNAVVHQRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELP
                             Q A       +  QHI GT LESL+KEYMA+NDAVIQ        QQ S+RN EVQ+GQLANE+R+RP GK+P DTE+P
Subjt:  ---------------------QNAVVHQRVVAAPQHIPGTSLESLLKEYMAKNDAVIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELP

Query:  KREGKE
        KREG E
Subjt:  KREGKE

A0A6J1H7E4 uncharacterized protein LOC1114611681.3e-4670Show/hide
Query:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN
        F+DE+LSEAWERFK++LRKCPHHGLPHCIQME FYNGLN +TKQVVDASANG +LS+TYNEA+EILERI+SNNCQWAD RS+ GKK RG+LE+DAL++IN
Subjt:  FDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDASANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAIN

Query:  AQMASMADMLKNLTVGNATQASPNVQNAVV
        AQ+AS+ ++L+NL  G  T        A V
Subjt:  AQMASMADMLKNLTVGNATQASPNVQNAVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATTTCCCAGGTTTAGAATTTTTTCTAGATCCTGAACCTGAGAGAACATTTCATAGAAGAAGGAGGATTCAAAATGAAAGAATGGCAGATTTCCGTGGAGGTCA
TGTAGATCCTAATAATCCTCAGAATAACCCTGGAAACCAACCACACGGCCAAGAAGGTTTTGGACCACCTTTATTTGATGATGAATCCTTAAGTGAGGCGTGGGAGAGAT
TTAAAGACTTGCTTAGGAAGTGTCCTCATCATGGTCTCCCTCATTGTATTCAAATGGAGATATTTTACAATGGGTTGAATACATCCACTAAACAAGTTGTTGATGCTTCG
GCAAATGGGACTCTTTTATCAAGAACCTATAATGAAGCATTTGAGATACTGGAGAGAATATCCTCAAACAACTGTCAATGGGCTGATTCAAGGAGCTCCCAAGGTAAAAA
GGTCAGAGGCATGTTGGAGTTAGATGCTTTAACCGCTATAAATGCACAAATGGCATCAATGGCTGATATGTTGAAAAACCTAACTGTTGGAAATGCAACTCAAGCTTCCC
CTAATGTTCAGAATGCAGTTGTGCATCAGAGGGTTGTAGCAGCACCACAGCATATTCCAGGTACTTCTCTTGAAAGTTTGCTGAAGGAATATATGGCTAAGAATGATGCT
GTCATTCAAGGACAACAAGCCGTTAACCAAAGGCAGCAAGCTTCGATGAGAAACTTTGAAGTGCAATTGGGGCAACTTGCTAATGAGATAAGATCTAGGCCTCAAGGTAA
GGTTCCTGCAGATACTGAACTCCCAAAAAGGGAAGGGAAGGAACAAGTGCAAGCTATAGAGTTAAGGAGTGGAAAACCTTTAGCTGTTTCTAAGAGAACAACAAGAAAAA
CCATAAGGGAAGGAATTAGCAAAATTTGGCATAGGCGAAGCTCGACCCATTACTGTCACATTGCAGCTTGCAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGATTTCCCAGGTTTAGAATTTTTTCTAGATCCTGAACCTGAGAGAACATTTCATAGAAGAAGGAGGATTCAAAATGAAAGAATGGCAGATTTCCGTGGAGGTCA
TGTAGATCCTAATAATCCTCAGAATAACCCTGGAAACCAACCACACGGCCAAGAAGGTTTTGGACCACCTTTATTTGATGATGAATCCTTAAGTGAGGCGTGGGAGAGAT
TTAAAGACTTGCTTAGGAAGTGTCCTCATCATGGTCTCCCTCATTGTATTCAAATGGAGATATTTTACAATGGGTTGAATACATCCACTAAACAAGTTGTTGATGCTTCG
GCAAATGGGACTCTTTTATCAAGAACCTATAATGAAGCATTTGAGATACTGGAGAGAATATCCTCAAACAACTGTCAATGGGCTGATTCAAGGAGCTCCCAAGGTAAAAA
GGTCAGAGGCATGTTGGAGTTAGATGCTTTAACCGCTATAAATGCACAAATGGCATCAATGGCTGATATGTTGAAAAACCTAACTGTTGGAAATGCAACTCAAGCTTCCC
CTAATGTTCAGAATGCAGTTGTGCATCAGAGGGTTGTAGCAGCACCACAGCATATTCCAGGTACTTCTCTTGAAAGTTTGCTGAAGGAATATATGGCTAAGAATGATGCT
GTCATTCAAGGACAACAAGCCGTTAACCAAAGGCAGCAAGCTTCGATGAGAAACTTTGAAGTGCAATTGGGGCAACTTGCTAATGAGATAAGATCTAGGCCTCAAGGTAA
GGTTCCTGCAGATACTGAACTCCCAAAAAGGGAAGGGAAGGAACAAGTGCAAGCTATAGAGTTAAGGAGTGGAAAACCTTTAGCTGTTTCTAAGAGAACAACAAGAAAAA
CCATAAGGGAAGGAATTAGCAAAATTTGGCATAGGCGAAGCTCGACCCATTACTGTCACATTGCAGCTTGCAGATAG
Protein sequenceShow/hide protein sequence
MNDFPGLEFFLDPEPERTFHRRRRIQNERMADFRGGHVDPNNPQNNPGNQPHGQEGFGPPLFDDESLSEAWERFKDLLRKCPHHGLPHCIQMEIFYNGLNTSTKQVVDAS
ANGTLLSRTYNEAFEILERISSNNCQWADSRSSQGKKVRGMLELDALTAINAQMASMADMLKNLTVGNATQASPNVQNAVVHQRVVAAPQHIPGTSLESLLKEYMAKNDA
VIQGQQAVNQRQQASMRNFEVQLGQLANEIRSRPQGKVPADTELPKREGKEQVQAIELRSGKPLAVSKRTTRKTIREGISKIWHRRSSTHYCHIAACR