; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028563 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028563
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Tf2-1 polyprotein isoform X1
Genome locationchr8:25047150..25047983
RNA-Seq ExpressionLag0028563
SyntenyLag0028563
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032309.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]9.2e-4644.37Show/hide
Query:  MTQKKMEERFDANEREMSDMKAGI------EEGMKDVRKVLAAMAV------EIALL--------------RRPEPS---GVTDSSVHKQ--KEKVETSE
        M Q ++EER +  ++E++ +K  +      E  + D+ K +  M        ++ L+              R  EP+    VT+    K+    K   S 
Subjt:  MTQKKMEERFDANEREMSDMKAGI------EEGMKDVRKVLAAMAV------EIALL--------------RRPEPS---GVTDSSVHKQ--KEKVETSE

Query:  HSGSHDHGEDST------SDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKK
         S S D  E  T      +DR+KFKKVEMPVF G++PD+WLFRA RYFQIHKLSD EK+ V+ IS +  ALNWYRS+E+RE F  W +LK+RLL RFR  
Subjt:  HSGSHDHGEDST------SDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKK

Query:  KDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE
        +D G V G+FL VKQ+STV  Y   F++LVA L  + D V+E+TFMNGL P ++AEVR  R  GL +MME AQLVE+RE  ++E
Subjt:  KDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE

KAA0034982.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.4e-4643.28Show/hide
Query:  KKMEERFDANEREMSDMKAGIE-------EGMKDVRKV------------LAAMAVEIALLRRPEPSGVTDSSVHKQKEKVETSEHSGSHD--HGEDSTS
        KK EERF+A E+E+ +++  ++       +  K + K+            L    +E  +++    +  ++ S  K K KV    + G+ +  +GE+  +
Subjt:  KKMEERFDANEREMSDMKAGIE-------EGMKDVRKV------------LAAMAVEIALLRRPEPSGVTDSSVHKQKEKVETSEHSGSHD--HGEDSTS

Query:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV
        DR KFKKVEM VF G++PD WLFRA RYFQIH L+D +K+TVA IS EG ALNWYR++E+R+ FKDW +LK RLL RFR  ++ G +CGQFL ++Q+S+V
Subjt:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV

Query:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGV
        ++Y   F+RL+A +  L D V+E TFM GL P ++AEV   R +GL +MM  AQL+E+RE  ++E G+
Subjt:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGV

KAA0037013.1 peroxidase 64 [Cucumis melo var. makuwa]3.7e-4754.82Show/hide
Query:  EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVK
        E++ +DR+KFKKVEMPVF G++PD+WLFRA RYFQIHKLSD EK+ V+ IS +G ALNWYRS+E+RE F  W +LK RLL RFR  +D G + G+FL VK
Subjt:  EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVK

Query:  QDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGVVITARPKPAYEAKVGWLWAATRNS
        Q+STV  Y   F++LVA L  + D V+E+TFMNGL P ++AEV   R  GL +MME AQLVE+RE  ++E  +   A  K  Y+  V    +A  NS
Subjt:  QDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGVVITARPKPAYEAKVGWLWAATRNS

KAA0060677.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]6.4e-4753.99Show/hide
Query:  RPEPSGVTDSSVHKQKEKVETSE------HSGSHDHG------EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVAL
        R E      S  +K KEK   S        S S D        E++T+DR+KFKKVEMPVF G++PD+WLFRA RYFQIHKLSD EK+ V+ IS +  AL
Subjt:  RPEPSGVTDSSVHKQKEKVETSE------HSGSHDHG------EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVAL

Query:  NWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEK
        NWYRS+E+RE F  W +LK RLL RFR  +D G V G FL VKQ+STV  Y   F++LVA L  + D V+E+TFMNGL P ++AEVR  RL GL +MME 
Subjt:  NWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEK

Query:  AQLVEDRETTKDE
        AQLVE+RE  ++E
Subjt:  AQLVEDRETTKDE

KAA0066460.1 uncharacterized protein E6C27_scaffold21G005610 [Cucumis melo var. makuwa]1.6e-4543.77Show/hide
Query:  MTQKKMEERFDANEREMSDMKAGIEEGMKDVRKVLAAMAVEIALLRRPEPSGVTDSSVHKQ-----KEKVETSEHS-------------GSHDHGEDSTS
        M   ++EER D  ++E+S MK  I + +  +   L  ++  + +L        T+S+ HK      KEK  ++  S             G  D+ E +T 
Subjt:  MTQKKMEERFDANEREMSDMKAGIEEGMKDVRKVLAAMAVEIALLRRPEPSGVTDSSVHKQ-----KEKVETSEHS-------------GSHDHGEDSTS

Query:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV
        DR KFKKVEMPVF G++PD+WLFRA RYFQIHKL++ EK+ V+ IS +G  LNWYRS+E+R+ F  W +LK RLL RFR   D G V G+FL ++Q+S+V
Subjt:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV

Query:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE
        ++Y   F++LVA L  + ++V+E+TFMNGL P ++AEV C R  GL +MM+ AQLVE++E  ++E
Subjt:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE

TrEMBL top hitse value%identityAlignment
A0A5A7SSU9 Transposon Tf2-1 polyprotein isoform X14.5e-4644.37Show/hide
Query:  MTQKKMEERFDANEREMSDMKAGI------EEGMKDVRKVLAAMAV------EIALL--------------RRPEPS---GVTDSSVHKQ--KEKVETSE
        M Q ++EER +  ++E++ +K  +      E  + D+ K +  M        ++ L+              R  EP+    VT+    K+    K   S 
Subjt:  MTQKKMEERFDANEREMSDMKAGI------EEGMKDVRKVLAAMAV------EIALL--------------RRPEPS---GVTDSSVHKQ--KEKVETSE

Query:  HSGSHDHGEDST------SDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKK
         S S D  E  T      +DR+KFKKVEMPVF G++PD+WLFRA RYFQIHKLSD EK+ V+ IS +  ALNWYRS+E+RE F  W +LK+RLL RFR  
Subjt:  HSGSHDHGEDST------SDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKK

Query:  KDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE
        +D G V G+FL VKQ+STV  Y   F++LVA L  + D V+E+TFMNGL P ++AEVR  R  GL +MME AQLVE+RE  ++E
Subjt:  KDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE

A0A5A7V1H3 Transposon Tf2-1 polyprotein isoform X13.1e-4753.99Show/hide
Query:  RPEPSGVTDSSVHKQKEKVETSE------HSGSHDHG------EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVAL
        R E      S  +K KEK   S        S S D        E++T+DR+KFKKVEMPVF G++PD+WLFRA RYFQIHKLSD EK+ V+ IS +  AL
Subjt:  RPEPSGVTDSSVHKQKEKVETSE------HSGSHDHG------EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVAL

Query:  NWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEK
        NWYRS+E+RE F  W +LK RLL RFR  +D G V G FL VKQ+STV  Y   F++LVA L  + D V+E+TFMNGL P ++AEVR  RL GL +MME 
Subjt:  NWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEK

Query:  AQLVEDRETTKDE
        AQLVE+RE  ++E
Subjt:  AQLVEDRETTKDE

A0A5A7VH17 Uncharacterized protein7.6e-4643.77Show/hide
Query:  MTQKKMEERFDANEREMSDMKAGIEEGMKDVRKVLAAMAVEIALLRRPEPSGVTDSSVHKQ-----KEKVETSEHS-------------GSHDHGEDSTS
        M   ++EER D  ++E+S MK  I + +  +   L  ++  + +L        T+S+ HK      KEK  ++  S             G  D+ E +T 
Subjt:  MTQKKMEERFDANEREMSDMKAGIEEGMKDVRKVLAAMAVEIALLRRPEPSGVTDSSVHKQ-----KEKVETSEHS-------------GSHDHGEDSTS

Query:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV
        DR KFKKVEMPVF G++PD+WLFRA RYFQIHKL++ EK+ V+ IS +G  LNWYRS+E+R+ F  W +LK RLL RFR   D G V G+FL ++Q+S+V
Subjt:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV

Query:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE
        ++Y   F++LVA L  + ++V+E+TFMNGL P ++AEV C R  GL +MM+ AQLVE++E  ++E
Subjt:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDE

A0A5D3BT54 Transposon Tf2-1 polyprotein isoform X16.9e-4743.28Show/hide
Query:  KKMEERFDANEREMSDMKAGIE-------EGMKDVRKV------------LAAMAVEIALLRRPEPSGVTDSSVHKQKEKVETSEHSGSHD--HGEDSTS
        KK EERF+A E+E+ +++  ++       +  K + K+            L    +E  +++    +  ++ S  K K KV    + G+ +  +GE+  +
Subjt:  KKMEERFDANEREMSDMKAGIE-------EGMKDVRKV------------LAAMAVEIALLRRPEPSGVTDSSVHKQKEKVETSEHSGSHD--HGEDSTS

Query:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV
        DR KFKKVEM VF G++PD WLFRA RYFQIH L+D +K+TVA IS EG ALNWYR++E+R+ FKDW +LK RLL RFR  ++ G +CGQFL ++Q+S+V
Subjt:  DRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTV

Query:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGV
        ++Y   F+RL+A +  L D V+E TFM GL P ++AEV   R +GL +MM  AQL+E+RE  ++E G+
Subjt:  QKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGV

A0A5D3C468 Peroxidase 641.8e-4754.82Show/hide
Query:  EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVK
        E++ +DR+KFKKVEMPVF G++PD+WLFRA RYFQIHKLSD EK+ V+ IS +G ALNWYRS+E+RE F  W +LK RLL RFR  +D G + G+FL VK
Subjt:  EDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVK

Query:  QDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGVVITARPKPAYEAKVGWLWAATRNS
        Q+STV  Y   F++LVA L  + D V+E+TFMNGL P ++AEV   R  GL +MME AQLVE+RE  ++E  +   A  K  Y+  V    +A  NS
Subjt:  QDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEVRCFRLVGLDDMMEKAQLVEDRETTKDEEGVVITARPKPAYEAKVGWLWAATRNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein4.3e-0928.16Show/hide
Query:  TQKKMEERFDANEREMSDMKAGIEEGMKDVRKVLAAMAVEIALLRRPEPSGVTDSSVHKQKEKVETSEHSGSHDHGEDSTSDRHK---------------
        T++ M+  F   +R  +D K   EE ++ +  VL  +   +A L R E          ++ ++VE  EHS        S+S   +               
Subjt:  TQKKMEERFDANEREMSDMKAGIEEGMKDVRKVLAAMAVEIALLRRPEPSGVTDSSVHKQKEKVETSEHSGSHDHGEDSTSDRHK---------------

Query:  FKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERF
         +++EMPVF G     W  +  R+F++ +  D +KL +  +SLEGVAL W+  +     F+DW   + RLL RF
Subjt:  FKKVEMPVFRGDEPDNWLFRAYRYFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERF

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding2.8e-0827.48Show/hide
Query:  YFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFM
        YF  + + + E+L +   +LEG    W +   K+     W++ K  ++ R  K     +    +  ++Q+ +V++Y E FE L      L  + LE  F+
Subjt:  YFQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFM

Query:  NGLKPVMQAEVRCFRLVGLDDMMEKAQLVED
         GL+P +Q  VR  +  G+  MM+ AQ +E+
Subjt:  NGLKPVMQAEVRCFRLVGLDDMMEKAQLVED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACAAAAGAAGATGGAGGAGCGTTTCGACGCTAACGAGAGAGAAATGTCTGACATGAAGGCGGGAATTGAGGAGGGCATGAAGGATGTTCGTAAGGTGCTTGCTGC
TATGGCGGTGGAAATCGCACTGCTCCGGCGACCCGAGCCCTCAGGGGTCACGGACAGTTCTGTTCACAAACAAAAGGAAAAAGTGGAAACGAGCGAGCATTCCGGCTCTC
ACGATCATGGGGAGGACAGCACCAGTGACAGACACAAGTTTAAGAAGGTCGAGATGCCAGTGTTCAGGGGAGATGAGCCCGACAATTGGTTGTTCCGTGCTTACAGGTAT
TTTCAAATTCATAAACTCTCTGATATTGAAAAGTTAACGGTGGCTGTGATAAGTCTTGAAGGAGTTGCGCTTAATTGGTATCGTTCGAAAGAGAAGCGGGAACCGTTTAA
AGATTGGCGAGATTTGAAACTTCGGTTGCTTGAGAGGTTTCGTAAGAAAAAGGATGGTGGGGATGTGTGTGGCCAATTCTTGGCTGTCAAGCAAGATTCTACTGTGCAGA
AGTATTGTGAGACTTTTGAGCGGTTGGTGGCGTCGTTGCCGCATTTAACGGATGAGGTCCTTGAAAACACTTTTATGAACGGTTTGAAGCCAGTGATGCAAGCAGAAGTT
CGTTGTTTTCGTCTGGTTGGTTTGGATGATATGATGGAAAAAGCTCAGTTAGTTGAAGATCGTGAAACCACTAAGGACGAAGAAGGGGTGGTTATCACGGCCCGACCCAA
ACCAGCGTATGAGGCCAAGGTGGGCTGGCTTTGGGCTGCGACGAGAAACTCCCAAGCCCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACAAAAGAAGATGGAGGAGCGTTTCGACGCTAACGAGAGAGAAATGTCTGACATGAAGGCGGGAATTGAGGAGGGCATGAAGGATGTTCGTAAGGTGCTTGCTGC
TATGGCGGTGGAAATCGCACTGCTCCGGCGACCCGAGCCCTCAGGGGTCACGGACAGTTCTGTTCACAAACAAAAGGAAAAAGTGGAAACGAGCGAGCATTCCGGCTCTC
ACGATCATGGGGAGGACAGCACCAGTGACAGACACAAGTTTAAGAAGGTCGAGATGCCAGTGTTCAGGGGAGATGAGCCCGACAATTGGTTGTTCCGTGCTTACAGGTAT
TTTCAAATTCATAAACTCTCTGATATTGAAAAGTTAACGGTGGCTGTGATAAGTCTTGAAGGAGTTGCGCTTAATTGGTATCGTTCGAAAGAGAAGCGGGAACCGTTTAA
AGATTGGCGAGATTTGAAACTTCGGTTGCTTGAGAGGTTTCGTAAGAAAAAGGATGGTGGGGATGTGTGTGGCCAATTCTTGGCTGTCAAGCAAGATTCTACTGTGCAGA
AGTATTGTGAGACTTTTGAGCGGTTGGTGGCGTCGTTGCCGCATTTAACGGATGAGGTCCTTGAAAACACTTTTATGAACGGTTTGAAGCCAGTGATGCAAGCAGAAGTT
CGTTGTTTTCGTCTGGTTGGTTTGGATGATATGATGGAAAAAGCTCAGTTAGTTGAAGATCGTGAAACCACTAAGGACGAAGAAGGGGTGGTTATCACGGCCCGACCCAA
ACCAGCGTATGAGGCCAAGGTGGGCTGGCTTTGGGCTGCGACGAGAAACTCCCAAGCCCACTGA
Protein sequenceShow/hide protein sequence
MTQKKMEERFDANEREMSDMKAGIEEGMKDVRKVLAAMAVEIALLRRPEPSGVTDSSVHKQKEKVETSEHSGSHDHGEDSTSDRHKFKKVEMPVFRGDEPDNWLFRAYRY
FQIHKLSDIEKLTVAVISLEGVALNWYRSKEKREPFKDWRDLKLRLLERFRKKKDGGDVCGQFLAVKQDSTVQKYCETFERLVASLPHLTDEVLENTFMNGLKPVMQAEV
RCFRLVGLDDMMEKAQLVEDRETTKDEEGVVITARPKPAYEAKVGWLWAATRNSQAH