; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019363 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019363
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag/pol protein
Genome locationscaffold1:44444318..44469156
RNA-Seq ExpressionSpg019363
SyntenySpg019363
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9474.15Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        L VGTG+ ISA+AVGD K+FF   + M L+N+YIV KIKRNL+S+SCL+E  YS++FS+NE FI K G  IC AKLENNLYVLRP E++A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRLG INL+RIGRLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+ARGG+EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYSRYGYLYLM HKSEALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-9373.31Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        L VGTG+ ISA+AVGD K+FF   + M L+N+YIV KIKRNL+S+SCL+E  YS++FS+NE FI K G  IC AKLENNLYVLRP E++A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRLG INL+RIGRLVK+GLL++L+D +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+ARG +EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYSRYGYLYLM HKSEALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-8568.51Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        + VGTG  +SA AVG ++++  +   +LL+NVY+V  +KRNLIS+ CLLEQ YS++F+VN+ FI K G +IC AKLENNLYVLR + S+A+LN EMFKTA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KR K+S  ++ +LWHLRLG INLNRI RLVKNGLLS+LE+ +LP CESCLEGKMTKRPFTGK +RAKEPLELVH++L GPMNV+ARGG+EYFI+F 
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQR
        DDYSRYGY+YLM HKSEALEKFKEYK EVEN L +
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQR

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-9272.46Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        L VGTG+ ISA+AVGD K+FF   + M L+N+YIV KIKRNL+S+SCL+E  YS+SFS+NE FI+K G  IC  KLE+NLYVL+P E +A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRLG INL+RIGRLVKNGLL++LED +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+A GG+EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYS YGYLYL+ HKSEALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-9072.03Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        LMVGTG+ ISA+AVGD+K+FF   + M L+N+YIV KIKRNL+ +SCL+E  YS++FS+NE FI+K G     AKLE+NLYVLRP E++A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRL  INL+RIGRLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+ARGG+EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYSRYGYLYLM HK EALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.2e-9373.31Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        L VGTG+ ISA+AVGD K+FF   + M L+N+YIV KIKRNL+S+SCL+E  YS++FS+NE FI K G  IC AKLENNLYVLRP E++A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRLG INL+RIGRLVK+GLL++L+D +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+ARG +EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYSRYGYLYLM HKSEALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

A0A5A7TU93 Gag/pol protein1.2e-8568.51Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        + VGTG  +SA AVG ++++  +   +LL+NVY+V  +KRNLIS+ CLLEQ YS++F+VN+ FI K G +IC AKLENNLYVLR + S+A+LN EMFKTA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KR K+S  ++ +LWHLRLG INLNRI RLVKNGLLS+LE+ +LP CESCLEGKMTKRPFTGK +RAKEPLELVH++L GPMNV+ARGG+EYFI+F 
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQR
        DDYSRYGY+YLM HKSEALEKFKEYK EVEN L +
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQR

A0A5A7TZD0 Gag/pol protein6.1e-9574.15Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        L VGTG+ ISA+AVGD K+FF   + M L+N+YIV KIKRNL+S+SCL+E  YS++FS+NE FI K G  IC AKLENNLYVLRP E++A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRLG INL+RIGRLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+ARGG+EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYSRYGYLYLM HKSEALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

A0A5A7VJG3 Gag/pol protein1.6e-9072.03Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        LMVGTG+ ISA+AVGD+K+FF   + M L+N+YIV KIKRNL+ +SCL+E  YS++FS+NE FI+K G     AKLE+NLYVLRP E++A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRL  INL+RIGRLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+ARGG+EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYSRYGYLYLM HK EALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

A0A5D3BNE1 Gag/pol protein9.8e-9372.46Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA
        L VGTG+ ISA+AVGD K+FF   + M L+N+YIV KIKRNL+S+SCL+E  YS+SFS+NE FI+K G  IC  KLE+NLYVL+P E +A+LN+EMF+TA
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTA

Query:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI
         TQ KRQ++S   +TYLWHLRLG INL+RIGRLVKNGLL++LED +LPPCESCLEGKMTKRPFTGK YRAKEPLEL+H+DL GPMNV+A GG+EYFISFI
Subjt:  ETQPKRQKVS--QSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFI

Query:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        DDYS YGYLYL+ HKSEALEKFKEYK EVENLL ++
Subjt:  DDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-1526.56Show/hide
Query:  ELFWHDSAAVQGCFRIILIKLMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNL
        E  + DS  V    +I + K     GEFI A   G +++    + ++ L++V    +   NL+S+  L E G S+ F  +   I+K G            
Subjt:  ELFWHDSAAVQGCFRIILIKLMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNL

Query:  YVLRPIESRAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGRIN------LNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRA--KEPL
          L  +++  +LNN      +      K   +  LWH R G I+      + R        LL+ LE  +   CE CL GK  + PF   + +   K PL
Subjt:  YVLRPIESRAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGRIN------LNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRA--KEPL

Query:  ELVHTDLYGPMNVRARGGYEYFISFIDDYSRYGYLYLMSHKSEALEKFKEYKIEVE
         +VH+D+ GP+         YF+ F+D ++ Y   YL+ +KS+    F+++  + E
Subjt:  ELVHTDLYGPMNVRARGGYEYFISFIDDYSRYGYLYLMSHKSEALEKFKEYKIEVE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-1827.59Show/hide
Query:  VGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTAET
        +G   +     +GDI +  +    ++L +V  V  ++ NLIS   L   GY   F+  +  +TK    I        LY            N      E 
Subjt:  VGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTAET

Query:  QPKRQKVSQSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFIDDYS
           + ++S    LWH R+G ++   +  L K  L+S  +  T+ PC+ CL GK  +  F     R    L+LV++D+ GPM + + GG +YF++FIDD S
Subjt:  QPKRQKVSQSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFIDDYS

Query:  RYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        R  ++Y++  K +  + F+++   VE    R+
Subjt:  RYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein2.7e-0730.08Show/hide
Query:  KRQKVSQSTY-LWHLRLGRINLNRIGRLVKNGLLSQLEDGTLP-------PCESCLEGKMTK-RPFTGKRYR---AKEPLELVHTDLYGPMNVRARGGYE
        K + V++  Y L H  LG  N   I + +K   ++ L++  +         C  CL GK TK R   G R +   + EP + +HTD++GP++   +    
Subjt:  KRQKVSQSTY-LWHLRLGRINLNRIGRLVKNGLLSQLEDGTLP-------PCESCLEGKMTK-RPFTGKRYR---AKEPLELVHTDLYGPMNVRARGGYE

Query:  YFISFIDDYSRYGYLYLMSHKSE
        YFISF D+ +R+ ++Y +  + E
Subjt:  YFISFIDDYSRYGYLYLMSHKSE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-1327.39Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLE-QGYSV-----SFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNN
        +MV  G  I     G   +  ++ R + L N+  V  I +NLIS+  L    G SV     SF V +      G  +   K ++ LY      S+ +   
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLE-QGYSV-----SFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNN

Query:  EMFKTAETQPKRQKVSQSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDG-TLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEY
         +F +  +     K + S+  WH RLG    + +  ++ N  LS L        C  CL  K  K PF+     +  PLE +++D++    + +   Y Y
Subjt:  EMFKTAETQPKRQKVSQSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDG-TLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEY

Query:  FISFIDDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        ++ F+D ++RY +LY +  KS+  E F  +K  +EN  Q R
Subjt:  FISFIDDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1226.86Show/hide
Query:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLE------QGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNN
        +M+  G  I     G   +  S  R + L+ V  V  I +NLIS+  L        + +  SF V +      G  +   K ++ LY      S+A+   
Subjt:  LMVGTGEFISAKAVGDIKMFFSRERDMLLDNVYIVTKIKRNLISISCLLE------QGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNN

Query:  EMFKTAETQPKRQKVSQSTY-LWHLRLGRINLNRIGRLVKNGLLSQLEDG-TLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYE
         MF +          S++T+  WH RLG  +L  +  ++ N  L  L     L  C  C   K  K PF+     + +PLE +++D++    + +   Y 
Subjt:  EMFKTAETQPKRQKVSQSTY-LWHLRLGRINLNRIGRLVKNGLLSQLEDG-TLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYE

Query:  YFISFIDDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR
        Y++ F+D ++RY +LY +  KS+  + F  +K  VEN  Q R
Subjt:  YFISFIDDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRR

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.3e-0734.18Show/hide
Query:  QKVSQSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNV
        +     T LWH RL  ++   +  LVK G L   +  +L  CE C+ GK  +  F+  ++  K PL+ VH+DL+G  +V
Subjt:  QKVSQSTYLWHLRLGRINLNRIGRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGTTTTGGCTGCGGTTCGACTTCGGTTCAGTGGTTTTCAGGCGGTTCATGGACGATTCGTTGACTGTTTGACTGGTTTTCACTGGTTCGTTGGTTTTACC
AGTGGTTTTGACTGGTTCAAGCATTTTTTGGCACGGTTCGAGCGGTTTTTGACCCGGTTCGAGCTGTTTTGGCACGATTCTGCAGCGGTTCAAGGATGTTTTAGG
ATAATTTTGATCAAATTAATGGTTGGGACTGGGGAGTTCATTTCAGCTAAAGCAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACGCGATATGTTATTAGAT
AATGTATATATAGTTACCAAAATAAAAAGAAATTTGATTTCTATTTCTTGTTTGCTCGAACAAGGATATTCAGTGTCCTTTTCTGTAAATGAAGGCTTCATTACT
AAAAGGGGCGCTGATATCTGTTTTGCAAAATTGGAGAACAATTTATATGTTTTAAGACCAATAGAGTCTAGGGCTATTTTAAACAATGAGATGTTTAAAACAGCT
GAGACTCAACCAAAGAGGCAAAAAGTTTCTCAAAGTACCTATCTTTGGCACTTGAGACTTGGCCGCATAAATCTCAATAGGATTGGGAGATTGGTTAAGAACGGA
CTTCTAAGCCAATTAGAGGATGGTACTTTACCTCCGTGTGAGTCATGTCTCGAAGGTAAAATGACCAAACGACCTTTTACTGGAAAACGTTATCGTGCCAAGGAG
CCCTTAGAGCTTGTACATACGGATCTCTATGGTCCAATGAATGTTAGGGCTCGAGGAGGGTATGAATATTTCATCTCTTTTATAGATGATTATTCAAGGTATGGT
TATTTATACCTAATGAGTCATAAGTCTGAAGCCCTTGAAAAGTTCAAGGAGTATAAGATTGAAGTTGAGAACTTGTTACAGCGTCGCGACGCTCTCGGCTTCCCT
TTCCAGAATCCGCAGTTTCGCGACAGCGTCGGGACGCTGTGCCGAATTTTTTTCCCTATTTATAGATTGCGATTAGCGTCGAGACGCTATGCAAGGCAGCGTCGC
GACGCTACCCATTTCTTGGGCAACAAGACGCGTGCGTTGCAGCGTCGCGACGCTGTGCAATGTAGCGTCGCGACGCTACCCCCATTCCGGGCCTATAAAAAGGCA
CCCTTGGGTGCCTCATGGAGGATCAATCAATTCATTCTTCAATCCATTCTTTCCTTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTCTAGCCTTTTTGAGA
GAATTCTTTAGTGGGAAATATTTTGGGGAATTTTGGGAGCATCTTAGGAAGCTCAAAGGAGTGTTCGGCGGAGCTCTGAGCGCCAAGCGAAGGGAGTTGCCTGAC
ATTGAGGATGAGCATGAAATGCATAAAAAAGATTACTTGGTAGATAATTTTGAGTCTGATCGTGATTACACTGAATCTATTAAGTCTGATCTTGACATTCCTGAA
TGCATGAACCCTGATAATGTTAATACCTTTGATTCGTGTCCTGATGATGTATATAGCATAGAAACTGACCCAGAGGAACTCGAATCTGTGCATAGTATAGAATCT
GACCCTGAAGAGCTTGAATTCTTTAATTCTGATAGTGAATCATGTCTAGTTGAATCAATTTTTAATACTGAGTCTGAGGAATCTGAGGAGCATGTAAATGTTTTT
TCTGATGAATGGTCTGACATGATTGATAGGCCATCTTTAGATCCTAGACCGGTAGATATTATAACGCTTGATGACTCTGTTAACAATTGCTCTGTGAATAGAGAC
TTAGAAAAGAGATCTGATGCTGTTACATATTTTCTGCAATCGTCGAGACGCTATGCAAGGCAGCGTCGCGACGCTACCCATTTCTTGGGCAACAAGACGCGTGCG
TTGCAGCGTCGCGACGCTGTGCAATGTAGCGTCGCGACGCTACCCCCATTCCGGGCCTATAAAAAGGCACCCTTGGGTGCCTCATGGAGGATCAATCAATTCATT
CTTCAATCCATTCTTTCCTTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTCTAGCCTTTTTGAGAGAATTCTTTAGTGGGAAATATTTTGGGGAATTTTGG
GAGCATCTTAGGAAGCTCAAAGGAGTGTTCGGCGGAGCTCTGAGCGCCAAGCGAAGGGAGTTGCCTGACATTGAGGATGAGCATGAAATGCATAAAAAAGATTAC
TTGGTAGATAATTTTGAGTCTGATCGTGATTACACTGAATCTATTAAGTCTGATCTTGACATTCCTGAATGCATGAACCCTGATAATGTTAATACCTTTGATTCG
TGTCCTGATGATGTATATAGCATAGAAACTGACCCAGAGGAACTCGAATCTGTGCATAGTATAGAATCTGACCCTGAAGAGCTTGAATTCTTTAATTCTGATAGT
GAATCATGTCTAGTTGAATCAATTTTTAATACTGAGTCTGAGGAATCTGAGGAGCATGTAAATGTTTTTTCTGATGAATGGTCTGACATGATTGATAGGCCATCT
TTAGATCCTAGACCGGTAGATATTATAACGCTTGATGACTCTGTTAACAATTGCTCTGTGAATAGAGACTTAGAAAAGAGATCTGATGCTGTTACATATTTTCTG
CAATCGTCGAGACGCTATGCAAGGCAGCGTCGCGACGCTACCCATTTCTTGGGCAACAAGACGCGTGCGTTGCAGCGTCGCGACGCTGTGCAATGTAGCGTCGCG
ACGCTACCCCCATTCCGGGCCTATAAAAAGGCACCCTTGGGTGCCTCATGGAGGATCAATCAATTCATTCTTCAATCCATTCTTTCCTTCCTTTGGCTCCTTTGG
AGCCTCTCTCAAGGCTTTCTAGCCTTTTTGAGAGAATTCTTTAGTGGGAAATATTTTGGGGAATTTTGGGAGCATCTTAGGAAGCTCAAAGGAGTGTTCGGCGGA
GCTCTGAGCGCCAAGCGAAGGGAGTTGCCTGACATTGAGGATGAGCATGAAATGCATAAAAAAGATTACTTGGTAGATAATTTTGAGTCTGATCGTGATTACACT
GAATCTATTAAGTCTGATCTTGACATTCCTGAATGCATGAACCCTGATAATGTTAATACCTTTGATTCGTGTCCTGATGATGTATATAGCATAGAAACTGACCCA
GAGGAACTCGAATCTGTGCATAGTATAGAATCTGACCCTGAAGAGCTTGAATTCTTTAATTCTGATAGTGAATCATGTCTAGTTGAATCAATTTTTAATACTGAG
TCTGAGGAATCTGAGGAGCATGTAAATGTTTTTTCTGATGAATGGTCTGACATGATTGATAGGCCATCTTTAGATCCTAGACCGGTAGATATTATAACGCTTGAT
GACTCTGTTAACAATTGCTCTGTGAATAGAGACTTAGAAAAGAGATCTGATGCTGTTACATATTTTCTGCAATGTTATTATGATAATCTCTTTAGTGATGATGGG
CCAGGGGAAAGTTTTTTTTCTCCCCTGTTTGCGCGTCGAGACGCTATGCAAGGCAGCGTCGCGACGCTACCCATTTCTTGGGCAACAAGACGCGTGCGTTGCAGC
GTCGCGACGCTGTGCAATGTAGCGTCGCGACGCTACCCCCATTCCGGGCCTATAAAAAGGCACCCTTGGGTGCCTCATGGAGGATCAATCAATTCATTCTTCAAT
CCATTCTTTCCTTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACGTTTTGGCTGCGGTTCGACTTCGGTTCAGTGGTTTTCAGGCGGTTCATGGACGATTCGTTGACTGTTTGACTGGTTTTCACTGGTTCGTTGGTTTTACC
AGTGGTTTTGACTGGTTCAAGCATTTTTTGGCACGGTTCGAGCGGTTTTTGACCCGGTTCGAGCTGTTTTGGCACGATTCTGCAGCGGTTCAAGGATGTTTTAGG
ATAATTTTGATCAAATTAATGGTTGGGACTGGGGAGTTCATTTCAGCTAAAGCAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACGCGATATGTTATTAGAT
AATGTATATATAGTTACCAAAATAAAAAGAAATTTGATTTCTATTTCTTGTTTGCTCGAACAAGGATATTCAGTGTCCTTTTCTGTAAATGAAGGCTTCATTACT
AAAAGGGGCGCTGATATCTGTTTTGCAAAATTGGAGAACAATTTATATGTTTTAAGACCAATAGAGTCTAGGGCTATTTTAAACAATGAGATGTTTAAAACAGCT
GAGACTCAACCAAAGAGGCAAAAAGTTTCTCAAAGTACCTATCTTTGGCACTTGAGACTTGGCCGCATAAATCTCAATAGGATTGGGAGATTGGTTAAGAACGGA
CTTCTAAGCCAATTAGAGGATGGTACTTTACCTCCGTGTGAGTCATGTCTCGAAGGTAAAATGACCAAACGACCTTTTACTGGAAAACGTTATCGTGCCAAGGAG
CCCTTAGAGCTTGTACATACGGATCTCTATGGTCCAATGAATGTTAGGGCTCGAGGAGGGTATGAATATTTCATCTCTTTTATAGATGATTATTCAAGGTATGGT
TATTTATACCTAATGAGTCATAAGTCTGAAGCCCTTGAAAAGTTCAAGGAGTATAAGATTGAAGTTGAGAACTTGTTACAGCGTCGCGACGCTCTCGGCTTCCCT
TTCCAGAATCCGCAGTTTCGCGACAGCGTCGGGACGCTGTGCCGAATTTTTTTCCCTATTTATAGATTGCGATTAGCGTCGAGACGCTATGCAAGGCAGCGTCGC
GACGCTACCCATTTCTTGGGCAACAAGACGCGTGCGTTGCAGCGTCGCGACGCTGTGCAATGTAGCGTCGCGACGCTACCCCCATTCCGGGCCTATAAAAAGGCA
CCCTTGGGTGCCTCATGGAGGATCAATCAATTCATTCTTCAATCCATTCTTTCCTTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTCTAGCCTTTTTGAGA
GAATTCTTTAGTGGGAAATATTTTGGGGAATTTTGGGAGCATCTTAGGAAGCTCAAAGGAGTGTTCGGCGGAGCTCTGAGCGCCAAGCGAAGGGAGTTGCCTGAC
ATTGAGGATGAGCATGAAATGCATAAAAAAGATTACTTGGTAGATAATTTTGAGTCTGATCGTGATTACACTGAATCTATTAAGTCTGATCTTGACATTCCTGAA
TGCATGAACCCTGATAATGTTAATACCTTTGATTCGTGTCCTGATGATGTATATAGCATAGAAACTGACCCAGAGGAACTCGAATCTGTGCATAGTATAGAATCT
GACCCTGAAGAGCTTGAATTCTTTAATTCTGATAGTGAATCATGTCTAGTTGAATCAATTTTTAATACTGAGTCTGAGGAATCTGAGGAGCATGTAAATGTTTTT
TCTGATGAATGGTCTGACATGATTGATAGGCCATCTTTAGATCCTAGACCGGTAGATATTATAACGCTTGATGACTCTGTTAACAATTGCTCTGTGAATAGAGAC
TTAGAAAAGAGATCTGATGCTGTTACATATTTTCTGCAATCGTCGAGACGCTATGCAAGGCAGCGTCGCGACGCTACCCATTTCTTGGGCAACAAGACGCGTGCG
TTGCAGCGTCGCGACGCTGTGCAATGTAGCGTCGCGACGCTACCCCCATTCCGGGCCTATAAAAAGGCACCCTTGGGTGCCTCATGGAGGATCAATCAATTCATT
CTTCAATCCATTCTTTCCTTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTCTAGCCTTTTTGAGAGAATTCTTTAGTGGGAAATATTTTGGGGAATTTTGG
GAGCATCTTAGGAAGCTCAAAGGAGTGTTCGGCGGAGCTCTGAGCGCCAAGCGAAGGGAGTTGCCTGACATTGAGGATGAGCATGAAATGCATAAAAAAGATTAC
TTGGTAGATAATTTTGAGTCTGATCGTGATTACACTGAATCTATTAAGTCTGATCTTGACATTCCTGAATGCATGAACCCTGATAATGTTAATACCTTTGATTCG
TGTCCTGATGATGTATATAGCATAGAAACTGACCCAGAGGAACTCGAATCTGTGCATAGTATAGAATCTGACCCTGAAGAGCTTGAATTCTTTAATTCTGATAGT
GAATCATGTCTAGTTGAATCAATTTTTAATACTGAGTCTGAGGAATCTGAGGAGCATGTAAATGTTTTTTCTGATGAATGGTCTGACATGATTGATAGGCCATCT
TTAGATCCTAGACCGGTAGATATTATAACGCTTGATGACTCTGTTAACAATTGCTCTGTGAATAGAGACTTAGAAAAGAGATCTGATGCTGTTACATATTTTCTG
CAATCGTCGAGACGCTATGCAAGGCAGCGTCGCGACGCTACCCATTTCTTGGGCAACAAGACGCGTGCGTTGCAGCGTCGCGACGCTGTGCAATGTAGCGTCGCG
ACGCTACCCCCATTCCGGGCCTATAAAAAGGCACCCTTGGGTGCCTCATGGAGGATCAATCAATTCATTCTTCAATCCATTCTTTCCTTCCTTTGGCTCCTTTGG
AGCCTCTCTCAAGGCTTTCTAGCCTTTTTGAGAGAATTCTTTAGTGGGAAATATTTTGGGGAATTTTGGGAGCATCTTAGGAAGCTCAAAGGAGTGTTCGGCGGA
GCTCTGAGCGCCAAGCGAAGGGAGTTGCCTGACATTGAGGATGAGCATGAAATGCATAAAAAAGATTACTTGGTAGATAATTTTGAGTCTGATCGTGATTACACT
GAATCTATTAAGTCTGATCTTGACATTCCTGAATGCATGAACCCTGATAATGTTAATACCTTTGATTCGTGTCCTGATGATGTATATAGCATAGAAACTGACCCA
GAGGAACTCGAATCTGTGCATAGTATAGAATCTGACCCTGAAGAGCTTGAATTCTTTAATTCTGATAGTGAATCATGTCTAGTTGAATCAATTTTTAATACTGAG
TCTGAGGAATCTGAGGAGCATGTAAATGTTTTTTCTGATGAATGGTCTGACATGATTGATAGGCCATCTTTAGATCCTAGACCGGTAGATATTATAACGCTTGAT
GACTCTGTTAACAATTGCTCTGTGAATAGAGACTTAGAAAAGAGATCTGATGCTGTTACATATTTTCTGCAATGTTATTATGATAATCTCTTTAGTGATGATGGG
CCAGGGGAAAGTTTTTTTTCTCCCCTGTTTGCGCGTCGAGACGCTATGCAAGGCAGCGTCGCGACGCTACCCATTTCTTGGGCAACAAGACGCGTGCGTTGCAGC
GTCGCGACGCTGTGCAATGTAGCGTCGCGACGCTACCCCCATTCCGGGCCTATAAAAAGGCACCCTTGGGTGCCTCATGGAGGATCAATCAATTCATTCTTCAAT
CCATTCTTTCCTTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTCTAG
Protein sequenceShow/hide protein sequence
MHVLAAVRLRFSGFQAVHGRFVDCLTGFHWFVGFTSGFDWFKHFLARFERFLTRFELFWHDSAAVQGCFRIILIKLMVGTGEFISAKAVGDIKMFFSRERDMLLD
NVYIVTKIKRNLISISCLLEQGYSVSFSVNEGFITKRGADICFAKLENNLYVLRPIESRAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGRINLNRIGRLVKNG
LLSQLEDGTLPPCESCLEGKMTKRPFTGKRYRAKEPLELVHTDLYGPMNVRARGGYEYFISFIDDYSRYGYLYLMSHKSEALEKFKEYKIEVENLLQRRDALGFP
FQNPQFRDSVGTLCRIFFPIYRLRLASRRYARQRRDATHFLGNKTRALQRRDAVQCSVATLPPFRAYKKAPLGASWRINQFILQSILSFLWLLWSLSQGFLAFLR
EFFSGKYFGEFWEHLRKLKGVFGGALSAKRRELPDIEDEHEMHKKDYLVDNFESDRDYTESIKSDLDIPECMNPDNVNTFDSCPDDVYSIETDPEELESVHSIES
DPEELEFFNSDSESCLVESIFNTESEESEEHVNVFSDEWSDMIDRPSLDPRPVDIITLDDSVNNCSVNRDLEKRSDAVTYFLQSSRRYARQRRDATHFLGNKTRA
LQRRDAVQCSVATLPPFRAYKKAPLGASWRINQFILQSILSFLWLLWSLSQGFLAFLREFFSGKYFGEFWEHLRKLKGVFGGALSAKRRELPDIEDEHEMHKKDY
LVDNFESDRDYTESIKSDLDIPECMNPDNVNTFDSCPDDVYSIETDPEELESVHSIESDPEELEFFNSDSESCLVESIFNTESEESEEHVNVFSDEWSDMIDRPS
LDPRPVDIITLDDSVNNCSVNRDLEKRSDAVTYFLQSSRRYARQRRDATHFLGNKTRALQRRDAVQCSVATLPPFRAYKKAPLGASWRINQFILQSILSFLWLLW
SLSQGFLAFLREFFSGKYFGEFWEHLRKLKGVFGGALSAKRRELPDIEDEHEMHKKDYLVDNFESDRDYTESIKSDLDIPECMNPDNVNTFDSCPDDVYSIETDP
EELESVHSIESDPEELEFFNSDSESCLVESIFNTESEESEEHVNVFSDEWSDMIDRPSLDPRPVDIITLDDSVNNCSVNRDLEKRSDAVTYFLQCYYDNLFSDDG
PGESFFSPLFARRDAMQGSVATLPISWATRRVRCSVATLCNVASRRYPHSGPIKRHPWVPHGGSINSFFNPFFPSFGSFGASLKAF