; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0224171 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0224171
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:14037347..14038669
RNA-Seq ExpressionCmc08g0224171
SyntenyCmc08g0224171
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-23594.56Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE
        MDSLREMFGQPSIQIKQEANVAHSK RF PS SGSEKIQKRKEGKGK PTIAVE KGKAKV IK KCFHCNVDEHWKTNCPKYLVKKKE EGKYDLLVLE
Subjt:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE

Query:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
        TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
Subjt:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE

Query:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR
        AFI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRTANTQNK+QRISPNNNTYL HLRLGHINLD+IGRLVKN LLNKL+D SLPPCESCLEGKMTKR
Subjt:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR

Query:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG
        PFT  GYRAKEPLELIHSDLCGPMNVKARGGF YFISFIDD SRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQ+YMIEHG
Subjt:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG

Query:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        IQSQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
Subjt:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-23193.2Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE
        MDSLREMFGQPSIQIKQEANVAHSK RF PS SGSEKIQKRKEGKGK PTIAVE KGKAKV IK KCFHCNVDEHWKTNCPKYLVKK E E KYDLLVLE
Subjt:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE

Query:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
        TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
Subjt:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE

Query:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR
        AFI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRTANTQNK+QRISPNNNTYL HLRLGHINLD+IGRLVK+ LLNKL+D SLPPCESCLEGKMTKR
Subjt:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR

Query:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG
        PFT  GYRAKEPLELIHSDLCGPMNVKARG F YFISFIDD SRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQ+YMIEHG
Subjt:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG

Query:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        IQSQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
Subjt:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

KAA0046415.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-16266.96Show/hide
Query:  DSLREMFGQPSIQIKQEANVAHSKR-----------FAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLV-KKKEN
        +SL ++ GQ     K EANVA S R             PSSSG++K +K+K G+G    +A     K     K  CFHCN + HWK NCPKYL  KKK  
Subjt:  DSLREMFGQPSIQIKQEANVAHSKR-----------FAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLV-KKKEN

Query:  EGKYDLLVLETCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEH
        +GKYDLLVLETCLVEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG VISA AVG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E 
Subjt:  EGKYDLLVLETCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEH

Query:  MYSINFSMNEAFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCE
         YS+ F++N+ FI KNGV ICSAKLE+NLYVLR   +KA+LN EMF+TA TQNK+ +ISP  N +L HLRLGHINL++I RLVKN LL++LE++SLP CE
Subjt:  MYSINFSMNEAFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCE

Query:  SCLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDL
        SCLEGKMTKRPFT  G+RAKEPLEL+HSDLCGPMNVKARGGF YFI+F DD SRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL
Subjt:  SCLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDL

Query:  RFQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        +FQNY++E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMSYA LP+SF
Subjt:  RFQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-23192.95Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET
        MDSLREMFGQPSIQIKQE NVAHSKRFA SS GSEKIQKRKEGKGK PTIA+EGKGK KVVIKEK FHCNV+EHWKTNCPKYLVKKKE EGKYDLLVLET
Subjt:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET

Query:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA
        CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEM LKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEA
Subjt:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA

Query:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP
        FISKNGVHICS KLEDNLYVL+PNE KAVLNHEMFRTANTQNK+QRIS NNNTYL HLRLGHINLD+IGRLVKN LLNKLEDDSLPPCESCLEGKMTKRP
Subjt:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP

Query:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI
        FT  GYRAKEPLELIHSDLCGPMNVKA GGF YFISFIDD S YGYLYL+EHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQ+YMIEHGI
Subjt:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI

Query:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        QSQLSAP TPQQNGVSERRNRTLLDMV SMMSY QLPSSF
Subjt:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-21188.18Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET
        MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKG+ PTIAVEGKGKAKVVIK KCFHCNVDEHWKTNCPKYLVKKKE E          
Subjt:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET

Query:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA
                      GATNHVCSSLQETSSFKQLEESEMTL VGTGDVISARAVGD KLFF  KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEA
Subjt:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA

Query:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP
        FISKNG     AKLEDNLYVLRPNEAKAVLNHEMFRTANTQNK+QRISPNNNTYL HLRL HINLD+IGRLVKN LLNKL+DDSLPPCESCLEGKMTKRP
Subjt:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP

Query:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI
        FT   YRAKEPLELIHSDLCGPMNVKARGGF YFISFIDD SRYGYLYLMEHK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQ+YMIEHGI
Subjt:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI

Query:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        QSQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
Subjt:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein3.6e-23193.2Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE
        MDSLREMFGQPSIQIKQEANVAHSK RF PS SGSEKIQKRKEGKGK PTIAVE KGKAKV IK KCFHCNVDEHWKTNCPKYLVKK E E KYDLLVLE
Subjt:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE

Query:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
        TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
Subjt:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE

Query:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR
        AFI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRTANTQNK+QRISPNNNTYL HLRLGHINLD+IGRLVK+ LLNKL+D SLPPCESCLEGKMTKR
Subjt:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR

Query:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG
        PFT  GYRAKEPLELIHSDLCGPMNVKARG F YFISFIDD SRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQ+YMIEHG
Subjt:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG

Query:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        IQSQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
Subjt:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

A0A5A7TYF5 Gag/pol protein9.2e-16366.96Show/hide
Query:  DSLREMFGQPSIQIKQEANVAHSKR-----------FAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLV-KKKEN
        +SL ++ GQ     K EANVA S R             PSSSG++K +K+K G+G    +A     K     K  CFHCN + HWK NCPKYL  KKK  
Subjt:  DSLREMFGQPSIQIKQEANVAHSKR-----------FAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLV-KKKEN

Query:  EGKYDLLVLETCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEH
        +GKYDLLVLETCLVEND +AWI+DSGATNHVCSS Q  SS++QLE  EMT++VGTG VISA AVG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E 
Subjt:  EGKYDLLVLETCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEH

Query:  MYSINFSMNEAFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCE
         YS+ F++N+ FI KNGV ICSAKLE+NLYVLR   +KA+LN EMF+TA TQNK+ +ISP  N +L HLRLGHINL++I RLVKN LL++LE++SLP CE
Subjt:  MYSINFSMNEAFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCE

Query:  SCLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDL
        SCLEGKMTKRPFT  G+RAKEPLEL+HSDLCGPMNVKARGGF YFI+F DD SRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK  RSDRGGEYMDL
Subjt:  SCLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDL

Query:  RFQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        +FQNY++E GI SQLSAP TPQQNGVSERRNRTLLDMVRSMMSYA LP+SF
Subjt:  RFQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

A0A5A7TZD0 Gag/pol protein4.1e-23594.56Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE
        MDSLREMFGQPSIQIKQEANVAHSK RF PS SGSEKIQKRKEGKGK PTIAVE KGKAKV IK KCFHCNVDEHWKTNCPKYLVKKKE EGKYDLLVLE
Subjt:  MDSLREMFGQPSIQIKQEANVAHSK-RFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLE

Query:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
        TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLE+SEMTLKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE
Subjt:  TCLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNE

Query:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR
        AFI KNGVHICSAKLE+NLYVLRPNEAKAVLNHEMFRTANTQNK+QRISPNNNTYL HLRLGHINLD+IGRLVKN LLNKL+D SLPPCESCLEGKMTKR
Subjt:  AFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKR

Query:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG
        PFT  GYRAKEPLELIHSDLCGPMNVKARGGF YFISFIDD SRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQ+YMIEHG
Subjt:  PFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHG

Query:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        IQSQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
Subjt:  IQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

A0A5A7VJG3 Gag/pol protein1.4e-21188.18Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET
        MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKG+ PTIAVEGKGKAKVVIK KCFHCNVDEHWKTNCPKYLVKKKE E          
Subjt:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET

Query:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA
                      GATNHVCSSLQETSSFKQLEESEMTL VGTGDVISARAVGD KLFF  KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEA
Subjt:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA

Query:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP
        FISKNG     AKLEDNLYVLRPNEAKAVLNHEMFRTANTQNK+QRISPNNNTYL HLRL HINLD+IGRLVKN LLNKL+DDSLPPCESCLEGKMTKRP
Subjt:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP

Query:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI
        FT   YRAKEPLELIHSDLCGPMNVKARGGF YFISFIDD SRYGYLYLMEHK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQ+YMIEHGI
Subjt:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI

Query:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        QSQLSAP TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
Subjt:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

A0A5D3BNE1 Gag/pol protein3.6e-23192.95Show/hide
Query:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET
        MDSLREMFGQPSIQIKQE NVAHSKRFA SS GSEKIQKRKEGKGK PTIA+EGKGK KVVIKEK FHCNV+EHWKTNCPKYLVKKKE EGKYDLLVLET
Subjt:  MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLET

Query:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA
        CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEM LKVGTGDVISARAVGDAKLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEA
Subjt:  CLVENDQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEA

Query:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP
        FISKNGVHICS KLEDNLYVL+PNE KAVLNHEMFRTANTQNK+QRIS NNNTYL HLRLGHINLD+IGRLVKN LLNKLEDDSLPPCESCLEGKMTKRP
Subjt:  FISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRP

Query:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI
        FT  GYRAKEPLELIHSDLCGPMNVKA GGF YFISFIDD S YGYLYL+EHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQ+YMIEHGI
Subjt:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI

Query:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        QSQLSAP TPQQNGVSERRNRTLLDMV SMMSY QLPSSF
Subjt:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-3531.14Show/hide
Query:  KGKAKVVIKEKCFHCNVDEHWKTNCPKYL----VKKKENEGKYDL-----LVLETCLVEN----DQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLK
        KG +K   K KC HC  + H K +C  Y      K KENE +        +      V N    D   ++LDSGA++H+   + + S +    E    LK
Subjt:  KGKAKVVIKEKCFHCNVDEHWKTNCPKYL----VKKKENEGKYDL-----LVLETCLVEN----DQNAWILDSGATNHVCSSLQETSSFKQLEESEMTLK

Query:  VGT---GDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLEDNLYVLRPNEAKAVLNHEMFRT
        +     G+ I A   G  +L  +++ + LE++    +   NL+SV  L E   SI F  +   ISKNG+ +  ++ + +N+          V+N + + +
Subjt:  VGT---GDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLEDNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLV-KNV-----LLNKLEDDSLPPCESCLEGKMTKRPF--TRNGYRAKEPLELIHSDLCGPMNVKARG
         N ++K       NN  L H R GHI+  ++  +  KN+     LLN LE  S   CE CL GK  + PF   ++    K PL ++HSD+CGP+      
Subjt:  ANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLV-KNV-----LLNKLEDDSLPPCESCLEGKMTKRPF--TRNGYRAKEPLELIHSDLCGPMNVKARG

Query:  GFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRS
           YF+ F+D  + Y   YL+++KS+    F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+
Subjt:  GFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRS

Query:  MMSYAQLPSSF
        M+S A+L  SF
Subjt:  MMSYAQLPSSF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-4428.22Show/hide
Query:  SEKIQKRKEGKGKCPTIAVEG---------------KGKAKVVIKEK---CFHCNVDEHWKTNCPKYLVKKKENEGK------------YDLLVL-----
        +EK++K+ E +G+       G               +GK+K   K +   C++CN   H+K +CP     K E  G+             D +VL     
Subjt:  SEKIQKRKEGKGKCPTIAVEG---------------KGKAKVVIKEK---CFHCNVDEHWKTNCPKYLVKKKENEGK------------YDLLVL-----

Query:  ETCL-VENDQNAWILDSGATNHV-------CSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHM
        E C+ +   ++ W++D+ A++H        C  +       ++  +  +   G GD+     VG          + L+++  VP ++ NL+S   L    
Subjt:  ETCL-VENDQNAWILDSGATNHV-------CSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHM

Query:  YSINFSMNEAFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCES
        Y   F+  +  ++K  + I        LY       +  LN            +  IS +    L H R+GH++   +  L K  L++  +  ++ PC+ 
Subjt:  YSINFSMNEAFISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCES

Query:  CLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLR
        CL GK  +  F  +  R    L+L++SD+CGPM +++ GG  YF++FIDD SR  ++Y+++ K +  + F+++   VE    +K+K LRSD GGEY    
Subjt:  CLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLR

Query:  FQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
        F+ Y   HGI+ + + P TPQ NGV+ER NRT+++ VRSM+  A+LP SF
Subjt:  FQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.2e-1926.53Show/hide
Query:  ILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC
        ++DSGA+  +  S            SE+ +       I   A+G+    F+N           P I  +L+S+S L     +  F+ N      +G  + 
Subjt:  ILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC

Query:  SAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTY-LCHLRLGHINLDQIGR-LVKNVLLNKLEDD------SLPPCESCLEGKMTKRPFT
              + Y L  ++   + +H    T N  NK +  S N   Y L H  LGH N   I + L KN +    E D      S   C  CL GK TK    
Subjt:  SAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTY-LCHLRLGHINLDQIGR-LVKNVLLNKLEDD------SLPPCESCLEGKMTKRPFT

Query:  RNGYRAK-----EPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSE--ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYM
        + G R K     EP + +H+D+ GP++   +    YFISF D+ +R+ ++Y +  + E   L  F      ++N  + ++ +++ DRG EY +     + 
Subjt:  RNGYRAK-----EPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSE--ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYM

Query:  IEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPS
           GI +  +     + +GV+ER NRTLL+  R+++  + LP+
Subjt:  IEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-3128.24Show/hide
Query:  NAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAF
        N W+LDSGAT+H+ S     S  +     +  + V  G  I     G   L  +++ + L N+  VP I +NL+SV  L          +  +F + +  
Subjt:  NAWILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAF

Query:  ISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLE-DDSLPPCESCLEGKMTKRP
            GV +   K +D LY     E     +  +   A+  +K    S        H RLGH     +  ++ N  L+ L        C  CL  K  K P
Subjt:  ISKNGVHICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLE-DDSLPPCESCLEGKMTKRP

Query:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI
        F+++   +  PLE I+SD+     + +   + Y++ F+D  +RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L    Y  +HGI
Subjt:  FTRNGYRAKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGI

Query:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
            S P TP+ NG+SER++R +++   +++S+A +P ++
Subjt:  QSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-3328.41Show/hide
Query:  NVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKY-----LVKKKENEGKYDLLVLETCLVEN---DQNAWI
        N   S  + PSSSGS     R + +   P +              +C  C+V  H    CP+         ++++   +        L  N   + N W+
Subjt:  NVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKY-----LVKKKENEGKYDLLVLETCLVEN---DQNAWI

Query:  LDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIS--KNGVH
        LDSGAT+H+ S      SF Q       + +  G  I     G A L   ++ + L  +  VP I +NL+SV  L   +  S+ F      +     GV 
Subjt:  LDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIS--KNGVH

Query:  ICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLE-DDSLPPCESCLEGKMTKRPFTRNGYR
        +   K +D LY      ++AV    MF  A+  +K    S        H RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+ +   
Subjt:  ICSAKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLE-DDSLPPCESCLEGKMTKRPFTRNGYR

Query:  AKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGIQSQLSAP
        + +PLE I+SD+     + +   + Y++ F+D  +RY +LY ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR  +Y+ +HGI    S P
Subjt:  AKEPLELIHSDLCGPMNVKARGGFIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGIQSQLSAP

Query:  CTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF
         TP+ NG+SER++R +++M  +++S+A +P ++
Subjt:  CTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.4e-0636Show/hide
Query:  NNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNV
        + T L H RL H++   +  LVK   L+  +  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCTCTTAGAGAGATGTTTGGGCAACCGTCCATTCAGATCAAACAAGAGGCAAATGTTGCTCATTCTAAGAGGTTTGCACCTTCATCTTCTGGATCTGAGAAAAT
TCAGAAGAGAAAAGAAGGGAAGGGGAAATGTCCTACTATTGCTGTTGAAGGCAAAGGGAAGGCTAAGGTAGTCATCAAGGAAAAATGTTTCCACTGCAATGTTGATGAGC
ATTGGAAAACAAATTGCCCTAAGTACCTTGTTAAGAAGAAAGAAAATGAAGGTAAATATGATTTACTTGTCTTGGAAACATGTTTAGTGGAAAATGACCAAAATGCCTGG
ATACTTGATTCAGGGGCCACTAACCATGTTTGTTCTTCTTTACAAGAAACTAGTTCCTTCAAGCAACTTGAGGAGAGTGAGATGACACTCAAGGTTGGAACGGGAGATGT
CATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTATTTTTCGAAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTT
CTTGTCTTATTGAACATATGTACTCGATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAGACAACTTGTATGTA
TTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAAGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGCCA
TTTAAGATTAGGTCACATAAATCTTGATCAGATCGGGAGATTGGTAAAGAATGTTCTTCTAAACAAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAAG
GTAAAATGACAAAGAGACCTTTTACTAGAAATGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGC
TTTATATACTTCATCTCTTTTATAGACGATTGTTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGT
TGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAGAACTATATGATAGAACATGGAATCCAATCCCAAC
TCTCAGCACCTTGTACGCCTCAACAAAATGGTGTATCGGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTT
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTCTCTTAGAGAGATGTTTGGGCAACCGTCCATTCAGATCAAACAAGAGGCAAATGTTGCTCATTCTAAGAGGTTTGCACCTTCATCTTCTGGATCTGAGAAAAT
TCAGAAGAGAAAAGAAGGGAAGGGGAAATGTCCTACTATTGCTGTTGAAGGCAAAGGGAAGGCTAAGGTAGTCATCAAGGAAAAATGTTTCCACTGCAATGTTGATGAGC
ATTGGAAAACAAATTGCCCTAAGTACCTTGTTAAGAAGAAAGAAAATGAAGGTAAATATGATTTACTTGTCTTGGAAACATGTTTAGTGGAAAATGACCAAAATGCCTGG
ATACTTGATTCAGGGGCCACTAACCATGTTTGTTCTTCTTTACAAGAAACTAGTTCCTTCAAGCAACTTGAGGAGAGTGAGATGACACTCAAGGTTGGAACGGGAGATGT
CATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTATTTTTCGAAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTT
CTTGTCTTATTGAACATATGTACTCGATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAGACAACTTGTATGTA
TTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAAGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGCCA
TTTAAGATTAGGTCACATAAATCTTGATCAGATCGGGAGATTGGTAAAGAATGTTCTTCTAAACAAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAAG
GTAAAATGACAAAGAGACCTTTTACTAGAAATGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGC
TTTATATACTTCATCTCTTTTATAGACGATTGTTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGACTGAAGT
TGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAGAACTATATGATAGAACATGGAATCCAATCCCAAC
TCTCAGCACCTTGTACGCCTCAACAAAATGGTGTATCGGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTT
TAG
Protein sequenceShow/hide protein sequence
MDSLREMFGQPSIQIKQEANVAHSKRFAPSSSGSEKIQKRKEGKGKCPTIAVEGKGKAKVVIKEKCFHCNVDEHWKTNCPKYLVKKKENEGKYDLLVLETCLVENDQNAW
ILDSGATNHVCSSLQETSSFKQLEESEMTLKVGTGDVISARAVGDAKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLEDNLYV
LRPNEAKAVLNHEMFRTANTQNKKQRISPNNNTYLCHLRLGHINLDQIGRLVKNVLLNKLEDDSLPPCESCLEGKMTKRPFTRNGYRAKEPLELIHSDLCGPMNVKARGG
FIYFISFIDDCSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQNYMIEHGIQSQLSAPCTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSF