; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO.jh101032.1 (gene) of Melon (Harukei-3) v1.41 genome

Gene IDMELO.jh101032.1
OrganismCucumis melo var. reticulatus cv. Harukei-3 (Melon (Harukei-3) v1.41)
DescriptionGag/pol protein
Genome locationchr01:21456742..21475930
RNA-Seq ExpressionMELO.jh101032.1
SyntenyMELO.jh101032.1
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.32e-22668.57Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL E QIIIPDDG+EDPLTYKQ MN VD DQWIKAM+LEMESMY N VWTLVD PS+V+PIGCKWIYKRKRDQAGKVQTFKARLV KGYTQ+EG+
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFY+YEIWQMDVKT FLNGNLEESIYMVQPEGFI + QEQKVCKLQ SIYGLKQASRSWNIRFDT IKSYGFEQNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------
        PCVYK+I+NS VAFL+LY DDILLIGNDV                                                                       
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------

Query:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR
                                                  V IV+RYQSN GRDHWTAVKNILKYLRRT++YMLVYG+KDLILTGYTDSDFQ+DKDAR
Subjt:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR

Query:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV
        KSTSGS+FTLNGGAV+WRS+KQ+CIADSTMEAEYVA CEAAKEAVWL+KF TDLEVVPNMHLPITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV

Query:  HRGDVIVTK
        HRGDV+VT+
Subjt:  HRGDVIVTK

KAA0025159.1 gag/pol protein [Cucumis melo var. makuwa]2.33e-22971.73Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL+E Q +IPDDG+EDPL+YKQ MN VD DQW+KAMDLEMESMY N VW LVD P  V+PIGCKWIYKRKRD AGKVQTFKARLV KGYTQRE +
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFYDYEIWQMDVKT FLNGNLEESI+M +PEGFI +GQEQKVCKL  SIYGLKQAS+SWNIRFD  IKSYGF+QNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-------------------------------------------------------VEIVNRYQSNLGRDHW
        PC+YK+I    VAFLVLY DDIL IGND+                                                       V IV+RYQSN G DHW
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-------------------------------------------------------VEIVNRYQSNLGRDHW

Query:  TAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVP
        TAVK ILKYLRRT+DYMLVYG+KDLILTGYTDSDFQTDKD+RKST GS+FTLNGGAV+WRSIKQ CIADSTMEAEYVA CEA KEAVWL+KF  DLEVVP
Subjt:  TAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVP

Query:  NMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR
        NM+LPITLY DNS AVANS+EPRSHKRGKHIERKYHLIREIV RGD IVTKI+S+ N+ADPFTK LTAK   G L +L +R
Subjt:  NMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.87e-22866.05Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL+E Q++IPDDG+EDPL+YKQ MN VD DQW+KAMDLEMESMY N VW LVD P  V+PIGCKWIYKRKRD AGKVQTFKARLV KGYTQREG+
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFYDYEIWQMDVKT FLNGNLEESI+M QPEGFI +GQEQKVCKL  SIYGLKQASRSWNIRFDT IKSYGF+QNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------
        PCVYK+I    VAFLVLY DDILLIGNDV                                                                       
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------

Query:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR
                                                  V IV+RYQSN G DHWTAVK +LKYLRRT+DYMLVYG+KDLILTGYTDSDFQTDKD+R
Subjt:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR

Query:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV
        KSTSGS+FTLNGGAV+WRSIKQ CIADSTMEAEYVA CEAAKEAVWL+KF  DLEVVPNM+LPITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV

Query:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR
         RGDVIVTKI+S+ N+ADPFTK LTAK   G L +L +R
Subjt:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]9.24e-22666.05Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL+E Q++IPDDG+EDPL+YKQ MN VD DQW+KAMDLEMESMY N VW LVD P  V+PIGCKWIYKRKRD AGKVQTFKARLV KGYTQREG+
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFYDYEIWQMDVKT FLNGNLEESI+M QPEGFI +GQEQKVCKL  SIYGLKQASRSWNIRFDT IKSYGF+QNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------
        PCVYK+I    VAFLVLY DDILLIGNDV                                                                       
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------

Query:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR
                                                  V IV+RYQSN G DHWTAVK +LKYLRRT+DYMLVYG+KDLILTGYTDSDFQTDKD+R
Subjt:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR

Query:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV
        KSTSGS+FTLNGGAV+WRSIKQ CIADSTMEAEYVA CEAAKEAVWL+KF  DLEVVPNM+LPITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV

Query:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR
         RGDVIVTKI+S+ N+ADPFTK LTAK   G L +L +R
Subjt:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]1.91e-22668.6Show/hide
Query:  DRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGID
        DRYLGLSEAQIIIPDDGIEDPLTYK  MN VD DQWIKAMDLEMESMYSN VWTLVDQP++V+PIGCKWIYKRKRDQAGKVQTFKARLV KGYTQ+EGID
Subjt:  DRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGID

Query:  YEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEP
        YEE FS  AMIKSIRILLSI TFYDYEIWQMDVKTTFLN NLEESIYMVQPE FIQKGQEQK+CKLQ SIYGLKQASRS NIRFDT IKSYG EQNVDEP
Subjt:  YEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEP

Query:  CVYKRIINSTVAFLVLYGDDILLIGNDV------------------------------------------------------------------------
        CVYKRI+NSTVAFLVLY DDILLIGNDV                                                                        
Subjt:  CVYKRIINSTVAFLVLYGDDILLIGNDV------------------------------------------------------------------------

Query:  -----------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARK
                                                 V IV+R QS  GRDHWT VKNILKYLRRTKDYMLVYGSKDLILTGYTD  FQTDKDARK
Subjt:  -----------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARK

Query:  STSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVH
        STSG +FT+NGGAV+WRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKF TDLEVVPNMHLP TLYCDNSGAV NSREPRSHKRGKHIERK  LIR+IVH
Subjt:  STSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVH

Query:  RGDVIVTKISSKQNVA
        +G V VTKIS +Q ++
Subjt:  RGDVIVTKISSKQNVA

TrEMBL top hitse value%identityAlignment
A0A5A7SIN2 Gag/pol protein1.13e-22971.73Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL+E Q +IPDDG+EDPL+YKQ MN VD DQW+KAMDLEMESMY N VW LVD P  V+PIGCKWIYKRKRD AGKVQTFKARLV KGYTQRE +
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFYDYEIWQMDVKT FLNGNLEESI+M +PEGFI +GQEQKVCKL  SIYGLKQAS+SWNIRFD  IKSYGF+QNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-------------------------------------------------------VEIVNRYQSNLGRDHW
        PC+YK+I    VAFLVLY DDIL IGND+                                                       V IV+RYQSN G DHW
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-------------------------------------------------------VEIVNRYQSNLGRDHW

Query:  TAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVP
        TAVK ILKYLRRT+DYMLVYG+KDLILTGYTDSDFQTDKD+RKST GS+FTLNGGAV+WRSIKQ CIADSTMEAEYVA CEA KEAVWL+KF  DLEVVP
Subjt:  TAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVP

Query:  NMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR
        NM+LPITLY DNS AVANS+EPRSHKRGKHIERKYHLIREIV RGD IVTKI+S+ N+ADPFTK LTAK   G L +L +R
Subjt:  NMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR

A0A5A7TZD0 Gag/pol protein4.30e-22866.05Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL+E Q++IPDDG+EDPL+YKQ MN VD DQW+KAMDLEMESMY N VW LVD P  V+PIGCKWIYKRKRD AGKVQTFKARLV KGYTQREG+
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFYDYEIWQMDVKT FLNGNLEESI+M QPEGFI +GQEQKVCKL  SIYGLKQASRSWNIRFDT IKSYGF+QNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------
        PCVYK+I    VAFLVLY DDILLIGNDV                                                                       
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------

Query:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR
                                                  V IV+RYQSN G DHWTAVK +LKYLRRT+DYMLVYG+KDLILTGYTDSDFQTDKD+R
Subjt:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR

Query:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV
        KSTSGS+FTLNGGAV+WRSIKQ CIADSTMEAEYVA CEAAKEAVWL+KF  DLEVVPNM+LPITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV

Query:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR
         RGDVIVTKI+S+ N+ADPFTK LTAK   G L +L +R
Subjt:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR

A0A5A7UYE8 Gag/pol protein4.47e-22666.05Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL+E Q++IPDDG+EDPL+YKQ MN VD DQW+KAMDLEMESMY N VW LVD P  V+PIGCKWIYKRKRD AGKVQTFKARLV KGYTQREG+
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFYDYEIWQMDVKT FLNGNLEESI+M QPEGFI +GQEQKVCKL  SIYGLKQASRSWNIRFDT IKSYGF+QNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------
        PCVYK+I    VAFLVLY DDILLIGNDV                                                                       
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------

Query:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR
                                                  V IV+RYQSN G DHWTAVK +LKYLRRT+DYMLVYG+KDLILTGYTDSDFQTDKD+R
Subjt:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR

Query:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV
        KSTSGS+FTLNGGAV+WRSIKQ CIADSTMEAEYVA CEAAKEAVWL+KF  DLEVVPNM+LPITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV

Query:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR
         RGDVIVTKI+S+ N+ADPFTK LTAK   G L +L +R
Subjt:  HRGDVIVTKISSKQNVADPFTKALTAK---GVLLTLSIR

A0A5D3BX45 Gag/pol protein9.24e-22768.6Show/hide
Query:  DRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGID
        DRYLGLSEAQIIIPDDGIEDPLTYK  MN VD DQWIKAMDLEMESMYSN VWTLVDQP++V+PIGCKWIYKRKRDQAGKVQTFKARLV KGYTQ+EGID
Subjt:  DRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGID

Query:  YEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEP
        YEE FS  AMIKSIRILLSI TFYDYEIWQMDVKTTFLN NLEESIYMVQPE FIQKGQEQK+CKLQ SIYGLKQASRS NIRFDT IKSYG EQNVDEP
Subjt:  YEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEP

Query:  CVYKRIINSTVAFLVLYGDDILLIGNDV------------------------------------------------------------------------
        CVYKRI+NSTVAFLVLY DDILLIGNDV                                                                        
Subjt:  CVYKRIINSTVAFLVLYGDDILLIGNDV------------------------------------------------------------------------

Query:  -----------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARK
                                                 V IV+R QS  GRDHWT VKNILKYLRRTKDYMLVYGSKDLILTGYTD  FQTDKDARK
Subjt:  -----------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARK

Query:  STSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVH
        STSG +FT+NGGAV+WRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKF TDLEVVPNMHLP TLYCDNSGAV NSREPRSHKRGKHIERK  LIR+IVH
Subjt:  STSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVH

Query:  RGDVIVTKISSKQNVA
        +G V VTKIS +Q ++
Subjt:  RGDVIVTKISSKQNVA

E2GK51 Gag/pol protein (Fragment)1.61e-22668.57Show/hide
Query:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI
        P+RYLGL E QIIIPDDG+EDPLTYKQ MN VD DQWIKAM+LEMESMY N VWTLVD PS+V+PIGCKWIYKRKRDQAGKVQTFKARLV KGYTQ+EG+
Subjt:  PDRYLGLSEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGI

Query:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE
        DYEETFSPVAM+KSIRILLSI TFY+YEIWQMDVKT FLNGNLEESIYMVQPEGFI + QEQKVCKLQ SIYGLKQASRSWNIRFDT IKSYGFEQNVDE
Subjt:  DYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDE

Query:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------
        PCVYK+I+NS VAFL+LY DDILLIGNDV                                                                       
Subjt:  PCVYKRIINSTVAFLVLYGDDILLIGNDV-----------------------------------------------------------------------

Query:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR
                                                  V IV+RYQSN GRDHWTAVKNILKYLRRT++YMLVYG+KDLILTGYTDSDFQ+DKDAR
Subjt:  ------------------------------------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDAR

Query:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV
        KSTSGS+FTLNGGAV+WRS+KQ+CIADSTMEAEYVA CEAAKEAVWL+KF TDLEVVPNMHLPITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  KSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIV

Query:  HRGDVIVTK
        HRGDV+VT+
Subjt:  HRGDVIVTK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-5829.76Show/hide
Query:  PLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILLSI
        P ++ +     D   W +A++ E+ +   N  WT+  +P     +  +W++  K ++ G    +KARLV +G+TQ+  IDYEETF+PVA I S R +LS+
Subjt:  PLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILLSI

Query:  VTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVY---KRIINSTVAFLVLY
        V  Y+ ++ QMDVKT FLNG L+E IYM  P+G         VCKL  +IYGLKQA+R W   F+  +K   F  +  + C+Y   K  IN  + +++LY
Subjt:  VTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVY---KRIINSTVAFLVLY

Query:  GDDILLIGNDV-----------------------------------------------------------------------------------------
         DD+++   D+                                                                                         
Subjt:  GDDILLIGNDV-----------------------------------------------------------------------------------------

Query:  ---------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLI----LTGYTDSDFQTDKDARKSTSGSIFTL-NGGAVIWRSIKQ
                       V I++RY S    + W  +K +L+YL+ T D  L++  K+L     + GY DSD+   +  RKST+G +F + +   + W + +Q
Subjt:  ---------------VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLI----LTGYTDSDFQTDKDARKSTSGSIFTL-NGGAVIWRSIKQ

Query:  SCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTK
        + +A S+ EAEY+A  EA +EA+WLK   T + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE V    + +  I ++  +AD FTK
Subjt:  SCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTK

Query:  ALTA
         L A
Subjt:  ALTA

P0CV72 Secreted RxLR effector protein 1613.3e-1744.34Show/hide
Query:  VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLI-LTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAA
        V +++++ S+    HW A+K +L+YL+ T+ Y L +       L GY+D+D+  D ++R+STSG +F LNGG V WRS KQ  +A S+ E EY+A  EA 
Subjt:  VEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLI-LTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAA

Query:  KEAVWL
        +EAVWL
Subjt:  KEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-8335.52Show/hide
Query:  SEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFS
        S   ++I DD   +P + K+ ++  + +Q +KAM  EMES+  N  + LV+ P   RP+ CKW++K K+D   K+  +KARLVVKG+ Q++GID++E FS
Subjt:  SEAQIIIPDDGIEDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFS

Query:  PVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVY-KR
        PV  + SIR +LS+    D E+ Q+DVKT FL+G+LEE IYM QPEGF   G++  VCKL  S+YGLKQA R W ++FD+ +KS  + +   +PCVY KR
Subjt:  PVAMIKSIRILLSIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVY-KR

Query:  IINSTVAFLVLYGDDILLIGND------------------------------------------------------------------------------
           +    L+LY DD+L++G D                                                                              
Subjt:  IINSTVAFLVLYGDDILLIGND------------------------------------------------------------------------------

Query:  -----------------------------------VVEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTSGS
                                            V +V+R+  N G++HW AVK IL+YLR T    L +G  D IL GYTD+D   D D RKS++G 
Subjt:  -----------------------------------VVEIVNRYQSNLGRDHWTAVKNILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTSGS

Query:  IFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVI
        +FT +GGA+ W+S  Q C+A ST EAEY+A  E  KE +WLK+F  +L +    ++   +YCD+  A+  S+    H R KHI+ +YH IRE+V    + 
Subjt:  IFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVI

Query:  VTKISSKQNVADPFTKAL
        V KIS+ +N AD  TK +
Subjt:  VTKISSKQNVADPFTKAL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-5529.55Show/hide
Query:  DQWIKAMDLEMESMYSNYVWTLV-DQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILLSIVTFYDYEIWQMD
        ++W  AM  E+ +   N+ W LV   PS V  +GC+WI+ +K +  G +  +KARLV KGY QR G+DY ETFSPV    SIRI+L +     + I Q+D
Subjt:  DQWIKAMDLEMESMYSNYVWTLV-DQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILLSIVTFYDYEIWQMD

Query:  VKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVYKRIINSTVAFLVLYGDDILLIGND-----
        V   FL G L + +YM QP GFI K +   VCKL+ ++YGLKQA R+W +     + + GF  +V +  ++      ++ ++++Y DDIL+ GND     
Subjt:  VKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVYKRIINSTVAFLVLYGDDILLIGND-----

Query:  ----------------------------------------VVEIVNR--------------------------------YQSNLG---------------
                                                +++++ R                                Y+  +G               
Subjt:  ----------------------------------------VVEIVNR--------------------------------YQSNLG---------------

Query:  ------------RDHWTAVKNILKYLRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAA
                     +H  A+K IL+YL  T ++ + +     L L  Y+D+D+  DKD   ST+G I  L    + W S KQ  +  S+ EAEY +    +
Subjt:  ------------RDHWTAVKNILKYLRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAA

Query:  KEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALT
         E  W+    T+L +   +  P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +S+   +AD  TK L+
Subjt:  KEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-5829.92Show/hide
Query:  DPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLV-DQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILL
        +P T  Q M     D+W +AM  E+ +   N+ W LV   P  V  +GC+WI+ +K +  G +  +KARLV KGY QR G+DY ETFSPV    SIRI+L
Subjt:  DPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLV-DQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILL

Query:  SIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVYKRIINSTVAFLVLYG
         +     + I Q+DV   FL G L + +YM QP GF+ K +   VC+L+ +IYGLKQA R+W +   T + + GF  ++ +  ++      ++ ++++Y 
Subjt:  SIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVYKRIINSTVAFLVLYG

Query:  DDILLIGNDVVEI---------------------------------------------------------------------------------------
        DDIL+ GND V +                                                                                       
Subjt:  DDILLIGNDVVEI---------------------------------------------------------------------------------------

Query:  --------------VNR---YQSNLGRDHWTAVKNILKYLRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIAD
                      VNR   Y      DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G I  L    + W S KQ  +  
Subjt:  --------------VNR---YQSNLGRDHWTAVKNILKYLRRTKDY-MLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIAD

Query:  STMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALT
        S+ EAEY +    + E  W+    T+L +   +  P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +S+   +AD  TK L+
Subjt:  STMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-5629.05Show/hide
Query:  EDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILL
        ++P TY +    +    W  AMD E+ +M + + W +   P   +PIGCKW+YK K +  G ++ +KARLV KGYTQ+EGID+ ETFSPV  + S++++L
Subjt:  EDPLTYKQTMNGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILL

Query:  SIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQE----QKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVYKRIINSTVAFL
        +I   Y++ + Q+D+   FLNG+L+E IYM  P G+  +  +      VC L+ SIYGLKQASR W ++F  T+  +GF Q+  +   + +I  +    +
Subjt:  SIVTFYDYEIWQMDVKTTFLNGNLEESIYMVQPEGFIQKGQE----QKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVYKRIINSTVAFL

Query:  VLYGDDILLIGNDVVEI-----------------------------------------------------------------------------VNRYQS
        ++Y DDI++  N+   +                                                                                Y+ 
Subjt:  VLYGDDILLIGNDVVEI-----------------------------------------------------------------------------VNRYQS

Query:  NLGR---------------------------DHWTAVKNILKYLRRTKDYMLVYGSK-DLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQS
         +GR                            H  AV  IL Y++ T    L Y S+ ++ L  ++D+ FQ+ KD R+ST+G    L    + W+S KQ 
Subjt:  NLGR---------------------------DHWTAVKNILKYLRRTKDYMLVYGSK-DLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQS

Query:  CIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIRE
         ++ S+ EAEY A   A  E +WL +F  +L++   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  CIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIRE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.1e-1441.18Show/hide
Query:  WIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILLSI
        W +AM  E++++  N  W LV  P     +GCKW++K K    G +   KARLV KG+ Q EGI + ET+SPV    +IR +L++
Subjt:  WIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATATTTAAAACTTCTTTTAGTGTTTCATTTCATGAATAGCTCGATAGTTCAATTGTTAGCTTCCGAAAAACTTAACGACGATAACTATGCGGCATGGAAATCAAA
TCTTAACACAATACTAGTTGCTGATGATTTACAGTTTGTCTTAACTGAGGAATGTCCTCAAACCTCAGCCTCAAATGCAAACCGAGCTAGTCGAAAAGCATATGATCGAT
GGATAAAAGCAAACAAGAAGCCCCCTGATCGCTATTTGGGTTTAAGTGAAGCTCAAATCATCATACCAGATGATGGCATAGAGGATCCATTGACCTATAAACAGACTATG
AATGGTGTGGATTGTGACCAATGGATCAAAGCCATGGACCTTGAAATGGAATCTATGTATTCCAATTATGTCTGGACTTTAGTAGATCAACCAAGTGAGGTAAGACCTAT
TGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGGTAAAAGGTTACACACAAAGGGAGGGAATAGATTATG
AAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATACGAATACTCTTGTCCATCGTCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTTTG
AATGGTAATCTTGAAGAGAGTATTTATATGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAATATCCATTTATGGATTGAAACA
AGCTTCTAGATCCTGGAATATAAGGTTTGATACTACGATCAAATCTTATGGTTTTGAACAGAATGTTGATGAACCTTGTGTTTACAAAAGGATCATCAATTCTACTGTAG
CATTCTTAGTTCTGTATGGAGATGACATTCTACTCATTGGGAATGACGTAGTGGAGATTGTTAATCGATATCAGTCCAATCTTGGACGTGATCATTGGACCGCCGTTAAA
AATATTCTAAAATATCTTAGAAGAACAAAAGACTATATGCTTGTGTATGGTTCTAAGGATCTGATCCTTACTGGATACACTGACTCCGATTTTCAAACTGATAAAGATGC
TAGAAAGTCTACATCAGGATCAATTTTCACTCTGAACGGAGGAGCAGTAATATGGAGAAGCATAAAACAATCTTGTATTGCCGACTCCACTATGGAAGCTGAATATGTAG
CTACCTGTGAAGCAGCGAAAGAAGCAGTATGGCTTAAAAAGTTCTCAACAGATTTGGAAGTCGTTCCAAATATGCATCTACCTATCACCTTATATTGTGACAACAGTGGT
GCAGTTGCAAACTCACGAGAACCTAGAAGTCATAAACGAGGAAAGCATATTGAACGAAAGTACCATCTGATCAGAGAAATCGTACATCGAGGAGACGTTATAGTAACAAA
AATCTCCTCTAAGCAAAATGTGGCTGATCCATTTACAAAAGCTCTCACGGCTAAAGGTGTTCTGTTAACTTTGTCTATCAGAGTCGTACCTGCATGGGTGTCCTTCGGGA
TCACCACCTATTTAGGACTGAGTGGTCCGACGGGACGCCAGTCTAGCATGGATAAAGATATGATTCGAGTGATTCGACGGGGTCCTCGCATCCCGATTGTCTTAGTGTTA
CCCCCAGGCGAAGGTAAAGGTAGACGAAAGCTGGCGAGCGACAGAGAAGGATCCGTGACATGCCATATGGGGACTCAGTTTTGCTTCCGCGAAGAGAAAAAGCTTTGGAT
CAAAGAAGAAGTTGAGAACTACAAAGGTAAGTTCATCGTTTACCTTCCTTTCACTTCATTTACGTTCATCGCATGCAACGTCACGATATTCTACAAGGATATAATAACAA
TATTTCAAGTTTTCAACAACAAAATAAATTTTATCAGACTAAGGAAAGGATTAAAAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATATTTAAAACTTCTTTTAGTGTTTCATTTCATGAATAGCTCGATAGTTCAATTGTTAGCTTCCGAAAAACTTAACGACGATAACTATGCGGCATGGAAATCAAA
TCTTAACACAATACTAGTTGCTGATGATTTACAGTTTGTCTTAACTGAGGAATGTCCTCAAACCTCAGCCTCAAATGCAAACCGAGCTAGTCGAAAAGCATATGATCGAT
GGATAAAAGCAAACAAGAAGCCCCCTGATCGCTATTTGGGTTTAAGTGAAGCTCAAATCATCATACCAGATGATGGCATAGAGGATCCATTGACCTATAAACAGACTATG
AATGGTGTGGATTGTGACCAATGGATCAAAGCCATGGACCTTGAAATGGAATCTATGTATTCCAATTATGTCTGGACTTTAGTAGATCAACCAAGTGAGGTAAGACCTAT
TGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTCAAAGCTCGACTAGTGGTAAAAGGTTACACACAAAGGGAGGGAATAGATTATG
AAGAAACTTTCTCTCCTGTTGCCATGATAAAGTCGATACGAATACTCTTGTCCATCGTCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTTTG
AATGGTAATCTTGAAGAGAGTATTTATATGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTAAGCTTCAAATATCCATTTATGGATTGAAACA
AGCTTCTAGATCCTGGAATATAAGGTTTGATACTACGATCAAATCTTATGGTTTTGAACAGAATGTTGATGAACCTTGTGTTTACAAAAGGATCATCAATTCTACTGTAG
CATTCTTAGTTCTGTATGGAGATGACATTCTACTCATTGGGAATGACGTAGTGGAGATTGTTAATCGATATCAGTCCAATCTTGGACGTGATCATTGGACCGCCGTTAAA
AATATTCTAAAATATCTTAGAAGAACAAAAGACTATATGCTTGTGTATGGTTCTAAGGATCTGATCCTTACTGGATACACTGACTCCGATTTTCAAACTGATAAAGATGC
TAGAAAGTCTACATCAGGATCAATTTTCACTCTGAACGGAGGAGCAGTAATATGGAGAAGCATAAAACAATCTTGTATTGCCGACTCCACTATGGAAGCTGAATATGTAG
CTACCTGTGAAGCAGCGAAAGAAGCAGTATGGCTTAAAAAGTTCTCAACAGATTTGGAAGTCGTTCCAAATATGCATCTACCTATCACCTTATATTGTGACAACAGTGGT
GCAGTTGCAAACTCACGAGAACCTAGAAGTCATAAACGAGGAAAGCATATTGAACGAAAGTACCATCTGATCAGAGAAATCGTACATCGAGGAGACGTTATAGTAACAAA
AATCTCCTCTAAGCAAAATGTGGCTGATCCATTTACAAAAGCTCTCACGGCTAAAGGTGTTCTGTTAACTTTGTCTATCAGAGTCGTACCTGCATGGGTGTCCTTCGGGA
TCACCACCTATTTAGGACTGAGTGGTCCGACGGGACGCCAGTCTAGCATGGATAAAGATATGATTCGAGTGATTCGACGGGGTCCTCGCATCCCGATTGTCTTAGTGTTA
CCCCCAGGCGAAGGTAAAGGTAGACGAAAGCTGGCGAGCGACAGAGAAGGATCCGTGACATGCCATATGGGGACTCAGTTTTGCTTCCGCGAAGAGAAAAAGCTTTGGAT
CAAAGAAGAAGTTGAGAACTACAAAGGTAAGTTCATCGTTTACCTTCCTTTCACTTCATTTACGTTCATCGCATGCAACGTCACGATATTCTACAAGGATATAATAACAA
TATTTCAAGTTTTCAACAACAAAATAAATTTTATCAGACTAAGGAAAGGATTAAAAATATAA
Protein sequenceShow/hide protein sequence
MKYLKLLLVFHFMNSSIVQLLASEKLNDDNYAAWKSNLNTILVADDLQFVLTEECPQTSASNANRASRKAYDRWIKANKKPPDRYLGLSEAQIIIPDDGIEDPLTYKQTM
NGVDCDQWIKAMDLEMESMYSNYVWTLVDQPSEVRPIGCKWIYKRKRDQAGKVQTFKARLVVKGYTQREGIDYEETFSPVAMIKSIRILLSIVTFYDYEIWQMDVKTTFL
NGNLEESIYMVQPEGFIQKGQEQKVCKLQISIYGLKQASRSWNIRFDTTIKSYGFEQNVDEPCVYKRIINSTVAFLVLYGDDILLIGNDVVEIVNRYQSNLGRDHWTAVK
NILKYLRRTKDYMLVYGSKDLILTGYTDSDFQTDKDARKSTSGSIFTLNGGAVIWRSIKQSCIADSTMEAEYVATCEAAKEAVWLKKFSTDLEVVPNMHLPITLYCDNSG
AVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKISSKQNVADPFTKALTAKGVLLTLSIRVVPAWVSFGITTYLGLSGPTGRQSSMDKDMIRVIRRGPRIPIVLVL
PPGEGKGRRKLASDREGSVTCHMGTQFCFREEKKLWIKEEVENYKGKFIVYLPFTSFTFIACNVTIFYKDIITIFQVFNNKINFIRLRKGLKI