; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001519 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001519
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr11:11770933..11773530
RNA-Seq ExpressionPay0001519
SyntenyPay0001519
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.2e-14643.18Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVSI KPEV WTD +EQASVGNARALN IFNGVDLNVFKLIN CSTAKEAWKTLEVAYEGTSK                    DESVSDYNK VL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVWKEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIG
        EI NESL+L   I       +EAHDITTLKLDELFGSLLTFEM  A+RESKK KGI+FKSTHV+E+A  DTE NM+E                +    +G
Subjt:  EIVNESLMLGEKIPNSKIVWKEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIG

Query:  SNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDS
              + + RK  +N     ++                      +E+ D+ D +   ++NAFT+ ++  ++ D+SECS +  +   + E+L+ LWKED 
Subjt:  SNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDS

Query:  EARTLKSIKMLNSRTE------NLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCY
        EAR ++  ++ +   E      NLD +L +G NG ++YGLGF A A     T+EIKFVPAS+  + DT+     +    K+    C+Y G+KGHIR  CY
Subjt:  EARTLKSIKMLNSRTE------NLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCY

Query:  KLRR-----------------------------------------------------------------------------------------------D
        KL+R                                                                                               D
Subjt:  KLRR-----------------------------------------------------------------------------------------------D

Query:  ILYVDGLKANLISVSQLYNY-----------VVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNI
        + YVDGLKANLI++SQL +            VV+NK+NQI M+G  QA+NCYHW  N S+ C L + DQT LWH KLGH+ ++ ++K +K   V+G+PN+
Subjt:  ILYVDGLKANLISVSQLYNY-----------VVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNI

Query:  NVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKGVKIVRIR
        +VN    C DC   KQ +++HKSLKEC TNRVLELLHMDL G +QT+SLGGK                       DT ++C +LCL LQRE+  KI RIR
Subjt:  NVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKGVKIVRIR

Query:  SNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK
        S+HGKEF N   N+FC  EG HHE+SAPIT   NGVVE KNKTL +MA+VM+HAK
Subjt:  SNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK

KAA0045252.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.1e-16368.74Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVS+LKPEV  TD +EQAS+GNARALN IFNGVDLNVFKLINSCSTAKEA KTLEVAYEGTSK                    DE VSDYNKRVL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI
        EI NESLML EKIP+SKIV K                 EAHDITTLKLDELFGSLLTFEMA  DRESKK KGI+FK THVSE+AVSDT+ NMNESIDLLI
Subjt:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI

Query:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYE---------------------------DTGDGEE
        KQFSNV+KKFKNLNT GSNA N+INYQRKDGENNTRR NENSNRRNSDYGRKKE EGRVFRCRE E                           DTGDGEE
Subjt:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYE---------------------------DTGDGEE

Query:  AKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR-------------------------------------TLKSIKMLNSRTENLD
          SMNAFTVC+SETDSGDESECS QICDKNFTFEEL+VLWKED EAR                                     TLKS+KMLNS TENLD
Subjt:  AKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR-------------------------------------TLKSIKMLNSRTENLD

Query:  VMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILY
        V+LNSGQNGLN++GLGFD  ARK+NTTTEI FVPASVNDKTDTV  TKVVSPSAKTTKWICHY GQK HIRPFCYKL RDILY
Subjt:  VMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILY

KAA0046862.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-14354.97Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSKDESVSDYNKRVLEIVNESLMLGEKIPNSKIVW
        MI V+ V +LKPEV WT+AKEQASVGNARALN+IFNGVDLNVFK+                     ++DES+SDYNKRVLEI NESLMLGEKIP+SKIV 
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSKDESVSDYNKRVLEIVNESLMLGEKIPNSKIVW

Query:  K-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIGSNA
        K                 EAHDITTLKLDELFGSLLTFEM   DRESKK K I+FKSTHVS++ VSDT+ NMNESIDLLIKQFSNVVKKFKNLNT GSNA
Subjt:  K-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIGSNA

Query:  PNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR
         N+INYQRKDGENNTRR +   NR                                  +F   + E  SG                              
Subjt:  PNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR

Query:  TLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILYV
                        V    G  G                                      ++++               KG+I         D+ YV
Subjt:  TLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILYV

Query:  DGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVNS
        DGLKANLISVSQL            N VVINKDNQILMNGS QA+NCYHWI NNSEVCHLNKEDQT LWH KLGHIDLKSID+TVK +VVIGVPNI+VNS
Subjt:  DGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVNS

Query:  KLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLIL
        KLVCGDCLTEKQ KASHKSLKECSTNRVLELLHMDL G LQTESLGGKKYVFV EEDFSRFTW+RFLKDKYDTPKV ISLCLIL
Subjt:  KLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLIL

KAA0054435.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.9e-14547.28Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVS+ KPE+ WTDA+EQASVG ARA+N IFNGVD NVFKLIN C+TAKEAWK LEVAYEGTSK                    DE+VS+YNKRVL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVW-----------------KEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAV--SDTEGNMNESIDL
        EIVN+ L+LGEKI  SKIV                  KEA DITTL LDELFGSLLTFEMAI+DRESKK KGI+FKS +  +K V  S  E N +ESI L
Subjt:  EIVNESLMLGEKIPNSKIVW-----------------KEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAV--SDTEGNMNESIDL

Query:  LIKQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQ
        L KQFS + KKFK LNT            R D EN+ R+ N++S RRNSD+G+K E  G                                         
Subjt:  LIKQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQ

Query:  ICDKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICH
                                           +LD +L+SGQN  +KYGLGFD   + +  T E K VP                            
Subjt:  ICDKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICH

Query:  YFGQKGHIRPFCYKLRRDILYVDGLKANLISVSQLYNYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDV
                        +DI + DG    +I+   +    + +K+NQ+ M+G  +++NCYHW  N S +CHL K DQT LWH KLGHI L+S+DK ++ + 
Subjt:  YFGQKGHIRPFCYKLRRDILYVDGLKANLISVSQLYNYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDV

Query:  VIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKG
        V+G+P++++N K  CGDC   KQ K+SH  LKEC T RVLELLH+DL G +QTESLGGKKYV V  +D+SRFTW+ FLK K DT K+CISLCL LQ EKG
Subjt:  VIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKG

Query:  VKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK
         KI+RIRS+H KEF N DLNNFC  EGIHHE +APIT   NGVVE KN+TL +MA+VM+HAK
Subjt:  VKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK

XP_008444307.1 PREDICTED: uncharacterized protein LOC103487675 [Cucumis melo]7.6e-26980.92Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVS+LKPEV  TD +EQAS+GNARALN IFNGVDLNVFKLINSCSTAKEA KTLEVAYEGTSK                    DE VSDYNKRVL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI
        EI NESLML EKIP+SKIV K                 EAHDITTLKLDELFGSLLTFEMA  DRESKK KGI+FK THVSE+AVSDT+ NMNESIDLLI
Subjt:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI

Query:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQIC
        KQFSNV+KKFKNLNT GSNA N+INYQRKDGENNTRR NENSNRRNSDYGRKKE EGRVFRCREY+DTGDGEE  SMNAFTVC+SETDSGDESECS QIC
Subjt:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQIC

Query:  DKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYF
        DKNFTFEEL+VLWKED EARTLKS+KMLNS TENLDV+LNSGQNGLN++GLGFD  ARK+NTTTEI FVPASVNDKTDTV  TKVVSPSAKTTKWICHY 
Subjt:  DKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYF

Query:  GQKGHIRPFCYKLRRDILYVDGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKS
        GQK HIRPFCYKL RDILYVDGLKANLISVS+L            N VVINKDNQILMNGS QA+NCYHWI N+SEVCHLNKEDQT LWH KLGHIDLKS
Subjt:  GQKGHIRPFCYKLRRDILYVDGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKS

Query:  IDKTVKKDVVIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISL
        ID T+K +VVIGVPNI+VNSKLVC DCLTEKQ KASHKSLKECSTNRVLELLHMDL GSLQTESLGGKKYVFVAEEDFSRFTW+RFLKDKYDTPKVCISL
Subjt:  IDKTVKKDVVIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISL

Query:  CLILQREK
        CLILQREK
Subjt:  CLILQREK

TrEMBL top hitse value%identityAlignment
A0A1S3BAW0 uncharacterized protein LOC1034876753.7e-26980.92Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVS+LKPEV  TD +EQAS+GNARALN IFNGVDLNVFKLINSCSTAKEA KTLEVAYEGTSK                    DE VSDYNKRVL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI
        EI NESLML EKIP+SKIV K                 EAHDITTLKLDELFGSLLTFEMA  DRESKK KGI+FK THVSE+AVSDT+ NMNESIDLLI
Subjt:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI

Query:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQIC
        KQFSNV+KKFKNLNT GSNA N+INYQRKDGENNTRR NENSNRRNSDYGRKKE EGRVFRCREY+DTGDGEE  SMNAFTVC+SETDSGDESECS QIC
Subjt:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQIC

Query:  DKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYF
        DKNFTFEEL+VLWKED EARTLKS+KMLNS TENLDV+LNSGQNGLN++GLGFD  ARK+NTTTEI FVPASVNDKTDTV  TKVVSPSAKTTKWICHY 
Subjt:  DKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYF

Query:  GQKGHIRPFCYKLRRDILYVDGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKS
        GQK HIRPFCYKL RDILYVDGLKANLISVS+L            N VVINKDNQILMNGS QA+NCYHWI N+SEVCHLNKEDQT LWH KLGHIDLKS
Subjt:  GQKGHIRPFCYKLRRDILYVDGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKS

Query:  IDKTVKKDVVIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISL
        ID T+K +VVIGVPNI+VNSKLVC DCLTEKQ KASHKSLKECSTNRVLELLHMDL GSLQTESLGGKKYVFVAEEDFSRFTW+RFLKDKYDTPKVCISL
Subjt:  IDKTVKKDVVIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISL

Query:  CLILQREK
        CLILQREK
Subjt:  CLILQREK

A0A5A7TNK7 Gag-pol polyprotein4.0e-14643.18Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVSI KPEV WTD +EQASVGNARALN IFNGVDLNVFKLIN CSTAKEAWKTLEVAYEGTSK                    DESVSDYNK VL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVWKEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIG
        EI NESL+L   I       +EAHDITTLKLDELFGSLLTFEM  A+RESKK KGI+FKSTHV+E+A  DTE NM+E                +    +G
Subjt:  EIVNESLMLGEKIPNSKIVWKEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIG

Query:  SNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDS
              + + RK  +N     ++                      +E+ D+ D +   ++NAFT+ ++  ++ D+SECS +  +   + E+L+ LWKED 
Subjt:  SNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDS

Query:  EARTLKSIKMLNSRTE------NLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCY
        EAR ++  ++ +   E      NLD +L +G NG ++YGLGF A A     T+EIKFVPAS+  + DT+     +    K+    C+Y G+KGHIR  CY
Subjt:  EARTLKSIKMLNSRTE------NLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCY

Query:  KLRR-----------------------------------------------------------------------------------------------D
        KL+R                                                                                               D
Subjt:  KLRR-----------------------------------------------------------------------------------------------D

Query:  ILYVDGLKANLISVSQLYNY-----------VVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNI
        + YVDGLKANLI++SQL +            VV+NK+NQI M+G  QA+NCYHW  N S+ C L + DQT LWH KLGH+ ++ ++K +K   V+G+PN+
Subjt:  ILYVDGLKANLISVSQLYNY-----------VVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNI

Query:  NVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKGVKIVRIR
        +VN    C DC   KQ +++HKSLKEC TNRVLELLHMDL G +QT+SLGGK                       DT ++C +LCL LQRE+  KI RIR
Subjt:  NVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKGVKIVRIR

Query:  SNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK
        S+HGKEF N   N+FC  EG HHE+SAPIT   NGVVE KNKTL +MA+VM+HAK
Subjt:  SNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK

A0A5A7TPF7 Gag-proteinase polyprotein5.5e-16468.74Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVS+LKPEV  TD +EQAS+GNARALN IFNGVDLNVFKLINSCSTAKEA KTLEVAYEGTSK                    DE VSDYNKRVL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI
        EI NESLML EKIP+SKIV K                 EAHDITTLKLDELFGSLLTFEMA  DRESKK KGI+FK THVSE+AVSDT+ NMNESIDLLI
Subjt:  EIVNESLMLGEKIPNSKIVWK-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLI

Query:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYE---------------------------DTGDGEE
        KQFSNV+KKFKNLNT GSNA N+INYQRKDGENNTRR NENSNRRNSDYGRKKE EGRVFRCRE E                           DTGDGEE
Subjt:  KQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYE---------------------------DTGDGEE

Query:  AKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR-------------------------------------TLKSIKMLNSRTENLD
          SMNAFTVC+SETDSGDESECS QICDKNFTFEEL+VLWKED EAR                                     TLKS+KMLNS TENLD
Subjt:  AKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR-------------------------------------TLKSIKMLNSRTENLD

Query:  VMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILY
        V+LNSGQNGLN++GLGFD  ARK+NTTTEI FVPASVNDKTDTV  TKVVSPSAKTTKWICHY GQK HIRPFCYKL RDILY
Subjt:  VMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILY

A0A5A7TTT8 Gag-pol polyprotein4.8e-14454.97Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSKDESVSDYNKRVLEIVNESLMLGEKIPNSKIVW
        MI V+ V +LKPEV WT+AKEQASVGNARALN+IFNGVDLNVFK+                     ++DES+SDYNKRVLEI NESLMLGEKIP+SKIV 
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSKDESVSDYNKRVLEIVNESLMLGEKIPNSKIVW

Query:  K-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIGSNA
        K                 EAHDITTLKLDELFGSLLTFEM   DRESKK K I+FKSTHVS++ VSDT+ NMNESIDLLIKQFSNVVKKFKNLNT GSNA
Subjt:  K-----------------EAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIGSNA

Query:  PNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR
         N+INYQRKDGENNTRR +   NR                                  +F   + E  SG                              
Subjt:  PNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEAR

Query:  TLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILYV
                        V    G  G                                      ++++               KG+I         D+ YV
Subjt:  TLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILYV

Query:  DGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVNS
        DGLKANLISVSQL            N VVINKDNQILMNGS QA+NCYHWI NNSEVCHLNKEDQT LWH KLGHIDLKSID+TVK +VVIGVPNI+VNS
Subjt:  DGLKANLISVSQLY-----------NYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVNS

Query:  KLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLIL
        KLVCGDCLTEKQ KASHKSLKECSTNRVLELLHMDL G LQTESLGGKKYVFV EEDFSRFTW+RFLKDKYDTPKV ISLCLIL
Subjt:  KLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLIL

A0A5D3CS19 Gag-pol polyprotein3.3e-14547.28Show/hide
Query:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL
        MI V+GVS+ KPE+ WTDA+EQASVG ARA+N IFNGVD NVFKLIN C+TAKEAWK LEVAYEGTSK                    DE+VS+YNKRVL
Subjt:  MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSK--------------------DESVSDYNKRVL

Query:  EIVNESLMLGEKIPNSKIVW-----------------KEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAV--SDTEGNMNESIDL
        EIVN+ L+LGEKI  SKIV                  KEA DITTL LDELFGSLLTFEMAI+DRESKK KGI+FKS +  +K V  S  E N +ESI L
Subjt:  EIVNESLMLGEKIPNSKIVW-----------------KEAHDITTLKLDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAV--SDTEGNMNESIDL

Query:  LIKQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQ
        L KQFS + KKFK LNT            R D EN+ R+ N++S RRNSD+G+K E  G                                         
Subjt:  LIKQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREGRVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQ

Query:  ICDKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICH
                                           +LD +L+SGQN  +KYGLGFD   + +  T E K VP                            
Subjt:  ICDKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIKFVPASVNDKTDTVTVTKVVSPSAKTTKWICH

Query:  YFGQKGHIRPFCYKLRRDILYVDGLKANLISVSQLYNYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDV
                        +DI + DG    +I+   +    + +K+NQ+ M+G  +++NCYHW  N S +CHL K DQT LWH KLGHI L+S+DK ++ + 
Subjt:  YFGQKGHIRPFCYKLRRDILYVDGLKANLISVSQLYNYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDV

Query:  VIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKG
        V+G+P++++N K  CGDC   KQ K+SH  LKEC T RVLELLH+DL G +QTESLGGKKYV V  +D+SRFTW+ FLK K DT K+CISLCL LQ EKG
Subjt:  VIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQREKG

Query:  VKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK
         KI+RIRS+H KEF N DLNNFC  EGIHHE +APIT   NGVVE KN+TL +MA+VM+HAK
Subjt:  VKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-2429.96Show/hide
Query:  CYKLRRDILYVDGLKANLISVSQL----YNYVVIN------KDNQILMNGSPQANNCYHWIPNNSEVC--HLNKEDQTL---LWHIKLGHIDLKSIDKTV
        C  + +D+ +V  L+ NLIS   L    Y     N      K + ++  G  +          N+E+C   LN     +   LWH ++GH+  K +    
Subjt:  CYKLRRDILYVDGLKANLISVSQL----YNYVVIN------KDNQILMNGSPQANNCYHWIPNNSEVC--HLNKEDQTL---LWHIKLGHIDLKSIDKTV

Query:  KKDVVIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQ
        KK ++       V     C  CL  KQ + S ++  E   N +L+L++ D+ G ++ ES+GG KY     +D SR  W+  LK K    +V      +++
Subjt:  KKDVVIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCISLCLILQ

Query:  REKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAKK
        RE G K+ R+RS++G E+ + +   +C S GI HE + P T  HNGV E  N+T+ +  + ML   K
Subjt:  REKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAKK

P25384 Transposon Ty2-C Gag-Pol polyprotein2.1e-1126.44Show/hide
Query:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE
        I N ++   +NK    L+ H  LGH + +SI K++KK+ V  +   ++     S   C DCL  K  K  H     LK   +    + LH D+ G +   
Subjt:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE

Query:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP
              Y     ++ +RF W+  L D+ +     V  S+   ++ +   +++ I+ + G E+ N  L+ F  + GI   Y+    S  +GV E  N+TL 
Subjt:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP

Query:  KMAQVMLH
           + +LH
Subjt:  KMAQVMLH

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein2.1e-1126.44Show/hide
Query:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE
        I N ++   +NK    L+ H  LGH + +SI K++KK+ V  +   ++     S   C DCL  K  K  H     LK   +    + LH D+ G +   
Subjt:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE

Query:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP
              Y     ++ +RF W+  L D+ +     V  S+   ++ +   +++ I+ + G E+ N  L+ F  + GI   Y+    S  +GV E  N+TL 
Subjt:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP

Query:  KMAQVMLH
           + +LH
Subjt:  KMAQVMLH

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein2.1e-1126.44Show/hide
Query:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE
        I N ++   +NK    L+ H  LGH + +SI K++KK+ V  +   ++     S   C DCL  K  K  H     LK   +    + LH D+ G +   
Subjt:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE

Query:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP
              Y     ++ +RF W+  L D+ +     V  S+   ++ +   +++ I+ + G E+ N  L+ F  + GI   Y+    S  +GV E  N+TL 
Subjt:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP

Query:  KMAQVMLH
           + +LH
Subjt:  KMAQVMLH

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.1e-1126.44Show/hide
Query:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE
        I N ++   +NK    L+ H  LGH + +SI K++KK+ V  +   ++     S   C DCL  K  K  H     LK   +    + LH D+ G +   
Subjt:  IPNNSEVCHLNKEDQTLLWHIKLGHIDLKSIDKTVKKDVVIGVPNINVN----SKLVCGDCLTEKQIKASH---KSLKECSTNRVLELLHMDLKGSLQTE

Query:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP
              Y     ++ +RF W+  L D+ +     V  S+   ++ +   +++ I+ + G E+ N  L+ F  + GI   Y+    S  +GV E  N+TL 
Subjt:  SLGGKKYVFVAEEDFSRFTWIRFLKDKYDTP--KVCISLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLP

Query:  KMAQVMLH
           + +LH
Subjt:  KMAQVMLH

Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein3.3e-0429.93Show/hide
Query:  LKVLWKEDSEA----RTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMN--------------TTTEIKFVPASVND-KTDTVTVTKVVSPS
        LK   +++ EA     T K+++MLN+ T+ L  +L+ G+   +K GLGF     K +              T  E   V    +D +TD+ T T+  S +
Subjt:  LKVLWKEDSEA----RTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMN--------------TTTEIKFVPASVND-KTDTVTVTKVVSPS

Query:  AKTTKW----------ICHYFGQKGHIRPFCYKLRRD
         K  K           +CH+ G  GHIRP C++L R+
Subjt:  AKTTKW----------ICHYFGQKGHIRPFCYKLRRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTACCGTGAGTGGTGTTTCGATTCTAAAACCTGAAGTTTATTGGACTGATGCTAAAGAGCAAGCTTCTGTTGGAAATGCCAGAGCACTTAACATGATATTTAATGG
TGTTGACCTGAACGTTTTCAAGTTAATAAATTCTTGCAGTACAGCCAAAGAAGCCTGGAAAACCTTGGAGGTAGCGTATGAAGGTACTTCCAAAGATGAATCAGTGTCTG
ATTACAATAAGAGAGTGCTTGAAATCGTAAATGAATCTTTGATGCTTGGTGAAAAAATACCTAACTCTAAAATAGTGTGGAAAGAAGCTCATGATATTACAACGTTGAAA
CTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGAGATGGCCATTGCTGATAGAGAAAGTAAGAAAGACAAGGGAATTTCTTTTAAATCGACACATGTAAGTGAGAAGGC
TGTAAGTGACACTGAAGGAAACATGAACGAATCAATAGACCTCCTGATCAAACAGTTTTCTAATGTGGTCAAGAAATTCAAAAACTTGAATACCATAGGATCAAACGCTC
CAAATATGATTAACTATCAAAGAAAAGATGGTGAGAACAATACGAGAAGGTTTAATGAAAATTCAAATAGGAGAAATAGTGATTATGGACGAAAAAAAGAGCGCGAAGGA
AGAGTTTTCAGATGTAGAGAATATGAAGATACTGGTGATGGTGAAGAAGCTAAAAGCATGAATGCATTTACAGTATGTGTTTCAGAAACTGACTCTGGAGATGAAAGTGA
ATGTTCTAGGCAGATTTGTGATAAAAACTTTACATTTGAAGAGCTCAAAGTCCTATGGAAAGAAGATTCTGAAGCCAGAACATTAAAGTCTATAAAGATGCTAAATTCAA
GAACTGAGAATTTAGATGTAATGTTAAATTCAGGACAGAATGGTTTAAACAAATATGGTCTTGGGTTTGATGCTTTTGCAAGAAAGATGAATACTACAACCGAAATAAAG
TTTGTACCTGCATCAGTTAATGACAAGACAGATACAGTCACGGTAACGAAAGTAGTTAGCCCTTCAGCTAAAACTACTAAATGGATCTGTCACTACTTTGGTCAGAAAGG
CCATATTAGACCTTTTTGCTATAAACTACGGAGAGATATATTATATGTGGATGGCTTAAAAGCAAATCTGATTAGTGTAAGTCAGCTATACAATTATGTGGTAATTAATA
AAGATAATCAGATTCTTATGAATGGTAGTCCGCAAGCGAATAACTGCTATCACTGGATCCCCAATAATTCAGAAGTTTGTCATTTGAATAAAGAAGATCAAACCTTGTTG
TGGCACATAAAGCTAGGACACATCGACCTGAAAAGCATAGACAAGACTGTAAAAAAGGATGTTGTGATAGGTGTTCCAAATATTAATGTAAATAGCAAATTGGTTTGTGG
AGATTGTCTAACTGAGAAGCAAATTAAAGCATCCCACAAAAGCCTAAAGGAATGTTCCACTAATAGAGTCCTTGAACTTCTACATATGGATCTTAAAGGATCATTGCAGA
CTGAAAGTCTTGGTGGAAAGAAATATGTGTTTGTAGCTGAAGAAGATTTTTCTCGATTTACATGGATTCGATTTTTGAAAGACAAGTATGATACTCCCAAAGTCTGCATC
AGCTTGTGCTTGATTTTGCAGCGAGAAAAAGGAGTGAAAATTGTTAGAATCAGAAGTAATCATGGCAAAGAATTTAAAAATGGAGATCTCAATAACTTTTGTGATTCTGA
AGGAATACACCATGAATACTCTGCTCCTATAACTTCTCTACATAATGGAGTTGTTGAAAGTAAAAACAAAACATTACCGAAGATGGCTCAAGTCATGTTACATGCCAAAA
AAAAAATTTACCCTCACATATTCAGGGGGAGCATGAAGTCTGCAGTTATTTGTCATATCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGATTACCGTGAGTGGTGTTTCGATTCTAAAACCTGAAGTTTATTGGACTGATGCTAAAGAGCAAGCTTCTGTTGGAAATGCCAGAGCACTTAACATGATATTTAATGG
TGTTGACCTGAACGTTTTCAAGTTAATAAATTCTTGCAGTACAGCCAAAGAAGCCTGGAAAACCTTGGAGGTAGCGTATGAAGGTACTTCCAAAGATGAATCAGTGTCTG
ATTACAATAAGAGAGTGCTTGAAATCGTAAATGAATCTTTGATGCTTGGTGAAAAAATACCTAACTCTAAAATAGTGTGGAAAGAAGCTCATGATATTACAACGTTGAAA
CTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGAGATGGCCATTGCTGATAGAGAAAGTAAGAAAGACAAGGGAATTTCTTTTAAATCGACACATGTAAGTGAGAAGGC
TGTAAGTGACACTGAAGGAAACATGAACGAATCAATAGACCTCCTGATCAAACAGTTTTCTAATGTGGTCAAGAAATTCAAAAACTTGAATACCATAGGATCAAACGCTC
CAAATATGATTAACTATCAAAGAAAAGATGGTGAGAACAATACGAGAAGGTTTAATGAAAATTCAAATAGGAGAAATAGTGATTATGGACGAAAAAAAGAGCGCGAAGGA
AGAGTTTTCAGATGTAGAGAATATGAAGATACTGGTGATGGTGAAGAAGCTAAAAGCATGAATGCATTTACAGTATGTGTTTCAGAAACTGACTCTGGAGATGAAAGTGA
ATGTTCTAGGCAGATTTGTGATAAAAACTTTACATTTGAAGAGCTCAAAGTCCTATGGAAAGAAGATTCTGAAGCCAGAACATTAAAGTCTATAAAGATGCTAAATTCAA
GAACTGAGAATTTAGATGTAATGTTAAATTCAGGACAGAATGGTTTAAACAAATATGGTCTTGGGTTTGATGCTTTTGCAAGAAAGATGAATACTACAACCGAAATAAAG
TTTGTACCTGCATCAGTTAATGACAAGACAGATACAGTCACGGTAACGAAAGTAGTTAGCCCTTCAGCTAAAACTACTAAATGGATCTGTCACTACTTTGGTCAGAAAGG
CCATATTAGACCTTTTTGCTATAAACTACGGAGAGATATATTATATGTGGATGGCTTAAAAGCAAATCTGATTAGTGTAAGTCAGCTATACAATTATGTGGTAATTAATA
AAGATAATCAGATTCTTATGAATGGTAGTCCGCAAGCGAATAACTGCTATCACTGGATCCCCAATAATTCAGAAGTTTGTCATTTGAATAAAGAAGATCAAACCTTGTTG
TGGCACATAAAGCTAGGACACATCGACCTGAAAAGCATAGACAAGACTGTAAAAAAGGATGTTGTGATAGGTGTTCCAAATATTAATGTAAATAGCAAATTGGTTTGTGG
AGATTGTCTAACTGAGAAGCAAATTAAAGCATCCCACAAAAGCCTAAAGGAATGTTCCACTAATAGAGTCCTTGAACTTCTACATATGGATCTTAAAGGATCATTGCAGA
CTGAAAGTCTTGGTGGAAAGAAATATGTGTTTGTAGCTGAAGAAGATTTTTCTCGATTTACATGGATTCGATTTTTGAAAGACAAGTATGATACTCCCAAAGTCTGCATC
AGCTTGTGCTTGATTTTGCAGCGAGAAAAAGGAGTGAAAATTGTTAGAATCAGAAGTAATCATGGCAAAGAATTTAAAAATGGAGATCTCAATAACTTTTGTGATTCTGA
AGGAATACACCATGAATACTCTGCTCCTATAACTTCTCTACATAATGGAGTTGTTGAAAGTAAAAACAAAACATTACCGAAGATGGCTCAAGTCATGTTACATGCCAAAA
AAAAAATTTACCCTCACATATTCAGGGGGAGCATGAAGTCTGCAGTTATTTGTCATATCAAATAG
Protein sequenceShow/hide protein sequence
MITVSGVSILKPEVYWTDAKEQASVGNARALNMIFNGVDLNVFKLINSCSTAKEAWKTLEVAYEGTSKDESVSDYNKRVLEIVNESLMLGEKIPNSKIVWKEAHDITTLK
LDELFGSLLTFEMAIADRESKKDKGISFKSTHVSEKAVSDTEGNMNESIDLLIKQFSNVVKKFKNLNTIGSNAPNMINYQRKDGENNTRRFNENSNRRNSDYGRKKEREG
RVFRCREYEDTGDGEEAKSMNAFTVCVSETDSGDESECSRQICDKNFTFEELKVLWKEDSEARTLKSIKMLNSRTENLDVMLNSGQNGLNKYGLGFDAFARKMNTTTEIK
FVPASVNDKTDTVTVTKVVSPSAKTTKWICHYFGQKGHIRPFCYKLRRDILYVDGLKANLISVSQLYNYVVINKDNQILMNGSPQANNCYHWIPNNSEVCHLNKEDQTLL
WHIKLGHIDLKSIDKTVKKDVVIGVPNINVNSKLVCGDCLTEKQIKASHKSLKECSTNRVLELLHMDLKGSLQTESLGGKKYVFVAEEDFSRFTWIRFLKDKYDTPKVCI
SLCLILQREKGVKIVRIRSNHGKEFKNGDLNNFCDSEGIHHEYSAPITSLHNGVVESKNKTLPKMAQVMLHAKKKIYPHIFRGSMKSAVICHIK