; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0017665 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0017665
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr03:9828576..9831565
RNA-Seq ExpressionPay0017665
SyntenyPay0017665
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-20056.99Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILELITSKFEALKMTEDETV
        MIFFIK LDGKAWRALVG YEPPM+TVNGV VPKPEIDWTD EE+ASVGNARAINAIFNGVDLNVFKLIN  TTAKE W                   TV
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILELITSKFEALKMTEDETV

Query:  SEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKSVYDQENTVNQSGNEA
        SEYNERVL+IANDSLLL EKI ESKIV KVL                   ITTL LDELFGSLLTFEM +SDRESKKGK   FKS YDQE TVNQSGNEA
Subjt:  SEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKSVYDQENTVNQSGNEA

Query:  NQDESIALLTKQFSKIARK-RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEA------
        NQDESIALL KQFSK+ +K ++ +  +K E  GR     + E     +AECP YL RQKK YCATLSDED D+DEDDHG+NAFTACITEINSEA      
Subjt:  NQDESIALLTKQFSKIARK-RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEA------

Query:  ------------------DSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVS
                          DSE     +ERIQDLMDENE+LMG                                       G+      +SDLILQ+RVS
Subjt:  ------------------DSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVS

Query:  RLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTG---------------
        RLLQK+GH RSFCYKLLR+RRHQQ+PK  NQQNQ R  KRNN+VRGTH IWRV TS KCNVAFTTVQ HVDAW  +    R++                 
Subjt:  RLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTG---------------

Query:  NRSFF--------------TELEECASGHV-----------TFGDGAKGKIIAKGNID-----------KKKSDTGKLCISLCLNLQREKGQKIIRIRSD
        N  FF                L+EC    V              +  +GK      +D           K+KSDT KLCISLC+NLQREKGQKII++RSD
Subjt:  NRSFF--------------TELEECASGHV-----------TFGDGAKGKIIAKGNID-----------KKKSDTGKLCISLCLNLQREKGQKIIRIRSD

Query:  HGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA---------------------------------SDQR
        HGK+FDNEDLNN CQT GIHHEF APIT +QNGVVE+KNRTLQEM+RVMIHA NLPLNF AEA                                 SDQ 
Subjt:  HGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA---------------------------------SDQR

Query:  IFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
        IFLGYS NSRAYRVFNIKS TVME INVVVNDFESN+NQFNIEDDET+VT EVTSTPL EMPK +
Subjt:  IFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.1e-21457.91Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIK LDGKAWRALV  YEP MIT+NGVSVPKPEIDWTD EE+ASVGNARAINAIFNGV+L+VFKLINS  TAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLL EKIPESKIV KVL                   I TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL
        +YDQENTVNQSGNEANQDESI LLTKQFSK+ARK                               RN+DHGKKKEDVGRSFRCREC+G      EC    
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL

Query:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL
                      D+D D++       T    +I  + DSE     +ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKS KMLN GTDSLDSIL
Subjt:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL

Query:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN
        S GQNGSSKY        R  +++                                      ++GHIRSFCYKLLRDRRHQQRPK  NQQN+Y TIKRN+
Subjt:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN

Query:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK
        DVRGTHWIWRV  S KCNVAFTTVQTHVDA                                                          LCLNLQREKGQK
Subjt:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK

Query:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------
        IIRIRSDHGK+FDNEDLNN CQT GIHH+F  PITP+QNGVVE +N TLQEM+RVMIHAKNLPLNFWAEA                              
Subjt:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------

Query:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
                                   S Q IFLGYSQNSRAYRVFNIKS TVMETINVVVNDFESN+NQFNIEDDET+VT EVTSTPL EMPK E
Subjt:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

KAA0059847.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.0e-26172.77Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIKILDGKAWR +VG YEPPMITVNGVSVPKPEIDWTD EEKASVGNARAINA+FNGVDLN+FKLINSYTTAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLLAEKIPESKIVCKVL                   ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC
        VYDQENTVNQS   +  D   + + +                                                       DE+L  +E           
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC

Query:  ITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVSRLLQKKGHIR
          +I  + DSE     +ERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVSRLLQKKGHIR
Subjt:  ITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVSRLLQKKGHIR

Query:  SFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGK
        SFCYKLLRDRRHQQRPK GN+QNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSR MTGNRSFFTELEEC SGHVTF DGAKGK
Subjt:  SFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGK

Query:  IIAKGNIDKK-----------------------KSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQK
        IIAKGNIDK+                       KSDTGKLCISLCLNLQREKGQKIIRIR+DH K+FDNEDLNNLCQT GIHHEFAAPITP QNGVVEQK
Subjt:  IIAKGNIDKK-----------------------KSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQK

Query:  NRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
        N+TLQEM+RVMIHAK+LPLNFWAEA +    + +++NSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVT EVTSTPLYEMPKDE
Subjt:  NRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.1e-21457.91Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIK LDGKAWRALV  YEP MIT+NGVSVPKPEIDWTD EE+ASVGNARAINAIFNGV+L+VFKLINS  TAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLL EKIPESKIV KVL                   I TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL
        +YDQENTVNQSGNEANQDESI LLTKQFSK+ARK                               RN+DHGKKKEDVGRSFRCREC+G      EC    
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL

Query:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL
                      D+D D++       T    +I  + DSE     +ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKS KMLN GTDSLDSIL
Subjt:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL

Query:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN
        S GQNGSSKY        R  +++                                      ++GHIRSFCYKLLRDRRHQQRPK  NQQN+Y TIKRN+
Subjt:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN

Query:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK
        DVRGTHWIWRV  S KCNVAFTTVQTHVDA                                                          LCLNLQREKGQK
Subjt:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK

Query:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------
        IIRIRSDHGK+FDNEDLNN CQT GIHH+F  PITP+QNGVVE +N TLQEM+RVMIHAKNLPLNFWAEA                              
Subjt:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------

Query:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
                                   S Q IFLGYSQNSRAYRVFNIKS TVMETINVVVNDFESN+NQFNIEDDET+VT EVTSTPL EMPK E
Subjt:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

XP_016903608.1 PREDICTED: uncharacterized protein LOC107992254 [Cucumis melo]0.0e+0084.89Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIKILDGKAWR +VG YEPPMITVNGVSVPKPEIDWTD EEKASVGNARAINA+FNGVDLN+FKLINSYTTAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLLAEKIPESKIVCKVL                   ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC
        VYDQENTVNQSGNEANQDES+ALLTKQFSK+ARKRN+DH KKKEDVG SFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDE+ DNDEDDHGINAFTAC
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC

Query:  ITEINSEADSEKTQKPE------------------------ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGS
        ITEINSEADSE +   E                        ERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGS
Subjt:  ITEINSEADSEKTQKPE------------------------ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGS

Query:  SKYSSDLILQIRVSRLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGN
        SKYSSDLILQIRVSRLLQKKGHIRSFCYKLLRDRRHQQRPK GN+QNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSR MTGN
Subjt:  SKYSSDLILQIRVSRLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGN

Query:  RSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQ
        RSFFTELEEC SGHVTF DGAKGKIIAKGNIDK+KSDTGKLCISLCLNLQREKGQKIIRIR+DH K+FDNEDLNNLCQT GIHHEFAAPITP QNGVVEQ
Subjt:  RSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQ

Query:  KNRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
        KN+TLQEM+RVMIHAK+LPLNFWAEA           NSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVT EVTSTPLYEMPKDE
Subjt:  KNRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

TrEMBL top hitse value%identityAlignment
A0A1S4E5V5 uncharacterized protein LOC1079922540.0e+0084.89Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIKILDGKAWR +VG YEPPMITVNGVSVPKPEIDWTD EEKASVGNARAINA+FNGVDLN+FKLINSYTTAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLLAEKIPESKIVCKVL                   ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC
        VYDQENTVNQSGNEANQDES+ALLTKQFSK+ARKRN+DH KKKEDVG SFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDE+ DNDEDDHGINAFTAC
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC

Query:  ITEINSEADSEKTQKPE------------------------ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGS
        ITEINSEADSE +   E                        ERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGS
Subjt:  ITEINSEADSEKTQKPE------------------------ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGS

Query:  SKYSSDLILQIRVSRLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGN
        SKYSSDLILQIRVSRLLQKKGHIRSFCYKLLRDRRHQQRPK GN+QNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSR MTGN
Subjt:  SKYSSDLILQIRVSRLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGN

Query:  RSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQ
        RSFFTELEEC SGHVTF DGAKGKIIAKGNIDK+KSDTGKLCISLCLNLQREKGQKIIRIR+DH K+FDNEDLNNLCQT GIHHEFAAPITP QNGVVEQ
Subjt:  RSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQ

Query:  KNRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
        KN+TLQEM+RVMIHAK+LPLNFWAEA           NSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVT EVTSTPLYEMPKDE
Subjt:  KNRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

A0A5A7U931 Gag-pol polyprotein1.5e-21457.91Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIK LDGKAWRALV  YEP MIT+NGVSVPKPEIDWTD EE+ASVGNARAINAIFNGV+L+VFKLINS  TAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLL EKIPESKIV KVL                   I TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL
        +YDQENTVNQSGNEANQDESI LLTKQFSK+ARK                               RN+DHGKKKEDVGRSFRCREC+G      EC    
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL

Query:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL
                      D+D D++       T    +I  + DSE     +ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKS KMLN GTDSLDSIL
Subjt:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL

Query:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN
        S GQNGSSKY        R  +++                                      ++GHIRSFCYKLLRDRRHQQRPK  NQQN+Y TIKRN+
Subjt:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN

Query:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK
        DVRGTHWIWRV  S KCNVAFTTVQTHVDA                                                          LCLNLQREKGQK
Subjt:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK

Query:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------
        IIRIRSDHGK+FDNEDLNN CQT GIHH+F  PITP+QNGVVE +N TLQEM+RVMIHAKNLPLNFWAEA                              
Subjt:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------

Query:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
                                   S Q IFLGYSQNSRAYRVFNIKS TVMETINVVVNDFESN+NQFNIEDDET+VT EVTSTPL EMPK E
Subjt:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

A0A5D3DCZ8 Gag-pol polyprotein1.5e-21457.91Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIK LDGKAWRALV  YEP MIT+NGVSVPKPEIDWTD EE+ASVGNARAINAIFNGV+L+VFKLINS  TAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLL EKIPESKIV KVL                   I TLKLDELFGSLLTFEMAISDRESKKGK IAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL
        +YDQENTVNQSGNEANQDESI LLTKQFSK+ARK                               RN+DHGKKKEDVGRSFRCREC+G      EC    
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARK-------------------------------RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYL

Query:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL
                      D+D D++       T    +I  + DSE     +ERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKS KMLN GTDSLDSIL
Subjt:  IRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSIL

Query:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN
        S GQNGSSKY        R  +++                                      ++GHIRSFCYKLLRDRRHQQRPK  NQQN+Y TIKRN+
Subjt:  SSGQNGSSKYSSDLILQIRVSRLL-------------------------------------QKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNN

Query:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK
        DVRGTHWIWRV  S KCNVAFTTVQTHVDA                                                          LCLNLQREKGQK
Subjt:  DVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQK

Query:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------
        IIRIRSDHGK+FDNEDLNN CQT GIHH+F  PITP+QNGVVE +N TLQEM+RVMIHAKNLPLNFWAEA                              
Subjt:  IIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA------------------------------

Query:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
                                   S Q IFLGYSQNSRAYRVFNIKS TVMETINVVVNDFESN+NQFNIEDDET+VT EVTSTPL EMPK E
Subjt:  ---------------------------SDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

A0A5D3DMG6 Gag-pol polyprotein9.6e-26272.77Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI
        MIFFIKILDGKAWR +VG YEPPMITVNGVSVPKPEIDWTD EEKASVGNARAINA+FNGVDLN+FKLINSYTTAKE WKILE               LI
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILE---------------LI

Query:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
        TSKFEALKMTEDETVSEYNERVL+IANDSLLLAEKIPESKIVCKVL                   ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS
Subjt:  TSKFEALKMTEDETVSEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKS

Query:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC
        VYDQENTVNQS   +  D   + + +                                                       DE+L  +E           
Subjt:  VYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTAC

Query:  ITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVSRLLQKKGHIR
          +I  + DSE     +ERIQDLMDENERLMGIISSLKVKLK+VQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVSRLLQKKGHIR
Subjt:  ITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVSRLLQKKGHIR

Query:  SFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGK
        SFCYKLLRDRRHQQRPK GN+QNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSR MTGNRSFFTELEEC SGHVTF DGAKGK
Subjt:  SFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTGNRSFFTELEECASGHVTFGDGAKGK

Query:  IIAKGNIDKK-----------------------KSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQK
        IIAKGNIDK+                       KSDTGKLCISLCLNLQREKGQKIIRIR+DH K+FDNEDLNNLCQT GIHHEFAAPITP QNGVVEQK
Subjt:  IIAKGNIDKK-----------------------KSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQK

Query:  NRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
        N+TLQEM+RVMIHAK+LPLNFWAEA +    + +++NSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVT EVTSTPLYEMPKDE
Subjt:  NRTLQEMSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

A0A5D3DSN1 Gag-pol polyprotein5.6e-20156.99Show/hide
Query:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILELITSKFEALKMTEDETV
        MIFFIK LDGKAWRALVG YEPPM+TVNGV VPKPEIDWTD EE+ASVGNARAINAIFNGVDLNVFKLIN  TTAKE W                   TV
Subjt:  MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILELITSKFEALKMTEDETV

Query:  SEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKSVYDQENTVNQSGNEA
        SEYNERVL+IANDSLLL EKI ESKIV KVL                   ITTL LDELFGSLLTFEM +SDRESKKGK   FKS YDQE TVNQSGNEA
Subjt:  SEYNERVLDIANDSLLLAEKIPESKIVCKVLH------------------ITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKSVYDQENTVNQSGNEA

Query:  NQDESIALLTKQFSKIARK-RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEA------
        NQDESIALL KQFSK+ +K ++ +  +K E  GR     + E     +AECP YL RQKK YCATLSDED D+DEDDHG+NAFTACITEINSEA      
Subjt:  NQDESIALLTKQFSKIARK-RNNDHGKKKEDVGRSFRCRECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEA------

Query:  ------------------DSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVS
                          DSE     +ERIQDLMDENE+LMG                                       G+      +SDLILQ+RVS
Subjt:  ------------------DSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTDSLDSILSSGQNGSSKYSSDLILQIRVS

Query:  RLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTG---------------
        RLLQK+GH RSFCYKLLR+RRHQQ+PK  NQQNQ R  KRNN+VRGTH IWRV TS KCNVAFTTVQ HVDAW  +    R++                 
Subjt:  RLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNMTG---------------

Query:  NRSFF--------------TELEECASGHV-----------TFGDGAKGKIIAKGNID-----------KKKSDTGKLCISLCLNLQREKGQKIIRIRSD
        N  FF                L+EC    V              +  +GK      +D           K+KSDT KLCISLC+NLQREKGQKII++RSD
Subjt:  NRSFF--------------TELEECASGHV-----------TFGDGAKGKIIAKGNID-----------KKKSDTGKLCISLCLNLQREKGQKIIRIRSD

Query:  HGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA---------------------------------SDQR
        HGK+FDNEDLNN CQT GIHHEF APIT +QNGVVE+KNRTLQEM+RVMIHA NLPLNF AEA                                 SDQ 
Subjt:  HGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIHAKNLPLNFWAEA---------------------------------SDQR

Query:  IFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE
        IFLGYS NSRAYRVFNIKS TVME INVVVNDFESN+NQFNIEDDET+VT EVTSTPL EMPK +
Subjt:  IFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1236.13Show/hide
Query:  VTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIH
        VTF D A  K+     I K K    ++       ++RE G+K+ R+RSD+G ++ + +    C + GI HE   P TP+ NGV E+ NRT+ E  R M+ 
Subjt:  VTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEMSRVMIH

Query:  AKNLPLNFWAEASDQRIFL
           LP +FW EA     +L
Subjt:  AKNLPLNFWAEASDQRIFL

P22382 Gag-Pol polyprotein5.0e-0532.88Show/hide
Query:  SDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEM
        ++TGK      L L  +    I ++ +D+G  F ++++  +C   GI H F  P  P+  GVVE KN+ L+E+
Subjt:  SDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQEM

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTTTTCATTAAAATCCTGGATGGAAAAGCATGGAGAGCACTTGTTGGTTGTTATGAGCCTCCAATGATCACTGTGAATGGAGTATCAGTGCCAAAACCTGAAAT
TGACTGGACAGATGTTGAAGAAAAAGCTTCGGTTGGAAATGCAAGAGCCATAAATGCTATCTTCAATGGTGTCGATTTAAACGTATTTAAACTTATTAATTCCTACACTA
CTGCTAAAGAGACGTGGAAAATACTAGAATTGATAACTTCAAAATTCGAAGCCTTGAAAATGACTGAAGATGAGACAGTCTCTGAATACAATGAGAGGGTCTTGGACATA
GCTAATGATTCGTTACTACTTGCTGAAAAGATTCCTGAGTCTAAAATTGTTTGCAAAGTGTTGCATATAACCACGTTAAAACTTGACGAACTATTTGGGTCACTACTTAC
GTTTGAAATGGCTATTTCGGATAGAGAAAGTAAGAAAGGTAAGAGAATAGCATTCAAATCAGTTTATGACCAGGAGAACACTGTAAATCAGTCCGGTAATGAAGCTAATC
AAGATGAGTCAATAGCTCTCTTAACGAAGCAATTCTCTAAGATCGCCAGAAAAAGAAATAACGACCATGGAAAGAAAAAAGAGGATGTAGGGAGGTCGTTTAGATGTAGA
GAATGTGAGGGGCTTGGTCATTATCAGGCCGAATGTCCCGCTTATCTCATAAGACAAAAGAAAAATTATTGTGCTACTTTGTCTGATGAGGATTTAGATAATGATGAAGA
TGATCATGGCATAAATGCGTTCACTGCGTGCATTACAGAAATCAATTCAGAAGCTGATAGTGAGAAGACTCAGAAGCCAGAAGAAAGAATTCAAGATTTAATGGATGAAA
ATGAAAGATTGATGGGGATTATATCATCTCTGAAAGTAAAGTTGAAAGAAGTTCAGAATGTGTATGATCAGACAATTAAGTCTGTGAAAATGTTGAATTATGGAACTGAC
AGCTTAGACTCAATCCTGAGTTCAGGGCAAAATGGTTCAAGTAAATATAGCTCGGATTTGATACTTCAAATAAGGGTGTCAAGATTACTCCAGAAAAAAGGTCATATACG
GTCATTCTGCTACAAATTACTGAGAGATAGAAGACATCAGCAGAGGCCAAAACGTGGAAACCAGCAAAATCAGTATAGGACCATCAAAAGGAACAATGATGTAAGGGGAA
CTCACTGGATCTGGAGGGTGATGACTTCTGGGAAGTGCAATGTAGCATTTACAACAGTCCAAACCCATGTTGATGCTTGGTACTTTGACAGTGGATGCTCAAGAAATATG
ACTGGCAATCGATCTTTCTTTACTGAGTTAGAAGAATGTGCCTCAGGACATGTCACTTTTGGAGATGGGGCCAAAGGAAAAATTATTGCAAAAGGAAACATTGACAAAAA
AAAATCAGATACGGGTAAACTATGTATTAGTCTATGTTTGAACTTGCAACGTGAGAAAGGCCAAAAGATAATCAGGATTCGTAGTGATCATGGGAAGAAATTTGATAATG
AAGATCTGAATAACTTATGTCAGACTGCAGGAATCCATCATGAATTTGCAGCTCCCATAACTCCTCGCCAAAATGGAGTAGTTGAACAGAAAAACAGAACGTTACAAGAG
ATGTCTCGAGTCATGATACATGCTAAAAATTTGCCTTTGAATTTTTGGGCGGAAGCTTCTGATCAAAGGATCTTTCTTGGTTATTCTCAAAATAGTCGAGCGTACAGAGT
CTTCAATATTAAATCTAGAACAGTCATGGAAACAATCAATGTTGTGGTTAATGATTTTGAGTCTAATATCAATCAGTTTAATATTGAGGATGATGAGACCTATGTGACAC
TTGAAGTTACTTCTACTCCCCTTTATGAAATGCCTAAAGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGATATTTTTCATTAAAATCCTGGATGGAAAAGCATGGAGAGCACTTGTTGGTTGTTATGAGCCTCCAATGATCACTGTGAATGGAGTATCAGTGCCAAAACCTGAAAT
TGACTGGACAGATGTTGAAGAAAAAGCTTCGGTTGGAAATGCAAGAGCCATAAATGCTATCTTCAATGGTGTCGATTTAAACGTATTTAAACTTATTAATTCCTACACTA
CTGCTAAAGAGACGTGGAAAATACTAGAATTGATAACTTCAAAATTCGAAGCCTTGAAAATGACTGAAGATGAGACAGTCTCTGAATACAATGAGAGGGTCTTGGACATA
GCTAATGATTCGTTACTACTTGCTGAAAAGATTCCTGAGTCTAAAATTGTTTGCAAAGTGTTGCATATAACCACGTTAAAACTTGACGAACTATTTGGGTCACTACTTAC
GTTTGAAATGGCTATTTCGGATAGAGAAAGTAAGAAAGGTAAGAGAATAGCATTCAAATCAGTTTATGACCAGGAGAACACTGTAAATCAGTCCGGTAATGAAGCTAATC
AAGATGAGTCAATAGCTCTCTTAACGAAGCAATTCTCTAAGATCGCCAGAAAAAGAAATAACGACCATGGAAAGAAAAAAGAGGATGTAGGGAGGTCGTTTAGATGTAGA
GAATGTGAGGGGCTTGGTCATTATCAGGCCGAATGTCCCGCTTATCTCATAAGACAAAAGAAAAATTATTGTGCTACTTTGTCTGATGAGGATTTAGATAATGATGAAGA
TGATCATGGCATAAATGCGTTCACTGCGTGCATTACAGAAATCAATTCAGAAGCTGATAGTGAGAAGACTCAGAAGCCAGAAGAAAGAATTCAAGATTTAATGGATGAAA
ATGAAAGATTGATGGGGATTATATCATCTCTGAAAGTAAAGTTGAAAGAAGTTCAGAATGTGTATGATCAGACAATTAAGTCTGTGAAAATGTTGAATTATGGAACTGAC
AGCTTAGACTCAATCCTGAGTTCAGGGCAAAATGGTTCAAGTAAATATAGCTCGGATTTGATACTTCAAATAAGGGTGTCAAGATTACTCCAGAAAAAAGGTCATATACG
GTCATTCTGCTACAAATTACTGAGAGATAGAAGACATCAGCAGAGGCCAAAACGTGGAAACCAGCAAAATCAGTATAGGACCATCAAAAGGAACAATGATGTAAGGGGAA
CTCACTGGATCTGGAGGGTGATGACTTCTGGGAAGTGCAATGTAGCATTTACAACAGTCCAAACCCATGTTGATGCTTGGTACTTTGACAGTGGATGCTCAAGAAATATG
ACTGGCAATCGATCTTTCTTTACTGAGTTAGAAGAATGTGCCTCAGGACATGTCACTTTTGGAGATGGGGCCAAAGGAAAAATTATTGCAAAAGGAAACATTGACAAAAA
AAAATCAGATACGGGTAAACTATGTATTAGTCTATGTTTGAACTTGCAACGTGAGAAAGGCCAAAAGATAATCAGGATTCGTAGTGATCATGGGAAGAAATTTGATAATG
AAGATCTGAATAACTTATGTCAGACTGCAGGAATCCATCATGAATTTGCAGCTCCCATAACTCCTCGCCAAAATGGAGTAGTTGAACAGAAAAACAGAACGTTACAAGAG
ATGTCTCGAGTCATGATACATGCTAAAAATTTGCCTTTGAATTTTTGGGCGGAAGCTTCTGATCAAAGGATCTTTCTTGGTTATTCTCAAAATAGTCGAGCGTACAGAGT
CTTCAATATTAAATCTAGAACAGTCATGGAAACAATCAATGTTGTGGTTAATGATTTTGAGTCTAATATCAATCAGTTTAATATTGAGGATGATGAGACCTATGTGACAC
TTGAAGTTACTTCTACTCCCCTTTATGAAATGCCTAAAGATGAATAG
Protein sequenceShow/hide protein sequence
MIFFIKILDGKAWRALVGCYEPPMITVNGVSVPKPEIDWTDVEEKASVGNARAINAIFNGVDLNVFKLINSYTTAKETWKILELITSKFEALKMTEDETVSEYNERVLDI
ANDSLLLAEKIPESKIVCKVLHITTLKLDELFGSLLTFEMAISDRESKKGKRIAFKSVYDQENTVNQSGNEANQDESIALLTKQFSKIARKRNNDHGKKKEDVGRSFRCR
ECEGLGHYQAECPAYLIRQKKNYCATLSDEDLDNDEDDHGINAFTACITEINSEADSEKTQKPEERIQDLMDENERLMGIISSLKVKLKEVQNVYDQTIKSVKMLNYGTD
SLDSILSSGQNGSSKYSSDLILQIRVSRLLQKKGHIRSFCYKLLRDRRHQQRPKRGNQQNQYRTIKRNNDVRGTHWIWRVMTSGKCNVAFTTVQTHVDAWYFDSGCSRNM
TGNRSFFTELEECASGHVTFGDGAKGKIIAKGNIDKKKSDTGKLCISLCLNLQREKGQKIIRIRSDHGKKFDNEDLNNLCQTAGIHHEFAAPITPRQNGVVEQKNRTLQE
MSRVMIHAKNLPLNFWAEASDQRIFLGYSQNSRAYRVFNIKSRTVMETINVVVNDFESNINQFNIEDDETYVTLEVTSTPLYEMPKDE