; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0017273 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0017273
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Genome locationchr08:26483286..26488237
RNA-Seq ExpressionIVF0017273
SyntenyIVF0017273
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]1.33e-16564.81Show/hide
Query:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------
        KVQTFKARLVAKGYTQREGVDYEET SPVAMLKSIRILL IATFYDYEIW+MDV TA+LN NLEESI+M+Q EGFI  G EQK+C L +SIY        
Subjt:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------

Query:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML
                                               DDILLI NDVGYLTD+K WLA +FQMKDLG+AQYVLGIQI+R+RK++TLA+SQA+YIDKML
Subjt:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML

Query:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT
         RY MQNSKKGLL +R+G+HLSK+QCPKTPQEVEDMR+ PYASAVGSL+                                         T+DYMLVYG 
Subjt:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT

Query:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP
        KDLILTGYTDSDFQT++D+RKSTS SVFTLNGGA+VWRS+KQ C+ DSTMEAEYVAAC+AAKEAVWLRKFL D+E+VPNM+LPITLYCDNSGAVANS+EP
Subjt:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP

Query:  RSHKRGKHMERK
        RSHKR KH+ERK
Subjt:  RSHKRGKHMERK

KAA0047406.1 gag/pol protein [Cucumis melo var. makuwa]5.71e-16772.36Show/hide
Query:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY----DDIL
        KVQTFKARLVAKGYTQREGVD+EET S V+MLKSIRILL IATFYDYEI QMDVKT  LN NLEESIYMAQLEGFI+NG EQK+ N   +      DDIL
Subjt:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY----DDIL

Query:  LIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYAS
        LI NDVGYLT+IKKWLA +FQM DLGDAQ+ LGIQIV NRK++T+AMSQASYIDKMLSRYKMQNSKKGL+ Y+YGI LSK+QCPKTPQEVEDMRK  YAS
Subjt:  LIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYAS

Query:  AVGSLI-----------------------------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQS
         VGSL+                                         TKDYMLVYGTKDLILTGYTDSDFQT++D R STSGSVFTLNGG VVWRSVKQ+
Subjt:  AVGSLI-----------------------------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQS

Query:  CMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK
        C+ +STME EYV  C+AAKEAVWLRKFLTD+EIVPNMHLPITLYCDNSGA+ANSREPRSHKR KH+E K
Subjt:  CMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK

KAA0065524.1 gag/pol protein [Cucumis melo var. makuwa]0.079.56Show/hide
Query:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI
        MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEG                     
Subjt:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI

Query:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
            NDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
Subjt:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA

Query:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
        SAVGSL+                   TKDYMLVYGTKDLILTGYTDSDFQTDRDARKS SGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
Subjt:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA

Query:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK----------------------------------------------------
        VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK                                                    
Subjt:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK----------------------------------------------------

Query:  ---------------------GRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVSETHTTYMDFNERDDVPLARLLKKGLFS
                             GRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVSETHTTYMDFNERDDVPLARLLKKGLFS
Subjt:  ---------------------GRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVSETHTTYMDFNERDDVPLARLLKKGLFS

Query:  NVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDKTTENVGRNDVPVDESIIIDAHVESTNT
        NVVPAKS+DPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDAS+TDHRLVPDSNPIDKTTENVGRNDVPVDESIIIDAHVESTNT
Subjt:  NVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDKTTENVGRNDVPVDESIIIDAHVESTNT

Query:  CATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKYVVQRRIADEVNISDKHHSCLSIMGLIE
        CATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKYVVQRRIADEVNISDKHHSCLSIMGLIE
Subjt:  CATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKYVVQRRIADEVNISDKHHSCLSIMGLIE

Query:  KAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKTLSLSYRLFQGSHVLDIEHDMQPSRDPR
        KA                                                         VLTTLDAPRPEPKTLSLSYRLFQGSHVLDIEHDMQPSRDPR
Subjt:  KAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKTLSLSYRLFQGSHVLDIEHDMQPSRDPR

Query:  IFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
        IFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
Subjt:  IFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]3.15e-16766.02Show/hide
Query:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------
        KVQTFKARLVAKGYTQ+EG+DYEE  S  AM+KSIRILL IATFYDYEIWQMDVKT +LN NLEESIYM Q E FI+ G EQK+C LQKSIY        
Subjt:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------

Query:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML
                                               DDILLI NDVG+L DIKKWLAM+FQMKDLG+AQYVLG+QIVRNRK++TLAMSQ SYIDKML
Subjt:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML

Query:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT
        SRYKM NSKKGLL YRYGIHLSK+QCPKTPQEVEDM   PYASAVGSL+                                         TKDYMLVYG+
Subjt:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT

Query:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP
        KDLILTGYTD  FQTD+DARKSTSG VFT+NGGAVVWRS+KQSC+ DSTMEAEYVA C+AAKEAVWL+KFLTD+E+VPNMHLP TLYCDNSGAV NSREP
Subjt:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP

Query:  RSHKRGKHMERK
        RSHKRGKH+ERK
Subjt:  RSHKRGKHMERK

TYK29294.1 gag/pol protein [Cucumis melo var. makuwa]0.087.22Show/hide
Query:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI
        MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEG                     
Subjt:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI

Query:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
            NDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
Subjt:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA

Query:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
        SAVGSL+                   TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
Subjt:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA

Query:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERKGRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVS
        VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERKGRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVS
Subjt:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERKGRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVS

Query:  ETHTTYMDFNERDDVPLARLLKKGLFSNVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDK
        ETHTTYMDFNERDDVPLARLLKKGLFSNVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDK
Subjt:  ETHTTYMDFNERDDVPLARLLKKGLFSNVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDK

Query:  TTENVGRNDVPVDESIIIDAHVESTNTCATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKY
        TTENVGRNDVPVDESIIIDAHVESTNTCATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKY
Subjt:  TTENVGRNDVPVDESIIIDAHVESTNTCATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKY

Query:  VVQRRIADEVNISDKHHSCLSIMGLIEKAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKT
        VVQRRIADEVNISDKHHSCLSIMGLIEKA                                                         VLTTLDAPRPEPKT
Subjt:  VVQRRIADEVNISDKHHSCLSIMGLIEKAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKT

Query:  LSLSYRLFQGSHVLDIEHDMQPSRDPRIFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
        LSLSYRLFQGSHVLDIEHDMQPSRDPRIFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
Subjt:  LSLSYRLFQGSHVLDIEHDMQPSRDPRIFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein7.2e-14265.78Show/hide
Query:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------
        KVQTFKARLVAKGYTQREGVDYEET SPVAMLKSIRILL IATFYDYEIWQMDVKTA+LN NLEESI+M+Q EGFI  G EQK+C L +SIY        
Subjt:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------

Query:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML
                                               DDILLI NDVGYLTD+K WLA +FQMKDLG+AQYVLGIQI+R+RK++TLA+SQA+YIDK+L
Subjt:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML

Query:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT
         RY MQNSKKGLL +R+G+HLSK+Q PKTPQEVEDMR+ PYASAVGSL+                                         T+DYMLVYG 
Subjt:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT

Query:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP
        KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNGGAVVWRS+KQ C+ DSTMEAEYVAAC+AAKEAVWLRKFL D+E+VPNM+LPITLYCDNSGAVANS+EP
Subjt:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP

Query:  RSHKRGKHMERK
        RSHKRGKH+ERK
Subjt:  RSHKRGKHMERK

A0A5A7UYE8 Gag/pol protein7.2e-14265.78Show/hide
Query:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------
        KVQTFKARLVAKGYTQREGVDYEET SPVAMLKSIRILL IATFYDYEIWQMDVKTA+LN NLEESI+M+Q EGFI  G EQK+C L +SIY        
Subjt:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------

Query:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML
                                               DDILLI NDVGYLTD+K WLA +FQMKDLG+AQYVLGIQI+R+RK++TLA+SQA+YIDK+L
Subjt:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML

Query:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT
         RY MQNSKKGLL +R+G+HLSK+Q PKTPQEVEDMR+ PYASAVGSL+                                         T+DYMLVYG 
Subjt:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT

Query:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP
        KDLILTGYTDSDFQTD+D+RKSTSGSVFTLNGGAVVWRS+KQ C+ DSTMEAEYVAAC+AAKEAVWLRKFL D+E+VPNM+LPITLYCDNSGAVANS+EP
Subjt:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP

Query:  RSHKRGKHMERK
        RSHKRGKH+ERK
Subjt:  RSHKRGKHMERK

A0A5A7VE22 Gag/pol protein0.0e+0079.56Show/hide
Query:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI
        MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEG                     
Subjt:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI

Query:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
            NDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
Subjt:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA

Query:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
        SAVGSL+                   TKDYMLVYGTKDLILTGYTDSDFQTDRDARKS SGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
Subjt:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA

Query:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK----------------------------------------------------
        VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK                                                    
Subjt:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERK----------------------------------------------------

Query:  ---------------------GRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVSETHTTYMDFNERDDVPLARLLKKGLFS
                             GRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVSETHTTYMDFNERDDVPLARLLKKGLFS
Subjt:  ---------------------GRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVSETHTTYMDFNERDDVPLARLLKKGLFS

Query:  NVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDKTTENVGRNDVPVDESIIIDAHVESTNT
        NVVPAKS+DPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDAS+TDHRLVPDSNPIDKTTENVGRNDVPVDESIIIDAHVESTNT
Subjt:  NVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDKTTENVGRNDVPVDESIIIDAHVESTNT

Query:  CATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKYVVQRRIADEVNISDKHHSCLSIMGLIE
        CATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKYVVQRRIADEVNISDKHHSCLSIMGLIE
Subjt:  CATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKYVVQRRIADEVNISDKHHSCLSIMGLIE

Query:  KAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKTLSLSYRLFQGSHVLDIEHDMQPSRDPR
        KA                                                         VLTTLDAPRPEPKTLSLSYRLFQGSHVLDIEHDMQPSRDPR
Subjt:  KAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKTLSLSYRLFQGSHVLDIEHDMQPSRDPR

Query:  IFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
        IFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
Subjt:  IFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE

A0A5D3E0G7 Gag/pol protein0.0e+0087.22Show/hide
Query:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI
        MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEG                     
Subjt:  MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDI

Query:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
            NDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA
Subjt:  LLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYA

Query:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
        SAVGSL+                   TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA
Subjt:  SAVGSLI-------------------TKDYMLVYGTKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEA

Query:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERKGRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVS
        VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERKGRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVS
Subjt:  VWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERKGRYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVS

Query:  ETHTTYMDFNERDDVPLARLLKKGLFSNVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDK
        ETHTTYMDFNERDDVPLARLLKKGLFSNVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDK
Subjt:  ETHTTYMDFNERDDVPLARLLKKGLFSNVVPAKSVDPITSTHSHESSSYKDIFVPTPSHSPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDK

Query:  TTENVGRNDVPVDESIIIDAHVESTNTCATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKY
        TTENVGRNDVPVDESIIIDAHVESTNTCATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKY
Subjt:  TTENVGRNDVPVDESIIIDAHVESTNTCATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTKTRRKKIPSNIPSVPIDGISFHLEESVQRWKY

Query:  VVQRRIADEVNISDKHHSCLSIMGLIEKAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKT
        VVQRRIADEVNISDKHHSCLSIMGLIEKA                                                         VLTTLDAPRPEPKT
Subjt:  VVQRRIADEVNISDKHHSCLSIMGLIEKAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINPAVINEFLVLTTLDAPRPEPKT

Query:  LSLSYRLFQGSHVLDIEHDMQPSRDPRIFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
        LSLSYRLFQGSHVLDIEHDMQPSRDPRIFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE
Subjt:  LSLSYRLFQGSHVLDIEHDMQPSRDPRIFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPSTSTGKHGLE

E2GK51 Gag/pol protein (Fragment)4.5e-14467.23Show/hide
Query:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------
        KVQTFKARLVAKGYTQ+EGVDYEET SPVAMLKSIRILL IATFY+YEIWQMDVKTA+LN NLEESIYM Q EGFI    EQK+C LQKSIY        
Subjt:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------

Query:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML
                                               DDILLI NDV YLTD+KKWL  +FQMKDLG+AQY+LGIQIVRNRK++TLAMSQASYIDK+L
Subjt:  ---------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKML

Query:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT
        SRYKMQNSKKG L +R+GIHLSK+QCPKTPQEVEDMR  PY+SAVGSL+                                         T++YMLVYG 
Subjt:  SRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYGT

Query:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP
        KDLILTGYTDSDFQ+D+DARKSTSGSVFTLNGGAVVWRSVKQ+C+ DSTMEAEYVAAC+AAKEAVWLRKFLTD+E+VPNMHLPITLYCDNSGAVANS+EP
Subjt:  KDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREP

Query:  RSHKRGKHMERK
        RSHKRGKH+ERK
Subjt:  RSHKRGKHMERK

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.7e-3627.74Show/hide
Query:  FKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY------------
        +KARLVA+G+TQ+  +DYEET +PVA + S R +L +   Y+ ++ QMDVKTA+LN  L+E IYM   +G   N     +C L K+IY            
Subjt:  FKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY------------

Query:  -------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSR
                                             DD+++   D+  + + K++L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ K+LS+
Subjt:  -------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSR

Query:  YKMQN----------------------------SKKGLLLY---------RYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLITKDYMLVYGTKDLI--
        + M+N                            S  G L+Y            +++  +   K   E+    K       G++   D  L++  K+L   
Subjt:  YKMQN----------------------------SKKGLLLY---------RYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLITKDYMLVYGTKDLI--

Query:  --LTGYTDSDFQTDRDARKSTSGSVFTL-NGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPR
          + GY DSD+      RKST+G +F + +   + W + +Q+ +  S+ EAEY+A  +A +EA+WL+  LT + I   +  PI +Y DN G ++ +  P 
Subjt:  --LTGYTDSDFQTDRDARKSTSGSVFTL-NGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPR

Query:  SHKRGKHMERK
         HKR KH++ K
Subjt:  SHKRGKHMERK

P0CV72 Secreted RxLR effector protein 1616.3e-1052.38Show/hide
Query:  LTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWL
        L GY+D+D+  D ++R+STSG +F LNGG V WRS KQ  +  S+ E EY+A  +A +EAVWL
Subjt:  LTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-6135.28Show/hide
Query:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------
        K+  +KARLV KG+ Q++G+D++E  SPV  + SIR +L +A   D E+ Q+DVKTA+L+ +LEE IYM Q EGF   G +  +C L KS+Y        
Subjt:  KVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY--------

Query:  ----------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKM
                                                DD+L++  D G +  +K  L+  F MKDLG AQ +LG++IVR R SR L +SQ  YI+++
Subjt:  ----------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKM

Query:  LSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYG
        L R+ M+N+K         + LSKK CP T +E  +M K PY+SAVGSL+                                         T    L +G
Subjt:  LSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----------------------------------------TKDYMLVYG

Query:  TKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSRE
          D IL GYTD+D   D D RKS++G +FT +GGA+ W+S  Q C+  ST EAEY+AA +  KE +WL++FL ++ +    ++   +YCD+  A+  S+ 
Subjt:  TKDLILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSRE

Query:  PRSHKRGKHME
           H R KH++
Subjt:  PRSHKRGKHME

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-1732.68Show/hide
Query:  FKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY------------
        +KARLVAKGY QR G+DY ET SPV    SIRI+L +A    + I Q+DV  A+L   L + +YM+Q  GFI       +C L+K++Y            
Subjt:  FKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY------------

Query:  -----------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYK
                                           DDIL+  ND   L +    L+  F +KD  +  Y LGI+    R    L +SQ  YI  +L+R  
Subjt:  -----------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYK

Query:  MQNSK
        M  +K
Subjt:  MQNSK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-1730.74Show/hide
Query:  FKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY------------
        +KARLVAKGY QR G+DY ET SPV    SIRI+L +A    + I Q+DV  A+L   L + +YM+Q  GF+       +C L+K+IY            
Subjt:  FKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIY------------

Query:  -----------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYK
                                           DDIL+  ND   L      L+  F +K+  D  Y LGI+    R  + L +SQ  Y   +L+R  
Subjt:  -----------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYK

Query:  MQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSL
        M  +K           L+     K P   E      Y   VGSL
Subjt:  MQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-3325.8Show/hide
Query:  VQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLE----QKLCNLQKSIY-----
        ++ +KARLVAKGYTQ+EG+D+ ET SPV  L S++++L I+  Y++ + Q+D+  A+LN +L+E IYM    G+     +      +C L+KSIY     
Subjt:  VQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLE----QKLCNLQKSIY-----

Query:  ------------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYID
                                                  DDI++  N+   + ++K  L   F+++DLG  +Y LG++I R+  +  + + Q  Y  
Subjt:  ------------------------------------------DDILLIRNDVGYLTDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYID

Query:  KMLSRYKMQNSKK-----------------------------GLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----TKDYMLVYGTK-DL
         +L    +   K                              G L+Y     L          +  +  +  +  AV  ++     T    L Y ++ ++
Subjt:  KMLSRYKMQNSKK-----------------------------GLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLI-----TKDYMLVYGTK-DL

Query:  ILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSH
         L  ++D+ FQ+ +D R+ST+G    L    + W+S KQ  ++ S+ EAEY A   A  E +WL +F  ++++   +  P  L+CDN+ A+  +     H
Subjt:  ILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSH

Query:  KRGKHME
        +R KH+E
Subjt:  KRGKHME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACCAAGCTGGAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAAAGAGAAGGAGTGGACTATGAAGAAACATTATCTCCTGTTGCCATGCT
TAAGTCGATTAGAATACTCTTATTCATTGCCACCTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTATTTGAATGAAAATCTTGAGGAGAGTATCTATA
TGGCTCAACTAGAGGGGTTTATTAAAAATGGTCTAGAACAAAAACTTTGTAATCTTCAGAAATCCATTTATGATGATATTCTACTCATTAGGAATGACGTAGGTTATCTT
ACTGACATCAAGAAATGGCTAGCTATGGAATTTCAAATGAAAGATTTGGGAGATGCACAATATGTTCTTGGAATTCAAATTGTTAGGAACCGTAAAAGTAGAACACTGGC
CATGTCTCAAGCATCTTATATTGACAAAATGTTGTCAAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGCTATACAGATATGGAATTCATTTGTCAAAGAAACAAT
GTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAAATTTCCCTATGCTTCCGCTGTGGGAAGTTTGATAACGAAAGACTACATGCTTGTTTATGGTACTAAGGATCTG
ATCCTTACTGGATACACTGATTCTGATTTCCAAACGGATAGAGATGCTAGAAAGTCTACATCAGGATCAGTATTCACTCTAAACGGAGGAGCAGTAGTATGGAGAAGCGT
AAAACAAAGTTGTATGACCGACTCTACAATGGAAGCTGAATATGTTGCAGCTTGCAAAGCAGCAAAGGAAGCAGTATGGCTAAGAAAATTTTTAACTGATATGGAAATTG
TTCCAAATATGCATCTGCCAATCACTTTATACTGTGACAATAGTGGTGCAGTTGCAAATTCACGAGAACCTAGAAGCCATAAACGGGGAAAGCATATGGAACGCAAGGGT
CGCTATTTCAAAAGCACTCCTCCACGAAGACCGTATCGCCTTCCATCGAAGAAACCTCAGGTAAATGTCTCTAAAAGCTCTTATTTGCATGTGCATGCTGAGTTTATTGC
TGAATGTGTTGCGGAGGATGTTGAAACTGCGCCAAGTGTATCTGAGACTCACACTACCTATATGGATTTCAATGAGCGTGATGATGTTCCTTTAGCTAGATTGTTGAAGA
AAGGGTTATTTTCCAATGTTGTCCCCGCTAAATCTGTTGATCCTATCACATCAACTCATTCTCATGAGAGCTCTTCCTATAAAGATATTTTTGTTCCAACACCTAGTCAT
TCGCCTACGACAAATGAAGGAGCTGGTCAATCGGGTCCATCTCCACTGGTTAAGTCATCGATTCAGGTCGATGCTTCTATTACTGATCATCGTTTAGTGCCTGATTCTAA
TCCTATTGATAAGACTACTGAGAATGTGGGAAGGAATGATGTTCCTGTTGATGAGTCTATTATTATTGATGCACATGTTGAATCTACTAATACTTGTGCAACCGATACTA
TTGAACCAGATGTTAATGATGAGCTTCAACCCGAGACTCAACAATCTCCTGGTGTATCTAGGCCAAAGGGAAAGAAGTTTCAGCAAAATCGACGAAATATTACTACAAAG
ACTAGAAGGAAGAAGATACCTTCAAACATTCCATCTGTTCCCATTGATGGAATCTCGTTTCATCTCGAAGAAAGTGTTCAAAGATGGAAATATGTTGTGCAGAGAAGGAT
TGCAGATGAGGTAAATATTTCTGATAAGCATCATTCTTGTTTGAGTATCATGGGTCTTATTGAAAAGGCAGGTCTCTTTAAGACAATATCAAATGTCGGGCCCTTCTATC
CTCGATTGATTAGGAAATTCATTGTAAATTTTCCTTCTGATTTTAATGATCCCAGTAGTCTAGACTATCAGACAGTTCACATTCGGGGTTTAAAATTCAAGATTAATCCT
GCAGTAATTAATGAGTTTCTGGTCCTAACGACGCTTGATGCCCCTAGACCTGAACCAAAGACTTTATCTCTAAGTTACAGACTCTTTCAGGGAAGTCATGTTCTAGATAT
AGAACATGACATGCAACCTTCAAGGGATCCTCGTATTTTTGACACAGAAGATTTGGATGAGGGTGCTGAAGGCTTCTTTGTTCATCAAAACTTAGCCTCTAGAATTGTTA
ATACACTTACAGCTGAGTCTCGCGCTCTCTCTACTTCCATTAACTTGTTCTCTGAACGGCGGGTGGAGGTTGATTTGTTTATTCGCCACTTGAAAACGTTGACTCCTTCT
ACTAGTACTGGGAAGCATGGTCTTGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACCAAGCTGGAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAAAGAGAAGGAGTGGACTATGAAGAAACATTATCTCCTGTTGCCATGCT
TAAGTCGATTAGAATACTCTTATTCATTGCCACCTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCCTATTTGAATGAAAATCTTGAGGAGAGTATCTATA
TGGCTCAACTAGAGGGGTTTATTAAAAATGGTCTAGAACAAAAACTTTGTAATCTTCAGAAATCCATTTATGATGATATTCTACTCATTAGGAATGACGTAGGTTATCTT
ACTGACATCAAGAAATGGCTAGCTATGGAATTTCAAATGAAAGATTTGGGAGATGCACAATATGTTCTTGGAATTCAAATTGTTAGGAACCGTAAAAGTAGAACACTGGC
CATGTCTCAAGCATCTTATATTGACAAAATGTTGTCAAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGCTATACAGATATGGAATTCATTTGTCAAAGAAACAAT
GTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAAATTTCCCTATGCTTCCGCTGTGGGAAGTTTGATAACGAAAGACTACATGCTTGTTTATGGTACTAAGGATCTG
ATCCTTACTGGATACACTGATTCTGATTTCCAAACGGATAGAGATGCTAGAAAGTCTACATCAGGATCAGTATTCACTCTAAACGGAGGAGCAGTAGTATGGAGAAGCGT
AAAACAAAGTTGTATGACCGACTCTACAATGGAAGCTGAATATGTTGCAGCTTGCAAAGCAGCAAAGGAAGCAGTATGGCTAAGAAAATTTTTAACTGATATGGAAATTG
TTCCAAATATGCATCTGCCAATCACTTTATACTGTGACAATAGTGGTGCAGTTGCAAATTCACGAGAACCTAGAAGCCATAAACGGGGAAAGCATATGGAACGCAAGGGT
CGCTATTTCAAAAGCACTCCTCCACGAAGACCGTATCGCCTTCCATCGAAGAAACCTCAGGTAAATGTCTCTAAAAGCTCTTATTTGCATGTGCATGCTGAGTTTATTGC
TGAATGTGTTGCGGAGGATGTTGAAACTGCGCCAAGTGTATCTGAGACTCACACTACCTATATGGATTTCAATGAGCGTGATGATGTTCCTTTAGCTAGATTGTTGAAGA
AAGGGTTATTTTCCAATGTTGTCCCCGCTAAATCTGTTGATCCTATCACATCAACTCATTCTCATGAGAGCTCTTCCTATAAAGATATTTTTGTTCCAACACCTAGTCAT
TCGCCTACGACAAATGAAGGAGCTGGTCAATCGGGTCCATCTCCACTGGTTAAGTCATCGATTCAGGTCGATGCTTCTATTACTGATCATCGTTTAGTGCCTGATTCTAA
TCCTATTGATAAGACTACTGAGAATGTGGGAAGGAATGATGTTCCTGTTGATGAGTCTATTATTATTGATGCACATGTTGAATCTACTAATACTTGTGCAACCGATACTA
TTGAACCAGATGTTAATGATGAGCTTCAACCCGAGACTCAACAATCTCCTGGTGTATCTAGGCCAAAGGGAAAGAAGTTTCAGCAAAATCGACGAAATATTACTACAAAG
ACTAGAAGGAAGAAGATACCTTCAAACATTCCATCTGTTCCCATTGATGGAATCTCGTTTCATCTCGAAGAAAGTGTTCAAAGATGGAAATATGTTGTGCAGAGAAGGAT
TGCAGATGAGGTAAATATTTCTGATAAGCATCATTCTTGTTTGAGTATCATGGGTCTTATTGAAAAGGCAGGTCTCTTTAAGACAATATCAAATGTCGGGCCCTTCTATC
CTCGATTGATTAGGAAATTCATTGTAAATTTTCCTTCTGATTTTAATGATCCCAGTAGTCTAGACTATCAGACAGTTCACATTCGGGGTTTAAAATTCAAGATTAATCCT
GCAGTAATTAATGAGTTTCTGGTCCTAACGACGCTTGATGCCCCTAGACCTGAACCAAAGACTTTATCTCTAAGTTACAGACTCTTTCAGGGAAGTCATGTTCTAGATAT
AGAACATGACATGCAACCTTCAAGGGATCCTCGTATTTTTGACACAGAAGATTTGGATGAGGGTGCTGAAGGCTTCTTTGTTCATCAAAACTTAGCCTCTAGAATTGTTA
ATACACTTACAGCTGAGTCTCGCGCTCTCTCTACTTCCATTAACTTGTTCTCTGAACGGCGGGTGGAGGTTGATTTGTTTATTCGCCACTTGAAAACGTTGACTCCTTCT
ACTAGTACTGGGAAGCATGGTCTTGAGTGA
Protein sequenceShow/hide protein sequence
MRPSWKVQTFKARLVAKGYTQREGVDYEETLSPVAMLKSIRILLFIATFYDYEIWQMDVKTAYLNENLEESIYMAQLEGFIKNGLEQKLCNLQKSIYDDILLIRNDVGYL
TDIKKWLAMEFQMKDLGDAQYVLGIQIVRNRKSRTLAMSQASYIDKMLSRYKMQNSKKGLLLYRYGIHLSKKQCPKTPQEVEDMRKFPYASAVGSLITKDYMLVYGTKDL
ILTGYTDSDFQTDRDARKSTSGSVFTLNGGAVVWRSVKQSCMTDSTMEAEYVAACKAAKEAVWLRKFLTDMEIVPNMHLPITLYCDNSGAVANSREPRSHKRGKHMERKG
RYFKSTPPRRPYRLPSKKPQVNVSKSSYLHVHAEFIAECVAEDVETAPSVSETHTTYMDFNERDDVPLARLLKKGLFSNVVPAKSVDPITSTHSHESSSYKDIFVPTPSH
SPTTNEGAGQSGPSPLVKSSIQVDASITDHRLVPDSNPIDKTTENVGRNDVPVDESIIIDAHVESTNTCATDTIEPDVNDELQPETQQSPGVSRPKGKKFQQNRRNITTK
TRRKKIPSNIPSVPIDGISFHLEESVQRWKYVVQRRIADEVNISDKHHSCLSIMGLIEKAGLFKTISNVGPFYPRLIRKFIVNFPSDFNDPSSLDYQTVHIRGLKFKINP
AVINEFLVLTTLDAPRPEPKTLSLSYRLFQGSHVLDIEHDMQPSRDPRIFDTEDLDEGAEGFFVHQNLASRIVNTLTAESRALSTSINLFSERRVEVDLFIRHLKTLTPS
TSTGKHGLE