; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0241221 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0241221
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:3263714..3265825
RNA-Seq ExpressionCmc09g0241221
SyntenyCmc09g0241221
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-30979.23Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISAR VGD KLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAF  KNG HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINLDRI RLVKNGLLN+L++ SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKAR  FEYFISF
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
        IDDY RYGYLYLMEHKS+ALEKFK+YK EVENLLSKKIKILRSDRGGEYMDLRFQDYM++H IQSQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAVETAV+ILNNVPSKSVSETPFELWR RKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------
        MR+HKPRSKLVL+EATD STRVVDEVG SSRVD+TTTS                                                              
Subjt:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------

Query:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG
                                 GCKWIYKRKRDSAGKV TFKARLVAKGYTQRE VDYEETFSPVAMLKSIRILLSIA FYDYEIWQMDVKT FLNG
Subjt:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLN SIY LKQASRS NIRFDT IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI N+V YLTDVK WLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA

Query:  QFQ
        QFQ
Subjt:  QFQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-30578.24Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISAR VGD KLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAF  KNG HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINLDRI RLVK+GLLN+L++ SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKAR  FEYFISF
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
        IDDY RYGYLYLMEHKS+ALEKFK+YK EVENLLSKKIKI RSDRGGEYMDL FQDYM++H IQSQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAVETAV+ILNNVPSKSVSETPFELWR RKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------
        MR+HKPRSKLVL+EATD STRVVDEVG SSRVD+TTTS                                                              
Subjt:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------

Query:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG
                                 GCKWIYKRKRDSAGKV TFKARLVAKGYT++E VDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKT FLNG
Subjt:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLN SIY LKQASRS NIRFDT IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI N+V YLTDVK WLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA

Query:  QFQ
        QFQ
Subjt:  QFQ

KAA0037509.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-27595.61Show/hide
Query:  AKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEP
        AKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEP
Subjt:  AKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEP

Query:  LELIHSDLCGSMNVKAREDFEYFISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQ
        LELIHSDLCGSMNVKAREDFEYFISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQ
Subjt:  LELIHSDLCGSMNVKAREDFEYFISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQ

Query:  QNGVSERRNRTLLDMGYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFV
        QNG            GYAVETAVYILNNVPSKSVSET FELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFV
Subjt:  QNGVSERRNRTLLDMGYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFV

Query:  STNATFLKEDHMRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS---------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSP
        STNATFLKEDHMRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS         GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSP
Subjt:  STNATFLKEDHMRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS---------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSP

Query:  VAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKIN
        VAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKIN
Subjt:  VAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKIN

Query:  K
        K
Subjt:  K

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-24763.93Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA  VG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ F  KNG  ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
          TQNKR +ISP  N +LWHLRLGHINL+RIERLVKNGLL+ELE +SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCG MNVKAR  FEYFI+F
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
         DDY RYGY+YLM+HKS+ALEKFK+YKAEVEN LSK IK  RSDRGGEYMDL+FQ+Y+++  I SQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAV+TAVYILN VPSKSVSETP +LW  RK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------
        +R+HKPRSK+VLN    E T+ STRVV+E         VG S+R  Q  +                                                  
Subjt:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------

Query:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT
                                     GCKWIYKRKR + GKV TFKARLVAKGYTQ E VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKT 
Subjt:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLN SIY LKQASRS NIRFDT IKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLI N++  LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT

Query:  WLAAQFQ
        WLA QFQ
Subjt:  WLAAQFQ

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-24763.93Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA  VG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ F  KNG  ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
          TQNKR +ISP  N +LWHLRLGHINL+RIERLVKNGLL+ELE +SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCG MNVKAR  FEYFI+F
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
         DDY RYGY+YLM+HKS+ALEKFK+YKAEVEN LSK IK  RSDRGGEYMDL+FQ+Y+++  I SQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAV+TAVYILN VPSKSVSETP +LW  RK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------
        +R+HKPRSK+VLN    E T+ STRVV+E         VG S+R  Q  +                                                  
Subjt:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------

Query:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT
                                     GCKWIYKRKR + GKV TFKARLVAKGYTQ E VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKT 
Subjt:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLN SIY LKQASRS NIRFDT IKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLI N++  LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT

Query:  WLAAQFQ
        WLA QFQ
Subjt:  WLAAQFQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein7.5e-24863.93Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA  VG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ F  KNG  ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
          TQNKR +ISP  N +LWHLRLGHINL+RIERLVKNGLL+ELE +SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCG MNVKAR  FEYFI+F
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
         DDY RYGY+YLM+HKS+ALEKFK+YKAEVEN LSK IK  RSDRGGEYMDL+FQ+Y+++  I SQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAV+TAVYILN VPSKSVSETP +LW  RK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------
        +R+HKPRSK+VLN    E T+ STRVV+E         VG S+R  Q  +                                                  
Subjt:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------

Query:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT
                                     GCKWIYKRKR + GKV TFKARLVAKGYTQ E VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKT 
Subjt:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLN SIY LKQASRS NIRFDT IKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLI N++  LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT

Query:  WLAAQFQ
        WLA QFQ
Subjt:  WLAAQFQ

A0A5A7T2V9 Gag/pol protein1.3e-30578.24Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISAR VGD KLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAF  KNG HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINLDRI RLVK+GLLN+L++ SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKAR  FEYFISF
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
        IDDY RYGYLYLMEHKS+ALEKFK+YK EVENLLSKKIKI RSDRGGEYMDL FQDYM++H IQSQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAVETAV+ILNNVPSKSVSETPFELWR RKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------
        MR+HKPRSKLVL+EATD STRVVDEVG SSRVD+TTTS                                                              
Subjt:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------

Query:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG
                                 GCKWIYKRKRDSAGKV TFKARLVAKGYT++E VDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKT FLNG
Subjt:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLN SIY LKQASRS NIRFDT IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI N+V YLTDVK WLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA

Query:  QFQ
        QFQ
Subjt:  QFQ

A0A5A7T820 Gag/pol protein8.5e-27695.61Show/hide
Query:  AKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEP
        AKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEP
Subjt:  AKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEP

Query:  LELIHSDLCGSMNVKAREDFEYFISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQ
        LELIHSDLCGSMNVKAREDFEYFISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQ
Subjt:  LELIHSDLCGSMNVKAREDFEYFISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQ

Query:  QNGVSERRNRTLLDMGYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFV
        QNG            GYAVETAVYILNNVPSKSVSET FELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFV
Subjt:  QNGVSERRNRTLLDMGYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFV

Query:  STNATFLKEDHMRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS---------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSP
        STNATFLKEDHMRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS         GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSP
Subjt:  STNATFLKEDHMRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS---------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSP

Query:  VAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKIN
        VAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKIN
Subjt:  VAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKIN

Query:  K
        K
Subjt:  K

A0A5A7TZD0 Gag/pol protein6.7e-31079.23Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISAR VGD KLFF NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAF  KNG HICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
         NTQNKRQRISPNNNTYLWHLRLGHINLDRI RLVKNGLLN+L++ SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCG MNVKAR  FEYFISF
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
        IDDY RYGYLYLMEHKS+ALEKFK+YK EVENLLSKKIKILRSDRGGEYMDLRFQDYM++H IQSQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAVETAV+ILNNVPSKSVSETPFELWR RKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------
        MR+HKPRSKLVL+EATD STRVVDEVG SSRVD+TTTS                                                              
Subjt:  MRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTS--------------------------------------------------------------

Query:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG
                                 GCKWIYKRKRDSAGKV TFKARLVAKGYTQRE VDYEETFSPVAMLKSIRILLSIA FYDYEIWQMDVKT FLNG
Subjt:  -------------------------GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNG

Query:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA
        NLEESIFMSQPEGFITQGQEQKVCKLN SIY LKQASRS NIRFDT IKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLI N+V YLTDVK WLAA
Subjt:  NLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAA

Query:  QFQ
        QFQ
Subjt:  QFQ

A0A5D3CPJ6 Gag/pol protein7.5e-24863.93Show/hide
Query:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MT++VGTG V+SA  VG  +L  +  F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ F  KNG  ICSAKLENNLYVLR   +KA+LN EMF+T
Subjt:  MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF
          TQNKR +ISP  N +LWHLRLGHINL+RIERLVKNGLL+ELE +SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HSDLCG MNVKAR  FEYFI+F
Subjt:  TNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISF

Query:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------
         DDY RYGY+YLM+HKS+ALEKFK+YKAEVEN LSK IK  RSDRGGEYMDL+FQ+Y+++  I SQLSAP T QQNGVSERRNRTLLDM           
Subjt:  IDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDM-----------

Query:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
            GYAV+TAVYILN VPSKSVSETP +LW  RK SL HFRIWGCPAHVL  NPKKLEPRS+LC FVGYPK TRGG F+DP++N+VFVSTNATFL+EDH
Subjt:  ----GYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------
        +R+HKPRSK+VLN    E T+ STRVV+E         VG S+R  Q  +                                                  
Subjt:  MRDHKPRSKLVLN----EATDGSTRVVDE---------VGLSSRVDQTTT--------------------------------------------------

Query:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT
                                     GCKWIYKRKR + GKV TFKARLVAKGYTQ E VDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKT 
Subjt:  ----------------------------SGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTT

Query:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT
        FLNGNLEE+I+M QPEGFI  GQEQK+CKLN SIY LKQASRS NIRFDT IKSYGFDQ VDEPCVYK+I    VAFLVLYVDDILLI N++  LTD+K 
Subjt:  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKT

Query:  WLAAQFQ
        WLA QFQ
Subjt:  WLAAQFQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.5e-6126.08Show/hide
Query:  GDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTTNTQNK
        G+ I A   G  +L  +++ + LE++    +   NL+SV  L E   SI F  +    SKNG  +  ++ + NN+          V+N + + + N ++K
Subjt:  GDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTTNTQNK

Query:  RQRISPNNNTYLWHLRLGHIN------LDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGSMNVKAREDFEYFI
               NN  LWH R GHI+      + R        LLN LE  S   CE CL GK  + PF     +   K PL ++HSD+CG +     +D  YF+
Subjt:  RQRISPNNNTYLWHLRLGHIN------LDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGSMNVKAREDFEYFI

Query:  SFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLD----------
         F+D +  Y   YL+++KS     F+D+ A+ E   + K+  L  D G EY+    + + +K  I   L+ P T Q NGVSER  RT+ +          
Subjt:  SFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLD----------

Query:  -----MGYAVETAVYILNNVPSKSV---SETPFELWRERKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFD-------------
              G AV TA Y++N +PS+++   S+TP+E+W  +KP L H R++G   +V + N + K + +S    FVGY  E  G   +D             
Subjt:  -----MGYAVETAVYILNNVPSKSV---SETPFELWRERKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFD-------------

Query:  ------------------------------PQENRVFVST----------NATFLKE-------------------------------DHMRDHKPRSKL
                                      P ++R  + T          N  FLK+                                 ++D K  +K 
Subjt:  ------------------------------PQENRVFVST----------NATFLKE-------------------------------DHMRDHKPRSKL

Query:  VLNEA----------------------------------------------------------------TDGSTRVV---------------DEV-----
         LNE+                                                                 D S   V               DE+     
Subjt:  VLNEA----------------------------------------------------------------TDGSTRVV---------------DEV-----

Query:  ---------------------GLSSRVDQTTTSGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMD
                              ++ R +       +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+ I Y+ ++ QMD
Subjt:  ---------------------GLSSRVDQTTTSGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMD

Query:  VKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIRNNVRY
        VKT FLNG L+E I+M  P+G         VCKLN +IY LKQA+R     F+  +K   F  +  + C+Y   K N  +  +++LYVDD+++   ++  
Subjt:  VKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIRNNVRY

Query:  LTDVKTWLAAQFQ
        + + K +L  +F+
Subjt:  LTDVKTWLAAQFQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-8128.69Show/hide
Query:  GTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRTTN--T
        G GD+     VG T        + L+++  VP ++ NL+S   L    Y   F+ N+ +    G+ +                AK V    ++RT     
Subjt:  GTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRTTN--T

Query:  QNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISFIDD
        Q +        +  LWH R+GH++   ++ L K  L++  +  ++ PC+ CL GK  +  F     R    L+L++SD+CG M +++    +YF++FIDD
Subjt:  QNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISFIDD

Query:  YLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLD---------------
          R  ++Y+++ K +  + F+ + A VE    +K+K LRSD GGEY    F++Y   H I+ + + P T Q NGV+ER NRT+++               
Subjt:  YLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLD---------------

Query:  MGYAVETAVYILNNVPSKSVS-ETPFELWRERKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH
         G AV+TA Y++N  PS  ++ E P  +W  ++ S SH +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + +V  S +  F +E  
Subjt:  MGYAVETAVYILNNVPSKSVS-ETPFELWRERKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDH

Query:  MRDHKPRSKLVLN----------------EATDGSTRVVDEVG---------------------------------------------------------
        +R     S+ V N                 + + +T  V E G                                                         
Subjt:  MRDHKPRSKLVLN----------------EATDGSTRVVDEVG---------------------------------------------------------

Query:  ---------------------LSSRVDQTTTSG---------------CKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILL
                             +   ++    +G               CKW++K K+D   K+  +KARLV KG+ Q++ +D++E FSPV  + SIR +L
Subjt:  ---------------------LSSRVDQTTTSG---------------CKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILL

Query:  SIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVY-KKINKGKVAFLVLY
        S+A   D E+ Q+DVKT FL+G+LEE I+M QPEGF   G++  VCKLN S+Y LKQA R   ++FD+ +KS  + +   +PCVY K+ ++     L+LY
Subjt:  SIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVY-KKINKGKVAFLVLY

Query:  VDDILLIRNNVRYLTDVKTWLAAQF
        VDD+L++  +   +  +K  L+  F
Subjt:  VDDILLIRNNVRYLTDVKTWLAAQF

Q12491 Transposon Ty2-B Gag-Pol polyprotein5.0e-1526.28Show/hide
Query:  ISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRI
        I    +G+    F+N           P I  +L+S+S L     +  F+ N      +G  +       + Y L  ++   + +H    T N  NK +  
Subjt:  ISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRI

Query:  SPNNNTY-LWHLRLGHINLDRIERLVKNGLLN-------ELENDSLPPCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGSMNVKAREDFEYF
        S N   Y L H  LGH N   I++ +K   +        E  N S   C  CL GK TK     KG R K     EP + +H+D+ G ++   +    YF
Subjt:  SPNNNTY-LWHLRLGHINLDRIERLVKNGLLN-------ELENDSLPPCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGSMNVKAREDFEYF

Query:  ISFIDDYLRYGYLYLM--EHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLD
        ISF D+  R+ ++Y +    +   L  F    A ++N  + ++ +++ DRG EY +     +     I +  +  + ++ +GV+ER NRTLL+
Subjt:  ISFIDDYLRYGYLYLM--EHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.4e-5122.48Show/hide
Query:  VGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMF
        V  G  I     G T L  +++ + L N+  VP I +NL+SV  L          +  +F + +  T   G  +   K ++ LY      ++ V    +F
Subjt:  VGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMF

Query:  RTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELE-NDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYF
         + +++              WH RLGH     +  ++ N  L+ L  +     C  CL  K  K PF+     +  PLE I+SD+  S  + + +++ Y+
Subjt:  RTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELE-NDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYF

Query:  ISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDMG-------
        + F+D + RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +H I    S P T + NG+SER++R +++ G       
Subjt:  ISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDMG-------

Query:  --------YAVETAVYILNNVPSKSVS-ETPFELWRERKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT
                YA   AVY++N +P+  +  E+PF+      P+    R++GC  +  +   N  KL+ +SR C F+GY       L    Q +R+++S +  
Subjt:  --------YAVETAVYILNNVPSKSVS-ETPFELWRERKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT

Query:  F---------------------------------------------LKEDH-------------------------------------------------
        F                                               + H                                                 
Subjt:  F---------------------------------------------LKEDH-------------------------------------------------

Query:  --------------------------------------------------------------------------------MRDH-------------KPR
                                                                                        +  H              P+
Subjt:  --------------------------------------------------------------------------------MRDH-------------KPR

Query:  SKLVLNEATDGSTRVVDEV--------GLSSRVD----------------QTTTSGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAM
          L ++ A +   R   +          + S ++                  T  GC+WI+ +K +S G ++ +KARLVAKGY QR  +DY ETFSPV  
Subjt:  SKLVLNEATDGSTRVVDEV--------GLSSRVD----------------QTTTSGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAM

Query:  LKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGK
          SIRI+L +A+   + I Q+DV   FL G L + ++MSQP GFI + +   VCKL  ++Y LKQA R+  +     + + GF  +V +  ++       
Subjt:  LKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGK

Query:  VAFLVLYVDDILLIRNNVRYLTDVKTWLAAQF
        + ++++YVDDIL+  N+   L +    L+ +F
Subjt:  VAFLVLYVDDILLIRNNVRYLTDVKTWLAAQF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-5122.49Show/hide
Query:  VGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMF
        +  G  I     G   L   ++ + L  +  VP I +NL+SV  L          +  +F + +  T   G  +   K ++ LY      ++AV    MF
Subjt:  VGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMF

Query:  RTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELE-NDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYF
         +  ++              WH RLGH +L  +  ++ N  L  L  +  L  C  C   K  K PF+     + +PLE I+SD+  S  + + +++ Y+
Subjt:  RTTNTQNKRQRISPNNNTYLWHLRLGHINLDRIERLVKNGLLNELE-NDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYF

Query:  ISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDMG-------
        + F+D + RY +LY ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +H I    S P T + NG+SER++R +++MG       
Subjt:  ISFIDDYLRYGYLYLMEHKSKALEKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDMG-------

Query:  --------YAVETAVYILNNVPSKSVS-ETPFELWRERKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT
                YA   AVY++N +P+  +  ++PF+    + P+    +++GC  +  +   N  KLE +S+ C F+GY       L       R++ S +  
Subjt:  --------YAVETAVYILNNVPSKSVS-ETPFELWRERKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT

Query:  F-------------------------------------------------------------------------------------------------LK
        F                                                                                                   
Subjt:  F-------------------------------------------------------------------------------------------------LK

Query:  EDHMRDHKPRSKLVLNEATDGS---------------------------------------------------------------------TRVVDEV--
        + H   +   +  +LN     S                                                                     TR  D +  
Subjt:  EDHMRDHKPRSKLVLNEATDGS---------------------------------------------------------------------TRVVDEV--

Query:  -------------------------------GLSSRVD----------------QTTTSGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFS
                                        + S ++                  T  GC+WI+ +K +S G ++ +KARLVAKGY QR  +DY ETFS
Subjt:  -------------------------------GLSSRVD----------------QTTTSGCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFS

Query:  PVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKI
        PV    SIRI+L +A+   + I Q+DV   FL G L + ++MSQP GF+ + +   VC+L  +IY LKQA R+  +   T + + GF  ++ +  ++   
Subjt:  PVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKI

Query:  NKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAAQF
            + ++++YVDDIL+  N+   L      L+ +F
Subjt:  NKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAAQF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-3641.76Show/hide
Query:  GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFIT-QGQE---Q
        GCKW+YK K +S G +  +KARLVAKGYTQ+E +D+ ETFSPV  L S++++L+I+  Y++ + Q+D+   FLNG+L+E I+M  P G+   QG      
Subjt:  GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFIT-QGQE---Q

Query:  KVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAAQFQ
         VC L  SIY LKQASR   ++F  T+  +GF Q+  +   + KI       +++YVDDI++  NN   + ++K+ L + F+
Subjt:  KVCKLNLSIYALKQASRSLNIRFDTTIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAAQFQ

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.9e-0940Show/hide
Query:  NNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNV
        + T LWH RL H++   +E LVK G L+  +  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G+ +V
Subjt:  NNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.5e-0948.21Show/hide
Query:  GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIA
        GCKW++K K  S G +   KARLVAKG+ Q E + + ET+SPV    +IR +L++A
Subjt:  GCKWIYKRKRDSAGKVHTFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACAGGAGATGTTATTTCAGCTCGTGTAGTAGGAGATACTAAGTTGTTTTTCGAAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCACTTCTAAGAATGGTGCACATATTTGTT
CAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTCAGAACTACTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGAAAGATTGGTAAAGAATGGACTTCTAAACGAGTTAGAAAATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTTCGATGAACGTAAAAGCTAGAGAGGATTTTGAATACTTCATCTCTTTTATAGATGATTATTTAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTAAAGCTCTT
GAAAAGTTCAAGGATTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAGGACTA
TATGTTAAAACATGAAATCCAATCTCAACTCTCAGCACCTAGCACAGCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACCTTGTTAGACATGGGGTATGCAGTAG
AGACTGCGGTTTATATCTTGAATAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGTTATGGAGAGAACGTAAACCTAGTTTAAGTCACTTCAGAATTTGGGGT
TGTCCAGCACACGTGTTGGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTCGATCC
ACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGAAAGAAGACCACATGAGAGATCATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGGAT
CAACAAGGGTTGTTGATGAAGTTGGTCTCTCGTCAAGAGTTGATCAAACCACCACATCAGGGTGTAAGTGGATCTATAAGAGAAAAAGAGATTCAGCTGGGAAGGTACAT
ACCTTCAAAGCTAGACTTGTAGCAAAAGGATATACCCAAAGGGAATGGGTTGACTATGAGGAAACTTTTTCCCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTC
CATCGCCATATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTACTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAA
CCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCTATCCATTTATGCGTTGAAACAAGCATCTAGATCTTTGAACATTAGGTTTGATACTACGATCAAATCTTACGGT
TTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGTAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTAGGAATAATGTGAG
ATACCTTACTGACGTTAAAACTTGGCTAGCAGCCCAATTCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACAGGAGATGTTATTTCAGCTCGTGTAGTAGGAGATACTAAGTTGTTTTTCGAAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCACTTCTAAGAATGGTGCACATATTTGTT
CAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTCAGAACTACTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGAAAGATTGGTAAAGAATGGACTTCTAAACGAGTTAGAAAATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTTCGATGAACGTAAAAGCTAGAGAGGATTTTGAATACTTCATCTCTTTTATAGATGATTATTTAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTAAAGCTCTT
GAAAAGTTCAAGGATTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCCAGGACTA
TATGTTAAAACATGAAATCCAATCTCAACTCTCAGCACCTAGCACAGCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGAACCTTGTTAGACATGGGGTATGCAGTAG
AGACTGCGGTTTATATCTTGAATAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGTTATGGAGAGAACGTAAACCTAGTTTAAGTCACTTCAGAATTTGGGGT
TGTCCAGCACACGTGTTGGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTCGATCC
ACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGAAAGAAGACCACATGAGAGATCATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGGAT
CAACAAGGGTTGTTGATGAAGTTGGTCTCTCGTCAAGAGTTGATCAAACCACCACATCAGGGTGTAAGTGGATCTATAAGAGAAAAAGAGATTCAGCTGGGAAGGTACAT
ACCTTCAAAGCTAGACTTGTAGCAAAAGGATATACCCAAAGGGAATGGGTTGACTATGAGGAAACTTTTTCCCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTC
CATCGCCATATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTACTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAA
CCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCTATCCATTTATGCGTTGAAACAAGCATCTAGATCTTTGAACATTAGGTTTGATACTACGATCAAATCTTACGGT
TTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGTAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTAGGAATAATGTGAG
ATACCTTACTGACGTTAAAACTTGGCTAGCAGCCCAATTCCAATGA
Protein sequenceShow/hide protein sequence
MTLKVGTGDVISARVVGDTKLFFENKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFTSKNGAHICSAKLENNLYVLRPNEAKAVLNHEMFRTTNTQNKRQRI
SPNNNTYLWHLRLGHINLDRIERLVKNGLLNELENDSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGSMNVKAREDFEYFISFIDDYLRYGYLYLMEHKSKAL
EKFKDYKAEVENLLSKKIKILRSDRGGEYMDLRFQDYMLKHEIQSQLSAPSTAQQNGVSERRNRTLLDMGYAVETAVYILNNVPSKSVSETPFELWRERKPSLSHFRIWG
CPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLKEDHMRDHKPRSKLVLNEATDGSTRVVDEVGLSSRVDQTTTSGCKWIYKRKRDSAGKVH
TFKARLVAKGYTQREWVDYEETFSPVAMLKSIRILLSIAIFYDYEIWQMDVKTTFLNGNLEESIFMSQPEGFITQGQEQKVCKLNLSIYALKQASRSLNIRFDTTIKSYG
FDQNVDEPCVYKKINKGKVAFLVLYVDDILLIRNNVRYLTDVKTWLAAQFQ