; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0106531 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0106531
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr04:24541400..24542815
RNA-Seq ExpressionCmc04g0106531
SyntenyCmc04g0106531
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042206.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.3e-14459.38Show/hide
Query:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC
        +R L+K+I+NEA+VGIP+L++NGKFFCGDCQ+GKQ + +H+ LKECY  RVLELLH+DLMG MQTESL GK+YVLVVVDDY  +TWV FLK K+DT+++C
Subjt:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC

Query:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNV---KYFHVFGS
         +LC+ LQREKG+KI ++RSDHGKEF+NED N+F   +GIHHEF AP T QQN VV+RKNR LQEMARVMIHA NLPL F AEA        +  H    
Subjt:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNV---KYFHVFGS

Query:  TCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMN-DEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSS
        TCYILADREY +KWD + +QGIFLGYS NS  Y +FN + G+VME INVV+ND +S + Q N ++DET    E  +T   E+ K D+   D  K+    +
Subjt:  TCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMN-DEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSS

Query:  EEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIG
        +E+I  ++ L+PSAHVKKNH +SSI+GDPSAG+ T+ KEK                      + LKDEY +N MQEELLQF+RNN+W LV KP+  N+IG
Subjt:  EEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIG

Query:  TKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        TKW+FKNKTDE+  V +N+ARLVAQGY QV+GVDF++TFAPVARLEAI
Subjt:  TKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-15773.17Show/hide
Query:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC
        MRGLEK+IKN+A+VGIPNL+VNG FFC DCQIGKQ R+THKSLKECYTNRVLELLHMDLMG MQT+SLG                      GKTDTVEIC
Subjt:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC

Query:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA----------------
        KNLCLKLQRE+ KKITRIRSDHGKEF+NE FNSF L EG HHEFSAP TPQQN VV+RKN+ LQEMARVMIHAKNLPLCF+AEA                
Subjt:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA----------------

Query:  -----------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEART
                   RK NVKYFHVFGSTCYILADREY +KWDAR EQGIFL YSQ S  Y ++NNR  SVMETIN  INDLDSAIK MND EDETPNMSE RT
Subjt:  -----------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEART

Query:  TSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQE
        TSTVE SKADN SD PGKSL+KSSEEII KK ELIPSAHV+KNHPA SIIGDPSAGM TR+K+KIDY+KMVA+LCY STIEP T+DS LK+EY LNAMQE
Subjt:  TSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQE

Query:  ELLQFRRNNV
        ELLQF+RNNV
Subjt:  ELLQFRRNNV

KAA0065371.1 putative mitochondrial protein [Cucumis melo var. makuwa]2.1e-16179.38Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVV
        MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDH                                  
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVV

Query:  QRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIK
                                                      ADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIK
Subjt:  QRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIK

Query:  QMNDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFT
        QMNDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFT
Subjt:  QMNDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFT

Query:  IDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        IDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
Subjt:  IDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

TYK07876.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.0e-13664.54Show/hide
Query:  RVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTT
        RVLELLHMDLMG MQTESLGGKRYVLVV DDYSRYTWVCFLKG+TD VEICKNLCLKLQREK                           GIHHEFSAP T
Subjt:  RVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTT

Query:  PQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------------------------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLG
        PQQN VV+RK R LQEMA VMIHAKNLPLCFWA+                            RKPNVKYFHVFGSTCYILADREYRQKWDAR EQGIFLG
Subjt:  PQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------------------------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLG

Query:  YSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSI
        YSQNS  Y +FNNR  SVM+TINVVI  LDS IKQMND ED+TPNMSEARTTS                                               
Subjt:  YSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSI

Query:  IGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQ
          DPS  M TR+K+KIDY+KMVADLCYTSTIEP T+DS +KDEY L AMQEELLQFRRNNVW LVSKPEGV+VI TKW+ KNK DE  CVTKNKARLVAQ
Subjt:  IGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQ

Query:  GYTQVEGVDFDETFAPVARLEAI
        GYTQVEGVDFDETFA +ARLEAI
Subjt:  GYTQVEGVDFDETFAPVARLEAI

TYK30677.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.5e-14063.86Show/hide
Query:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC
        MRGLEKIIKNEA+VGIPNL+VNG FFCGDCQIGKQ R +HKSLKECYTNRVLELLHM+LMG MQTESLGGKR  +                         
Subjt:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC

Query:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCY
                        R R+            +  L+E                                          W + RK NVKYFHVFGSTCY
Subjt:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCY

Query:  ILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQM-NDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEI
        ILADREYRQKWD + EQGIF GYSQNS  Y +FNN  GS ++TINVVINDLDSAIKQ+ N+EDETPNMSEARTTS++EV KADNP  D  KSLEKSS+E 
Subjt:  ILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQM-NDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEI

Query:  ITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKW
        ITKKSELIPSA VKKNHP SSIIGDPSA M TR+KEKIDYMKMVADLCY ST EP T++S L+DEY LNAMQEELLQFRRNNVW LVSKPEGVNVIGTKW
Subjt:  ITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKW

Query:  VFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVA
        VFKNK DEAGCVTKNKARLVA GYTQVEG+DFDETFA VA
Subjt:  VFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVA

TrEMBL top hitse value%identityAlignment
A0A5A7TNK7 Gag-pol polyprotein6.9e-15873.17Show/hide
Query:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC
        MRGLEK+IKN+A+VGIPNL+VNG FFC DCQIGKQ R+THKSLKECYTNRVLELLHMDLMG MQT+SLG                      GKTDTVEIC
Subjt:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC

Query:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA----------------
        KNLCLKLQRE+ KKITRIRSDHGKEF+NE FNSF L EG HHEFSAP TPQQN VV+RKN+ LQEMARVMIHAKNLPLCF+AEA                
Subjt:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA----------------

Query:  -----------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEART
                   RK NVKYFHVFGSTCYILADREY +KWDAR EQGIFL YSQ S  Y ++NNR  SVMETIN  INDLDSAIK MND EDETPNMSE RT
Subjt:  -----------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEART

Query:  TSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQE
        TSTVE SKADN SD PGKSL+KSSEEII KK ELIPSAHV+KNHPA SIIGDPSAGM TR+K+KIDY+KMVA+LCY STIEP T+DS LK+EY LNAMQE
Subjt:  TSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQE

Query:  ELLQFRRNNV
        ELLQF+RNNV
Subjt:  ELLQFRRNNV

A0A5A7VI01 Putative mitochondrial protein1.0e-16179.38Show/hide
Query:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVV
        MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDH                                  
Subjt:  MDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVV

Query:  QRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIK
                                                      ADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIK
Subjt:  QRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIK

Query:  QMNDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFT
        QMNDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFT
Subjt:  QMNDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFT

Query:  IDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        IDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
Subjt:  IDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

A0A5D3CBZ0 Gag-pol polyprotein2.0e-13664.54Show/hide
Query:  RVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTT
        RVLELLHMDLMG MQTESLGGKRYVLVV DDYSRYTWVCFLKG+TD VEICKNLCLKLQREK                           GIHHEFSAP T
Subjt:  RVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTT

Query:  PQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------------------------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLG
        PQQN VV+RK R LQEMA VMIHAKNLPLCFWA+                            RKPNVKYFHVFGSTCYILADREYRQKWDAR EQGIFLG
Subjt:  PQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------------------------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLG

Query:  YSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSI
        YSQNS  Y +FNNR  SVM+TINVVI  LDS IKQMND ED+TPNMSEARTTS                                               
Subjt:  YSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMND-EDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSI

Query:  IGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQ
          DPS  M TR+K+KIDY+KMVADLCYTSTIEP T+DS +KDEY L AMQEELLQFRRNNVW LVSKPEGV+VI TKW+ KNK DE  CVTKNKARLVAQ
Subjt:  IGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQ

Query:  GYTQVEGVDFDETFAPVARLEAI
        GYTQVEGVDFDETFA +ARLEAI
Subjt:  GYTQVEGVDFDETFAPVARLEAI

A0A5D3DSN1 Gag-pol polyprotein2.6e-14459.38Show/hide
Query:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC
        +R L+K+I+NEA+VGIP+L++NGKFFCGDCQ+GKQ + +H+ LKECY  RVLELLH+DLMG MQTESL GK+YVLVVVDDY  +TWV FLK K+DT+++C
Subjt:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC

Query:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNV---KYFHVFGS
         +LC+ LQREKG+KI ++RSDHGKEF+NED N+F   +GIHHEF AP T QQN VV+RKNR LQEMARVMIHA NLPL F AEA        +  H    
Subjt:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNV---KYFHVFGS

Query:  TCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMN-DEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSS
        TCYILADREY +KWD + +QGIFLGYS NS  Y +FN + G+VME INVV+ND +S + Q N ++DET    E  +T   E+ K D+   D  K+    +
Subjt:  TCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMN-DEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSS

Query:  EEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIG
        +E+I  ++ L+PSAHVKKNH +SSI+GDPSAG+ T+ KEK                      + LKDEY +N MQEELLQF+RNN+W LV KP+  N+IG
Subjt:  EEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIG

Query:  TKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        TKW+FKNKTDE+  V +N+ARLVAQGY QV+GVDF++TFAPVARLEAI
Subjt:  TKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

A0A5D3E3N1 Gag-pol polyprotein1.7e-14063.86Show/hide
Query:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC
        MRGLEKIIKNEA+VGIPNL+VNG FFCGDCQIGKQ R +HKSLKECYTNRVLELLHM+LMG MQTESLGGKR  +                         
Subjt:  MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEIC

Query:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCY
                        R R+            +  L+E                                          W + RK NVKYFHVFGSTCY
Subjt:  KNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCY

Query:  ILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQM-NDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEI
        ILADREYRQKWD + EQGIF GYSQNS  Y +FNN  GS ++TINVVINDLDSAIKQ+ N+EDETPNMSEARTTS++EV KADNP  D  KSLEKSS+E 
Subjt:  ILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQM-NDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEI

Query:  ITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKW
        ITKKSELIPSA VKKNHP SSIIGDPSA M TR+KEKIDYMKMVADLCY ST EP T++S L+DEY LNAMQEELLQFRRNNVW LVSKPEGVNVIGTKW
Subjt:  ITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKW

Query:  VFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVA
        VFKNK DEAGCVTKNKARLVA GYTQVEG+DFDETFA VA
Subjt:  VFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-1244Show/hide
Query:  AMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEA
        A+  EL   + NN W +  +PE  N++ ++WVF  K +E G   + KARLVA+G+TQ   +D++ETFAPVAR+ +
Subjt:  AMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-4629.13Show/hide
Query:  CGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEF
        C  C  GKQ R + ++  E   N +L+L++ D+ G M+ ES+GG +Y +  +DD SR  WV  LK K    ++ +     ++RE G+K+ R+RSD+G E+
Subjt:  CGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQREKGKKITRIRSDHGKEF

Query:  NNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------RKPN----------------VKYFH--VFGSTC
         + +F  +    GI HE + P TPQ N V +R NR + E  R M+    LP  FW EA         R P+                V Y H  VFG   
Subjt:  NNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------RKPN----------------VKYFH--VFGSTC

Query:  YILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMNDEDE------TPNMSEARTTSTVEVSKADNPSDDPGKSLE
        +    +E R K D +    IF+GY      Y +++     V+ + +VV    +S ++   D  E       PN     +TS    S A++ +D+  +  E
Subjt:  YILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMNDEDE------TPNMSEARTTSTVEVSKADNPSDDPGKSLE

Query:  KSSEEIITKKSELIPSAHVKKNHPASS------IIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYR---LNAMQEELLQFRRNNVWM
        +  E  + ++ E +     +  HP         +       + +R+    +Y+ +  D       EP ++   L    +   + AMQEE+   ++N  + 
Subjt:  KSSEEIITKKSELIPSAHVKKNHPASS------IIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYR---LNAMQEELLQFRRNNVWM

Query:  LVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        LV  P+G   +  KWVFK K D    + + KARLV +G+ Q +G+DFDE F+PV ++ +I
Subjt:  LVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

P92520 Uncharacterized mitochondrial protein AtMg008206.7e-1744.07Show/hide
Query:  MHTRKKEKIDYMKMVADLCYTSTI--EPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQV
        M TR K  I+ +     L  T+TI  EP ++   LKD     AMQEEL    RN  W+LV  P   N++G KWVFK K    G + + KARLVA+G+ Q 
Subjt:  MHTRKKEKIDYMKMVADLCYTSTI--EPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQV

Query:  EGVDFDETFAPVARLEAI
        EG+ F ET++PV R   I
Subjt:  EGVDFDETFAPVARLEAI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.3e-2323.44Show/hide
Query:  LEKIIKNEALVGIPNLNVNGKFF-CGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKN
        L  +I N +L     LN + KF  C DC I K  +    S     + R LE ++ D+       S    RY ++ VD ++RYTW+  LK K+   E    
Subjt:  LEKIIKNEALVGIPNLNVNGKFF-CGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKN

Query:  LCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA------------------
            L+     +I    SD+G EF       +F   GI H  S P TP+ N + +RK+R + E    ++   ++P  +W  A                  
Subjt:  LCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA------------------

Query:  ---------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFN-----------------------------------------
                   PN     VFG  CY       + K D +  Q +FLGYS     Y   +                                         
Subjt:  ---------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFN-----------------------------------------

Query:  -----------------------------NRPGSVMETINVVINDLDS-------------AIKQMNDEDET-PNMSEARTTSTVEVSKADNPSDDPGKS
                                     + P +      V  ++LDS             A +Q   +  T P  ++ +T S+   S+ +  ++ P + 
Subjt:  -----------------------------NRPGSVMETINVVINDLDS-------------AIKQMNDEDET-PNMSEARTTSTVEVSKADNPSDDPGKS

Query:  LEKSSEEIITKKSELIPSAHVKKNH----PASSIIGDP----------------SAGMHTRKKEKI--DYMKMVADLCYTSTIEPFTIDSTLKDEYRLNA
         +  S    +  S   P+     +     P S +I  P                +  M TR K  I     K    +   +  EP T    LKDE   NA
Subjt:  LEKSSEEIITKKSELIPSAHVKKNH----PASSIIGDP----------------SAGMHTRKKEKI--DYMKMVADLCYTSTIEPFTIDSTLKDEYRLNA

Query:  MQEELLQFRRNNVWMLVSKPEG-VNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        M  E+     N+ W LV  P   V ++G +W+F  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +I
Subjt:  MQEELLQFRRNNVWMLVSKPEG-VNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-2223.16Show/hide
Query:  LEKIIKNEALVGIPNLNVNGKFF-CGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKT---DTVEI
        L  +I N +L   P LN + K   C DC I K  +    S     +++ LE ++ D+       S+   RY ++ VD ++RYTW+  LK K+   DT  I
Subjt:  LEKIIKNEALVGIPNLNVNGKFF-CGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKT---DTVEI

Query:  CKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------------
         K+L   ++     +I  + SD+G EF       +    GI H  S P TP+ N + +RK+R + EM   ++   ++P  +W  A               
Subjt:  CKNLCLKLQREKGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEA---------------

Query:  ------------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVIND---------LDSAIKQMNDEDE
                    + PN +   VFG  CY       R K + + +Q  F+GYS     Y   +   G +  + +V  ++            +  Q    D 
Subjt:  ------------RKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIFLGYSQNSPTYTIFNNRPGSVMETINVVIND---------LDSAIKQMNDEDE

Query:  TPNM------------------------------SEARTTSTVEVSKADNPSD----------------------------------------DPGKSLE
         PN                               S      T +VS ++ PS                                         +P     
Subjt:  TPNM------------------------------SEARTTSTVEVSKADNPSD----------------------------------------DPGKSLE

Query:  KSSEEIITKKSELIPSAHV--------KKNHPASSIIGDP---------------------SAGMHTRKKEKI--DYMKMVADLCYTSTIEPFTIDSTLK
         S  +        I S H+        + N P+SS    P                     +  M TR K+ I     K        +  EP T    +K
Subjt:  KSSEEIITKKSELIPSAHV--------KKNHPASSIIGDP---------------------SAGMHTRKKEKI--DYMKMVADLCYTSTIEPFTIDSTLK

Query:  DEYRLNAMQEELLQFRRNNVWMLV-SKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        D+    AM  E+     N+ W LV   P  V ++G +W+F  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +I
Subjt:  DEYRLNAMQEELLQFRRNNVWMLV-SKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.3e-1229.89Show/hide
Query:  TTSTVEVSKADNPSDD-PGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAM
        ++S++++  + N  +D P  S+  S     T+K   +   +   +  AS  I D S  +   K   + +  +V   C     EP T +   +      AM
Subjt:  TTSTVEVSKADNPSDD-PGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMHTRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAM

Query:  QEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
         +E+      + W + + P     IG KWV+K K +  G + + KARLVA+GYTQ EG+DF ETF+PV +L ++
Subjt:  QEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.8e-1844.07Show/hide
Query:  MHTRKKEKIDYMKMVADLCYTSTI--EPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQV
        M TR K  I+ +     L  T+TI  EP ++   LKD     AMQEEL    RN  W+LV  P   N++G KWVFK K    G + + KARLVA+G+ Q 
Subjt:  MHTRKKEKIDYMKMVADLCYTSTI--EPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQV

Query:  EGVDFDETFAPVARLEAI
        EG+ F ET++PV R   I
Subjt:  EGVDFDETFAPVARLEAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGCTTGGAAAAAATTATTAAAAATGAAGCACTTGTGGGCATTCCTAATTTAAATGTAAATGGGAAATTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGATAAG
GGCTACTCATAAAAGTCTAAAAGAATGTTATACAAATAGAGTCTTGGAATTGTTACATATGGATCTCATGGGTCTGATGCAAACAGAAAGTCTGGGAGGAAAGAGGTATG
TGCTGGTTGTAGTTGATGATTACTCAAGATATACTTGGGTTTGCTTTCTTAAAGGAAAAACAGATACTGTTGAAATATGCAAAAATCTGTGTTTGAAGCTTCAACGTGAA
AAAGGGAAGAAGATAACGAGGATCCGAAGTGATCATGGTAAAGAGTTTAATAATGAAGACTTTAACAGTTTTTTTCTGTTCGAAGGAATACACCATGAATTTTCTGCACC
TACAACTCCTCAACAAAATGATGTAGTACAAAGAAAGAACAGGATGTTACAAGAAATGGCACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAG
CTAGAAAGCCAAATGTTAAGTACTTCCATGTGTTTGGAAGTACATGTTATATCTTAGCTGATAGGGAATACCGTCAGAAATGGGATGCAAGGTTAGAACAAGGAATCTTT
CTTGGGTACTCTCAGAATAGTCCGACCTATACAATCTTCAATAACAGACCTGGGAGTGTTATGGAAACAATCAACGTGGTTATAAATGATCTCGATTCAGCTATCAAACA
GATGAATGATGAAGATGAGACTCCAAACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCTGATGATCCAGGCAAAAGTTTGGAAA
AATCATCAGAAGAAATCATCACTAAAAAATCAGAACTAATTCCATCTGCTCATGTGAAGAAAAATCATCCAGCAAGCTCTATAATAGGTGATCCATCGGCTGGGATGCAT
ACCAGAAAGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGTTATACTTCCACCATTGAACCTTTTACTATTGACTCTACTCTCAAGGATGAGTATCGGCT
AAATGCTATGCAAGAGGAGCTACTCCAATTTAGACGAAACAATGTCTGGATGTTAGTTTCAAAGCCAGAAGGTGTAAACGTTATTGGCACCAAATGGGTATTTAAAAATA
AAACTGACGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGA
CTTGAAGCCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGCTTGGAAAAAATTATTAAAAATGAAGCACTTGTGGGCATTCCTAATTTAAATGTAAATGGGAAATTCTTCTGTGGAGACTGTCAAATTGGCAAGCAGATAAG
GGCTACTCATAAAAGTCTAAAAGAATGTTATACAAATAGAGTCTTGGAATTGTTACATATGGATCTCATGGGTCTGATGCAAACAGAAAGTCTGGGAGGAAAGAGGTATG
TGCTGGTTGTAGTTGATGATTACTCAAGATATACTTGGGTTTGCTTTCTTAAAGGAAAAACAGATACTGTTGAAATATGCAAAAATCTGTGTTTGAAGCTTCAACGTGAA
AAAGGGAAGAAGATAACGAGGATCCGAAGTGATCATGGTAAAGAGTTTAATAATGAAGACTTTAACAGTTTTTTTCTGTTCGAAGGAATACACCATGAATTTTCTGCACC
TACAACTCCTCAACAAAATGATGTAGTACAAAGAAAGAACAGGATGTTACAAGAAATGGCACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAAG
CTAGAAAGCCAAATGTTAAGTACTTCCATGTGTTTGGAAGTACATGTTATATCTTAGCTGATAGGGAATACCGTCAGAAATGGGATGCAAGGTTAGAACAAGGAATCTTT
CTTGGGTACTCTCAGAATAGTCCGACCTATACAATCTTCAATAACAGACCTGGGAGTGTTATGGAAACAATCAACGTGGTTATAAATGATCTCGATTCAGCTATCAAACA
GATGAATGATGAAGATGAGACTCCAAACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCTGATGATCCAGGCAAAAGTTTGGAAA
AATCATCAGAAGAAATCATCACTAAAAAATCAGAACTAATTCCATCTGCTCATGTGAAGAAAAATCATCCAGCAAGCTCTATAATAGGTGATCCATCGGCTGGGATGCAT
ACCAGAAAGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGTTATACTTCCACCATTGAACCTTTTACTATTGACTCTACTCTCAAGGATGAGTATCGGCT
AAATGCTATGCAAGAGGAGCTACTCCAATTTAGACGAAACAATGTCTGGATGTTAGTTTCAAAGCCAGAAGGTGTAAACGTTATTGGCACCAAATGGGTATTTAAAAATA
AAACTGACGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGA
CTTGAAGCCATTTGA
Protein sequenceShow/hide protein sequence
MRGLEKIIKNEALVGIPNLNVNGKFFCGDCQIGKQIRATHKSLKECYTNRVLELLHMDLMGLMQTESLGGKRYVLVVVDDYSRYTWVCFLKGKTDTVEICKNLCLKLQRE
KGKKITRIRSDHGKEFNNEDFNSFFLFEGIHHEFSAPTTPQQNDVVQRKNRMLQEMARVMIHAKNLPLCFWAEARKPNVKYFHVFGSTCYILADREYRQKWDARLEQGIF
LGYSQNSPTYTIFNNRPGSVMETINVVINDLDSAIKQMNDEDETPNMSEARTTSTVEVSKADNPSDDPGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMH
TRKKEKIDYMKMVADLCYTSTIEPFTIDSTLKDEYRLNAMQEELLQFRRNNVWMLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVAR
LEAI