; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012399 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012399
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionHistone-lysine N-methyltransferase SETD1B-like isoform X2
Genome locationchr04:31177452..31181229
RNA-Seq ExpressionPay0012399
SyntenyPay0012399
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043909.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa]3.3e-261100Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM
        LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM

Query:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Subjt:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

KAG6580678.1 hypothetical protein SDJN03_20680, partial [Cucurbita argyrosperma subsp. sororia]1.7e-15368.16Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MA+K HLHELLK+DQ PFLL+NFI DRRSLLKR S KS F L   KPIS S DF   FCRS CFFSF HSPDL  SSPLF F SPVKTPCR+ N +F HV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PA TAGLLLEAALRIQKQSTAA+SKS GKSN LG LGSFLKRLTHR R RKREI  D R N  R  PPLPA      NE ENDSV R          +SN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LC+SPFRFVLQSS SPGHRTPE SSP SSPAR +HQ  D ESL+KL  EDEEEEKEQSSPVSVLDPPFE+ DEG++     EDDYNL+RS+AIVQKAKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM
        LLKKLRRFERLAEL+ +ELETFLL DED+DEDEL D  DI HL ++      DI +HN   N SSRFQ  P R    L+ NL+T+EER++V IE      
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM

Query:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        KRV +R +LWK VD+NAID++  +DLK EVDGW+RN E RGEI I+IE+AIFSLLVEEMQ+ELHCLAH
Subjt:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

XP_011651995.1 uncharacterized protein LOC105434967 [Cucumis sativus]7.3e-24593.86Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPI HS DFSAKFCRSTCFFSFNHSPDLANSSP FGFQSPVKTPCR+PNPVFFHV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE ENDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQKAKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELS--DGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKR
        LLKKLRRFERLAELDP+ELETFLL+DEDQDEDELS  DGDDIDHLKEEVE+YEKDIKQHNKEGNDSSRFQ   RPSRDTK LVCNLIT+EERN+V IEK 
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELS--DGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKR

Query:  EETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        EETMKRVYMR DLWKRVDSNAID+MVGKDLKEEVDGWN NKEPRGEI +EIEVAIFSLLVEEMQSELHCL H
Subjt:  EETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

XP_022144766.1 uncharacterized protein LOC111014376 [Momordica charantia]1.4e-16670.82Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        M  ++HLHELLK+DQEPF+L+NFI DRRSLLKR S KS+ HLK  KPIS + DF  KFC+S CFFSF+ SPDL   SPLF FQSPV    R+PN +F HV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE----KENDSVFRLSNVTGFDF
        PARTAG+LLEAALRIQKQSTAARSK  GK+NGLGLLGSFLKRLTHR R+RKREI GDGR ND   G PLPAKMAIEENE     EN SV   +N+T F F
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE----KENDSVFRLSNVTGFDF

Query:  CESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQK
        CESN CDSPFRFVLQSS S GHRTPE SSP +SP R DHQ NDVESL+KLP EDEEEEKEQSSPVS+LDPPFEDDDEG++EDGEDED Y+LERS+ IVQK
Subjt:  CESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQK

Query:  AKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEK
        AKHQLLKKLRRFE+LAELDP+ELE+FLL  E   EDEL D DDIDHLKE  EEYE  + +QH+ E N SS FQ  P R    LV N IT E+R+    + 
Subjt:  AKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEK

Query:  REETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        REE  K VY+R DLWKRVDSNAID  VG+DLK E+DGWNRN++ RGE+ IEIE+AIFSLLV EMQ+EL CL H
Subjt:  REETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

XP_038903007.1 uncharacterized protein LOC120089713 [Benincasa hispida]5.1e-21484.76Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL NPKPISHS DF AKFCRS CFFSFNHSPDL NSSPLFGFQSPVKTPCR+PNP+F HV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQST ARSKS GKSNGLG+LGSFLKRLTHR R+RKREI GDGR NDPRDGPPLPAKMAIEENE ENDSV RLSNVTGFDFC+SN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LCDSPFRFVLQSS SPGH+TPEL+SP SSPARLDHQANDVE L+KLP EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDDYNLERSFAIVQ+AKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREE
        LLKKLRRFERLAELDP+ELETFLL DED+DEDE  D DDIDHLKEE E+Y+KDIK+H+ E NDSSRFQ  +RP+RD   LVCNL+TEEER++V IEKREE
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREE

Query:  TMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELH
         MK +Y+R DLWKRVDSNAI+VMVG+DLKEEVDGW RNKE R EI IEIEVAIFSLLVEEMQ ELH
Subjt:  TMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELH

TrEMBL top hitse value%identityAlignment
A0A0A0LAR8 Uncharacterized protein3.5e-24593.86Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPI HS DFSAKFCRSTCFFSFNHSPDLANSSP FGFQSPVKTPCR+PNPVFFHV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE ENDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQKAKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELS--DGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKR
        LLKKLRRFERLAELDP+ELETFLL+DEDQDEDELS  DGDDIDHLKEEVE+YEKDIKQHNKEGNDSSRFQ   RPSRDTK LVCNLIT+EERN+V IEK 
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELS--DGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKR

Query:  EETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        EETMKRVYMR DLWKRVDSNAID+MVGKDLKEEVDGWN NKEPRGEI +EIEVAIFSLLVEEMQSELHCL H
Subjt:  EETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

A0A5D3DNQ5 Histone-lysine N-methyltransferase SETD1B-like isoform X21.6e-261100Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM
        LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM

Query:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Subjt:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

A0A6J1CUE0 uncharacterized protein LOC1110143766.6e-16770.82Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        M  ++HLHELLK+DQEPF+L+NFI DRRSLLKR S KS+ HLK  KPIS + DF  KFC+S CFFSF+ SPDL   SPLF FQSPV    R+PN +F HV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE----KENDSVFRLSNVTGFDF
        PARTAG+LLEAALRIQKQSTAARSK  GK+NGLGLLGSFLKRLTHR R+RKREI GDGR ND   G PLPAKMAIEENE     EN SV   +N+T F F
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE----KENDSVFRLSNVTGFDF

Query:  CESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQK
        CESN CDSPFRFVLQSS S GHRTPE SSP +SP R DHQ NDVESL+KLP EDEEEEKEQSSPVS+LDPPFEDDDEG++EDGEDED Y+LERS+ IVQK
Subjt:  CESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQK

Query:  AKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEK
        AKHQLLKKLRRFE+LAELDP+ELE+FLL  E   EDEL D DDIDHLKE  EEYE  + +QH+ E N SS FQ  P R    LV N IT E+R+    + 
Subjt:  AKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEK

Query:  REETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        REE  K VY+R DLWKRVDSNAID  VG+DLK E+DGWNRN++ RGE+ IEIE+AIFSLLV EMQ+EL CL H
Subjt:  REETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

A0A6J1FAX4 uncharacterized protein LOC1114424113.5e-15267.95Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MA+K HLHELLK+DQ PFLL+NFI DRRSLLKR S KS F L   KPIS S D    FCRS CFFSF HSPDL  SSPLF F SPVKTPCR+ N +F HV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PA TAGLLLEAALRIQKQSTAA+SKS GKSN LG LGSFLKRLTHR R RKREI  DGR N  R  PPLP       NE ENDSV R          +SN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LC+SPFRFVLQSS SPGHRTPE SSP SSPAR +HQ  D ESL+KL  EDEEEEKEQSSPVSVLDPPFE+ DEG++     EDDYNL+RS+AIVQKAKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM
        LLKKLRRFERLAELD +ELETFLL DED+DEDEL D  DI HL ++      DI +HN   N SSRFQ  P R    L+ NL+T+EER++V IE      
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM

Query:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        KRV +R +LWK VD+NAID++  +DLK EVDGW+RN E RGEI I++E+AIFSLLVEEMQ+ELHCLAH
Subjt:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

A0A6J1J5Y5 uncharacterized protein LOC1114816478.1e-14966.88Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MA+K HLHELLK+DQ PFLL+NFI DRRSLLK  + KS F L   KPIS S DF   FCRS CFFSF HSPDL  SSPLF F SPVKTPC + N  F HV
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        PA TAGLLLEAALRIQKQSTAA SKS GKSNGLG LGSFLKRLTHR R RKREI  DGR N  R  PPLPA         ENDSV R          +SN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
        LC+SPFRFVLQSS S GHRTPE SSP SSPAR +HQ  D ESL+KL  EDEEEEKEQSSPVSVLDPPFE+ +EG++     EDDYNL+RS+AIVQKAKHQ
Subjt:  LCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ

Query:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM
        LLKKLRRFERLAELD +ELETFLL DED+DEDEL+D  DI HL ++      DI +H    N SSRFQ  P R    L+ NL+T++ER++V IE      
Subjt:  LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM

Query:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
        KRV +R +LWK VD+NAIDV++ +DLK EVDGW+RN E RGEI I+IE+AIFSLLVEEMQ+ELH LAH
Subjt:  KRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36420.1 unknown protein5.7e-5436.97Show/hide
Query:  RKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPA
        +K+HLHE L+ DQEPF L+++I + RS +  S  +     K+    +  P   +  C ++CFF+ + SPD    SPLF  +SP K   R    VF  +PA
Subjt:  RKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPA

Query:  RTAGLLLEAALRIQKQST--AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN
        RTA +LL+AA RIQKQ +  A  +K+  + NG G+ GS LK LT+R  ++ R  + DG              +++E   +   S  R   V   D C   
Subjt:  RTAGLLLEAALRIQKQST--AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESN

Query:  LCDSPFRFVLQSS-SSPGHRTPELSSPVSSPARL---DHQANDVESLQKLPAED----EEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFA
         C+SPF FVLQ++ SS GH+TP  +S  +SPAR    D  +++ ESL+K+  ++    EEE+KEQ SPVSVLDP  E++++ +    E +   NL  SF 
Subjt:  LCDSPFRFVLQSS-SSPGHRTPELSSPVSSPARL---DHQANDVESLQKLPAED----EEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFA

Query:  IVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLK--EEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNI
        IVQ+AK +LLKKLRRFE+LA LDP+ELE  +  +ED++E+E  + ++ D+++  +  EEYE           D      R SR           E+E+  
Subjt:  IVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLK--EEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNI

Query:  VAIEKREETMKRVYMRPDLWKRVDSNA---IDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSEL
           +K +E  K+  M  + W RV   A   +D +V KDL+EE   W R+     E   ++E +IF +L++E   EL
Subjt:  VAIEKREETMKRVYMRPDLWKRVDSNA---IDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSEL

AT5G03670.1 unknown protein5.3e-6036.45Show/hide
Query:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV
        MA ++HL +LL++DQEPF L ++I+DRR  +  ++  +H  +K  +PIS +    ++FCR+ CFFS   SPD    SPLF     +K+P RS N +F ++
Subjt:  MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHV

Query:  PARTAGLLLEAALRIQKQST-AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGD---GRINDP------RDGPPLPAKMAI---EENEKENDS--V
        PARTA +LLEAA+RIQKQS+  +++++    N  G+ GS LK+LT+R   +KREI G    GR++        R   P+  K+     + NE+EN S   
Subjt:  PARTAGLLLEAALRIQKQST-AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGD---GRINDP------RDGPPLPAKMAI---EENEKENDS--V

Query:  FRLSNVTGF-----------------------DFCES--------------------------NLCDSPFRFVLQS-SSSPGHRTPELSSPVSSPA----
         ++++ T F                       DF  S                            C+SPF FVLQ+  S+ G RTP  SSP +SP     
Subjt:  FRLSNVTGF-----------------------DFCES--------------------------NLCDSPFRFVLQS-SSSPGHRTPELSSPVSSPA----

Query:  RLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDE
         ++ ++ +VE L+KL  E+EEEEKEQSSPVSVLDPPF+DDDE         DD N+  SF  VQKAKH LL+KL RFE+LA LDP+ELE   ++D++ +E
Subjt:  RLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDE

Query:  DELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM-KRVYMRPDLWKRVDSNAIDVMVGKDLKEEV
        +E  + +++  L       ++ +K + +E               + L+ +L  EE  + +  E     + KRV  R   W+ V+SN ID+MV  D + E 
Subjt:  DELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM-KRVYMRPDLWKRVDSNAIDVMVGKDLKEEV

Query:  DG-W-NRNKEPRGEIGIEIEVAIFSLLVEEMQSEL
         G W ++N     E  ++IE  IF  LVEE+  ++
Subjt:  DG-W-NRNKEPRGEIGIEIEVAIFSLLVEEMQSEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGCGCTCTTCCTTCAA
ATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTTCAGCTAAATTTTGCAGGAGCACTTGTTTTTTCTCTTTCAACCATTCCCCTGATCTTG
CTAACTCATCCCCGCTTTTTGGGTTTCAGTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAA
GCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTTTGACTCATCGGAG
CCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAGAAAGAGAACGACT
CTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACCCGGTCACCGGACG
CCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGGAAGAGAAAGAACA
GAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGCAGCTTCGCCATTG
TACAAAAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGACGAAGACCAAGAT
GAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATGACAGTTCAAGGTT
CCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACAATGAAGAGGGTGT
ACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAATAAGGAGCCGAGA
GGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATTTGGCGCAACCCTGATTTTGATTCAGCCAAGTCCTTCTCTCCACATTCCCCTCTCTCTGTTTCTTTTGCTCTGTTCTATTTGTGTCAACTTCTTCCACACTCACCTTT
TTCTCCACTGAAACTTCAAAAATTTCTTCAACTTCTTCTTACTCCAAATTCATTCCTTTAACCCCAATCCACAATTCTCCATTTAGCTTTCCTCTACTCTTTTGCAGACA
TTCTGCCACTCCCATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGC
GCTCTTCCTTCAAATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTTCAGCTAAATTTTGCAGGAGCACTTGTTTTTTCTCTTTCAACCAT
TCCCCTGATCTTGCTAACTCATCCCCGCTTTTTGGGTTTCAGTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGG
ACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTT
TGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAG
AAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACC
CGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGG
AAGAGAAAGAACAGAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGC
AGCTTCGCCATTGTACAAAAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGA
CGAAGACCAAGATGAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATG
ACAGTTCAAGGTTCCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACA
ATGAAGAGGGTGTACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAA
TAAGGAGCCGAGAGGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAAACTGCAAGCA
ATTGGAGTGAGTCCACAAAAAAAAATTTAAAATTTCCACAAAATAATCTCTAGATTTTTTAATCACTCTTAGGAATATATAATCTGACTTCAAGAGCATAGGGTTTAAGT
TTAACACCTCTGAAGTAGAAAGATTAGGATGAAAGGGACATCACTATGATTTGTAAATTCATTATCTCTCCATATATTATCATCATCTTATAATATCTATAATTTTAATT
TTTGAAATGCTTATCTTCCCCCCTTTTCCTTTTTTATTTTTTGGGAAAAATAAAATGAAAAAGACATGTCAGTTAAAAAGTGAAGGGATGGTTAATGGCAG
Protein sequenceShow/hide protein sequence
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLE
AALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRT
PELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQD
EDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPR
GEIGIEIEVAIFSLLVEEMQSELHCLAH