; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G204590 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G204590
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionHistone-lysine N-methyltransferase SETD1B-like protein
Genome locationCicolChr10:31903599..31907191
RNA-Seq ExpressionCcUC10G204590
SyntenyCcUC10G204590
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043909.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa]1.6e-20875.86Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPISHS  F AKFCRS CFFSFNHSPDL NSSPLFGFQSPVKTPCR+PNP+F HV
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKS GKSNGLGLLGSFLKRLT R R+RKREI GDGR NDPRDGPP+PAKMAI+ENE  NDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN

Query:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV
        +CDSPFRFVLQSS SPGHRTP+LSSP SSPARLDH                                                    Q NDVESL+KLP 
Subjt:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV

Query:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEE--DEDELDDDDEINHLKEE-E
        EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDDYNLERSFAIVQKA+HQLLKKLRRFERLAELDP+ELETFLL +E  DEDEL D D+I+HLKEE E
Subjt:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEE--DEDELDDDDEINHLKEE-E

Query:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE
        +YEKDIKQ+N E NDSSRFQ  +RP+RD K L+CNLITEEER++VAI+KREETMKRVYMR DLWKRVDS+ ID+MVG+DLK E+DGWN NKE RGEIGIE
Subjt:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE

Query:  IEFAIFNLLVEEMQTELHCLTH
        IE AIF+LLVEEMQ+ELHCL H
Subjt:  IEFAIFNLLVEEMQTELHCLTH

XP_011651995.1 uncharacterized protein LOC105434967 [Cucumis sativus]3.7e-21376.15Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPI HSS F AKFCRS CFFSFNHSPDL NSSP FGFQSPVKTPCRNPNP+F HV
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKS GKSNGLGLLGSFLKRLT R RARKREI GDGR NDPRDGPP+PAKMAI+ENE  NDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN

Query:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV
        +CDSPFRFVLQSSPSPGHRTP+LSSPASSPARLDH                                                    Q NDVESL+KLP 
Subjt:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV

Query:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELD----DDDEINHLKEE
        EDEEEEKEQSSPVSVLDPPFEDDDEGH+EDGEDEDDYNLERSFAIVQKA+HQLLKKLRRFERLAELDP+ELETFLL +ED+DE +    D D+I+HLKEE
Subjt:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELD----DDDEINHLKEE

Query:  -EDYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIG
         E YEKDIKQ+N E NDSSRFQIP+RP+RD KTL+CNLIT+EER+LV I+K EETMKRVYMR DLWKRVDS+ ID+MVG+DLK E+DGWNINKE RGEI 
Subjt:  -EDYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIG

Query:  IEIEFAIFNLLVEEMQTELHCLTH
        +EIE AIF+LLVEEMQ+ELHCLTH
Subjt:  IEIEFAIFNLLVEEMQTELHCLTH

XP_022144766.1 uncharacterized protein LOC111014376 [Momordica charantia]5.7e-17467.62Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP
        M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPS KS+ HL   KPIS +  FP KFC+SACFFSF+ SPDL   SPLF FQSPV    RNPN IFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP

Query:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENE----NGNDSVFRLSNVTGFDFC
        ARTAG+LLEAALRIQKQSTAARSK  GK+NGLGLLGSFLKRLT RGRARKREIDGDGRRND   G P+PAKMAI+ENE    N N SV   +N+T F FC
Subjt:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENE----NGNDSVFRLSNVTGFDFC

Query:  ESNVCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKK
        ESN CDSPFRFVLQSSPS GHRTP+ SSPA+SP R DH                                                    Q NDVESLKK
Subjt:  ESNVCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKK

Query:  LPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEE
        LPVEDEEEEKEQSSPVS+LDPPFEDDDEGHYEDGEDED Y+LERS+ IVQKA+HQLLKKLRRFE+LAELDPVELE+FLLK E EDELDDDD+I+HLKEEE
Subjt:  LPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEE

Query:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE
            + +Q++ EAN SS FQIPHR       L+ N IT E+RD    D REE  K VY+RSDLWKRVDS+ ID  VGQDLK E+DGWN N++QRGE+ IE
Subjt:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE

Query:  IEFAIFNLLVEEMQTELHCLTH
        IE AIF+LLV EMQTEL CLTH
Subjt:  IEFAIFNLLVEEMQTELHCLTH

XP_023526007.1 uncharacterized protein LOC111789613 [Cucurbita pepo subsp. pepo]1.3e-15764.29Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP
        MAQKHLHELLKEDQ PFLL NFIADRRSLLKRPS KS F LN  KPIS SS F   FCRSACFFSF HSPDL+ SSPLF FQSPVKTPCRN N IFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP

Query:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV
        A TAGLLLEAALRIQKQSTAA+S+SLGKSNGLG LGSFLKRLT RGR RKREI  DGR+N  R  PP+PA     ENEN NDSV R          +SN+
Subjt:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV

Query:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE
        C SPFRFVLQSSPSPGHRTP+ SSP SSPAR +H                                                    QV D ESLKK  VE
Subjt:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE

Query:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEEDYEK
        DEEEEKEQSSPVSVLDPPFE+ DEGHY     EDDYNL+RS+AIVQKA+HQLLKKLRRFERLAELD VELETFLLK+EDEDEL+DD  I HL ++E +  
Subjt:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEEDYEK

Query:  DIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIEFA
        DI ++N   N SSRFQIP       K L+ NL+T++ERD+V I+      KRV +RS LWK VD++ ID++  QDLK E+DGW+ N EQRGEI IEIE A
Subjt:  DIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIEFA

Query:  IFNLLVEEMQTELHCLTH
        IF+LLVEEMQTELHCL H
Subjt:  IFNLLVEEMQTELHCLTH

XP_038903007.1 uncharacterized protein LOC120089713 [Benincasa hispida]3.9e-22379.77Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP
        MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNN KPISHSS FPAKFCRSACFFSFNHSPDL+NSSPLFGFQSPVKTPCRNPNPIFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP

Query:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV
        ARTAGLLLEAALRIQKQST ARSKSLGKSNGLG+LGSFLKRLT RGRARKREIDGDGR+NDPRDGPP+PAKMAI+ENEN NDSV RLSNVTGFDFC+SN+
Subjt:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV

Query:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE
        CDSPFRFVLQSSPSPGH+TP+L+SPASSPARLDH                                                    Q NDVE LKKLPVE
Subjt:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE

Query:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEEDYEK
        DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQ+A+HQLLKKLRRFERLAELDPVELETFLLK+EDEDE +DDD+I+HLKEEEDY+K
Subjt:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEEDYEK

Query:  DIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIEFA
        DIK+++ EANDSSRFQIPHRPARDM TL+CNL+TEEERDLV I+KREE MK +Y+RSDLWKRVDS+ I++MVGQDLK E+DGW  NKEQR EI IEIE A
Subjt:  DIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIEFA

Query:  IFNLLVEEMQTELH
        IF+LLVEEMQ ELH
Subjt:  IFNLLVEEMQTELH

TrEMBL top hitse value%identityAlignment
A0A0A0LAR8 Uncharacterized protein1.8e-21376.15Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPI HSS F AKFCRS CFFSFNHSPDL NSSP FGFQSPVKTPCRNPNP+F HV
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKS GKSNGLGLLGSFLKRLT R RARKREI GDGR NDPRDGPP+PAKMAI+ENE  NDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN

Query:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV
        +CDSPFRFVLQSSPSPGHRTP+LSSPASSPARLDH                                                    Q NDVESL+KLP 
Subjt:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV

Query:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELD----DDDEINHLKEE
        EDEEEEKEQSSPVSVLDPPFEDDDEGH+EDGEDEDDYNLERSFAIVQKA+HQLLKKLRRFERLAELDP+ELETFLL +ED+DE +    D D+I+HLKEE
Subjt:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELD----DDDEINHLKEE

Query:  -EDYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIG
         E YEKDIKQ+N E NDSSRFQIP+RP+RD KTL+CNLIT+EER+LV I+K EETMKRVYMR DLWKRVDS+ ID+MVG+DLK E+DGWNINKE RGEI 
Subjt:  -EDYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIG

Query:  IEIEFAIFNLLVEEMQTELHCLTH
        +EIE AIF+LLVEEMQ+ELHCLTH
Subjt:  IEIEFAIFNLLVEEMQTELHCLTH

A0A5D3DNQ5 Histone-lysine N-methyltransferase SETD1B-like isoform X27.7e-20975.86Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV
        MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL N KPISHS  F AKFCRS CFFSFNHSPDL NSSPLFGFQSPVKTPCR+PNP+F HV
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHV

Query:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN
        PARTAGLLLEAALRIQKQSTAARSKS GKSNGLGLLGSFLKRLT R R+RKREI GDGR NDPRDGPP+PAKMAI+ENE  NDSVFRLSNVTGFDFCESN
Subjt:  PARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESN

Query:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV
        +CDSPFRFVLQSS SPGHRTP+LSSP SSPARLDH                                                    Q NDVESL+KLP 
Subjt:  VCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPV

Query:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEE--DEDELDDDDEINHLKEE-E
        EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDDYNLERSFAIVQKA+HQLLKKLRRFERLAELDP+ELETFLL +E  DEDEL D D+I+HLKEE E
Subjt:  EDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEE--DEDELDDDDEINHLKEE-E

Query:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE
        +YEKDIKQ+N E NDSSRFQ  +RP+RD K L+CNLITEEER++VAI+KREETMKRVYMR DLWKRVDS+ ID+MVG+DLK E+DGWN NKE RGEIGIE
Subjt:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE

Query:  IEFAIFNLLVEEMQTELHCLTH
        IE AIF+LLVEEMQ+ELHCL H
Subjt:  IEFAIFNLLVEEMQTELHCLTH

A0A6J1CUE0 uncharacterized protein LOC1110143762.8e-17467.62Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP
        M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPS KS+ HL   KPIS +  FP KFC+SACFFSF+ SPDL   SPLF FQSPV    RNPN IFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP

Query:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENE----NGNDSVFRLSNVTGFDFC
        ARTAG+LLEAALRIQKQSTAARSK  GK+NGLGLLGSFLKRLT RGRARKREIDGDGRRND   G P+PAKMAI+ENE    N N SV   +N+T F FC
Subjt:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENE----NGNDSVFRLSNVTGFDFC

Query:  ESNVCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKK
        ESN CDSPFRFVLQSSPS GHRTP+ SSPA+SP R DH                                                    Q NDVESLKK
Subjt:  ESNVCDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKK

Query:  LPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEE
        LPVEDEEEEKEQSSPVS+LDPPFEDDDEGHYEDGEDED Y+LERS+ IVQKA+HQLLKKLRRFE+LAELDPVELE+FLLK E EDELDDDD+I+HLKEEE
Subjt:  LPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEE

Query:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE
            + +Q++ EAN SS FQIPHR       L+ N IT E+RD    D REE  K VY+RSDLWKRVDS+ ID  VGQDLK E+DGWN N++QRGE+ IE
Subjt:  DYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIE

Query:  IEFAIFNLLVEEMQTELHCLTH
        IE AIF+LLV EMQTEL CLTH
Subjt:  IEFAIFNLLVEEMQTELHCLTH

A0A6J1FAX4 uncharacterized protein LOC1114424114.1e-15463.85Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP
        MAQKHLHELLKEDQ PFLL NFIADRRSLLKRPS KS F LN SKPIS SS     FCRSACFFSF HSPDL  SSPLF F SPVKTPCRN N IFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP

Query:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV
        A TAGLLLEAALRIQKQSTAA+SKSLGKSN LG LGSFLKRLT RGR RKREI  DGR+N  R  PP+P       NEN NDSV R          +SN+
Subjt:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV

Query:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE
        C+SPFRFVLQSSPSPGHRTP+ SSP SSPAR +H                                                    QV D ESLKKL VE
Subjt:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE

Query:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLK--EEDEDELDDDDEINHLKEEEDY
        DEEEEKEQSSPVSVLDPPFE+ DEGHY     EDDYNL+RS+AIVQKA+HQLLKKLRRFERLAELD VELETFLLK  +EDEDELDDD +I HL ++E +
Subjt:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLK--EEDEDELDDDDEINHLKEEEDY

Query:  EKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIE
          DI ++N   N SSRFQIP       K L+ NL+T+EERD+V I+      KRV +RS+LWK VD++ IDM+  QDLK E+DGW+ N EQRGEI I++E
Subjt:  EKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIE

Query:  FAIFNLLVEEMQTELHCLTH
         AIF+LLVEEMQTELHCL H
Subjt:  FAIFNLLVEEMQTELHCLTH

A0A6J1J5Y5 uncharacterized protein LOC1114816472.4e-14962.12Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP
        MAQKHLHELLKEDQ PFLL NFIADRRSLLK P+ KS F LN SKPIS SS F   FCRSACFFSF HSPDL+ SSPLF F SPVKTPC N N  FLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVP

Query:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV
        A TAGLLLEAALRIQKQSTAA SKSLGKSNGLG LGSFLKRLT RGR RKREI  DGR+N  R  PP+PA        N NDSV R          +SN+
Subjt:  ARTAGLLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNV

Query:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE
        C+SPFRFVLQSSPS GHRTP+ SSP SSPAR +H                                                    QV D ESLKKL VE
Subjt:  CDSPFRFVLQSSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVE

Query:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLK--EEDEDELDDDDEINHLKEEEDY
        DEEEEKEQSSPVSVLDPPFE+ +EGHY     EDDYNL+RS+AIVQKA+HQLLKKLRRFERLAELD VELETFLLK  +EDEDEL+DD +I HL ++E +
Subjt:  DEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLK--EEDEDELDDDDEINHLKEEEDY

Query:  EKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIE
          DI ++    N SSRFQIP       K L+ NL+T++ERD+V I+      KRV +RS+LWK VD++ ID+++ QDLK E+DGW+ N EQRGEI I+IE
Subjt:  EKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIE

Query:  FAIFNLLVEEMQTELHCLTH
         AIF+LLVEEMQTELH L H
Subjt:  FAIFNLLVEEMQTELHCLTH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36420.1 unknown protein1.4e-4532.06Show/hide
Query:  QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKF-CRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPA
        +KHLHE L++DQEPF L ++I + RS +      S   +   K  + ++  P  F C ++CFF+ + SPD    SPLF  +SP K   R+   +FL +PA
Subjt:  QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKF-CRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPA

Query:  RTAGLLLEAALRIQKQST--AARSKSLGKSNGLGLLGSFLKRLTLR-GRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCES
        RTA +LL+AA RIQKQ +  A  +K+  + NG G+ GS LK LT R  + R    DG+               ++++       S  R   V   D C  
Subjt:  RTAGLLLEAALRIQKQST--AARSKSLGKSNGLGLLGSFLKRLTLR-GRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCES

Query:  NVCDSPFRFVLQSSP-SPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKL
          C+SPF FVLQ++P S GH+TP  +S A+SPAR   +                                                 +   ++ ESL+K+
Subjt:  NVCDSPFRFVLQSSP-SPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKL

Query:  PVED----EEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLK
          ++    EEE+KEQ SPVSVLDP  E++++  +   E +   NL  SF IVQ+A+ +LLKKLRRFE+LA LDPVELE  + +EEDE             
Subjt:  PVED----EEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLK

Query:  EEEDYEKDIKQNNTEANDSSR--FQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWK--RVDSSTIDMMVGQDLKAEIDGWNINKEQ
        EEE+YE+  + +N    DS      +    AR+ +        E+E+      K+ +  ++ +   + W+        +D +V +DL+ E   W  +  +
Subjt:  EEEDYEKDIKQNNTEANDSSR--FQIPHRPARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWK--RVDSSTIDMMVGQDLKAEIDGWNINKEQ

Query:  RGEIGIEIEFAIFNLLVEEMQTEL
          E   ++E +IF +L++E   EL
Subjt:  RGEIGIEIEFAIFNLLVEEMQTEL

AT5G03670.1 unknown protein1.5e-6035.05Show/hide
Query:  AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPA
        +Q+HL +LL+EDQEPF L ++I+DRR  +   +  +H  +   +PIS ++G P++FCR+ACFFS   SPD    SPLF     +K+P R+ N IF+++PA
Subjt:  AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPA

Query:  RTAGLLLEAALRIQKQST-AARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDG-----------------------------DGRRNDPRDGPPVPAK
        RTA +LLEAA+RIQKQS+  +++++    N  G+ GS LK+LT R   +KREI G                               +RN+  +      K
Subjt:  RTAGLLLEAALRIQKQST-AARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDG-----------------------------DGRRNDPRDGPPVPAK

Query:  MAIQ-----------------------------------ENENGNDSVFRLSNVTGFDFCE-SNVCDSPFRFVLQSSPS-PGHRTPDLSSPASSPARLDH
        +A +                                      NG+D    + N  G D  E    C+SPF FVLQ+ PS  G RTP+ SSPA+SP     
Subjt:  MAIQ-----------------------------------ENENGNDSVFRLSNVTGFDFCE-SNVCDSPFRFVLQSSPS-PGHRTPDLSSPASSPARLDH

Query:  QVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDED
                          R +C  M          ++ Y                +VE LKKL +E+EEEEKEQSSPVSVLDPPF+DDDE  +      D
Subjt:  QVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDED

Query:  DYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLIT
        D N+  SF  VQKA+H LL+KL RFE+LA LDP+ELE  +  +E E+E ++++E   +K     E  I Q   +       ++P      ++ L+ +L  
Subjt:  DYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDSSRFQIPHRPARDMKTLLCNLIT

Query:  EE-ERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDG-W-NINKEQRGEIGIEIEFAIFNLLVEEMQTEL
        EE   D+    +     KRV  R   W+ V+S+TIDMMV  D + E  G W + N     E  ++IEF IF  LVEE+  ++
Subjt:  EE-ERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDG-W-NINKEQRGEIGIEIEFAIFNLLVEEMQTEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAAAAGCACTTACACGAGCTTTTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCTGACAGACGCTCCCTTCTCAAACGCCCTTCTCTC
AAATCCCATTTCCATCTTAACAATTCAAAACCCATCTCCCATTCCTCTGGTTTTCCAGCTAAATTTTGCAGGAGCGCTTGTTTTTTCTCTTTCAATCACTCCCCT
GATCTCGTAAACTCATCTCCGCTCTTTGGATTTCAGTCGCCGGTCAAAACCCCTTGTCGAAACCCCAATCCCATTTTTCTTCATGTTCCGGCTAGAACGGCTGGA
CTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCTCTGGGGAAATCGAATGGATTAGGGCTTCTGGGTTCTTTTCTTAAG
CGTTTGACTCTTCGTGGCCGTGCTCGGAAGCGAGAGATCGACGGCGATGGTCGGAGAAATGACCCCCGCGATGGCCCGCCAGTGCCGGCGAAAATGGCGATTCAG
GAGAACGAGAATGGGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATGTATGCGATAGCCCTTTTCGATTTGTGCTTCAA
TCGAGCCCTTCCCCCGGTCACCGGACGCCGGACCTCTCTTCGCCGGCGTCTTCTCCGGCTCGCCTAGACCATCAGGTTTTACTTAGTTTTATTACTGTTTTTGGG
TTTCATCTTTCAATGTCTTGGCGTAGAAACTGCATTTTTATGAACTGGGTGTCACCGGCAAAAACCAACTGTCGGCAGTGTTATTTTGTGAACTATTCTTTGCCT
TGTTTTCACACAAATTTACAGGTCAATGATGTAGAGAGCTTGAAAAAATTGCCGGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCGGTGTTG
GATCCTCCATTTGAGGACGACGACGAAGGACATTACGAGGATGGTGAGGATGAGGATGATTACAATTTGGAACGCAGCTTTGCCATTGTACAAAAGGCGAGGCAT
CAACTACTGAAAAAACTTCGAAGATTCGAGAGATTAGCAGAACTAGACCCAGTAGAACTCGAGACATTTCTACTAAAAGAGGAGGACGAAGACGAACTCGATGAT
GACGATGAAATCAATCATCTCAAGGAAGAAGAAGACTACGAAAAAGACATCAAACAAAACAACACAGAAGCCAATGACAGTTCAAGGTTCCAAATTCCTCACCGA
CCCGCAAGAGATATGAAGACACTCCTCTGCAATCTCATTACTGAGGAAGAGAGAGACTTGGTTGCGATCGACAAGAGAGAAGAGACGATGAAGAGAGTATACATG
AGATCGGATTTGTGGAAACGGGTGGACTCGAGCACAATCGACATGATGGTGGGGCAAGATTTGAAAGCAGAGATTGATGGGTGGAACATAAATAAGGAGCAGAGA
GGAGAAATAGGCATAGAAATAGAGTTCGCAATCTTCAACTTGTTGGTGGAGGAAATGCAAACTGAACTACATTGCTTAACTCATTAA
mRNA sequenceShow/hide mRNA sequence
TTACTGAAAATCCATATCCTTAACCCCAAACCAACAATTCTCGATTTCATTCCCCCATTTTAGCTTTCATCTTCTCTTTTGCAGAGCTACTGCCACTCCCATGGC
TCAAAAGCACTTACACGAGCTTTTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCTGACAGACGCTCCCTTCTCAAACGCCCTTCTCTCAAATC
CCATTTCCATCTTAACAATTCAAAACCCATCTCCCATTCCTCTGGTTTTCCAGCTAAATTTTGCAGGAGCGCTTGTTTTTTCTCTTTCAATCACTCCCCTGATCT
CGTAAACTCATCTCCGCTCTTTGGATTTCAGTCGCCGGTCAAAACCCCTTGTCGAAACCCCAATCCCATTTTTCTTCATGTTCCGGCTAGAACGGCTGGACTTCT
TTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCTCTGGGGAAATCGAATGGATTAGGGCTTCTGGGTTCTTTTCTTAAGCGTTT
GACTCTTCGTGGCCGTGCTCGGAAGCGAGAGATCGACGGCGATGGTCGGAGAAATGACCCCCGCGATGGCCCGCCAGTGCCGGCGAAAATGGCGATTCAGGAGAA
CGAGAATGGGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATGTATGCGATAGCCCTTTTCGATTTGTGCTTCAATCGAG
CCCTTCCCCCGGTCACCGGACGCCGGACCTCTCTTCGCCGGCGTCTTCTCCGGCTCGCCTAGACCATCAGGTTTTACTTAGTTTTATTACTGTTTTTGGGTTTCA
TCTTTCAATGTCTTGGCGTAGAAACTGCATTTTTATGAACTGGGTGTCACCGGCAAAAACCAACTGTCGGCAGTGTTATTTTGTGAACTATTCTTTGCCTTGTTT
TCACACAAATTTACAGGTCAATGATGTAGAGAGCTTGAAAAAATTGCCGGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCGGTGTTGGATCC
TCCATTTGAGGACGACGACGAAGGACATTACGAGGATGGTGAGGATGAGGATGATTACAATTTGGAACGCAGCTTTGCCATTGTACAAAAGGCGAGGCATCAACT
ACTGAAAAAACTTCGAAGATTCGAGAGATTAGCAGAACTAGACCCAGTAGAACTCGAGACATTTCTACTAAAAGAGGAGGACGAAGACGAACTCGATGATGACGA
TGAAATCAATCATCTCAAGGAAGAAGAAGACTACGAAAAAGACATCAAACAAAACAACACAGAAGCCAATGACAGTTCAAGGTTCCAAATTCCTCACCGACCCGC
AAGAGATATGAAGACACTCCTCTGCAATCTCATTACTGAGGAAGAGAGAGACTTGGTTGCGATCGACAAGAGAGAAGAGACGATGAAGAGAGTATACATGAGATC
GGATTTGTGGAAACGGGTGGACTCGAGCACAATCGACATGATGGTGGGGCAAGATTTGAAAGCAGAGATTGATGGGTGGAACATAAATAAGGAGCAGAGAGGAGA
AATAGGCATAGAAATAGAGTTCGCAATCTTCAACTTGTTGGTGGAGGAAATGCAAACTGAACTACATTGCTTAACTCATTAACTGCAATTGAGTGAGACCATCAC
AAAAAACAATAAAATAATCTCTAGATTTTGAATTACTTTTAGGAATTTATAATCTAACTTTAAGAGCATAGAAGTAGAAAGATTAGGATGAAAGGGACATCATTA
TGATCTGTAAATTCATTATCCCACCATATATATTATTATCATCTACCATTTTAATTTTTGAAATGCTTTTTTCTCCTCTATTTAAAA
Protein sequenceShow/hide protein sequence
MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNSKPISHSSGFPAKFCRSACFFSFNHSPDLVNSSPLFGFQSPVKTPCRNPNPIFLHVPARTAG
LLLEAALRIQKQSTAARSKSLGKSNGLGLLGSFLKRLTLRGRARKREIDGDGRRNDPRDGPPVPAKMAIQENENGNDSVFRLSNVTGFDFCESNVCDSPFRFVLQ
SSPSPGHRTPDLSSPASSPARLDHQVLLSFITVFGFHLSMSWRRNCIFMNWVSPAKTNCRQCYFVNYSLPCFHTNLQVNDVESLKKLPVEDEEEEKEQSSPVSVL
DPPFEDDDEGHYEDGEDEDDYNLERSFAIVQKARHQLLKKLRRFERLAELDPVELETFLLKEEDEDELDDDDEINHLKEEEDYEKDIKQNNTEANDSSRFQIPHR
PARDMKTLLCNLITEEERDLVAIDKREETMKRVYMRSDLWKRVDSSTIDMMVGQDLKAEIDGWNINKEQRGEIGIEIEFAIFNLLVEEMQTELHCLTH