; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G004560 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G004560
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHistone-lysine N-methyltransferase SETD1B-like protein
Genome locationCmo_Chr06:2177504..2180821
RNA-Seq ExpressionCmoCh06G004560
SyntenyCmoCh06G004560
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596552.1 hypothetical protein SDJN03_09732, partial [Cucurbita argyrosperma subsp. sororia]3.5e-22182.4Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
        MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP

Query:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR
        ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDH LPPINEKDSVSRQSNVTSSDFC+SPFRFVLQSSPSAGHR
Subjt:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR

Query:  TPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPL
        TPEFSSPPSSPARHDHQ                                                                                   
Subjt:  TPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPL

Query:  FVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD
         VNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDY+MERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD
Subjt:  FVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD

Query:  IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI
        IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGD+AIEIEVEIFRLLVEEMQTEVDCFI
Subjt:  IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI

XP_022945267.1 uncharacterized protein LOC111449564 [Cucurbita moschata]1.3e-22383.23Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
        MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP

Query:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR
        ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR
Subjt:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR

Query:  TPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPL
        TPEFSSPPSSPARHDHQ                                                                                   
Subjt:  TPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPL

Query:  FVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD
         VNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD
Subjt:  FVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD

Query:  IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI
        IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI
Subjt:  IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI

Query:  K
        K
Subjt:  K

XP_023005858.1 uncharacterized protein LOC111498735 [Cucurbita maxima]5.0e-19976.54Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
        MAQKHLHELLKEDQEPFLLTNFIA+RRVLKRPSPKSHLLHLNK KPISHF+DFPASFCKGACFLSFN SPDLRNPSPLFQFQSPVKSPCRNSNA+FLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP

Query:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINE--KDSVSRQSNVTSSDFCESPFRFVLQSSPSAG
        ATTA LLLEAALRIQKQST ARSNGFGLLGSFLKRFT+RGRSRKREIDGGCRRNDP    +    NE   DSVSRQSNVTSSDFC+SPFRFVLQSSPSAG
Subjt:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINE--KDSVSRQSNVTSSDFCESPFRFVLQSSPSAG

Query:  HRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDK
        HRTPEFSSPPSSPAR DHQ                                                                                 
Subjt:  HRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDK

Query:  PLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDD
           VNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDY+MERSYAIV+KAKHQLLKKLRRFERLAELDPVELETFLLKDEEG+LDD
Subjt:  PLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDD

Query:  DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC
        D  DHL+EEEC+SHNFDRSNNEKDMKQHGI+ NVERVYMRWDLWKEVESSAIDVMA EDLRAEVD GWKRNGE RGDIAIEIEVEIFRLLVEEMQTEVDC
Subjt:  DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC

Query:  FIK
        FIK
Subjt:  FIK

XP_023539063.1 uncharacterized protein LOC111799817 [Cucurbita pepo subsp. pepo]1.1e-21179.25Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
        MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHF+DFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP

Query:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----INEKDSVSRQSNVTSSDFCESPFRFVLQSSP
        ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCR+NDP DDHLLP       N+ DSVSRQSNVTSSDFC+SPFRFVLQSSP
Subjt:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----INEKDSVSRQSNVTSSDFCESPFRFVLQSSP

Query:  SAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQE
        SAGHRTPEFSSPPSSPARHDHQ                                                                              
Subjt:  SAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQE

Query:  KDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGE
              VNDVESLKKLPVQDEEEEKEQSSPVSVLDPPF+DDEEGRYEDGEDDDDY+MERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLL+DEEGE
Subjt:  KDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGE

Query:  LDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTE
        LDD+DIDHLKEEECESHN DRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMA EDLRAEVDDGWKRNGE RGDIAIEIEVEIFRLLVEEMQTE
Subjt:  LDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTE

Query:  VDCFIK
        VD FIK
Subjt:  VDCFIK

XP_038903007.1 uncharacterized protein LOC120089713 [Benincasa hispida]1.2e-13956.29Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHV
        MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPS KSH  HLN  KPISH SDFPA FC+ ACF SFN SPDL N SPLF FQSPVK+PCRN N +FLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHV

Query:  PATTAGLLLEAALRIQKQSTAAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-------INEKDSVSRQSNVTSSDF----
        PA TAGLLLEAALRIQKQST AR      SNG G+LGSFLKR THRGR+RKREIDG  R+NDPRD   LP         NE DSVSR SNVT  DF    
Subjt:  PATTAGLLLEAALRIQKQSTAAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-------INEKDSVSRQSNVTSSDF----

Query:  -CESPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYS
         C+SPFRFVLQSSPS GH+TPE +SP SSPAR DHQ                                                                
Subjt:  -CESPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYS

Query:  THFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDP
                             NDVE LKKLPV+DEEEEKEQSSPVSVLDPPFEDD+EG YEDGED+DDY +ERS+AIV++AKHQLLKKLRRFERLAELDP
Subjt:  THFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDP

Query:  VELETFLLK--DEEGELDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGN---------------------------------------VERVYMRWD
        VELETFLLK  DE+ + DDDDIDHLKEEE         + +KD+K+H I+ N                                       ++ +Y+R D
Subjt:  VELETFLLK--DEEGELDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGN---------------------------------------VERVYMRWD

Query:  LWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEV
        LWK V+S+AI+VM G+DL+ EV DGWKRN E R +IAIEIEV IF LLVEEMQ E+
Subjt:  LWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEV

TrEMBL top hitse value%identityAlignment
A0A0A0LAR8 Uncharacterized protein2.1e-13154.35Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLH
        MA+K HLHELLK+DQEPFLL+NFI DRR +LKR S KSH  HL   KPI H SDF A FC+  CF SFN SPDL N SP F FQSPVK+PCRN N VF H
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLH

Query:  VPATTAGLLLEAALRIQKQSTAAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-------INEKDSVSRQSNVTSSDFCE-
        VPA TAGLLLEAALRIQKQSTAAR      SNG GLLGSFLKR THR R+RKREI G  R NDPRD   LP          E DSV R SNVT  DFCE 
Subjt:  VPATTAGLLLEAALRIQKQSTAAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-------INEKDSVSRQSNVTSSDFCE-

Query:  ----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFY
            SPFRFVLQSSPS GHRTPE SSP SSPAR DHQ                                                               
Subjt:  ----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFY

Query:  STHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELD
                              NDVESL+KLP +DEEEEKEQSSPVSVLDPPFEDD+EG +EDGED+DDY +ERS+AIV+KAKHQLLKKLRRFERLAELD
Subjt:  STHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELD

Query:  PVELETFLLKDE---EGELDD---DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGN---------------------------------------VERV
        P+ELETFLL DE   E EL D   DDIDHLKEE            EKD+KQH  +GN                                       ++RV
Subjt:  PVELETFLLKDE---EGELDD---DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGN---------------------------------------VERV

Query:  YMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC
        YMR DLWK V+S+AID+M G+DL+ EV DGW  N E RG+IA+EIEV IF LLVEEMQ+E+ C
Subjt:  YMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC

A0A5D3DNQ5 Histone-lysine N-methyltransferase SETD1B-like isoform X21.6e-13155.28Show/hide
Query:  MAQK-HLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLH
        MA+K HLHELLK+DQEPFLL+NFI DRR +LKR S KSH  HL   KPISH  DF A FC+  CF SFN SPDL N SPLF FQSPVK+PCR+ N VF H
Subjt:  MAQK-HLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLH

Query:  VPATTAGLLLEAALRIQKQSTAAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-------INEKDSVSRQSNVTSSDFCE-
        VPA TAGLLLEAALRIQKQSTAAR      SNG GLLGSFLKR THR RSRKREI G  R NDPRD   LP          E DSV R SNVT  DFCE 
Subjt:  VPATTAGLLLEAALRIQKQSTAAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-------INEKDSVSRQSNVTSSDFCE-

Query:  ----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFY
            SPFRFVLQSS S GHRTPE SSP SSPAR DHQ                                                               
Subjt:  ----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFY

Query:  STHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELD
                              NDVESL+KLP +DEEEEKEQSSPVSVLDPPFEDD+EG +EDGED+DDY +ERS+AIV+KAKHQLLKKLRRFERLAELD
Subjt:  STHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELD

Query:  PVELETFLLKDE---EGELDD-DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGN-------------------------------------VERVYMRW
        P+ELETFLL DE   E EL D DDIDHLKEE  E         EKD+KQH  +GN                                     ++RVYMR 
Subjt:  PVELETFLLKDE---EGELDD-DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGN-------------------------------------VERVYMRW

Query:  DLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC
        DLWK V+S+AIDVM G+DL+ EV DGW RN E RG+I IEIEV IF LLVEEMQ+E+ C
Subjt:  DLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC

A0A6J1CUE0 uncharacterized protein LOC1110143765.8e-13755.96Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHV
        M QKHLHELLKEDQEPF+LTNFIADRR +LKRPSPKS+ LHL +RKPIS   DFP  FCK ACF SF++SPDLR  SPLF+FQSPV    RN NA+FLHV
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHV

Query:  PATTAGLLLEAALRIQKQSTAARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----------INEKDSVSRQSNVTS---
        PA TAG+LLEAALRIQKQSTAARS      NG GLLGSFLKR THRGR+RKREIDG  RRND      LP            +NE  SVS Q+N+TS   
Subjt:  PATTAGLLLEAALRIQKQSTAARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----------INEKDSVSRQSNVTS---

Query:  --SDFCESPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSL
          S+FC+SPFRFVLQSSPS+GHRTPEFSSP +SP R DHQ                                                            
Subjt:  --SDFCESPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSL

Query:  IFYSTHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLA
                                 NDVESLKKLPV+DEEEEKEQSSPVS+LDPPFEDD+EG YEDGED+D Y++ERSY IV+KAKHQLLKKLRRFE+LA
Subjt:  IFYSTHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLA

Query:  ELDPVELETFLLKDEEGEL-DDDDIDHLKEEECESHNFDRSNNEK-------------------DMKQHGIDGNVER----VYMRWDLWKEVESSAIDVM
        ELDPVELE+FLLK EE EL DDDDIDHLKEEE ESHNF++ + E                    + +   +  N E     VY+R DLWK V+S+AID  
Subjt:  ELDPVELETFLLKDEEGEL-DDDDIDHLKEEECESHNFDRSNNEK-------------------DMKQHGIDGNVER----VYMRWDLWKEVESSAIDVM

Query:  AGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC
         G+DL+ E+ DGW RN + RG++AIEIE+ IF LLV EMQTE+DC
Subjt:  AGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC

A0A6J1G0G0 uncharacterized protein LOC1114495646.3e-22483.23Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
        MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP

Query:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR
        ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR
Subjt:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHR

Query:  TPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPL
        TPEFSSPPSSPARHDHQ                                                                                   
Subjt:  TPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPL

Query:  FVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD
         VNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD
Subjt:  FVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDD

Query:  IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI
        IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI
Subjt:  IDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFI

Query:  K
        K
Subjt:  K

A0A6J1L3C1 uncharacterized protein LOC1114987352.4e-19976.54Show/hide
Query:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
        MAQKHLHELLKEDQEPFLLTNFIA+RRVLKRPSPKSHLLHLNK KPISHF+DFPASFCKGACFLSFN SPDLRNPSPLFQFQSPVKSPCRNSNA+FLHVP
Subjt:  MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP

Query:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINE--KDSVSRQSNVTSSDFCESPFRFVLQSSPSAG
        ATTA LLLEAALRIQKQST ARSNGFGLLGSFLKRFT+RGRSRKREIDGGCRRNDP    +    NE   DSVSRQSNVTSSDFC+SPFRFVLQSSPSAG
Subjt:  ATTAGLLLEAALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINE--KDSVSRQSNVTSSDFCESPFRFVLQSSPSAG

Query:  HRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDK
        HRTPEFSSPPSSPAR DHQ                                                                                 
Subjt:  HRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDK

Query:  PLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDD
           VNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDY+MERSYAIV+KAKHQLLKKLRRFERLAELDPVELETFLLKDEEG+LDD
Subjt:  PLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDD

Query:  DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC
        D  DHL+EEEC+SHNFDRSNNEKDMKQHGI+ NVERVYMRWDLWKEVESSAIDVMA EDLRAEVD GWKRNGE RGDIAIEIEVEIFRLLVEEMQTEVDC
Subjt:  DDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC

Query:  FIK
        FIK
Subjt:  FIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36420.1 unknown protein9.7e-4432.45Show/hide
Query:  QKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASF--CKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP
        +KHLHE L++DQEPF L ++I + R     S     + + KRK   + + FP     C+ +CF + + SPD R  SPLF+ +SP K   R+   VFL +P
Subjt:  QKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASF--CKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVP

Query:  ATTAGLLLEAALRIQKQST--------AARSNGFGLLGSFLKRFTHR-GRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTS-SD--FCESPFR
        A TA +LL+AA RIQKQ +          R NGFG+ GS LK  T+R  + R    DG            L   +E  S SR+  +   SD  FCESPF 
Subjt:  ATTAGLLLEAALRIQKQST--------AARSNGFGLLGSFLKRFTHR-GRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTS-SD--FCESPFR

Query:  FVLQSSP-SAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFP
        FVLQ++P S+GH+TP F+S  +SPAR                                           S  D  +  T S E+   +            
Subjt:  FVLQSSP-SAGHRTPEFSSPPSSPARHDHQVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFP

Query:  GKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETF
              +E+DK                 EEE+KEQ SPVSVLDP  E++E+  +   E D    +  S+ IV++AK +LLKKLRRFE+LA LDPVELE  
Subjt:  GKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETF

Query:  LLKD--------EEGELDD-----------DDIDH--LKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGW
        + ++        EE E DD           +D+D    +E  C      + N+E+  K         R+   W +    E   +D +  +DLR E  + W
Subjt:  LLKD--------EEGELDD-----------DDIDH--LKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGW

Query:  KRNGEARGDIAIEIEVEIFRLLVEEMQTEV
         R+G    +   ++E  IF +L++E   E+
Subjt:  KRNGEARGDIAIEIEVEIFRLLVEEMQTEV

AT5G03670.1 unknown protein2.7e-5432.29Show/hide
Query:  AQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHL--NKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHV
        +Q+HL +LL+EDQEPF L ++I+DRR        +H+ HL   KR+PIS  +  P+ FC+ ACF S  +SPD +  SPLF+    +KSP R+ NA+F+++
Subjt:  AQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHL--NKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHV

Query:  PATTAGLLLEAALRIQKQS-------TAARSNGFGLLGSFLKRFTHRGRSRKREIDGG-----------------------------CRRNDPRD----D
        PA TA +LLEAA+RIQKQS       T    N FG+ GS LK+ T+R   +KREI GG                              +RN+  +     
Subjt:  PATTAGLLLEAALRIQKQS-------TAARSNGFGLLGSFLKRFTHRGRSRKREIDGG-----------------------------CRRNDPRD----D

Query:  HLLP--------------------------PINEKDSVSRQSNVTSSD----------------FCESPFRFVLQSSPS-AGHRTPEFSSPPSSPARHDH
        H +                            ++ + S+S  S    SD                FCESPF FVLQ+ PS  G RTP FSSP +SP    H
Subjt:  HLLP--------------------------PINEKDSVSRQSNVTSSD----------------FCESPFRFVLQSSPS-AGHRTPEFSSPPSSPARHDH

Query:  QVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDE
        ++E                                                SYE                                  VE LKKL +++E
Subjt:  QVEILILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDE

Query:  EEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDDIDHLKE-EECE-----
        EEEKEQSSPVSVLDPPF+DD+E  +      DD  +  S+  V+KAKH LL+KL RFE+LA LDP+ELE  +   E  E ++++ + +K    CE     
Subjt:  EEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDDIDHLKE-EECE-----

Query:  --SHNFDR------------SNNEKDMKQHGIDGNVE------RVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEAR-GDIAIEIEVEIFRLLV
             F+             S+   +     IDG  E      RV  R   W++VES+ ID+M   D R E    W+   +A   +  ++IE EIF  LV
Subjt:  --SHNFDR------------SNNEKDMKQHGIDGNVE------RVYMRWDLWKEVESSAIDVMAGEDLRAEVDDGWKRNGEAR-GDIAIEIEVEIFRLLV

Query:  EEMQTEV
        EE+  ++
Subjt:  EEMQTEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAAAAACACTTACACGAGCTTCTCAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATAGCCGACAGACGTGTTCTTAAGCGCCCTTCCCCCAAATCTCA
CCTTCTTCACCTCAATAAACGAAAACCCATTTCCCATTTCTCTGATTTTCCGGCGAGTTTTTGTAAGGGTGCTTGTTTTTTATCGTTTAATGATTCTCCTGATCTTAGAA
ACCCTTCGCCGCTCTTTCAATTTCAGTCTCCGGTGAAGAGTCCTTGCCGGAATTCCAATGCTGTGTTTCTCCATGTTCCGGCTACAACGGCGGGGCTTCTTCTGGAAGCT
GCTTTGAGGATTCAGAAACAGTCAACGGCCGCGAGATCGAATGGATTTGGGCTTTTGGGTTCTTTTCTTAAGCGGTTTACTCATCGTGGCCGTTCTCGGAAGCGGGAGAT
TGACGGTGGTTGCCGGAGAAATGACCCCCGCGACGACCACCTATTGCCGCCGATTAACGAGAAGGACTCTGTTTCTCGGCAGAGTAATGTAACGAGCTCGGACTTCTGCG
AAAGCCCTTTTCGATTTGTGCTTCAATCAAGTCCCTCCGCTGGTCACCGGACGCCGGAATTTTCTTCTCCGCCGTCTTCTCCGGCTCGACACGACCATCAGGTTGAAATA
CTCATTCTATTACTGTTCTTTTGCAATTTTATGACATGGGTTCATCGGAAAAACACCAACTGTCGGCAATTTTGTGTAAGTTTTCTGAAATTACCGCCGGAATTAGCGTC
AAAACACGACAACCCCACCAAAACGACAGTGAGTTATGAGGAACAAACTGCTAAATCTCTCATTTTCTATTCCACACACTTCTTTTTTCCCGGAAAATTTTTGACCTTTC
AAGAAAAGGACAAACCCCTTTTTGTCAATGACGTTGAGAGCTTGAAGAAATTGCCCGTTCAGGATGAGGAGGAAGAGAAAGAACAAAGCAGTCCTGTGTCTGTGTTGGAT
CCTCCGTTCGAGGACGACGAAGAGGGTCGCTATGAGGACGGTGAGGATGACGACGATTACGAAATGGAGCGCAGCTACGCCATTGTCGAAAAGGCAAAGCATCAGCTACT
GAAAAAGCTTCGGAGATTCGAGAGACTAGCAGAGCTAGACCCCGTAGAACTCGAGACGTTTCTACTAAAGGATGAGGAAGGCGAACTTGACGACGACGACATTGATCATC
TCAAGGAAGAAGAGTGCGAAAGCCATAACTTCGATCGGTCTAACAACGAAAAGGACATGAAACAACACGGCATAGATGGCAATGTGGAGAGAGTTTACATGAGATGGGAT
TTGTGGAAAGAGGTGGAGTCGAGCGCCATCGACGTGATGGCGGGGGAAGATTTGAGAGCGGAGGTTGACGACGGGTGGAAGAGAAATGGGGAGGCAAGAGGAGACATAGC
CATAGAAATAGAGGTTGAGATCTTCAGGTTGCTGGTGGAGGAAATGCAAACAGAAGTAGATTGCTTCATTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCAAAAACACTTACACGAGCTTCTCAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATAGCCGACAGACGTGTTCTTAAGCGCCCTTCCCCCAAATCTCA
CCTTCTTCACCTCAATAAACGAAAACCCATTTCCCATTTCTCTGATTTTCCGGCGAGTTTTTGTAAGGGTGCTTGTTTTTTATCGTTTAATGATTCTCCTGATCTTAGAA
ACCCTTCGCCGCTCTTTCAATTTCAGTCTCCGGTGAAGAGTCCTTGCCGGAATTCCAATGCTGTGTTTCTCCATGTTCCGGCTACAACGGCGGGGCTTCTTCTGGAAGCT
GCTTTGAGGATTCAGAAACAGTCAACGGCCGCGAGATCGAATGGATTTGGGCTTTTGGGTTCTTTTCTTAAGCGGTTTACTCATCGTGGCCGTTCTCGGAAGCGGGAGAT
TGACGGTGGTTGCCGGAGAAATGACCCCCGCGACGACCACCTATTGCCGCCGATTAACGAGAAGGACTCTGTTTCTCGGCAGAGTAATGTAACGAGCTCGGACTTCTGCG
AAAGCCCTTTTCGATTTGTGCTTCAATCAAGTCCCTCCGCTGGTCACCGGACGCCGGAATTTTCTTCTCCGCCGTCTTCTCCGGCTCGACACGACCATCAGGTTGAAATA
CTCATTCTATTACTGTTCTTTTGCAATTTTATGACATGGGTTCATCGGAAAAACACCAACTGTCGGCAATTTTGTGTAAGTTTTCTGAAATTACCGCCGGAATTAGCGTC
AAAACACGACAACCCCACCAAAACGACAGTGAGTTATGAGGAACAAACTGCTAAATCTCTCATTTTCTATTCCACACACTTCTTTTTTCCCGGAAAATTTTTGACCTTTC
AAGAAAAGGACAAACCCCTTTTTGTCAATGACGTTGAGAGCTTGAAGAAATTGCCCGTTCAGGATGAGGAGGAAGAGAAAGAACAAAGCAGTCCTGTGTCTGTGTTGGAT
CCTCCGTTCGAGGACGACGAAGAGGGTCGCTATGAGGACGGTGAGGATGACGACGATTACGAAATGGAGCGCAGCTACGCCATTGTCGAAAAGGCAAAGCATCAGCTACT
GAAAAAGCTTCGGAGATTCGAGAGACTAGCAGAGCTAGACCCCGTAGAACTCGAGACGTTTCTACTAAAGGATGAGGAAGGCGAACTTGACGACGACGACATTGATCATC
TCAAGGAAGAAGAGTGCGAAAGCCATAACTTCGATCGGTCTAACAACGAAAAGGACATGAAACAACACGGCATAGATGGCAATGTGGAGAGAGTTTACATGAGATGGGAT
TTGTGGAAAGAGGTGGAGTCGAGCGCCATCGACGTGATGGCGGGGGAAGATTTGAGAGCGGAGGTTGACGACGGGTGGAAGAGAAATGGGGAGGCAAGAGGAGACATAGC
CATAGAAATAGAGGTTGAGATCTTCAGGTTGCTGGTGGAGGAAATGCAAACAGAAGTAGATTGCTTCATTAAGTGATGGAAATGATTAAAGTCCATAGATTATATTTAAA
TTTGCATAAATAATGTCTAGATTATAAATTAGATTAGAAATATAATCTGACTTTAAGAGAATAGGCTTGAGTTTAACTTTACCTCCCTCTTTAAGTACAAAGATATTAGC
ACTATGATTTGTAAATTTATTATCCTTCCATATATTATCATCCT
Protein sequenceShow/hide protein sequence
MAQKHLHELLKEDQEPFLLTNFIADRRVLKRPSPKSHLLHLNKRKPISHFSDFPASFCKGACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEA
ALRIQKQSTAARSNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPINEKDSVSRQSNVTSSDFCESPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVEI
LILLLFFCNFMTWVHRKNTNCRQFCVSFLKLPPELASKHDNPTKTTVSYEEQTAKSLIFYSTHFFFPGKFLTFQEKDKPLFVNDVESLKKLPVQDEEEEKEQSSPVSVLD
PPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLLKKLRRFERLAELDPVELETFLLKDEEGELDDDDIDHLKEEECESHNFDRSNNEKDMKQHGIDGNVERVYMRWD
LWKEVESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDCFIK