; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002099 (gene) of Snake gourd v1 genome

Gene IDTan0002099
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG10:19340939..19357819
RNA-Seq ExpressionTan0002099
SyntenyTan0002099
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597677.1 Protein IQ-DOMAIN 14, partial [Cucurbita argyrosperma subsp. sororia]4.0e-24392.75Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEVETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK+AELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLWKIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGDSTEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKG
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        KQSEGT L SVPTTQPDPMGKAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

KAG7029121.1 hypothetical protein SDJN02_10306, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-24392.75Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEVETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK+AELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLWKIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGDSTEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKG
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        KQSEGT L SVPTTQPDPMGKAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

XP_022932640.1 uncharacterized protein LOC111439133 isoform X1 [Cucurbita moschata]8.9e-24392.54Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEVETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK+AELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLWKIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT ++SIP PQIGDSTEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKG
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        KQSEGT L SVPTTQPDPMGKAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

XP_022972330.1 uncharacterized protein LOC111470900 [Cucurbita maxima]3.6e-24492.96Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEVETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK+AELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVK+EKNA  DDKFGINNDSVLEFRDTN+SLEQVLWKIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGDSTEDIV NVLIHNE+AEAERN    IV QPVEKHEE EKG
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        KQSEGT L SVPTTQPDPMGKAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

XP_023540257.1 uncharacterized protein LOC111800683 isoform X1 [Cucurbita pepo subsp. pepo]9.5e-24593.18Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEVETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK+AELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLWKIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGDSTEDIV NVLIHNE+AEAERN    IV QPVEKHE+ EKG
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        KQSEGT L SVPTTQPDPMGKAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

TrEMBL top hitse value%identityAlignment
A0A1S3BXR7 uncharacterized protein LOC103494594 isoform X13.1e-22586.78Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        ME+HSK   RED EVDIIE SNKTDPKFCGKEDPDATEYSSSF ETSDADNCSGF+EGEVETQFFGDIGLPPTFGSFS  L IRKRKLT HWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQ K    DPTME+F SK FPFSS YYRRKAMKRRKRK+ ED  DISSYMS HNLFSY+ENKR+ELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVK+EKNA SDDKFGINNDS+LE RDT+NSLEQVLWKIE VHSRLHKLKGQMDKVMSKNAA FSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
         VMCAS Q ISECDIG+LMKPESAISS+G+AILVPDIIESTVG LTAT DVS+PQPQIGDSTE IVDNVL HNEV EAERN DS++V QPVEKH E EK 
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
         Q EGT L S PTTQPDPMGKAL+S+EQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

A0A5A7TML9 Uncharacterized protein1.1e-22586.99Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        ME+HSK   RED EVDIIE SNKTDPKFCGKEDPDATEYSSSF ETSDADNCSGF+EGEVETQFFGDIGLPPTFGSFS  L IRKRKLT HWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQ K    DPTMEDF SK FPFSS YYRRKAMKRRKRK+ ED  DISSYMS HNLFSY+ENKR+ELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVK+EKNA SDDKFGINNDS+LE RDT+NSLEQVLWKIE VHSRLHKLKGQMDKVMSKNAA FSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
         VMCAS Q ISECDIG+LMKPESAISS+G+AILVPDIIESTVG LTAT DVS+PQPQIGDSTE IVDNVL HNEV EAERN DS++V QPVEKH E EK 
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
         Q EGT L S PTTQPDPMGKAL+S+EQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

A0A6J1E2X4 uncharacterized protein LOC1110255453.0e-22886.78Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        MET SK KCRED EV+IIECSN+TDPK  GKEDPDATEYSSSFAETSDAD+CSGF+EG+VETQFFGDIGLPP FGSFS  LPIRKRKLT+HWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKE+ESQ L+YSRALAVYEQGK+S LDPT+E+FSSK  P SS YYRRKAMKRRKRKR EDTNDI SYM+QHNLFSY+ENKR+ELDG  VAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVK+EKNA SDDKFGINNDSVLEFR++N+SLEQVLWKIEAVHSRLHKLKGQMD VMSKNAAKFSSSENLSLLAPCEAQTSSAP+PTFSAGNGE+SV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIG+LMKPESAISSYGEAILVPDIIESTVGLLTAT DVS+PQPQIGDSTEDIVDNVLIHNEVAEAERN  SRIVVQPVEKH E EK 
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        K  EGT L S+PTTQ DPMGKA++ +EQS LKKCLASDINFPRNKRKRGERKAGPGSWNK+HSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

A0A6J1F2Q4 uncharacterized protein LOC111439133 isoform X14.3e-24392.54Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEVETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK+AELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLWKIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT ++SIP PQIGDSTEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKG
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        KQSEGT L SVPTTQPDPMGKAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

A0A6J1I8A3 uncharacterized protein LOC1114709001.8e-24492.96Show/hide
Query:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM
        METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEVETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLM
Subjt:  METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLM

Query:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD
        WRCKWTELRIKEIESQ LKYSRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK+AELDGTSVAD
Subjt:  WRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVAD

Query:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
        EFANPVK+EKNA  DDKFGINNDSVLEFRDTN+SLEQVLWKIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Subjt:  EFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV

Query:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG
        GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGDSTEDIV NVLIHNE+AEAERN    IV QPVEKHEE EKG
Subjt:  GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKG

Query:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
        KQSEGT L SVPTTQPDPMGKAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Subjt:  KQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G50040.1 unknown protein1.6e-2727.55Show/hide
Query:  SSSFAETSDADNCSGFNEG-EVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPT
        SSSF ++  A +   F  G E ++    D  LP T    +  L + K+K  + W+   +P+MWRCKW EL++KEI+SQ   Y + +  Y   K   L+ +
Subjt:  SSSFAETSDADNCSGFNEG-EVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPT

Query:  -MEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK-RAELDGTSVADEFANPVKM--EKNAVSDDKFGINNDSVLEFRDTNNS
         +E F  K+ PF     RR   KR +RKR E+T D+++YMS HNLFSY + +    + G  +  +F    K   +++A+ DD        + E   +++ 
Subjt:  -MEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENK-RAELDGTSVADEFANPVKM--EKNAVSDDKFGINNDSVLEFRDTNNS

Query:  LEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQT---SSAPSPTFSAGNGELSVGVMCASAQHISECDIGELMKPESAISSYGEA
        L + L KI+    +  +L+ ++D++M  +    +SS    ++APC   +   +        A    +  G  C  A HI       LM P++ I   G+ 
Subjt:  LEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQT---SSAPSPTFSAGNGELSVGVMCASAQHISECDIGELMKPESAISSYGEA

Query:  ILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVL------------IHNEVAEAERNP
        +     I S++       D+ + +P++ D   +  D  L            +  E A+AE +P
Subjt:  ILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVL------------IHNEVAEAERNP

AT3G59670.1 unknown protein3.2e-9747.3Show/hide
Query:  EDFEVDIIEC-SNKTDPKFCGKEDPDATEYSSSFAETSD------ADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRC
        E+ +VDI+E   NKT       EDP+ATEYSSSF++T+        D  +G  E EVE+ ++ +  L P + SFS     RK++LTNHW+ FIRPLMWR 
Subjt:  EDFEVDIIEC-SNKTDPKFCGKEDPDATEYSSSFAETSD------ADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRC

Query:  KWTELRIKEIESQTLKYSRALAVYEQGK-NSGLDPTMEDFSS---KAFPFSSP-YYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSV
        KW ELRI+E+ES+ L+Y + L +Y+Q K  + +DP++ +      K+ PFS+P Y +R A KRRKRK+ E T+DI+SYM+ HNLFSY E KR   DG  +
Subjt:  KWTELRIKEIESQTLKYSRALAVYEQGK-NSGLDPTMEDFSS---KAFPFSSP-YYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSV

Query:  ADEFANPVKMEKNAVSDDKFGINN-DSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSA-GNG
        AD+F +    +  + S++   +++ DS+   RD ++ LE+VLWKIE VHS++H+LK Q+D V+SKN A+FSSSENLSLLA      SSAPSPT SA GNG
Subjt:  ADEFANPVKMEKNAVSDDKFGINN-DSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSA-GNG

Query:  E-LSVGVMCASAQHISECDIGELM-KPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKH
        + +S G +  ++QH+++  +G+++   E  ISSYG+A  +PDIIESTVGL  A ADV++   QIGDS EDI+DN+LI N VAE E N D    +     H
Subjt:  E-LSVGVMCASAQHISECDIGELM-KPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKH

Query:  EEQEKGKQSEGTCLGSVPTTQ------PDPMGKALISDEQSALKKCLASDINFPRNKRKR-GERKAGPGSWNKKHSSEPDSQ
        +E EK ++ EGT +  +  T+       +     L   E S L+ CLAS++  PRNKR R GERKA   SW KKH S+P+SQ
Subjt:  EEQEKGKQSEGTCLGSVPTTQ------PDPMGKALISDEQSALKKCLASDINFPRNKRKR-GERKAGPGSWNKKHSSEPDSQ

AT4G37440.1 unknown protein1.7e-3731.65Show/hide
Query:  EDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRP-LMWRCKWTELR
        ++ EVDI+EC++  + +  G +D     YSSSF  T         N+ EV++    +  LP         L +RKRKLT+HW+ F++P LMWRCKW EL+
Subjt:  EDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRP-LMWRCKWTELR

Query:  IKEIESQTLKYSRALAVYEQGKNSGLDPT-MEDFSSKAFPFSSPYYRRKA--MKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVADEFANPV
         KE+++Q  KY + +  Y Q K   L+    E+   KA P   P Y +K   MKR+ RKR E+T D++SY S HNLFSYY+ +++  D  ++ D   N  
Subjt:  IKEIESQTLKYSRALAVYEQGKNSGLDPT-MEDFSSKAFPFSSPYYRRKA--MKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVADEFANPV

Query:  KMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSVGVMCAS
        K  K+A  +  F       LEFR+ +  LEQ+L KIEA  S    LK ++DKV+S+N + F  +  ++ L   +  TSS       A   E    +    
Subjt:  KMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSVGVMCAS

Query:  AQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGT
           ISE    E     +++SS+    + P+  E+T  LL   +++   + + G S   I D  L+  E A  E  P      +PV K   + +       
Subjt:  AQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGT

Query:  CLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKK
            + T +     +  +S E+      +AS   F   KRKRG+R++G     ++
Subjt:  CLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKK

AT4G37440.2 unknown protein7.5e-3837.99Show/hide
Query:  EDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRP-LMWRCKWTELR
        ++ EVDI+EC++  + +  G +D     YSSSF  T         N+ EV++    +  LP         L +RKRKLT+HW+ F++P LMWRCKW EL+
Subjt:  EDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRP-LMWRCKWTELR

Query:  IKEIESQTLKYSRALAVYEQGKNSGLDPT-MEDFSSKAFPFSSPYYRRKA--MKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVADEFANPV
         KE+++Q  KY + +  Y Q K   L+    E+   KA P   P Y +K   MKR+ RKR E+T D++SY S HNLFSYY+ +++  D  ++ D   N  
Subjt:  IKEIESQTLKYSRALAVYEQGKNSGLDPT-MEDFSSKAFPFSSPYYRRKA--MKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVADEFANPV

Query:  KMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSS
        K  K+A  +  F       LEFR+ +  LEQ+L KIEA  S    LK ++DKV+S+N + F  +  ++ L   +  TSS
Subjt:  KMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTCATAGCAAAGGGAAGTGCCGGGAGGATTTCGAGGTTGATATAATTGAATGTTCAAATAAAACTGATCCCAAATTTTGTGGGAAAGAAGACCCTGATGCAAC
TGAATATTCAAGCTCATTTGCTGAGACATCTGATGCGGATAATTGTTCAGGATTCAACGAAGGAGAAGTAGAAACTCAATTTTTTGGAGACATTGGCTTGCCACCCACAT
TTGGTTCATTTAGTGGTGCACTTCCAATAAGGAAGAGGAAGTTAACAAATCACTGGCAAAACTTCATCCGCCCTCTAATGTGGCGTTGCAAGTGGACAGAATTGAGAATT
AAGGAAATTGAGTCACAGACATTAAAATATTCCAGAGCACTTGCAGTGTATGAACAAGGAAAAAATTCAGGCCTTGATCCAACAATGGAAGATTTTTCTTCAAAAGCATT
TCCATTTTCTAGCCCATATTATAGAAGAAAGGCAATGAAGAGAAGAAAACGAAAGAGAGCTGAAGATACAAATGATATATCATCCTATATGTCACAACATAACCTTTTCT
CCTACTATGAAAATAAGAGAGCTGAACTAGATGGTACTTCTGTAGCTGATGAATTTGCTAATCCAGTGAAAATGGAGAAAAATGCTGTTTCAGACGACAAATTTGGGATT
AATAATGACTCTGTTCTCGAGTTCAGAGACACTAATAATTCTTTGGAGCAAGTACTCTGGAAAATTGAAGCGGTGCACTCTCGACTTCACAAGCTAAAGGGTCAAATGGA
TAAGGTGATGTCCAAAAATGCTGCAAAATTTTCTTCCTCAGAGAATCTGAGTCTTCTTGCACCTTGTGAGGCACAGACCAGCTCTGCCCCTAGTCCTACGTTCTCTGCTG
GTAATGGAGAATTATCAGTTGGAGTTATGTGTGCCTCGGCCCAACATATATCAGAGTGTGATATTGGTGAGCTGATGAAGCCTGAAAGTGCTATTTCAAGCTATGGGGAG
GCAATACTAGTTCCTGATATTATTGAAAGTACAGTAGGTCTTTTGACTGCTACTGCTGATGTTTCAATTCCTCAACCACAAATTGGGGACTCAACTGAGGATATTGTCGA
TAATGTTCTGATACACAATGAGGTGGCTGAGGCAGAGAGAAACCCAGATAGCAGGATTGTTGTGCAGCCCGTGGAAAAACATGAAGAACAAGAAAAAGGCAAGCAAAGTG
AAGGTACCTGTCTTGGCTCAGTTCCAACTACACAACCTGATCCTATGGGAAAAGCTTTGATCTCCGACGAACAATCAGCTCTTAAGAAATGTTTAGCTTCAGATATCAAT
TTCCCTAGGAACAAGAGAAAGCGAGGGGAAAGAAAAGCAGGCCCGGGTAGTTGGAACAAGAAACATTCGAGCGAACCCGATAGCCAGTAA
mRNA sequenceShow/hide mRNA sequence
TTCGCTTTAGGTTTATAACGTTAACTTCTCTGCTCTCTCTCTCTCTCTCTCTCTTCTTCTAGCCGCGCCGAAATTTCTCTCTCCGCACTCTCTCCCGCGCTCACGCCTCT
CTCTCCTCTCTCTTTCTCTCTTCCGTTTCATTGCTCCTCCATAGCTTTTTTTCACTCCCTCGCCTTTCTACATTGCTCTTTCTTGCCTCCTCTGCTTCTTCCTTCTTCGG
TCTGCTGCATTTTCTTTCTCTGTTGCTGTCCTCTGACTCTCGCTATAATATAATTGTGCTAAATCATTGATGGACCTCTATCTGCACAAGACAATGGATTAAAGAAAAAT
GCTGCACGTATTAAGTTGGAGATTTTCTTGGTAAATTCTAGTGGGAAAGGCAGGGGCAGGAGCTAGTCTGGGAAACCATTTCTAGGAATGGAAACTCATAGCAAAGGGAA
GTGCCGGGAGGATTTCGAGGTTGATATAATTGAATGTTCAAATAAAACTGATCCCAAATTTTGTGGGAAAGAAGACCCTGATGCAACTGAATATTCAAGCTCATTTGCTG
AGACATCTGATGCGGATAATTGTTCAGGATTCAACGAAGGAGAAGTAGAAACTCAATTTTTTGGAGACATTGGCTTGCCACCCACATTTGGTTCATTTAGTGGTGCACTT
CCAATAAGGAAGAGGAAGTTAACAAATCACTGGCAAAACTTCATCCGCCCTCTAATGTGGCGTTGCAAGTGGACAGAATTGAGAATTAAGGAAATTGAGTCACAGACATT
AAAATATTCCAGAGCACTTGCAGTGTATGAACAAGGAAAAAATTCAGGCCTTGATCCAACAATGGAAGATTTTTCTTCAAAAGCATTTCCATTTTCTAGCCCATATTATA
GAAGAAAGGCAATGAAGAGAAGAAAACGAAAGAGAGCTGAAGATACAAATGATATATCATCCTATATGTCACAACATAACCTTTTCTCCTACTATGAAAATAAGAGAGCT
GAACTAGATGGTACTTCTGTAGCTGATGAATTTGCTAATCCAGTGAAAATGGAGAAAAATGCTGTTTCAGACGACAAATTTGGGATTAATAATGACTCTGTTCTCGAGTT
CAGAGACACTAATAATTCTTTGGAGCAAGTACTCTGGAAAATTGAAGCGGTGCACTCTCGACTTCACAAGCTAAAGGGTCAAATGGATAAGGTGATGTCCAAAAATGCTG
CAAAATTTTCTTCCTCAGAGAATCTGAGTCTTCTTGCACCTTGTGAGGCACAGACCAGCTCTGCCCCTAGTCCTACGTTCTCTGCTGGTAATGGAGAATTATCAGTTGGA
GTTATGTGTGCCTCGGCCCAACATATATCAGAGTGTGATATTGGTGAGCTGATGAAGCCTGAAAGTGCTATTTCAAGCTATGGGGAGGCAATACTAGTTCCTGATATTAT
TGAAAGTACAGTAGGTCTTTTGACTGCTACTGCTGATGTTTCAATTCCTCAACCACAAATTGGGGACTCAACTGAGGATATTGTCGATAATGTTCTGATACACAATGAGG
TGGCTGAGGCAGAGAGAAACCCAGATAGCAGGATTGTTGTGCAGCCCGTGGAAAAACATGAAGAACAAGAAAAAGGCAAGCAAAGTGAAGGTACCTGTCTTGGCTCAGTT
CCAACTACACAACCTGATCCTATGGGAAAAGCTTTGATCTCCGACGAACAATCAGCTCTTAAGAAATGTTTAGCTTCAGATATCAATTTCCCTAGGAACAAGAGAAAGCG
AGGGGAAAGAAAAGCAGGCCCGGGTAGTTGGAACAAGAAACATTCGAGCGAACCCGATAGCCAGTAAGTTATTGTAGCTGTCCATTAGAGGTCTGGAGGTGGTCAGACGT
AAATTTTCCACTTTGCCATTAGCCACTGGCATAGGTACGTAATGACCACTGCTCAGGTTTAGTCTTTATTAAGCAATTTCCAGAATGAAGCAGCTCTCAGTTGCTGCTGC
CACAATGTGTTAGAGAGAGATCCTGAGTTTTTCTCGTGTAAGTTCTGTCTGTATCATGTTTATTGGGAACTCATCTCTCTCGTACATCATTTTAGATCTCAATAAGATAA
CTTTCGCCAA
Protein sequenceShow/hide protein sequence
METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRI
KEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGI
NNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSVGVMCASAQHISECDIGELMKPESAISSYGE
AILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDIN
FPRNKRKRGERKAGPGSWNKKHSSEPDSQ