; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023563 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023563
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationscaffold1258:139071..141618
RNA-Seq ExpressionMS023563
SyntenyMS023563
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-16173.99Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQD        
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR

Query:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSE
                 RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDSE
Subjt:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSE

Query:  AIAAFEELKKVGLS-SQRK
        AI AFE +K+VGL  +QRK
Subjt:  AIAAFEELKKVGLS-SQRK

XP_022150972.1 protein CHUP1, chloroplastic [Momordica charantia]3.8e-21395.38Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
        MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ

Query:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
        RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
Subjt:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE

Query:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFK
        GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQD               
Subjt:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFK

Query:  FCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEE
          RLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDS+AIAAFE 
Subjt:  FCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEE

Query:  LKKVGLSSQRK
        LKKVGLSSQRK
Subjt:  LKKVGLSSQRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]9.8e-16174.22Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQD        
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR

Query:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSE
                 RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDSE
Subjt:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSE

Query:  AIAAFEELKKVGLS-SQRK
        AI AFE +K+VGL  +QRK
Subjt:  AIAAFEELKKVGLS-SQRK

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]5.2e-16274.11Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI
        VN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQD      
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI

Query:  LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFD
                   RLEQSVSNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFD
Subjt:  LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFD

Query:  SEAIAAFEELKKVG-LSSQRK
        SEAI AFE +K+VG L SQRK
Subjt:  SEAIAAFEELKKVG-LSSQRK

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]6.1e-16374.35Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS IQSLKAHNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI
        VN LI+EVE AAPRDI EVERFV WLD ELGSLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQD      
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI

Query:  LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFD
                   RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFD
Subjt:  LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFD

Query:  SEAIAAFEELKKVGLS-SQRK
        SEAI AFE +K+VGL  +QRK
Subjt:  SEAIAAFEELKKVGLS-SQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein5.2e-16073.93Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK
        MP+EEDE LAMEI  L+KEL+I++ KS FLEKENQELRQEL RL+SQIQS KA NN+RKS+LWKKF++S+D     A+SPP      A DKRE+TK SPK
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK

Query:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG
        Q  W  VKES RM  G PA  P PPPPPLPTKLL GSKAVRRVPEVLELYR+LTKRDAQKENK AHGG PAVAFTKNMIGEIENRSAYL+AIKSEVETHG
Subjt:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG

Query:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC
        +FVNWLIKEVE  APRDI+EVERFV WLD +L SLVDERAVLK+FPRWPE KADALREAAFSYRDLK LES+VC FRDNPKEEM VVLKRAQALQD    
Subjt:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC

Query:  TILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGG
                     R+EQSVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+K+S+LRLAKEYM RITRELQS + T Q +NL LQG RFAYRVHQYAGG
Subjt:  TILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGG

Query:  FDSEAIAAFEELKKVGLSSQRK
        FDSE I AFE LKK GLSSQRK
Subjt:  FDSEAIAAFEELKKVGLSSQRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X11.2e-15974.17Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK
        MP+E+DEELAMEI  L+K+L+I++ KS FLE+ENQELR EL RLKSQIQSLKA NN+RKS+LWKKF++SMD     A+SPP      A DKRE TK  PK
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMD-----AESPP------ATDKREATKSSPK

Query:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG
        Q  W  VKESQRM    PA AP PPPPPLP KLL GSKAVRRVPEVL+LYR+LTKRDAQKENK AHGG P VAFTKNMIGEIENRSAYL+AIKSEVETHG
Subjt:  QPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHG

Query:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC
        EFVNWLIKEVE  APRDI+E E+FV WLD +L SLVDERAVLKHFPRWPE KADALREAAFSYRDLKSLES+VC FRDNPKEEM VVLKRAQALQD    
Subjt:  EFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRREC

Query:  TILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGG
                     R+EQSVSN+E+TREF+C KY+ F+IPC+WMF+S L  Q+KLS+LRLAKEYM RITREL+S + T QA+NL LQGVRFAYRVHQYAGG
Subjt:  TILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGG

Query:  FDSEAIAAFEELKKVGLSSQRK
        FDSEAI AFE LKK GLSSQRK
Subjt:  FDSEAIAAFEELKKVGLSSQRK

A0A6J1DC83 protein CHUP1, chloroplastic1.8e-21395.38Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
        MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQ

Query:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
        RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE
Subjt:  RMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVE

Query:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFK
        GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQD               
Subjt:  GAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFK

Query:  FCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEE
          RLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDS+AIAAFE 
Subjt:  FCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEE

Query:  LKKVGLSSQRK
        LKKVGLSSQRK
Subjt:  LKKVGLSSQRK

A0A6J1G8X0 protein CHUP1, chloroplastic4.7e-16174.22Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW
        MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA-------ESPPATDKREATKSSPKQPVW

Query:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  VAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQD        
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR

Query:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSE
                 RLEQSVSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDSE
Subjt:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSE

Query:  AIAAFEELKKVGLS-SQRK
        AI AFE +K+VGL  +QRK
Subjt:  AIAAFEELKKVGLS-SQRK

A0A6J1K8G4 protein CHUP1, chloroplastic2.5e-16274.11Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDA---------ESPPATDKREATKSSPKQP

Query:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEF

Query:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI
        VN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQD      
Subjt:  VNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTI

Query:  LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFD
                   RLEQSVSNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFD
Subjt:  LRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFD

Query:  SEAIAAFEELKKVG-LSSQRK
        SEAI AFE +K+VG L SQRK
Subjt:  SEAIAAFEELKKVG-LSSQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic4.5e-6847.44Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +        
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR

Query:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF
                 ++EQSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Subjt:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF

Query:  DSEAIAAFEELK
        D+E++ AFEEL+
Subjt:  DSEAIAAFEELK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown4.6e-10048.07Show/hide
Query:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKR--EATKSSPKQPVWVAVKE
        +P  ED+    ++  L KELQ  + ++D LEKEN ELRQE+ RL++Q+ +LK+H N+RKS+LWKK  +S D  +   ++ +  E+ KS+ K      V+ 
Subjt:  MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKR--EATKSSPKQPVWVAVKE

Query:  SQRMP--EGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLI
            P  +G       PPPPPLP+K   G ++VRR PEV+E YR+LTKR++   NK    G  + AF +NMIGEIENRS YL+ IKS+ + H + ++ LI
Subjt:  SQRMP--EGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLI

Query:  KEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDE
         +VE A   DI+EVE FV W+D EL SLVDERAVLKHFP+WPE K D+LREAA +Y+  K+L +E+ SF+DNPK+ +   L+R Q+LQD           
Subjt:  KEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDE

Query:  NFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIA
              RLE+SV+N EK R+ +  +Y++F+IP EWM ++GL+GQ+K SSLRLA+EYM+RI +EL+S + + +  NL+LQGVRFAY +HQ+AGGFD E ++
Subjt:  NFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIA

Query:  AFEELKKVGLSSQR
         F ELKK+     R
Subjt:  AFEELKKVGLSSQR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.2e-6947.44Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +        
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR

Query:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF
                 ++EQSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Subjt:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF

Query:  DSEAIAAFEELK
        D+E++ AFEEL+
Subjt:  DSEAIAAFEELK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.2e-6947.44Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +        
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR

Query:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF
                 ++EQSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Subjt:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF

Query:  DSEAIAAFEELK
        D+E++ AFEEL+
Subjt:  DSEAIAAFEELK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein3.2e-6947.44Show/hide
Query:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN
        P G P P P    PPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV 
Subjt:  PEGAPAPAPA---PPPPPLPTKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVN

Query:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR
         L  EV  ++  DI ++  FV+WLD EL  LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       LK+   L +        
Subjt:  WLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILR

Query:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF
                 ++EQSV  + +TR+ + ++Y+ F IP +W+ ++G+VG++KLSS++LAK+YM+R+  EL S+  + +  N   LLLQGVRFA+RVHQ+AGGF
Subjt:  SDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADN---LLLQGVRFAYRVHQYAGGF

Query:  DSEAIAAFEELK
        D+E++ AFEEL+
Subjt:  DSEAIAAFEELK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-7346.31Show/hide
Query:  AESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVA-------FT
        A+ PP    +++    P  P    +++    P  + AP P PPPPP P  L   S  VRRVPEV+E Y SL +RD+    + + GG  A A         
Subjt:  AESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVA-------FT

Query:  KNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCS
        ++MIGEIENRS YL AIK++VET G+F+ +LIKEV  AA  DI +V  FV WLD EL  LVDERAVLKHF  WPE KADALREAAF Y DLK L SE   
Subjt:  KNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCS

Query:  FRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSID
        FR++P++     LK+ QAL                 F +LE  V ++ + RE +  K+++F+IP +WM E+G+  Q+KL+S++LA +YM+R++ EL++I+
Subjt:  FRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCEWMFESGLVGQMKLSSLRLAKEYMRRITRELQSID

Query:  -NTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELK
            + + L++QGVRFA+RVHQ+AGGFD+E + AFEEL+
Subjt:  -NTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGT
CGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCT
CCGGCGCCGCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGC
GCAAAAGGAAAACAAGGCCGCCCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAG
AGGTGGAAACTCACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAG
CTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAG
CCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAATGTACAATCTTGAGAA
GTGATGAGAATTTTTTCAAATTTTGCAGGCTGGAGCAGAGTGTGAGCAATGTGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAA
TGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAGTTAAGCTCATTGAGGCTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCA
ACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACAGGGTTCACCAGTATGCAGGCGGTTTCGATTCAGAAGCTATAGCAGCATTTGAAGAACTGAAGAAAG
TTGGGCTGAGTAGTCAAAGAAAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCAGGAAGAAGATGAAGAATTGGCCATGGAGATCACCAGCTTGAGAAAAGAACTGCAAATTGCTGTGGACAAATCAGATTTTCTAGAGAAAGAAAATCAAGAACT
CAGACAAGAATTGGGTCGTCTCAAATCGCAGATTCAGTCTCTCAAAGCTCACAACAATGACAGAAAATCCCTTCTCTGGAAGAAATTTTACAACTCCATGGATGCAGAGT
CGCCGCCGGCGACTGACAAACGGGAGGCGACCAAATCATCGCCGAAACAGCCTGTTTGGGTCGCCGTGAAAGAGAGCCAGAGAATGCCGGAGGGGGCACCGGCTCCGGCT
CCGGCGCCGCCGCCGCCGCCGCTTCCGACGAAGCTGCTCGCCGGATCAAAGGCAGTGCGGCGAGTACCGGAAGTGTTGGAGCTGTACCGCTCGCTGACGAAACGAGATGC
GCAAAAGGAAAACAAGGCCGCCCACGGCGGATTTCCGGCGGTGGCATTCACCAAAAATATGATCGGAGAAATCGAAAACCGGTCAGCGTATCTCACTGCGATAAAATCAG
AGGTGGAAACTCACGGAGAGTTCGTGAACTGGCTGATCAAGGAAGTGGAAGGGGCAGCACCAAGGGACATAACAGAGGTGGAGAGGTTCGTGAACTGGCTGGACAGAGAG
CTGGGGTCGCTGGTAGATGAGAGGGCAGTGCTGAAGCACTTCCCACGGTGGCCTGAGGGGAAGGCGGATGCACTGCGGGAGGCAGCATTCAGTTACAGGGACCTGAAGAG
CCTGGAGAGTGAAGTATGTTCCTTCAGAGACAATCCGAAGGAGGAGATGGGTGTGGTACTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAATGTACAATCTTGAGAA
GTGATGAGAATTTTTTCAAATTTTGCAGGCTGGAGCAGAGTGTGAGCAATGTGGAGAAGACGAGGGAGTTCAGTTGTAACAAGTACAGAAATTTTAGAATACCCTGCGAA
TGGATGTTCGAATCTGGACTTGTCGGTCAGATGAAGTTAAGCTCATTGAGGCTGGCCAAGGAATACATGCGAAGGATAACAAGAGAACTCCAATCAATCGATAACACGCA
ACAAGCAGATAATCTTCTTCTTCAAGGGGTTCGATTTGCTTACAGGGTTCACCAGTATGCAGGCGGTTTCGATTCAGAAGCTATAGCAGCATTTGAAGAACTGAAGAAAG
TTGGGCTGAGTAGTCAAAGAAAA
Protein sequenceShow/hide protein sequence
MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKSLLWKKFYNSMDAESPPATDKREATKSSPKQPVWVAVKESQRMPEGAPAPA
PAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITEVERFVNWLDRE
LGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNPKEEMGVVLKRAQALQDRRECTILRSDENFFKFCRLEQSVSNVEKTREFSCNKYRNFRIPCE
WMFESGLVGQMKLSSLRLAKEYMRRITRELQSIDNTQQADNLLLQGVRFAYRVHQYAGGFDSEAIAAFEELKKVGLSSQRK