; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg25537 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg25537
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDUF3685 domain-containing protein
Genome locationCarg_Chr17:8967089..8975867
RNA-Seq ExpressionCarg25537
SyntenyCarg25537
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022552 - Uncharacterised protein family Ycf55


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575973.1 putative protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.64Show/hide
Query:  MSSGVASSVSPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKFRPNGSLRFIASCSSGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQ
        MSSGVASSVSPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKFRPNGSLRFIASCSSGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQ
Subjt:  MSSGVASSVSPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKFRPNGSLRFIASCSSGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQ

Query:  AERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVIQDPPRGDEEPNNQLRSFDKLRPPVICFS
        AERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQ SSPNAYSETVV QDPPR              L PP     
Subjt:  AERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVIQDPPRGDEEPNNQLRSFDKLRPPVICFS

Query:  FKLAASASASIYDGRECGCRTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASE
        F +A +   +     + G RT ++A +      S KR   + +ESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASE
Subjt:  FKLAASASASIYDGRECGCRTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASE

Query:  VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLS
        VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASDNPTFSGS MKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLS
Subjt:  VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLS

Query:  LNAIAYCYEFCSLREICKS-----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKE
        LNAI    E   LR +  S      +VNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKE
Subjt:  LNAIAYCYEFCSLREICKS-----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKE

Query:  ELYAELMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVS
        ELYAELMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVS
Subjt:  ELYAELMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVS

Query:  MYEDRFDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISF
        MYEDRFDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISF
Subjt:  MYEDRFDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISF

Query:  FLVCLIGRSLGLIYTGIRQSLRWK
        FLVCLIGRSLGLIYTGIRQSLRWK
Subjt:  FLVCLIGRSLGLIYTGIRQSLRWK

KAG7014497.1 putative protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MSSGVASSVSPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKFRPNGSLRFIASCSSGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQ
        MSSGVASSVSPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKFRPNGSLRFIASCSSGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQ
Subjt:  MSSGVASSVSPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKFRPNGSLRFIASCSSGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQ

Query:  AERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVIQDPPRGDEEPNNQLRSFDKLRPPVICFS
        AERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVIQDPPRGDEEPNNQLRSFDKLRPPVICFS
Subjt:  AERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVIQDPPRGDEEPNNQLRSFDKLRPPVICFS

Query:  FKLAASASASIYDGRECGCRTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASE
        FKLAASASASIYDGRECGCRTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASE
Subjt:  FKLAASASASIYDGRECGCRTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASE

Query:  VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLS
        VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLS
Subjt:  VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLS

Query:  LNAIAYCYEFCSLREICKSRKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAE
        LNAIAYCYEFCSLREICKSRKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAE
Subjt:  LNAIAYCYEFCSLREICKSRKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAE

Query:  LMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDR
        LMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDR
Subjt:  LMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDR

Query:  FDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCL
        FDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCL
Subjt:  FDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCL

Query:  IGRSLGLIYTGIRQSLRWK
        IGRSLGLIYTGIRQSLRWK
Subjt:  IGRSLGLIYTGIRQSLRWK

XP_022953638.1 uncharacterized protein LOC111456110 isoform X1 [Cucurbita moschata]4.2e-25292.69Show/hide
Query:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
        RT ++A +      S KR   + +ESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
Subjt:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA

Query:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS
        GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAI    E   LR +  S
Subjt:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS

Query:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY
              +VNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR RDY
Subjt:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY

Query:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
        CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
Subjt:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL

Query:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR
        PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR
Subjt:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR

Query:  QSLRWK
        QSLRWK
Subjt:  QSLRWK

XP_022953640.1 uncharacterized protein LOC111456110 isoform X2 [Cucurbita moschata]1.7e-25392.87Show/hide
Query:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
        RT ++A +      S KR   + +ESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
Subjt:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA

Query:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS
        GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAI    E   LR +  S
Subjt:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS

Query:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC
              +VNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC
Subjt:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC

Query:  YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP
        YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP
Subjt:  YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP

Query:  GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ
        GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ
Subjt:  GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ

Query:  SLRWK
        SLRWK
Subjt:  SLRWK

XP_023549012.1 uncharacterized protein LOC111807500 isoform X2 [Cucurbita pepo subsp. pepo]1.0e-25095.02Show/hide
Query:  QESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASD
        +ESSCGSYKFTRISTWRRRALSGFRGSNLIV PAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASD
Subjt:  QESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISNTNSASD

Query:  NPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS-----RKVNNLSTATIQNMDDLS
        NPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPL LNAI    E   LR +  S      +VNNLSTATIQNMDDLS
Subjt:  NPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS-----RKVNNLSTATIQNMDDLS

Query:  IIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFA
        IIFSKFIQKSS PVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFA
Subjt:  IIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFA

Query:  DGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLS
        DGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQW YQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLS
Subjt:  DGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLS

Query:  SELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        SELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDIT PMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Subjt:  SELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK

TrEMBL top hitse value%identityAlignment
A0A6J1GNK4 uncharacterized protein LOC111456110 isoform X39.3e-22191.8Show/hide
Query:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
        RT ++A +      S KR   + +ESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
Subjt:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA

Query:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS
        GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAI    E   LR +  S
Subjt:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS

Query:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY
              +VNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR RDY
Subjt:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY

Query:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
        CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
Subjt:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL

Query:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGW
        PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGW
Subjt:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGW

A0A6J1GNV1 uncharacterized protein LOC111456110 isoform X12.1e-25292.69Show/hide
Query:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
        RT ++A +      S KR   + +ESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
Subjt:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA

Query:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS
        GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAI    E   LR +  S
Subjt:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS

Query:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY
              +VNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR RDY
Subjt:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY

Query:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
        CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
Subjt:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL

Query:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR
        PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR
Subjt:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR

Query:  QSLRWK
        QSLRWK
Subjt:  QSLRWK

A0A6J1GQ87 uncharacterized protein LOC111456110 isoform X28.3e-25492.87Show/hide
Query:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
        RT ++A +      S KR   + +ESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
Subjt:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA

Query:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS
        GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAI    E   LR +  S
Subjt:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS

Query:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC
              +VNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC
Subjt:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC

Query:  YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP
        YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP
Subjt:  YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP

Query:  GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ
        GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ
Subjt:  GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ

Query:  SLRWK
        SLRWK
Subjt:  SLRWK

A0A6J1JVH5 uncharacterized protein LOC111488215 isoform X25.6e-25091.68Show/hide
Query:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
        RT ++A +      S KR   + ++SSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHA LRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
Subjt:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA

Query:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS
        GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFE RKSVENEVLEFAESHAKQPLSLNAI    E   LR +  S
Subjt:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS

Query:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC
              +VNNLSTATIQNMDDLSIIFSKFIQKSS PVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC
Subjt:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYC

Query:  YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP
        YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP
Subjt:  YYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELP

Query:  GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ
        GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ
Subjt:  GSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQ

Query:  SLRWK
        SLRWK
Subjt:  SLRWK

A0A6J1JWY6 uncharacterized protein LOC111488215 isoform X11.4e-24891.5Show/hide
Query:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
        RT ++A +      S KR   + ++SSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHA LRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA
Subjt:  RTMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMA

Query:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS
        GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFE RKSVENEVLEFAESHAKQPLSLNAI    E   LR +  S
Subjt:  GVIPVPKSNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKS

Query:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY
              +VNNLSTATIQNMDDLSIIFSKFIQKSS PVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR RDY
Subjt:  -----RKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPR-RDY

Query:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
        CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL
Subjt:  CYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIEL

Query:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR
        PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDIT P+IRVVIDKISSGISFFLVCLIGRSLGLIYTGIR
Subjt:  PGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIR

Query:  QSLRWK
        QSLRWK
Subjt:  QSLRWK

SwissProt top hitse value%identityAlignment
P73628 Thylakoid protein sll17695.6e-0534.74Show/hide
Query:  VLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQAERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRS
        VL AFFLG+AFAE L+E+VE  V   LSE+G+  AE+++ +  F  EV  RA     +         GP+S+  +  T++   +  AS   + ++
Subjt:  VLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQAERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRS

Q8LDV3 Uncharacterized protein At4g13200, chloroplastic5.2e-1942.2Show/hide
Query:  SPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKF-----RPNGSLRFIASCS-----SGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRL
        SPS FS+  + ++ T   S +L   RS+     L+      R + SLR  ++CS     SG+ +N++VLDAFFLGKA AE + ER+ESTVGEVLS IG+ 
Subjt:  SPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKF-----RPNGSLRFIASCS-----SGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRL

Query:  QAERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVI
        QAE+Q+Q+ + QEEV++RAKKAKE+A R+  E QG ++S      + +T +P A       S+    S+T+++
Subjt:  QAERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVI

Arabidopsis top hitse value%identityAlignment
AT4G13200.1 unknown protein3.7e-2042.2Show/hide
Query:  SPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKF-----RPNGSLRFIASCS-----SGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRL
        SPS FS+  + ++ T   S +L   RS+     L+      R + SLR  ++CS     SG+ +N++VLDAFFLGKA AE + ER+ESTVGEVLS IG+ 
Subjt:  SPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKF-----RPNGSLRFIASCS-----SGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRL

Query:  QAERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVI
        QAE+Q+Q+ + QEEV++RAKKAKE+A R+  E QG ++S      + +T +P A       S+    S+T+++
Subjt:  QAERQQQIIDFQEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVI

AT5G48830.1 unknown protein4.3e-10150.11Show/hide
Query:  SLVNVDGTTASE-VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISN-TNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENE
        SL + DG   S  V   DQ+LL  SIFLTYMAGVIPV K++   +  S       +  T   SG +TD + + K   DVVK K+LD LDA +R  ++ ++
Subjt:  SLVNVDGTTASE-VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISN-TNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENE

Query:  VLEFAESHAKQPLSLNAIAYCYEFCSLRE-ICKSRKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNI
        VL+      K PLSL AI+   +   L     K  +  N  + TI N D+    F++ ++++    C +WLK EL ++N DS  A   L+   L  +D I
Subjt:  VLEFAESHAKQPLSLNAIAYCYEFCSLRE-ICKSRKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNI

Query:  LPGIKKSGKEELYAELMHFLSFG-PRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQ
           I+KSGKE+L+AE ++F  FG P + +C YD S F  HG++ILED +IT ADG+AS+YLE ISVDS F +E+++ GL++C+LS+RALQ+LRNEVA+ Q
Subjt:  LPGIKKSGKEELYAELMHFLSFG-PRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQ

Query:  WLYQNIEAIVSMYEDRFDLCTLSSQQI-ELPGSRQANIDNWWMKHIL-RRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMI
        WL+QN+EA+VSMYEDRFDL  L +Q I  L GS      +WW K  L + +   SS LRY +I  F++PVKRTKEL+AL GWRYYFSL +ELSDI  P+I
Subjt:  WLYQNIEAIVSMYEDRFDLCTLSSQQI-ELPGSRQANIDNWWMKHIL-RRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMI

Query:  RVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        RVV+DK+SS ISFFLV LIGRS+GLI+TGIRQSLRWK
Subjt:  RVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK

AT5G48830.2 unknown protein3.7e-9749.1Show/hide
Query:  SLVNVDGTTASE-VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISN-TNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENE
        SL + DG   S  V   DQ+LL  SIFLTYMAGVIPV K++   +  S       +  T   SG +TD + + K   DVVK K+LD LDA +R  ++ ++
Subjt:  SLVNVDGTTASE-VLFVDQLLLMTSIFLTYMAGVIPVPKSNQPGNIISN-TNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENE

Query:  VLEFAESHAKQPLSLNAIAYCYEFCSLRE-ICKSRKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSS-------KAFLSLMSEK
        VL+      K PLSL AI+   +   L     K  +  N  + TI N D+    F++ ++++    C +WLK EL ++N DS        +A   L+   
Subjt:  VLEFAESHAKQPLSLNAIAYCYEFCSLRE-ICKSRKVNNLSTATIQNMDDLSIIFSKFIQKSSLPVCMSWLKNELSMKNNDSS-------KAFLSLMSEK

Query:  LKAEDNILPGIKKSGKEELYAELMHFLSFG-PRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLR
        L  +D I   I+KSGKE+L+AE ++F  FG P + +C YD S F  HG++ILED +IT ADG+AS+YLE ISVDS F +E+++ GL++C+LS+RALQ+LR
Subjt:  LKAEDNILPGIKKSGKEELYAELMHFLSFG-PRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLALCTLSTRALQRLR

Query:  NEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQI-ELPGSRQANIDNWWMKHIL-RRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELS
        NEVA+ QWL+QN+EA+VSMYEDRFDL  L +Q I  L GS      +WW K  L + +   SS LRY +I  F++PVKRTKEL+AL GW YYFSL +ELS
Subjt:  NEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQI-ELPGSRQANIDNWWMKHIL-RRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELS

Query:  DITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
        DI  P+IRVV+DK+SS ISFFLV LIGRS+GLI+TGIRQSLRWK
Subjt:  DITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAGCGGGGTTGCATCTTCAGTTTCCCCATCCCTTTTCTCGGTTGTACCCACTAAATCAAAGATTACCCATTGTTCATCTGCCTCTCTCGTTCGATTTCGATCCTC
ATCCACTTCCTCAAATCTCAAATTTCGACCGAATGGGTCTCTCAGATTCATAGCCAGTTGCAGCTCCGGCGATGGTGACAACAGGACTGTTCTAGATGCCTTTTTCTTAG
GAAAAGCTTTTGCAGAAACCTTAACTGAGCGTGTCGAGTCAACTGTTGGAGAGGTTTTGAGTGAGATTGGTAGGCTGCAAGCTGAACGACAACAACAAATTATTGATTTC
CAGGAGGAGGTTATAGACAGAGCTAAAAAGGCCAAGGAAAAAGCAGAACGTGATGCCAAAGAAGCACAAGGACCCATCTCCTCTTCGATAATATCGGCTACAATCGAAGT
TACTTCTTCTCCAACTGCTTCGAGCAATGGACAACAACGCTCAAGTCCGAACGCATATTCAGAAACTGTGGTGATTCAAGATCCTCCTCGTGGTGACGAAGAACCGAACA
ACCAGTTACGATCGTTCGATAAGTTGAGGCCCCCTGTGATTTGCTTCAGTTTCAAGCTCGCTGCCTCCGCCTCCGCCTCCATTTACGATGGCAGAGAATGTGGTTGTCGC
ACCATGTATCAAGCTGCAAATTGGAAGAACTCCCTTCGAAGCGAAAAGCGCCGCACCGTGCAGTTTCAAGAATCCTCTTGTGGGAGCTATAAGTTCACGAGGATCTCAAC
TTGGAGAAGGCGTGCGCTTAGTGGTTTTCGGGGCTCAAACTTAATTGTAAGTCCTGCTCCCAGGAAGATCTTCAGAGAGCATGCTTACCTAAGGTCTTTGGTAAACGTTG
ATGGAACAACAGCCTCTGAGGTACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTTCTAACATATATGGCTGGCGTAATACCCGTACCGAAGTCTAATCAACCT
GGAAATATCATCTCTAATACCAATTCAGCCTCAGATAACCCAACATTTTCTGGTAGTGGCATGAAGACTGATGATCAAATTAATTCGAAGTATGCATTAGATGTAGTTAA
AGGAAAGATTTTGGATTTTCTAGATGCTTTTGAACGTAGGAAAAGTGTGGAAAACGAGGTACTCGAATTTGCTGAAAGTCATGCCAAGCAACCCCTAAGCTTGAATGCCA
TTGCCTACTGTTATGAGTTCTGCTCCTTACGTGAGATCTGCAAATCAAGGAAAGTCAACAATCTTTCCACTGCTACTATTCAGAACATGGATGATTTGTCTATAATATTT
TCTAAATTTATTCAAAAATCCTCCCTACCAGTATGCATGTCTTGGCTGAAAAACGAACTGTCGATGAAAAATAATGATTCTAGTAAGGCATTTCTTTCATTGATGTCTGA
GAAGCTTAAAGCTGAAGACAACATTTTACCAGGAATTAAGAAGTCTGGCAAGGAAGAGTTGTATGCAGAATTGATGCACTTTCTTAGTTTTGGTCCTCGCAGGGATTATT
GCTATTATGACTATAGCTTGTTTGTCAAGCATGGGATTTCAATATTAGAAGATTTGCTGATAACCTTTGCTGACGGGATTGCAAGTATGTATCTGGAATTTATTTCTGTT
GACAGCAGTTTCTTTGATGAAGTGGATAATATTGGCTTGGCATTATGTACCTTATCAACACGAGCACTCCAAAGATTGCGTAATGAGGTGGCCATGAACCAATGGTTGTA
TCAAAACATCGAGGCAATTGTATCGATGTATGAAGACCGATTTGATCTATGTACACTTAGTAGTCAACAGATTGAGCTACCAGGCAGTAGACAGGCCAATATTGATAATT
GGTGGATGAAACATATCCTCAGAAGAAGAGAAACATTGTCTTCTGAGTTGCGTTATGTTGTGATAGACTCCTTCGCCATGCCTGTAAAACGGACCAAGGAGTTGAGAGCT
TTAAGGGGATGGAGGTATTACTTCAGCCTGTTGATTGAATTATCGGACATTACAACGCCAATGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCGTTCTTTCT
GGTCTGCTTGATTGGAAGATCTTTAGGGCTCATCTACACAGGAATCAGGCAGTCACTAAGGTGGAAATGA
mRNA sequenceShow/hide mRNA sequence
AATCCACGAAAGCGGCAATTGGGCATGTCGAACAAGGAATGAGCAAGAACAGGGAGGGTTCTGGGTCACTGTCATTTCATTTCCCCGCCAAAGATCTTTCCTTATAGATT
CCTGTTTGGCTGTCGAGAAACTCAGAGAGAAACGTTGAGAAAGTGCGCCGGGTAATTTTCCATTCCGTCGCACAAAATGAGTAGCGGGGTTGCATCTTCAGTTTCCCCAT
CCCTTTTCTCGGTTGTACCCACTAAATCAAAGATTACCCATTGTTCATCTGCCTCTCTCGTTCGATTTCGATCCTCATCCACTTCCTCAAATCTCAAATTTCGACCGAAT
GGGTCTCTCAGATTCATAGCCAGTTGCAGCTCCGGCGATGGTGACAACAGGACTGTTCTAGATGCCTTTTTCTTAGGAAAAGCTTTTGCAGAAACCTTAACTGAGCGTGT
CGAGTCAACTGTTGGAGAGGTTTTGAGTGAGATTGGTAGGCTGCAAGCTGAACGACAACAACAAATTATTGATTTCCAGGAGGAGGTTATAGACAGAGCTAAAAAGGCCA
AGGAAAAAGCAGAACGTGATGCCAAAGAAGCACAAGGACCCATCTCCTCTTCGATAATATCGGCTACAATCGAAGTTACTTCTTCTCCAACTGCTTCGAGCAATGGACAA
CAACGCTCAAGTCCGAACGCATATTCAGAAACTGTGGTGATTCAAGATCCTCCTCGTGGTGACGAAGAACCGAACAACCAGTTACGATCGTTCGATAAGTTGAGGCCCCC
TGTGATTTGCTTCAGTTTCAAGCTCGCTGCCTCCGCCTCCGCCTCCATTTACGATGGCAGAGAATGTGGTTGTCGCACCATGTATCAAGCTGCAAATTGGAAGAACTCCC
TTCGAAGCGAAAAGCGCCGCACCGTGCAGTTTCAAGAATCCTCTTGTGGGAGCTATAAGTTCACGAGGATCTCAACTTGGAGAAGGCGTGCGCTTAGTGGTTTTCGGGGC
TCAAACTTAATTGTAAGTCCTGCTCCCAGGAAGATCTTCAGAGAGCATGCTTACCTAAGGTCTTTGGTAAACGTTGATGGAACAACAGCCTCTGAGGTACTTTTTGTTGA
TCAATTGCTTCTGATGACCAGTATATTTCTAACATATATGGCTGGCGTAATACCCGTACCGAAGTCTAATCAACCTGGAAATATCATCTCTAATACCAATTCAGCCTCAG
ATAACCCAACATTTTCTGGTAGTGGCATGAAGACTGATGATCAAATTAATTCGAAGTATGCATTAGATGTAGTTAAAGGAAAGATTTTGGATTTTCTAGATGCTTTTGAA
CGTAGGAAAAGTGTGGAAAACGAGGTACTCGAATTTGCTGAAAGTCATGCCAAGCAACCCCTAAGCTTGAATGCCATTGCCTACTGTTATGAGTTCTGCTCCTTACGTGA
GATCTGCAAATCAAGGAAAGTCAACAATCTTTCCACTGCTACTATTCAGAACATGGATGATTTGTCTATAATATTTTCTAAATTTATTCAAAAATCCTCCCTACCAGTAT
GCATGTCTTGGCTGAAAAACGAACTGTCGATGAAAAATAATGATTCTAGTAAGGCATTTCTTTCATTGATGTCTGAGAAGCTTAAAGCTGAAGACAACATTTTACCAGGA
ATTAAGAAGTCTGGCAAGGAAGAGTTGTATGCAGAATTGATGCACTTTCTTAGTTTTGGTCCTCGCAGGGATTATTGCTATTATGACTATAGCTTGTTTGTCAAGCATGG
GATTTCAATATTAGAAGATTTGCTGATAACCTTTGCTGACGGGATTGCAAGTATGTATCTGGAATTTATTTCTGTTGACAGCAGTTTCTTTGATGAAGTGGATAATATTG
GCTTGGCATTATGTACCTTATCAACACGAGCACTCCAAAGATTGCGTAATGAGGTGGCCATGAACCAATGGTTGTATCAAAACATCGAGGCAATTGTATCGATGTATGAA
GACCGATTTGATCTATGTACACTTAGTAGTCAACAGATTGAGCTACCAGGCAGTAGACAGGCCAATATTGATAATTGGTGGATGAAACATATCCTCAGAAGAAGAGAAAC
ATTGTCTTCTGAGTTGCGTTATGTTGTGATAGACTCCTTCGCCATGCCTGTAAAACGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGAGGTATTACTTCAGCCTGTTGA
TTGAATTATCGGACATTACAACGCCAATGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCGTTCTTTCTGGTCTGCTTGATTGGAAGATCTTTAGGGCTCATC
TACACAGGAATCAGGCAGTCACTAAGGTGGAAATGAAGGTTCGATAGTTCTGATCTATATTTGGTTGTTTCTCCACTATTTTGGTTTCCTTCTCATTTCTTTTATGTATC
AACTTGATTTTGGTAGTTCATTACAATTTTTGAGATTATATCAATTGCAATATGAGATAGTATAGCATGTTGAAGTATATTTTTGCTTTCCATCGGTTTTGTTATTCCCT
AGTCGTAAGGGAGTGA
Protein sequenceShow/hide protein sequence
MSSGVASSVSPSLFSVVPTKSKITHCSSASLVRFRSSSTSSNLKFRPNGSLRFIASCSSGDGDNRTVLDAFFLGKAFAETLTERVESTVGEVLSEIGRLQAERQQQIIDF
QEEVIDRAKKAKEKAERDAKEAQGPISSSIISATIEVTSSPTASSNGQQRSSPNAYSETVVIQDPPRGDEEPNNQLRSFDKLRPPVICFSFKLAASASASIYDGRECGCR
TMYQAANWKNSLRSEKRRTVQFQESSCGSYKFTRISTWRRRALSGFRGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPKSNQP
GNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENEVLEFAESHAKQPLSLNAIAYCYEFCSLREICKSRKVNNLSTATIQNMDDLSIIF
SKFIQKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHFLSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISV
DSSFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNWWMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRA
LRGWRYYFSLLIELSDITTPMIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK