; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G006050 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G006050
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionmethyl-CpG-binding domain-containing protein 11-like isoform X4
Genome locationCmo_Chr08:3674415..3681293
RNA-Seq ExpressionCmoCh08G006050
SyntenyCmoCh08G006050
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR001739 - Methyl-CpG DNA binding
IPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin
IPR016177 - DNA-binding domain superfamily
IPR039391 - Phytocyanin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593389.1 hypothetical protein SDJN03_12865, partial [Cucurbita argyrosperma subsp. sororia]5.5e-23797.48Show/hide
Query:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
        MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
Subjt:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT

Query:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV
        TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL     +GDDVVRV
Subjt:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV

Query:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
        TKRSFDLCSDDDDIGEDIDVSPARFF NAPG+YYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
Subjt:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW

Query:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
        AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVL APGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
Subjt:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA

Query:  SSPG--GSGSGSPFSSANTVAAALSATLFGLVLNFF
        SSPG  GSGSGSPFSSANTVAAALSATLFGLVLNFF
Subjt:  SSPG--GSGSGSPFSSANTVAAALSATLFGLVLNFF

KAG7025736.1 hypothetical protein SDJN02_12234 [Cucurbita argyrosperma subsp. argyrosperma]6.1e-23693.67Show/hide
Query:  KLTEEELKMAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSL--------------VFNFTTDRDDVTRVPKASFD
        +LTEE+LKMAAKVALVLG ALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSL              VFNFTTDRDDVTRVPKASFD
Subjt:  KLTEEELKMAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSL--------------VFNFTTDRDDVTRVPKASFD

Query:  MCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATG
        MCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTS RAPVTHVVGDATGWRIPQGGNMFYVNWATG
Subjt:  MCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATG

Query:  KEFVVGDSL-----SGDDVVRVTKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVT
        KEFVVGDSL     +GDDVVRVTKRSFDLCSDDDDIGEDIDVSPARFF NAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVT
Subjt:  KEFVVGDSL-----SGDDVVRVTKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVT

Query:  HVVGDAVGWTVPQGGAAFYTNWAAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAIN
        HVVGDAVGWTVPQGGAAFYTNWAAGKTFAVGDSLVFNF+SEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVL APGEHYYISTENQDCELGQKLAIN
Subjt:  HVVGDAVGWTVPQGGAAFYTNWAAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAIN

Query:  VVASRSNVPATSIATSPSSGPASSPG--GSGSGSPFSSANTVAAALSATLFGLVLNFF
        VVASRSNVPATSIATSPSSGPASSPG  GSGSGSPFSSANTVAAALSATLFGLVLNFF
Subjt:  VVASRSNVPATSIATSPSSGPASSPG--GSGSGSPFSSANTVAAALSATLFGLVLNFF

XP_022960373.1 uncharacterized protein LOC111461118 [Cucurbita moschata]4.0e-24098.62Show/hide
Query:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
        MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
Subjt:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT

Query:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV
        TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL     +GDDVVRV
Subjt:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV

Query:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
        TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
Subjt:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW

Query:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
        AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
Subjt:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA

Query:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF
        SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF
Subjt:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF

XP_023004735.1 uncharacterized protein LOC111497949 [Cucurbita maxima]4.1e-22492.4Show/hide
Query:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
        MAAKVALVLG ALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWA GK FTVGDSLVFNFTTDRDDVTRVPKASF++CSDDNEIGDSIEIGPAT+ L+
Subjt:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT

Query:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV
        TAGE+YFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGN+FYVNWATGKEFVVGDSL     +GDDVVRV
Subjt:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV

Query:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
        TKRSFDLCSDDDDIGEDIDVSPA    +A GEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPK APVTHVVGDAVGWTVPQGGAAFYTNW
Subjt:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW

Query:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
        AA  TFAVGDSLVFNFR EVHDV+RVTKRSFDICSDDDEIGDSIDSSPAT+VL +PG HYYISTENQDCELGQKLAINVVA+RSNVPATSIATSPSSGPA
Subjt:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA

Query:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF
        S+PGGSGSGSPFSSANTVAAALSATLFGLVLNFF
Subjt:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF

XP_023513444.1 uncharacterized protein LOC111778056 [Cucurbita pepo subsp. pepo]3.8e-23094.53Show/hide
Query:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
        MA KVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
Subjt:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT

Query:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV
        TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL     +GDDVVRV
Subjt:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV

Query:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
        TKRSFDLCSDDDDIGEDIDVSPARFF NAPGEYYFISSED HCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
Subjt:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW

Query:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSS---
        AAGKTF VGDSLVFNFR EVHDV+RVTKRSFDICSDDDEIGDSIDSSPAT+VL APGEHYYISTENQDCELGQKLAINVVA+RSN PATSIATSPSS   
Subjt:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSS---

Query:  --GPASSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF
          GPAS+PG  GSGSPFSSANTVAAALSATLFGLVLNFF
Subjt:  --GPASSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF

TrEMBL top hitse value%identityAlignment
A0A6J1H6K1 methyl-CpG-binding domain-containing protein 11-like isoform X27.6e-176100Show/hide
Query:  MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE
        MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE
Subjt:  MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE

Query:  KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK
        KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK
Subjt:  KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK

Query:  EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN
        EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN
Subjt:  EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN

Query:  IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET
        IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET
Subjt:  IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET

A0A6J1H7R0 methyl-CpG-binding domain-containing protein 11-like isoform X44.0e-177100Show/hide
Query:  MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE
        MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE
Subjt:  MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE

Query:  KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK
        KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK
Subjt:  KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK

Query:  EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN
        EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN
Subjt:  EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN

Query:  IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIETGK
        IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIETGK
Subjt:  IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIETGK

A0A6J1H804 methyl-CpG-binding domain-containing protein 11-like isoform X17.6e-176100Show/hide
Query:  MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE
        MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE
Subjt:  MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISE

Query:  KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK
        KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK
Subjt:  KAKTTPPPQEDPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDK

Query:  EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN
        EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN
Subjt:  EIETAKDEQAENVVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASN

Query:  IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET
        IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET
Subjt:  IKENGVAGTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET

A0A6J1H8Q4 uncharacterized protein LOC1114611182.0e-24098.62Show/hide
Query:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
        MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
Subjt:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT

Query:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV
        TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL     +GDDVVRV
Subjt:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV

Query:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
        TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
Subjt:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW

Query:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
        AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
Subjt:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA

Query:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF
        SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF
Subjt:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF

A0A6J1KVF8 uncharacterized protein LOC1114979492.0e-22492.4Show/hide
Query:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT
        MAAKVALVLG ALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWA GK FTVGDSLVFNFTTDRDDVTRVPKASF++CSDDNEIGDSIEIGPAT+ L+
Subjt:  MAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLT

Query:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV
        TAGE+YFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGN+FYVNWATGKEFVVGDSL     +GDDVVRV
Subjt:  TAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL-----SGDDVVRV

Query:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW
        TKRSFDLCSDDDDIGEDIDVSPA    +A GEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPK APVTHVVGDAVGWTVPQGGAAFYTNW
Subjt:  TKRSFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNW

Query:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA
        AA  TFAVGDSLVFNFR EVHDV+RVTKRSFDICSDDDEIGDSIDSSPAT+VL +PG HYYISTENQDCELGQKLAINVVA+RSNVPATSIATSPSSGPA
Subjt:  AAGKTFAVGDSLVFNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPA

Query:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF
        S+PGGSGSGSPFSSANTVAAALSATLFGLVLNFF
Subjt:  SSPGGSGSGSPFSSANTVAAALSATLFGLVLNFF

SwissProt top hitse value%identityAlignment
P29602 Cucumber peeling cupredoxin2.0e-2448.03Show/hide
Query:  TVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRV-PKASFDMCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLA
        TVH+VGD+TGW +P + +FY++WA GKTF VGDSL FNF  +  +V  +  K SFD C+  N   D     P   RL   G HYF+ +  THC  GQKL+
Subjt:  TVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRV-PKASFDMCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLA

Query:  INVTAAPAA----PITPTPPSTKAPPP
        INV AA A     P + +PPS+  PPP
Subjt:  INVTAAPAA----PITPTPPSTKAPPP

P42849 Umecyanin3.8e-1542.73Show/hide
Query:  VGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTA
        VG    W+ P    FY  WATGKTF VGD L F+F     DV  V K +FD C  +N I   +   P  + L T G  Y+I +   HC+ GQKL+INV  
Subjt:  VGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTA

Query:  APAAPITPTP
        A  A    TP
Subjt:  APAAPITPTP

Q07488 Blue copper protein9.7e-1941.61Show/hide
Query:  VGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTA
        VGD T W  P   +FY  WATGKTF VGD L F+F   R DV  V +A+F+ C  +  I   + + P  + L T G  YFI +   HC+ GQKL+I V A
Subjt:  VGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGDSIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTA

Query:  A-------PAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGN
        A       P A  TP P ST   P T G  P T     A G   P G +
Subjt:  A-------PAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGN

Q9LW00 Methyl-CpG-binding domain-containing protein 111.2e-2948.16Show/hide
Query:  GGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRAR
        GG ++ VS  LPAP SWKKL  P K G+ +K EV+F+APTGEEISNRKQLEQYLKSHP + A+++FDW+T  TPRRSARISEK K TP P ++PP+KR R
Subjt:  GGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRAR

Query:  -KSPGSKK-KEAKNSEGVKE--TDVKDVEMSEKEH-AEIKKEKDK----EPKMVDETKE----KETENAK--DEEPEKDIETAKDEQAKKEDETKDKEIE
         KSP SKK  E + SEG  E  + VKD EM+  E  AE +   DK    E + V++ KE    +ET NA    EE E   E A D    K  ET DKE +
Subjt:  -KSPGSKK-KEAKNSEGVKE--TDVKDVEMSEKEH-AEIKKEKDK----EPKMVDETKE----KETENAK--DEEPEKDIETAKDEQAKKEDETKDKEIE

Query:  TAKDEQAENVVETKDEPGIENVPGNTTITK----NGQLDAEDGKE
        T   E+    VE K     +    +   T+    NG     +GKE
Subjt:  TAKDEQAENVVETKDEPGIENVPGNTTITK----NGQLDAEDGKE

Q9XI36 Methyl-CpG-binding domain-containing protein 101.5e-2735.71Show/hide
Query:  VSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRARKSPGSK
        VS  LPAP SWKKL  PK+ GTPRK E++F+APTGEEIS+RKQLEQYLK+HP +  +S+F+W+TGETPRRS+RIS+K K T P  +  P  + R+S  +K
Subjt:  VSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRARKSPGSK

Query:  KKEAKNSEGVKETDVK---DVEMSEK-EHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKD--------IETAKDEQAKKEDETKDKEIETAKDEQAEN
        K   + +E  +E  VK   DV+   K E+AE +KEK+KE   V E  E E EN + E+ E +         E  K+ Q +  +  K+KE E A+ E  E 
Subjt:  KKEAKNSEGVKETDVK---DVEMSEK-EHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKD--------IETAKDEQAKKEDETKDKEIETAKDEQAEN

Query:  VVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQE-----KADASNIKENGVA
         V    +  +E               AE+  +VE      ++  +       V     ++K EN+     E   E      E      A+A   KE+   
Subjt:  VVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQE-----KADASNIKENGVA

Query:  GTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET
            +E    + +D Q+ D +  +    KE+E  E+
Subjt:  GTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET

Arabidopsis top hitse value%identityAlignment
AT1G15340.1 methyl-CPG-binding domain 101.1e-2835.71Show/hide
Query:  VSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRARKSPGSK
        VS  LPAP SWKKL  PK+ GTPRK E++F+APTGEEIS+RKQLEQYLK+HP +  +S+F+W+TGETPRRS+RIS+K K T P  +  P  + R+S  +K
Subjt:  VSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRARKSPGSK

Query:  KKEAKNSEGVKETDVK---DVEMSEK-EHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKD--------IETAKDEQAKKEDETKDKEIETAKDEQAEN
        K   + +E  +E  VK   DV+   K E+AE +KEK+KE   V E  E E EN + E+ E +         E  K+ Q +  +  K+KE E A+ E  E 
Subjt:  KKEAKNSEGVKETDVK---DVEMSEK-EHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKD--------IETAKDEQAKKEDETKDKEIETAKDEQAEN

Query:  VVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQE-----KADASNIKENGVA
         V    +  +E               AE+  +VE      ++  +       V     ++K EN+     E   E      E      A+A   KE+   
Subjt:  VVETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQE-----KADASNIKENGVA

Query:  GTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET
            +E    + +D Q+ D +  +    KE+E  E+
Subjt:  GTPASEGTITENDDAQKHDIQAKDRVNKKESEVIET

AT1G15340.2 methyl-CPG-binding domain 106.9e-1230.93Show/hide
Query:  VSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHP------VDVALS---DFDWSTGETPRRSA-RISEKAKTTPPPQEDPPR
        VS  LPAP SWKKL  PK+ GTPRK E++F+APTGEEIS+RKQLEQYLK +        + A+    D D   G+T    A +  EK   T   + +   
Subjt:  VSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHP------VDVALS---DFDWSTGETPRRSA-RISEKAKTTPPPQEDPPR

Query:  KRARKSPGSK-KKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDKEIETAKDEQAENV
            K+   K  KE + +E  KE         + E AE +KEK+ E K   E KE E    K E  E D    + +    E   +  ++E  KD + +  
Subjt:  KRARKSPGSK-KKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDKEIETAKDEQAENV

Query:  VETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAE--KANESCCMKQEKADASNIKENGV--AGT
         E   E  +E  P        G +  E   E       NV   E     ++ A  G + K  +   TEAE  K N++    ++K +A+  KEN    +  
Subjt:  VETKDEPGIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAE--KANESCCMKQEKADASNIKENGV--AGT

Query:  PASEGTITENDDAQKHDIQAKDRVNKKESEVIE
          +E  + E    + +D++A+D     E+  ++
Subjt:  PASEGTITENDDAQKHDIQAKDRVNKKESEVIE

AT1G45063.1 copper ion binding;electron carriers8.1e-2131.87Show/hide
Query:  AKVALVLGFALFLF--LHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKA-SFDMCSDDNEIGDSI-EIGPATMR
        A++  +  F + +F  L    + TV+ VGDS GW        Y  W   K   VGDSL+F +  + +DVT+V     ++ C  D+    ++   G   + 
Subjt:  AKVALVLGFALFLF--LHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKA-SFDMCSDDNEIGDSI-EIGPATMR

Query:  LTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL---SGDDVVRV
         T  G +YFI+S  T C  GQ+L + V   P++P +P P  +K  P         + VGD+  W +      FY NW+  K+F VGD L     ++V  V
Subjt:  LTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL---SGDDVVRV

Query:  TKRSFDL----CSDDDDI-----GEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPP
         + S DL    C     I     G DI           PG +YFISSE GHC  G KL + V    + P   P
Subjt:  TKRSFDL----CSDDDDI-----GEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPP

AT1G45063.2 copper ion binding;electron carriers8.1e-2131.87Show/hide
Query:  AKVALVLGFALFLF--LHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKA-SFDMCSDDNEIGDSI-EIGPATMR
        A++  +  F + +F  L    + TV+ VGDS GW        Y  W   K   VGDSL+F +  + +DVT+V     ++ C  D+    ++   G   + 
Subjt:  AKVALVLGFALFLF--LHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKA-SFDMCSDDNEIGDSI-EIGPATMR

Query:  LTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL---SGDDVVRV
         T  G +YFI+S  T C  GQ+L + V   P++P +P P  +K  P         + VGD+  W +      FY NW+  K+F VGD L     ++V  V
Subjt:  LTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSL---SGDDVVRV

Query:  TKRSFDL----CSDDDDI-----GEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPP
         + S DL    C     I     G DI           PG +YFISSE GHC  G KL + V    + P   P
Subjt:  TKRSFDL----CSDDDDI-----GEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPP

AT3G15790.1 methyl-CPG-binding domain 118.6e-3148.16Show/hide
Query:  GGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRAR
        GG ++ VS  LPAP SWKKL  P K G+ +K EV+F+APTGEEISNRKQLEQYLKSHP + A+++FDW+T  TPRRSARISEK K TP P ++PP+KR R
Subjt:  GGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQEDPPRKRAR

Query:  -KSPGSKK-KEAKNSEGVKE--TDVKDVEMSEKEH-AEIKKEKDK----EPKMVDETKE----KETENAK--DEEPEKDIETAKDEQAKKEDETKDKEIE
         KSP SKK  E + SEG  E  + VKD EM+  E  AE +   DK    E + V++ KE    +ET NA    EE E   E A D    K  ET DKE +
Subjt:  -KSPGSKK-KEAKNSEGVKE--TDVKDVEMSEKEH-AEIKKEKDK----EPKMVDETKE----KETENAK--DEEPEKDIETAKDEQAKKEDETKDKEIE

Query:  TAKDEQAENVVETKDEPGIENVPGNTTITK----NGQLDAEDGKE
        T   E+    VE K     +    +   T+    NG     +GKE
Subjt:  TAKDEQAENVVETKDEPGIENVPGNTTITK----NGQLDAEDGKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAAGTGGGAGTGCAGAGCAAAGAAGAGCTTCAAAACCCAGTTGAAGGAGGAGCTAAAGATGAAGTTTCTGCTCATCTTCCAGCTCCACCTTCTTGGAAGAA
ACTGTTATCACCCAAGAAGGGAGGTACACCAAGAAAGAACGAAGTCATATTCATAGCACCGACTGGCGAGGAGATCAGTAATCGAAAACAGCTCGAGCAGTACTTGAAAT
CACACCCTGTAGATGTTGCATTATCGGATTTTGATTGGAGTACTGGTGAGACGCCTCGGAGATCTGCTAGGATTAGTGAGAAGGCTAAGACAACGCCTCCCCCACAGGAA
GATCCCCCAAGGAAACGAGCTAGAAAGTCTCCCGGCTCGAAAAAGAAGGAAGCTAAGAACTCTGAGGGTGTAAAGGAAACTGATGTTAAAGATGTCGAAATGTCCGAGAA
GGAGCACGCTGAAATCAAGAAAGAAAAGGATAAAGAGCCTAAAATGGTTGATGAAACAAAAGAAAAAGAAACTGAAAATGCCAAGGATGAAGAGCCTGAAAAAGACATTG
AAACTGCCAAGGATGAACAGGCCAAGAAGGAAGACGAAACAAAAGACAAGGAAATTGAAACGGCCAAGGATGAACAGGCCGAGAACGTTGTCGAAACAAAAGACGAACCC
GGCATTGAGAATGTGCCCGGTAACACGACGATAACCAAAAACGGTCAGTTAGATGCTGAAGATGGCAAAGAAGTAGAAGATCAAAGTCATGGTAATGTTCAAAATCAAGA
GGTAGCCACAGCTGTTGAAAGTGTGGCATTGGTGGGTGGACAAGATAAAGGGGAAAACAGACCACAAACTGAAGCTGAGAAAGCTAATGAATCATGCTGTATGAAACAGG
AAAAGGCAGATGCTAGTAACATCAAGGAAAATGGTGTTGCAGGAACTCCAGCATCCGAGGGAACGATCACGGAAAACGACGATGCACAAAAGCACGACATCCAGGCAAAG
GACAGAGTAAACAAAAAGGAGAGCGAGGTGATCGAAACCGGTAAATTAACAGAGGAAGAACTAAAAATGGCCGCCAAAGTCGCTCTTGTTCTGGGTTTTGCTCTGTTTTT
GTTCCTTCATCACTCCGCCGCTCAGACCGTTCATGTCGTCGGTGACTCCACCGGCTGGAGAATCCCTCCTACCGCTGATTTCTACGCTAAATGGGCCACTGGTAAAACTT
TCACCGTCGGTGATTCTTTGGTGTTTAATTTCACTACGGATAGGGATGATGTTACGAGAGTACCGAAAGCGTCGTTTGATATGTGCAGTGACGACAATGAAATCGGCGAC
TCCATTGAAATTGGACCAGCAACCATGCGTCTCACAACTGCAGGGGAACATTATTTCATCAGCTCTGAGGATACTCACTGTCAACAAGGTCAAAAGTTAGCTATCAATGT
CACCGCCGCCCCCGCCGCCCCGATAACTCCAACGCCGCCTTCTACCAAAGCTCCTCCCCCAACCTCCGGACGCGCTCCCGTGACCCATGTCGTCGGAGACGCTACTGGCT
GGCGCATTCCTCAGGGCGGCAACATGTTCTACGTCAACTGGGCTACCGGAAAGGAATTCGTTGTCGGCGATTCTCTCTCCGGGGACGATGTAGTTAGAGTGACGAAGAGA
TCCTTCGATTTATGTAGCGACGACGATGACATCGGCGAGGACATCGACGTTAGCCCTGCAAGGTTCTTCTTCAACGCTCCCGGCGAGTATTATTTCATTAGCAGCGAGGA
TGGGCACTGCCAGCAAGGTCAGAAATTAGCAATCAATGTTACAGCCGCTGCCTCTGGACCTATGCCTCCACCGTCCAACGCTCGTCCACCACCCCCAAAGCCAGCTCCAG
TGACCCATGTCGTCGGAGACGCCGTCGGATGGACCGTCCCACAAGGGGGCGCTGCTTTCTACACCAACTGGGCTGCCGGCAAGACATTCGCCGTCGGCGATTCTCTAGTG
TTCAATTTCCGATCCGAAGTACACGATGTGCAAAGAGTAACAAAGAGATCGTTCGATATATGTAGCGACGACGACGAGATTGGCGACAGCATCGACTCGAGCCCTGCAAC
GATGGTGCTCGCCGCTCCCGGCGAGCATTACTACATCAGCACGGAGAACCAAGACTGTGAATTAGGTCAAAAATTAGCAATCAATGTTGTCGCCTCCAGATCCAACGTTC
CTGCAACCTCCATTGCAACATCTCCAAGCTCCGGTCCAGCGTCGAGCCCCGGCGGCAGCGGCAGCGGCTCACCGTTTTCCTCCGCTAACACCGTCGCCGCCGCTCTCTCC
GCCACATTGTTTGGCCTAGTTTTGAACTTCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ACTTATTGGCGAAGCTAAGCTCTAAACTTTCATGAAGCTTCGACCCCTCTTGATTTAACCCATAAATGCGTCTTAATCTTAATCTTAATCCTAATCTAGAGAGAGAGTGA
AGAGAGAGAGAGAGAGTTTATAGAAGAGTTTTGACTCTCAAAGGCCTAAGCGAATTGAAAGTTTTGTGTTGTTTGCAAGTGGGAAAAAGAGATGGAAGACGAAGTGGGAG
TGCAGAGCAAAGAAGAGCTTCAAAACCCAGTTGAAGGAGGAGCTAAAGATGAAGTTTCTGCTCATCTTCCAGCTCCACCTTCTTGGAAGAAACTGTTATCACCCAAGAAG
GGAGGTACACCAAGAAAGAACGAAGTCATATTCATAGCACCGACTGGCGAGGAGATCAGTAATCGAAAACAGCTCGAGCAGTACTTGAAATCACACCCTGTAGATGTTGC
ATTATCGGATTTTGATTGGAGTACTGGTGAGACGCCTCGGAGATCTGCTAGGATTAGTGAGAAGGCTAAGACAACGCCTCCCCCACAGGAAGATCCCCCAAGGAAACGAG
CTAGAAAGTCTCCCGGCTCGAAAAAGAAGGAAGCTAAGAACTCTGAGGGTGTAAAGGAAACTGATGTTAAAGATGTCGAAATGTCCGAGAAGGAGCACGCTGAAATCAAG
AAAGAAAAGGATAAAGAGCCTAAAATGGTTGATGAAACAAAAGAAAAAGAAACTGAAAATGCCAAGGATGAAGAGCCTGAAAAAGACATTGAAACTGCCAAGGATGAACA
GGCCAAGAAGGAAGACGAAACAAAAGACAAGGAAATTGAAACGGCCAAGGATGAACAGGCCGAGAACGTTGTCGAAACAAAAGACGAACCCGGCATTGAGAATGTGCCCG
GTAACACGACGATAACCAAAAACGGTCAGTTAGATGCTGAAGATGGCAAAGAAGTAGAAGATCAAAGTCATGGTAATGTTCAAAATCAAGAGGTAGCCACAGCTGTTGAA
AGTGTGGCATTGGTGGGTGGACAAGATAAAGGGGAAAACAGACCACAAACTGAAGCTGAGAAAGCTAATGAATCATGCTGTATGAAACAGGAAAAGGCAGATGCTAGTAA
CATCAAGGAAAATGGTGTTGCAGGAACTCCAGCATCCGAGGGAACGATCACGGAAAACGACGATGCACAAAAGCACGACATCCAGGCAAAGGACAGAGTAAACAAAAAGG
AGAGCGAGGTGATCGAAACCGGTAAATTAACAGAGGAAGAACTAAAAATGGCCGCCAAAGTCGCTCTTGTTCTGGGTTTTGCTCTGTTTTTGTTCCTTCATCACTCCGCC
GCTCAGACCGTTCATGTCGTCGGTGACTCCACCGGCTGGAGAATCCCTCCTACCGCTGATTTCTACGCTAAATGGGCCACTGGTAAAACTTTCACCGTCGGTGATTCTTT
GGTGTTTAATTTCACTACGGATAGGGATGATGTTACGAGAGTACCGAAAGCGTCGTTTGATATGTGCAGTGACGACAATGAAATCGGCGACTCCATTGAAATTGGACCAG
CAACCATGCGTCTCACAACTGCAGGGGAACATTATTTCATCAGCTCTGAGGATACTCACTGTCAACAAGGTCAAAAGTTAGCTATCAATGTCACCGCCGCCCCCGCCGCC
CCGATAACTCCAACGCCGCCTTCTACCAAAGCTCCTCCCCCAACCTCCGGACGCGCTCCCGTGACCCATGTCGTCGGAGACGCTACTGGCTGGCGCATTCCTCAGGGCGG
CAACATGTTCTACGTCAACTGGGCTACCGGAAAGGAATTCGTTGTCGGCGATTCTCTCTCCGGGGACGATGTAGTTAGAGTGACGAAGAGATCCTTCGATTTATGTAGCG
ACGACGATGACATCGGCGAGGACATCGACGTTAGCCCTGCAAGGTTCTTCTTCAACGCTCCCGGCGAGTATTATTTCATTAGCAGCGAGGATGGGCACTGCCAGCAAGGT
CAGAAATTAGCAATCAATGTTACAGCCGCTGCCTCTGGACCTATGCCTCCACCGTCCAACGCTCGTCCACCACCCCCAAAGCCAGCTCCAGTGACCCATGTCGTCGGAGA
CGCCGTCGGATGGACCGTCCCACAAGGGGGCGCTGCTTTCTACACCAACTGGGCTGCCGGCAAGACATTCGCCGTCGGCGATTCTCTAGTGTTCAATTTCCGATCCGAAG
TACACGATGTGCAAAGAGTAACAAAGAGATCGTTCGATATATGTAGCGACGACGACGAGATTGGCGACAGCATCGACTCGAGCCCTGCAACGATGGTGCTCGCCGCTCCC
GGCGAGCATTACTACATCAGCACGGAGAACCAAGACTGTGAATTAGGTCAAAAATTAGCAATCAATGTTGTCGCCTCCAGATCCAACGTTCCTGCAACCTCCATTGCAAC
ATCTCCAAGCTCCGGTCCAGCGTCGAGCCCCGGCGGCAGCGGCAGCGGCTCACCGTTTTCCTCCGCTAACACCGTCGCCGCCGCTCTCTCCGCCACATTGTTTGGCCTAG
TTTTGAACTTCTTCTAGATACTTCCCTACTTTGACTGGTTCTCCTTGTTTAATAATTATTATATTTTATTATTTTCTTTTTGTTGTTATTATTTGAGTGCTTGGCTGTGT
CATTCCGGTCACGGCCATATTTATGTCGACGTGTCGTGCTGTATTTTGTCATTTATTTATAAATAAATAAATCATTTAATTTA
Protein sequenceShow/hide protein sequence
MEDEVGVQSKEELQNPVEGGAKDEVSAHLPAPPSWKKLLSPKKGGTPRKNEVIFIAPTGEEISNRKQLEQYLKSHPVDVALSDFDWSTGETPRRSARISEKAKTTPPPQE
DPPRKRARKSPGSKKKEAKNSEGVKETDVKDVEMSEKEHAEIKKEKDKEPKMVDETKEKETENAKDEEPEKDIETAKDEQAKKEDETKDKEIETAKDEQAENVVETKDEP
GIENVPGNTTITKNGQLDAEDGKEVEDQSHGNVQNQEVATAVESVALVGGQDKGENRPQTEAEKANESCCMKQEKADASNIKENGVAGTPASEGTITENDDAQKHDIQAK
DRVNKKESEVIETGKLTEEELKMAAKVALVLGFALFLFLHHSAAQTVHVVGDSTGWRIPPTADFYAKWATGKTFTVGDSLVFNFTTDRDDVTRVPKASFDMCSDDNEIGD
SIEIGPATMRLTTAGEHYFISSEDTHCQQGQKLAINVTAAPAAPITPTPPSTKAPPPTSGRAPVTHVVGDATGWRIPQGGNMFYVNWATGKEFVVGDSLSGDDVVRVTKR
SFDLCSDDDDIGEDIDVSPARFFFNAPGEYYFISSEDGHCQQGQKLAINVTAAASGPMPPPSNARPPPPKPAPVTHVVGDAVGWTVPQGGAAFYTNWAAGKTFAVGDSLV
FNFRSEVHDVQRVTKRSFDICSDDDEIGDSIDSSPATMVLAAPGEHYYISTENQDCELGQKLAINVVASRSNVPATSIATSPSSGPASSPGGSGSGSPFSSANTVAAALS
ATLFGLVLNFF