; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G03050 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G03050
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBlue copper protein
Genome locationClcChr09:2409756..2437912
RNA-Seq ExpressionClc09G03050
SyntenyClc09G03050
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0046658 - anchored component of plasma membrane (cellular component)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin
IPR028871 - Blue (type 1) copper protein, binding site
IPR039391 - Phytocyanin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058752.1 mucin-5AC [Cucumis melo var. makuwa]2.4e-18679.13Show/hide
Query:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS
        GRFA+VLGFALFLFL+YSAAQTV+TVGDSAGW +P NG  FY +WAAGKTF VGDSL     TG+DEV+RV+K  FD+CSDD+EIGDSIETGPATIPL S
Subjt:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS

Query:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS
        PGEYYFISSEDRHCQQGQKLAINVTAAPG  +PPSS+ PP  P PGRAPVTHVVGD  GW +PQGGA+FYANW A + F+VGDSL+FNF T  DD+VRV+
Subjt:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS

Query:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA
        K+S     DD EIGDDID+GPATI L TPGEYYFISNED HCQQGQKLAINV+AAA GPM PPSSNPPPS     +PRPAPVTHIVG  VGWT+PPGGAA
Subjt:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA

Query:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP
        FYVNWTAGKTFAVGDSLVFNF+TDVHDVERVPK SFDICSDDNEIGDTIESGPAT+VL  PGEHYYIS ENQDC+LGQKLAINV A+RS GPVTSIS PP
Subjt:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP

Query:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL
        TSGPT  P  SG G PNSS NT+AAALSATVFGL+L
Subjt:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL

XP_004135850.1 early nodulin-like protein 2 [Cucumis sativus]7.1e-18678.21Show/hide
Query:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP
        RFA VL FALFLFLQYSAAQTV+TVGDS GW +P NG  FY TWAA K F VGDSLVFNFTT  DEV RV+K  FD+CSDD+EIGDSIETGPATI L +P
Subjt:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP

Query:  GEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTT-NDDVVRVSK
        GEY+FISSEDRHCQQGQKLAINVTAAPG  +PPSSN PP  P P RAPVTHVVGD AGW +P+GGA+FY+NWAA ++F+ GDSL+FNF T +DDVVRVSK
Subjt:  GEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTT-NDDVVRVSK

Query:  RS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFY
        +S     DD EIG+DID+GPATI L TPGEYYFISNED HCQQGQKLAINV+AA G MAPPSSNPPPS     +PRPAPVTHIVGD VGWT PPGGAAFY
Subjt:  RS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFY

Query:  VNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPTS
        VNWT GKTFAVGDS+VFNF T+VHDVERVPK SFDICSDDNEIG+TIESGPAT+VL  PGEHYYISTENQDCQLGQKLAINV ATRS GPVTS+S PPTS
Subjt:  VNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPTS

Query:  GPTSGPTPSGT--GPPNSSANTVAAALSATVFGLIL
        GPT+G +P GT  G P SSANT+AAA+SATVFGL L
Subjt:  GPTSGPTPSGT--GPPNSSANTVAAALSATVFGLIL

XP_008461119.1 PREDICTED: uncharacterized protein LOC103499800 [Cucumis melo]2.5e-19180.28Show/hide
Query:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS
        GRFA+VLGFALFLFL+YSAAQTV+TVGDSAGW +P NG  FY +WAAGKTF VGDSLVFNF TG+DEV+RV+K  FD+CSDD+EIGDSIETGPATIPL S
Subjt:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS

Query:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS
        PGEYYFISSEDRHCQQGQKLAINVTAAPG  +PPSS+ PP  P PGRAPVTHVVGD  GW +PQGGA+FYANW A + F+VGDSL+FNF T  DD+VRV+
Subjt:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS

Query:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA
        K+S     DD EIGDDID+GPATI L TPGEYYFISNED HCQQGQKLAINV+AAA GPM PPSSNPPPS     +PRPAPVTHIVG  VGWT+PPGGAA
Subjt:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA

Query:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP
        FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPK SFDICSDDNEIGDTIESGPAT+VL  PGEHYYIS ENQDC+LGQKLAINV A+RS GPVTSIS PP
Subjt:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP

Query:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL
        TSGPT  P  SG G PNSS NT+AAALSATVFGL+L
Subjt:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL

XP_023513444.1 uncharacterized protein LOC111778056 [Cucurbita pepo subsp. pepo]4.6e-17774.89Show/hide
Query:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP
        + ALVLGFALFLFL +SAAQTVH VGDS GWRIPP  ADFY  WA GKTF VGDSLVFNFTT  D+VTRV K SFD+CSDD+EIGDSIE GPAT+ L++ 
Subjt:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP

Query:  GEYYFISSEDRHCQQGQKLAINVTAAPGA---PTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSK
        GE+YFISSED HCQQGQKLAINVTAAP A   PTPPS+ APPPT GRAPVTHVVGDA GW +PQGG +FY NWA  + FVVGDSLLFNF   DDVVRV+K
Subjt:  GEYYFISSEDRHCQQGQKLAINVTAAPGA---PTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSK

Query:  R-----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAF
        R     SDDD+IG+DID  PA   L+ PGEYYFIS+EDRHCQQGQKLAINV+AAA GPM      PPPSNARPP P+PAPVTH+VGD VGWTVP GGAAF
Subjt:  R-----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAF

Query:  YVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPT
        Y NW AGKTF VGDSLVFNF+ +VHDVERV KRSFDICSDD+EIGD+I+S PATIVL  PGEHYYISTENQDC+LGQKLAINV ATRSN P TSI+  P+
Subjt:  YVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPT

Query:  SGPTSG---PTPSGTGPPNSSANTVAAALSATVFGLIL
        SGP++G     P G+G P SSANTVAAALSAT+FGL+L
Subjt:  SGPTSG---PTPSGTGPPNSSANTVAAALSATVFGLIL

XP_038899883.1 blue copper protein-like [Benincasa hispida]2.1e-22291.03Show/hide
Query:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS
        GRFALVLGFA+FLFLQYSAAQTVHTVGDSAGWRIPPNGA FYV WAAGKTFRVGDSLVFNFT+GLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPL S
Subjt:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS

Query:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSKR-
        PGEYYFISSEDRHC+QGQKLAINVTAAPGAPTPPSSNAPP TPGRAPVTHVVG AAGW VPQGGA+FY NWAA +TFVVGDSLLFNFTTNDDVVRVSKR 
Subjt:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSKR-

Query:  ----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFYVN
            SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAAGPMAPPSSNPPPSNARPP+PRPAPVTHIVGD VGWT PPGGAAFYVN
Subjt:  ----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFYVN

Query:  WTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPTSGP
        W AGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDD++IGD+IESGPATIVL++ GEHYYISTENQDCQLGQKLAINVTATR N PVTSIS PPTSGP
Subjt:  WTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPTSGP

Query:  TSGPTPS---GTGPPNSSANTVAAALSATVFGLIL
        T G +PS   GTGPPNSSANTVAAALSAT+FGL+L
Subjt:  TSGPTPS---GTGPPNSSANTVAAALSATVFGLIL

TrEMBL top hitse value%identityAlignment
A0A0A0K6B8 Uncharacterized protein3.4e-18678.21Show/hide
Query:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP
        RFA VL FALFLFLQYSAAQTV+TVGDS GW +P NG  FY TWAA K F VGDSLVFNFTT  DEV RV+K  FD+CSDD+EIGDSIETGPATI L +P
Subjt:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP

Query:  GEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTT-NDDVVRVSK
        GEY+FISSEDRHCQQGQKLAINVTAAPG  +PPSSN PP  P P RAPVTHVVGD AGW +P+GGA+FY+NWAA ++F+ GDSL+FNF T +DDVVRVSK
Subjt:  GEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTT-NDDVVRVSK

Query:  RS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFY
        +S     DD EIG+DID+GPATI L TPGEYYFISNED HCQQGQKLAINV+AA G MAPPSSNPPPS     +PRPAPVTHIVGD VGWT PPGGAAFY
Subjt:  RS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFY

Query:  VNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPTS
        VNWT GKTFAVGDS+VFNF T+VHDVERVPK SFDICSDDNEIG+TIESGPAT+VL  PGEHYYISTENQDCQLGQKLAINV ATRS GPVTS+S PPTS
Subjt:  VNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPTS

Query:  GPTSGPTPSGT--GPPNSSANTVAAALSATVFGLIL
        GPT+G +P GT  G P SSANT+AAA+SATVFGL L
Subjt:  GPTSGPTPSGT--GPPNSSANTVAAALSATVFGLIL

A0A1S3CDI6 uncharacterized protein LOC1034998001.2e-19180.28Show/hide
Query:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS
        GRFA+VLGFALFLFL+YSAAQTV+TVGDSAGW +P NG  FY +WAAGKTF VGDSLVFNF TG+DEV+RV+K  FD+CSDD+EIGDSIETGPATIPL S
Subjt:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS

Query:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS
        PGEYYFISSEDRHCQQGQKLAINVTAAPG  +PPSS+ PP  P PGRAPVTHVVGD  GW +PQGGA+FYANW A + F+VGDSL+FNF T  DD+VRV+
Subjt:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS

Query:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA
        K+S     DD EIGDDID+GPATI L TPGEYYFISNED HCQQGQKLAINV+AAA GPM PPSSNPPPS     +PRPAPVTHIVG  VGWT+PPGGAA
Subjt:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA

Query:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP
        FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPK SFDICSDDNEIGDTIESGPAT+VL  PGEHYYIS ENQDC+LGQKLAINV A+RS GPVTSIS PP
Subjt:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP

Query:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL
        TSGPT  P  SG G PNSS NT+AAALSATVFGL+L
Subjt:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL

A0A5D3CJG7 Mucin-5AC1.2e-18679.13Show/hide
Query:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS
        GRFA+VLGFALFLFL+YSAAQTV+TVGDSAGW +P NG  FY +WAAGKTF VGDSL     TG+DEV+RV+K  FD+CSDD+EIGDSIETGPATIPL S
Subjt:  GRFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSS

Query:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS
        PGEYYFISSEDRHCQQGQKLAINVTAAPG  +PPSS+ PP  P PGRAPVTHVVGD  GW +PQGGA+FYANW A + F+VGDSL+FNF T  DD+VRV+
Subjt:  PGEYYFISSEDRHCQQGQKLAINVTAAPGAPTPPSSNAPP--PTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNF-TTNDDVVRVS

Query:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA
        K+S     DD EIGDDID+GPATI L TPGEYYFISNED HCQQGQKLAINV+AAA GPM PPSSNPPPS     +PRPAPVTHIVG  VGWT+PPGGAA
Subjt:  KRS-----DDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAA

Query:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP
        FYVNWTAGKTFAVGDSLVFNF+TDVHDVERVPK SFDICSDDNEIGDTIESGPAT+VL  PGEHYYIS ENQDC+LGQKLAINV A+RS GPVTSIS PP
Subjt:  FYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPP

Query:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL
        TSGPT  P  SG G PNSS NT+AAALSATVFGL+L
Subjt:  TSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL

A0A6J1H8Q4 uncharacterized protein LOC1114611188.5e-17774.71Show/hide
Query:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP
        + ALVLGFALFLFL +SAAQTVH VGDS GWRIPP  ADFY  WA GKTF VGDSLVFNFTT  D+VTRV K SFD+CSDD+EIGDSIE GPAT+ L++ 
Subjt:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP

Query:  GEYYFISSEDRHCQQGQKLAINVTAAPGA---PTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSK
        GE+YFISSED HCQQGQKLAINVTAAP A   PTPPS+ APPPT GRAPVTHVVGDA GW +PQGG +FY NWA  + FVVGDSLLFNF+  DDVVRV+K
Subjt:  GEYYFISSEDRHCQQGQKLAINVTAAPGA---PTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSK

Query:  R-----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAF
        R     SDDD+IG+DID  PA    + PGEYYFIS+ED HCQQGQKLAINV+AAA GPM      PPPSNARPP P+PAPVTH+VGD VGWTVP GGAAF
Subjt:  R-----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAF

Query:  YVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPT
        Y NW AGKTFAVGDSLVFNF+++VHDV+RV KRSFDICSDD+EIGD+I+S PAT+VL  PGEHYYISTENQDC+LGQKLAINV A+RSN P TSI+  P+
Subjt:  YVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPT

Query:  SGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL
        SGP S P  SG+G P SSANTVAAALSAT+FGL+L
Subjt:  SGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL

A0A6J1KVF8 uncharacterized protein LOC1114979499.3e-17675.86Show/hide
Query:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP
        + ALVLG ALFLFL +SAAQTVH VGDS GWRIPP  ADFY  WA GK F VGDSLVFNFTT  D+VTRV K SF+LCSDD+EIGDSIE GPATI LS+ 
Subjt:  RFALVLGFALFLFLQYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSP

Query:  GEYYFISSEDRHCQQGQKLAINVTAAPGA---PTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSK
        GEYYFISSED HCQQGQKLAINVTAAP A   PTPPS+ APPPT GRAPVTHVVGDA GW +PQGG IFY NWA  + FVVGDSLLFNF   DDVVRV+K
Subjt:  GEYYFISSEDRHCQQGQKLAINVTAAPGA---PTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSK

Query:  R-----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAF
        R     SDDD+IG+DID  PATILLS  GEYYFIS+ED HCQQGQKLAINV+AAA GPM      PPPSNARPP P+ APVTH+VGD VGWTVP GGAAF
Subjt:  R-----SDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQQGQKLAINVSAAA-GPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAF

Query:  YVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPT
        Y NW A  TFAVGDSLVFNF+ +VHDVERV KRSFDICSDD+EIGD+I+S PATIVL  PG HYYISTENQDC+LGQKLAINV ATRSN P TSI+  P+
Subjt:  YVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPT

Query:  SGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL
        SGP S P  SG+G P SSANTVAAALSAT+FGL+L
Subjt:  SGPTSGPTPSGTGPPNSSANTVAAALSATVFGLIL

SwissProt top hitse value%identityAlignment
P29602 Cucumber peeling cupredoxin2.0e-2650.75Show/hide
Query:  TVHVVGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKV-TKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKL
        TVH+VGD+TGW+VP     FYS WA+   F VGDSL FNF  N H+V ++ TK SFDACN  N+  +V  T P   +LD  G+HYF+CTVG HC  GQKL
Subjt:  TVHVVGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKV-TKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKL

Query:  SVTVSASTPGGAMPPSSNTP----RPPAANTPSS
        S+ V A+    +MPP S++P     PP    P S
Subjt:  SVTVSASTPGGAMPPSSNTP----RPPAANTPSS

P42849 Umecyanin2.9e-1746.85Show/hide
Query:  VGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLSVTV-
        VG    W  P     FY  WA+   F VGD L F+F   MHDV  VTKD+FD C   N I + +TT P  + L+T G  Y+ICTVG HC  GQKLS+ V 
Subjt:  VGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLSVTV-

Query:  -SASTPGGAMP
         +    GGA P
Subjt:  -SASTPGGAMP

P80728 Mavicyanin4.2e-1644.76Show/hide
Query:  TVHVVGDSTGWT--VPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQK
        TVH VGDSTGWT  VP      Y+ WAS N F VGDSL FN+    H+VL+V ++ F +CNS++   +  T+G  ++ L   G  YF+C +  HC  GQK
Subjt:  TVHVVGDSTGWT--VPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQK

Query:  LSVTV
        + + V
Subjt:  LSVTV

Q07488 Blue copper protein7.7e-1839.88Show/hide
Query:  VGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLSVT-V
        VGD T WT P     FY+ WA+   F VGD L F+F    HDV  V++ +F+ C     I + +T  P  + L+T G  YFICTVG HC  GQKLS+T V
Subjt:  VGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLSVT-V

Query:  SASTPGGAMPPSSNTPRPPAANTPSSHDDACAPTPANSSSLSPPIHGGSPSAPAPSSSTAL-MATFYV
        +A   GGA P +  TP P   +TPS+       TP  +   + P      + PA +++++L  ATF V
Subjt:  SASTPGGAMPPSSNTPRPPAANTPSSHDDACAPTPANSSSLSPPIHGGSPSAPAPSSSTAL-MATFYV

Q41001 Blue copper protein8.2e-2041.92Show/hide
Query:  TVHVVGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLS
        TV+ VGD++GW +     G YS WAS   FAVGDSL FN+G   H V +V +  + +C S N+I +  +TG  T+ L  AG HYFIC V  H  GG KLS
Subjt:  TVHVVGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLS

Query:  VTVSASTPGGAMPPSSNTPRPPAANTPSSHDDACAPTPANSSSLSPPIHGGSPSAPAPSSSTALMAT
        + V AS+   A P  S TP      +PSS D     TPA +++ + P      SA + S   AL  T
Subjt:  VTVSASTPGGAMPPSSNTPRPPAANTPSSHDDACAPTPANSSSLSPPIHGGSPSAPAPSSSTALMAT

Arabidopsis top hitse value%identityAlignment
AT1G45063.1 copper ion binding;electron carriers8.8e-1732.23Show/hide
Query:  VGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTN-DDVVRVSKRSD----DDEIGDDIDN-GPATILLSTPGEYYFISNEDRHCQQGQKLAINVS
        VGD+ GW         Y +W   +   VGDSL+F +  N +DV +VS   +    D      + N G   +  + PG YYFI++    C  GQ+L + V 
Subjt:  VGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTN-DDVVRVSKRSD----DDEIGDDIDN-GPATILLSTPGEYYFISNEDRHCQQGQKLAINVS

Query:  AAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFYVNWTAGKTFAVGDSLVFNFQTDVHDVERVP-KRSFDICSDDNEIGDTIESGP
               P S +P P    P    P+   + VGD   W V    + FY NW+  K F VGD L+F +  +V+ V  +     F  C   + I    ++G 
Subjt:  AAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFYVNWTAGKTFAVGDSLVFNFQTDVHDVERVP-KRSFDICSDDNEIGDTIESGP

Query:  ATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTS
          I L +PG HY+IS+E   C  G KL + V  T  N P  S
Subjt:  ATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTS

AT1G45063.2 copper ion binding;electron carriers8.8e-1732.23Show/hide
Query:  VGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTN-DDVVRVSKRSD----DDEIGDDIDN-GPATILLSTPGEYYFISNEDRHCQQGQKLAINVS
        VGD+ GW         Y +W   +   VGDSL+F +  N +DV +VS   +    D      + N G   +  + PG YYFI++    C  GQ+L + V 
Subjt:  VGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTN-DDVVRVSKRSD----DDEIGDDIDN-GPATILLSTPGEYYFISNEDRHCQQGQKLAINVS

Query:  AAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFYVNWTAGKTFAVGDSLVFNFQTDVHDVERVP-KRSFDICSDDNEIGDTIESGP
               P S +P P    P    P+   + VGD   W V    + FY NW+  K F VGD L+F +  +V+ V  +     F  C   + I    ++G 
Subjt:  AAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFYVNWTAGKTFAVGDSLVFNFQTDVHDVERVP-KRSFDICSDDNEIGDTIESGP

Query:  ATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTS
          I L +PG HY+IS+E   C  G KL + V  T  N P  S
Subjt:  ATIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTS

AT2G25060.1 early nodulin-like protein 141.7e-1533.57Show/hide
Query:  MADRLSLVLPFALFLLLQYSAAQTVHTVGDSAGWRIPPNGAAFYVNWAAGKTFCVGDSLVFNFTTGLDEVTRVSKRSFDLCSNNDEIGDSIETGPATILL
        MA     V  F+L  L   +AA  V   G S  W+IPP+ +  +  WA    F VGD +VF + +G D V  V+K +++ C+  + + +  + G   + L
Subjt:  MADRLSLVLPFALFLLLQYSAAQTVHTVGDSAGWRIPPNGAAFYVNWAAGKTFCVGDSLVFNFTTGLDEVTRVSKRSFDLCSNNDEIGDSIETGPATILL

Query:  SSPGEYYFISSEDRHCQQGQKLAINVIAAPGAPTPPSSNAPPP
           G +YFIS  + HC++GQKL++ VI+   +   P   AP P
Subjt:  SSPGEYYFISSEDRHCQQGQKLAINVIAAPGAPTPPSSNAPPP

AT2G31050.1 Cupredoxin superfamily protein2.5e-1637.97Show/hide
Query:  SLVLPFALFLLLQYSAAQTVHTVGDSAGWRIPPNGAAFYVNWAAGKTFCVGDSLVFNFTTGLDEVTRVSKRSFDLCSNNDEIGDSIETGPATILLSSPGE
        SL++  ALF +   S   TVH VGDS GW I    +  Y  WA+  TF VGDSLVF +     +VT V+   +++C  +  +    ETG   ++L+ PG 
Subjt:  SLVLPFALFLLLQYSAAQTVHTVGDSAGWRIPPNGAAFYVNWAAGKTFCVGDSLVFNFTTGLDEVTRVSKRSFDLCSNNDEIGDSIETGPATILLSSPGE

Query:  YYFISSEDRHCQQGQKLAINVIAA---------PGAPTPPSS-NAPPPTP-GRAPVTH
         +FI     HC  GQKL I+V+ A         PG   PPSS ++P  +P   +PV H
Subjt:  YYFISSEDRHCQQGQKLAINVIAA---------PGAPTPPSS-NAPPPTP-GRAPVTH

AT5G20230.1 blue-copper-binding protein5.5e-1939.88Show/hide
Query:  VGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLSVT-V
        VGD T WT P     FY+ WA+   F VGD L F+F    HDV  V++ +F+ C     I + +T  P  + L+T G  YFICTVG HC  GQKLS+T V
Subjt:  VGDSTGWTVPQGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLSVT-V

Query:  SASTPGGAMPPSSNTPRPPAANTPSSHDDACAPTPANSSSLSPPIHGGSPSAPAPSSSTAL-MATFYV
        +A   GGA P +  TP P   +TPS+       TP  +   + P      + PA +++++L  ATF V
Subjt:  SASTPGGAMPPSSNTPRPPAANTPSSHDDACAPTPANSSSLSPPIHGGSPSAPAPSSSTAL-MATFYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATAGGCTTTCTCTTGTTCTGCCTTTTGCTCTCTTTTTGTTGCTTCAATACTCCGCCGCTCAGACCGTTCATACCGTCGGTGACTCCGCCGGCTGGAGGATCCC
TCCAAACGGCGCCGCTTTCTATGTCAACTGGGCTGCTGGAAAAACTTTCTGCGTCGGTGATTCTCTCGTTTTCAATTTCACGACTGGTTTGGATGAAGTGACGAGAGTTT
CGAAGAGGTCGTTTGATTTGTGCAGTAACAACGATGAAATTGGAGATTCCATTGAAACAGGGCCTGCAACCATTCTTTTATCATCTCCAGGCGAGTATTATTTCATTAGC
TCTGAGGATAGGCACTGCCAGCAAGGTCAAAAGTTAGCCATCAATGTCATCGCTGCCCCTGGAGCTCCGACGCCGCCGTCTTCCAACGCTCCTCCACCAACCCCAGGACG
AGCTCCAGTGACCCATGTCGTCGGAGACGCCGCTGGGTGGGCGGTTCCTCAAGGCGGTGCGATCTTCTATGTCAACTGGGTTGGCAGAAAAACCTTCGTTCTGGGCGATT
CTCTATGTAAGGATAATCTATTCAATGTGATAAACAGAGGAAAGGGGAAAGAAAGGAAAAAAAATGGTCGGTTCGCTCTTGTTCTGGGTTTTGCTCTGTTTTTGTTTCTT
CAATACTCCGCCGCTCAGACCGTTCATACCGTCGGTGACTCTGCTGGCTGGAGAATCCCTCCAAACGGCGCCGATTTCTATGTCACCTGGGCTGCTGGAAAAACTTTCCG
AGTCGGTGATTCTCTCGTTTTCAATTTCACGACGGGTTTGGATGAAGTGACGAGAGTTTCGAAAAGGTCGTTTGATTTGTGCAGTGACGACGATGAAATTGGAGATTCCA
TTGAAACAGGGCCTGCAACCATTCCCTTATCATCACCAGGGGAGTATTATTTCATCAGCTCTGAAGATAGGCATTGCCAGCAAGGTCAAAAATTAGCCATCAATGTCACC
GCCGCCCCTGGAGCTCCGACGCCGCCGTCTTCCAACGCTCCTCCACCAACGCCAGGACGAGCTCCAGTGACCCACGTTGTCGGAGACGCCGCTGGGTGGGCGGTTCCTCA
AGGCGGAGCCATCTTCTACGCCAACTGGGCTGCCAGAAGAACGTTTGTTGTGGGCGATTCTTTATTGTTCAATTTTACAACCAACGATGATGTAGTTAGAGTATCGAAAC
GATCCGATGACGATGAGATCGGCGACGACATCGACAATGGGCCGGCGACCATCCTACTATCAACTCCCGGCGAGTATTACTTCATAAGCAATGAAGATAGGCACTGCCAG
CAAGGTCAGAAATTAGCAATCAACGTCTCTGCCGCCGCTGGACCAATGGCTCCGCCTTCATCGAATCCTCCACCTTCCAACGCTCGTCCGCCATCTCCACGCCCTGCCCC
CGTGACTCATATCGTCGGAGACCGCGTAGGCTGGACGGTTCCCCCAGGCGGCGCTGCTTTCTACGTCAATTGGACTGCCGGCAAGACATTCGCGGTTGGTGATTCTCTAG
TGTTCAACTTCCAAACTGATGTACACGATGTCGAAAGAGTACCAAAAAGATCCTTTGATATATGTAGTGACGACAACGAGATCGGTGACACCATTGAATCTGGCCCTGCA
ACCATCGTTCTCAAAGAACCCGGCGAGCATTACTACATTAGTACCGAGAACCAGGATTGTCAACTCGGACAAAAACTAGCAATTAATGTAACTGCCACCAGATCCAACGG
TCCTGTGACATCCATTTCAGCCCCCCCAACCTCCGGTCCAACCTCCGGTCCGACTCCCAGTGGCACCGGACCACCGAACTCCTCCGCCAACACTGTTGCTGCTGCCCTCT
CCGCCACAGTTTTTGGCCTTATTTTAAGGAAAAAAATGGCCGGTAGGTTCGCTCTTGTTCTGTTTTTTGCTCCGTTTTTGTTTCTTCAATACTCCGCCGCTCAGACCGTT
CATACCGTCGGTGACTCCGCTGGCTGGAGAATCCATCCGAACGACGCCGCATTCTACGTCAATTGGGCTGCCGGAAGAACTTTCCACGTCGGTGATTCTCTCGTTTTCAA
TTTCACGACGGATTTGGATGAAGTAACGAGAGTTTCCAAGAGGTCGTTTGATTTGTGCCGTGACGACGATGAAATTGGAGATTCCATTGAAACAGGGCCTGCAACCATTC
CTTTATCATCACCAGGGGAGTATTATTTCATCAGCCCTGAGGATAGGCATTGCCAGCAAGGTCAGAAATTAGCAATCAACGTCTCTGCCGCCGCTGGACCGATGGCCCCG
CCTTCATCAAATCCTCCACCTTCCAACGCTCGTCCGCCATCTCCACGCCCTGCCCCAGTGACCCATATCGTCGGAGACGGCGTAGGCTGGACCATTCCCCCGAGCGGCGC
CACTTTTTACGTCAACTGGGCTGCCGGCAAGACGTTCACCGTCGGTGATTCTCTAGAGATTGGTGGAACCATTGAATCTGGCCCAGCAACCATCCTTCTCACTGCTCCCC
GCGAGCATTACTACATAAGTACCGAGAATCAAGATTGTCAGTTCGGCCAAAAGTTAGCAATCAATGTAACTACCCCGGGATCGAAGGGTCCTGTTACATCCATTTCATCC
CTCCCAACCTCTGGTCCGACGACTGGTAGTTCACCATCTACACCTGGTGGCACCGGACCACCGAACTCCTCCGCCAACACTGTCGCCGCCGCTCTCTTCGCCACAGCCTT
GGGCCTCATAATGGCTGGAGGAGTTGGTTTTATTTTGGGTTTCCTTCTTCTTGTTTCTATTCTGCACCATGCCACCGCCCAGACTGTGCACGTCGTTGGCGACACCACCG
CATTTAAGTTTATGACCGGAGCACACGATGTAGCTAAAGTATCAAAAGAAGCCTTCGAAGCATGCAGTTCCGATAGCCCTATCGGCAATGTCCTCAGTACGGGCCCTGCG
ACGGTGAAACTCGACACTGCTGGCGTGCATTTCTTCATCTGTACTGTTGGTAGGCACTGTTCGGGAGGCCAAAAGTTATCCGTCACTGTCTCCGCCTCCGCCACTCCCGC
TGGTGCTGTGGCGCCGTCTCCAAACACTACTCATGGCGGTGCTGTGGCGTCGTCTCCAAACACTACTGATGAGCCACCGGCAACTCCAACTCCTGCAAATTCGCCGACCT
CGTCACTACCTAATGATGGTGGATCTGCTGCTGCAGCTCCGGCGCCTTCCTCCTCCACTGCAGTGATGGCATCAATTTATGTCACTTTGTCCGCCATTGTTATGAACATT
TCCGATAAGGGAATTGGCAATGTCCTAACGACGGGCCCTGCAACGGTGAAACTTGACACTGATGGCGTGCATTACTTCATTTGTACCGTTGGTAAACACTGTTTGGGAGG
CCAAAAGTTATCCATCACTGTTTCCGCCACTCCTGGGGGTGCCATGGCGCCTTCTTCAAACACTACTGATGAGCCACTGGCAACTCCAACTCCGGCAAATTCGCCGACCT
CGTCACTACTTAACGGCGCTGAATCTCCTGCAGCTCCCGCGCCTTCCTCCTCCACTGCCGTGATGGCAACTCTTTATGTCACTTTGTCTGCCATGGTTATGAACTTCTTT
GTCCTCAAAGTACCAAAAGAGTCATTCAATATGTGCAGTACATACAATGCCGTCGGAAATTTCATCACTACAGATCCTGAAACGGTGAAGCTCGGCACCATCGGCGTATA
TTACTTCATTTGTATCGTCGACAGGCACTGTTTTGAAGGCCAAAAGTTATCTGTCACCGTCTCGACCAGTACTCCTGGCGGTGCAATGCCGCCTTCTTCAAACACTGCTC
AGCCTCCTTCTGACGATGATGCATGTGCTCCAACTCCGGCAAATTCGTTGTCTTCATCCCTATCTGTGACAGTTCATGTTGTCGGAGACTCCACCGGTTGGACGGTTCCG
CAGGGCGGCGCTGGTTTCTATTCCGACTGGGCTTCTAGAAACAACTTTGCCGTTGGCGATTCTTTAACATTTAACTTCGGGACCAACATGCACGATGTTCTTAAAGTAAC
AAAAGATTCCTTCGATGCTTGCAATTCAAACAATGCCATCGGCAATGTTATCACTACAGGTCCTGCCACAGTGAAGCTTGACACCGCCGGTGTACATTACTTCATTTGTA
CTGTCGGCCAGCACTGTTTTGGAGGCCAAAAGTTATCCGTTACCGTCTCCGCCAGTACTCCCGGCGGTGCAATGCCGCCTTCTTCAAACACTCCTCGGCCACCCGCCGCA
AACACTCCTTCTTCTCATGATGATGCTTGTGCTCCAACTCCTGCAAATTCGTCGTCTTTGTCCCCACCTATCCACGGTGGATCTCCATCAGCTCCGGCGCCTTCCTCCTC
CACCGCGCTGATGGCAACTTTTTATGTCACTTTGTCCGCCATTCATCCATGGCGGCCGGACTTGGGTTTGTTTTGGGTTTTCTTGCGGTGGTTTTTCTTCAACATGCCAC
CGCTCAGACGGTTCATGTTGTCGGTGACAACACCGGTTGGACAGTTCCACAGGACGGCCCCGCTTTCTATTCCGGCTGGGCTGCTAACAAGAATTTCCGCCCACGATGTA
CTTAAAGTCTCAAAAGAATCGTTCGACAGATGTAATTTCACCGGCGACGATGACGATATCATAAGGACAGGTCCGACGACGGTGAGACTCAACGAGACGGACATGCATTA
CTTTATTTGCACCATCGGCACACATTGTTCGTTAGGCCAAAAGTTATCCATCAATGTCGTCGCCGCCACTGCCGGTGGTCCAATGTCGCCGTCACCGCCTTCCTCAGTAT
TTCCACCACCGTCTCCTTCCTCCTCCAATGCACTTATTGCAACCCTTTGTCTCAATTTCTCCGCCCTTCTTATGGCCTTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATAGGCTTTCTCTTGTTCTGCCTTTTGCTCTCTTTTTGTTGCTTCAATACTCCGCCGCTCAGACCGTTCATACCGTCGGTGACTCCGCCGGCTGGAGGATCCC
TCCAAACGGCGCCGCTTTCTATGTCAACTGGGCTGCTGGAAAAACTTTCTGCGTCGGTGATTCTCTCGTTTTCAATTTCACGACTGGTTTGGATGAAGTGACGAGAGTTT
CGAAGAGGTCGTTTGATTTGTGCAGTAACAACGATGAAATTGGAGATTCCATTGAAACAGGGCCTGCAACCATTCTTTTATCATCTCCAGGCGAGTATTATTTCATTAGC
TCTGAGGATAGGCACTGCCAGCAAGGTCAAAAGTTAGCCATCAATGTCATCGCTGCCCCTGGAGCTCCGACGCCGCCGTCTTCCAACGCTCCTCCACCAACCCCAGGACG
AGCTCCAGTGACCCATGTCGTCGGAGACGCCGCTGGGTGGGCGGTTCCTCAAGGCGGTGCGATCTTCTATGTCAACTGGGTTGGCAGAAAAACCTTCGTTCTGGGCGATT
CTCTATGTAAGGATAATCTATTCAATGTGATAAACAGAGGAAAGGGGAAAGAAAGGAAAAAAAATGGTCGGTTCGCTCTTGTTCTGGGTTTTGCTCTGTTTTTGTTTCTT
CAATACTCCGCCGCTCAGACCGTTCATACCGTCGGTGACTCTGCTGGCTGGAGAATCCCTCCAAACGGCGCCGATTTCTATGTCACCTGGGCTGCTGGAAAAACTTTCCG
AGTCGGTGATTCTCTCGTTTTCAATTTCACGACGGGTTTGGATGAAGTGACGAGAGTTTCGAAAAGGTCGTTTGATTTGTGCAGTGACGACGATGAAATTGGAGATTCCA
TTGAAACAGGGCCTGCAACCATTCCCTTATCATCACCAGGGGAGTATTATTTCATCAGCTCTGAAGATAGGCATTGCCAGCAAGGTCAAAAATTAGCCATCAATGTCACC
GCCGCCCCTGGAGCTCCGACGCCGCCGTCTTCCAACGCTCCTCCACCAACGCCAGGACGAGCTCCAGTGACCCACGTTGTCGGAGACGCCGCTGGGTGGGCGGTTCCTCA
AGGCGGAGCCATCTTCTACGCCAACTGGGCTGCCAGAAGAACGTTTGTTGTGGGCGATTCTTTATTGTTCAATTTTACAACCAACGATGATGTAGTTAGAGTATCGAAAC
GATCCGATGACGATGAGATCGGCGACGACATCGACAATGGGCCGGCGACCATCCTACTATCAACTCCCGGCGAGTATTACTTCATAAGCAATGAAGATAGGCACTGCCAG
CAAGGTCAGAAATTAGCAATCAACGTCTCTGCCGCCGCTGGACCAATGGCTCCGCCTTCATCGAATCCTCCACCTTCCAACGCTCGTCCGCCATCTCCACGCCCTGCCCC
CGTGACTCATATCGTCGGAGACCGCGTAGGCTGGACGGTTCCCCCAGGCGGCGCTGCTTTCTACGTCAATTGGACTGCCGGCAAGACATTCGCGGTTGGTGATTCTCTAG
TGTTCAACTTCCAAACTGATGTACACGATGTCGAAAGAGTACCAAAAAGATCCTTTGATATATGTAGTGACGACAACGAGATCGGTGACACCATTGAATCTGGCCCTGCA
ACCATCGTTCTCAAAGAACCCGGCGAGCATTACTACATTAGTACCGAGAACCAGGATTGTCAACTCGGACAAAAACTAGCAATTAATGTAACTGCCACCAGATCCAACGG
TCCTGTGACATCCATTTCAGCCCCCCCAACCTCCGGTCCAACCTCCGGTCCGACTCCCAGTGGCACCGGACCACCGAACTCCTCCGCCAACACTGTTGCTGCTGCCCTCT
CCGCCACAGTTTTTGGCCTTATTTTAAGGAAAAAAATGGCCGGTAGGTTCGCTCTTGTTCTGTTTTTTGCTCCGTTTTTGTTTCTTCAATACTCCGCCGCTCAGACCGTT
CATACCGTCGGTGACTCCGCTGGCTGGAGAATCCATCCGAACGACGCCGCATTCTACGTCAATTGGGCTGCCGGAAGAACTTTCCACGTCGGTGATTCTCTCGTTTTCAA
TTTCACGACGGATTTGGATGAAGTAACGAGAGTTTCCAAGAGGTCGTTTGATTTGTGCCGTGACGACGATGAAATTGGAGATTCCATTGAAACAGGGCCTGCAACCATTC
CTTTATCATCACCAGGGGAGTATTATTTCATCAGCCCTGAGGATAGGCATTGCCAGCAAGGTCAGAAATTAGCAATCAACGTCTCTGCCGCCGCTGGACCGATGGCCCCG
CCTTCATCAAATCCTCCACCTTCCAACGCTCGTCCGCCATCTCCACGCCCTGCCCCAGTGACCCATATCGTCGGAGACGGCGTAGGCTGGACCATTCCCCCGAGCGGCGC
CACTTTTTACGTCAACTGGGCTGCCGGCAAGACGTTCACCGTCGGTGATTCTCTAGAGATTGGTGGAACCATTGAATCTGGCCCAGCAACCATCCTTCTCACTGCTCCCC
GCGAGCATTACTACATAAGTACCGAGAATCAAGATTGTCAGTTCGGCCAAAAGTTAGCAATCAATGTAACTACCCCGGGATCGAAGGGTCCTGTTACATCCATTTCATCC
CTCCCAACCTCTGGTCCGACGACTGGTAGTTCACCATCTACACCTGGTGGCACCGGACCACCGAACTCCTCCGCCAACACTGTCGCCGCCGCTCTCTTCGCCACAGCCTT
GGGCCTCATAATGGCTGGAGGAGTTGGTTTTATTTTGGGTTTCCTTCTTCTTGTTTCTATTCTGCACCATGCCACCGCCCAGACTGTGCACGTCGTTGGCGACACCACCG
CATTTAAGTTTATGACCGGAGCACACGATGTAGCTAAAGTATCAAAAGAAGCCTTCGAAGCATGCAGTTCCGATAGCCCTATCGGCAATGTCCTCAGTACGGGCCCTGCG
ACGGTGAAACTCGACACTGCTGGCGTGCATTTCTTCATCTGTACTGTTGGTAGGCACTGTTCGGGAGGCCAAAAGTTATCCGTCACTGTCTCCGCCTCCGCCACTCCCGC
TGGTGCTGTGGCGCCGTCTCCAAACACTACTCATGGCGGTGCTGTGGCGTCGTCTCCAAACACTACTGATGAGCCACCGGCAACTCCAACTCCTGCAAATTCGCCGACCT
CGTCACTACCTAATGATGGTGGATCTGCTGCTGCAGCTCCGGCGCCTTCCTCCTCCACTGCAGTGATGGCATCAATTTATGTCACTTTGTCCGCCATTGTTATGAACATT
TCCGATAAGGGAATTGGCAATGTCCTAACGACGGGCCCTGCAACGGTGAAACTTGACACTGATGGCGTGCATTACTTCATTTGTACCGTTGGTAAACACTGTTTGGGAGG
CCAAAAGTTATCCATCACTGTTTCCGCCACTCCTGGGGGTGCCATGGCGCCTTCTTCAAACACTACTGATGAGCCACTGGCAACTCCAACTCCGGCAAATTCGCCGACCT
CGTCACTACTTAACGGCGCTGAATCTCCTGCAGCTCCCGCGCCTTCCTCCTCCACTGCCGTGATGGCAACTCTTTATGTCACTTTGTCTGCCATGGTTATGAACTTCTTT
GTCCTCAAAGTACCAAAAGAGTCATTCAATATGTGCAGTACATACAATGCCGTCGGAAATTTCATCACTACAGATCCTGAAACGGTGAAGCTCGGCACCATCGGCGTATA
TTACTTCATTTGTATCGTCGACAGGCACTGTTTTGAAGGCCAAAAGTTATCTGTCACCGTCTCGACCAGTACTCCTGGCGGTGCAATGCCGCCTTCTTCAAACACTGCTC
AGCCTCCTTCTGACGATGATGCATGTGCTCCAACTCCGGCAAATTCGTTGTCTTCATCCCTATCTGTGACAGTTCATGTTGTCGGAGACTCCACCGGTTGGACGGTTCCG
CAGGGCGGCGCTGGTTTCTATTCCGACTGGGCTTCTAGAAACAACTTTGCCGTTGGCGATTCTTTAACATTTAACTTCGGGACCAACATGCACGATGTTCTTAAAGTAAC
AAAAGATTCCTTCGATGCTTGCAATTCAAACAATGCCATCGGCAATGTTATCACTACAGGTCCTGCCACAGTGAAGCTTGACACCGCCGGTGTACATTACTTCATTTGTA
CTGTCGGCCAGCACTGTTTTGGAGGCCAAAAGTTATCCGTTACCGTCTCCGCCAGTACTCCCGGCGGTGCAATGCCGCCTTCTTCAAACACTCCTCGGCCACCCGCCGCA
AACACTCCTTCTTCTCATGATGATGCTTGTGCTCCAACTCCTGCAAATTCGTCGTCTTTGTCCCCACCTATCCACGGTGGATCTCCATCAGCTCCGGCGCCTTCCTCCTC
CACCGCGCTGATGGCAACTTTTTATGTCACTTTGTCCGCCATTCATCCATGGCGGCCGGACTTGGGTTTGTTTTGGGTTTTCTTGCGGTGGTTTTTCTTCAACATGCCAC
CGCTCAGACGGTTCATGTTGTCGGTGACAACACCGGTTGGACAGTTCCACAGGACGGCCCCGCTTTCTATTCCGGCTGGGCTGCTAACAAGAATTTCCGCCCACGATGTA
CTTAAAGTCTCAAAAGAATCGTTCGACAGATGTAATTTCACCGGCGACGATGACGATATCATAAGGACAGGTCCGACGACGGTGAGACTCAACGAGACGGACATGCATTA
CTTTATTTGCACCATCGGCACACATTGTTCGTTAGGCCAAAAGTTATCCATCAATGTCGTCGCCGCCACTGCCGGTGGTCCAATGTCGCCGTCACCGCCTTCCTCAGTAT
TTCCACCACCGTCTCCTTCCTCCTCCAATGCACTTATTGCAACCCTTTGTCTCAATTTCTCCGCCCTTCTTATGGCCTTCTTTTAG
Protein sequenceShow/hide protein sequence
MADRLSLVLPFALFLLLQYSAAQTVHTVGDSAGWRIPPNGAAFYVNWAAGKTFCVGDSLVFNFTTGLDEVTRVSKRSFDLCSNNDEIGDSIETGPATILLSSPGEYYFIS
SEDRHCQQGQKLAINVIAAPGAPTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYVNWVGRKTFVLGDSLCKDNLFNVINRGKGKERKKNGRFALVLGFALFLFL
QYSAAQTVHTVGDSAGWRIPPNGADFYVTWAAGKTFRVGDSLVFNFTTGLDEVTRVSKRSFDLCSDDDEIGDSIETGPATIPLSSPGEYYFISSEDRHCQQGQKLAINVT
AAPGAPTPPSSNAPPPTPGRAPVTHVVGDAAGWAVPQGGAIFYANWAARRTFVVGDSLLFNFTTNDDVVRVSKRSDDDEIGDDIDNGPATILLSTPGEYYFISNEDRHCQ
QGQKLAINVSAAAGPMAPPSSNPPPSNARPPSPRPAPVTHIVGDRVGWTVPPGGAAFYVNWTAGKTFAVGDSLVFNFQTDVHDVERVPKRSFDICSDDNEIGDTIESGPA
TIVLKEPGEHYYISTENQDCQLGQKLAINVTATRSNGPVTSISAPPTSGPTSGPTPSGTGPPNSSANTVAAALSATVFGLILRKKMAGRFALVLFFAPFLFLQYSAAQTV
HTVGDSAGWRIHPNDAAFYVNWAAGRTFHVGDSLVFNFTTDLDEVTRVSKRSFDLCRDDDEIGDSIETGPATIPLSSPGEYYFISPEDRHCQQGQKLAINVSAAAGPMAP
PSSNPPPSNARPPSPRPAPVTHIVGDGVGWTIPPSGATFYVNWAAGKTFTVGDSLEIGGTIESGPATILLTAPREHYYISTENQDCQFGQKLAINVTTPGSKGPVTSISS
LPTSGPTTGSSPSTPGGTGPPNSSANTVAAALFATALGLIMAGGVGFILGFLLLVSILHHATAQTVHVVGDTTAFKFMTGAHDVAKVSKEAFEACSSDSPIGNVLSTGPA
TVKLDTAGVHFFICTVGRHCSGGQKLSVTVSASATPAGAVAPSPNTTHGGAVASSPNTTDEPPATPTPANSPTSSLPNDGGSAAAAPAPSSSTAVMASIYVTLSAIVMNI
SDKGIGNVLTTGPATVKLDTDGVHYFICTVGKHCLGGQKLSITVSATPGGAMAPSSNTTDEPLATPTPANSPTSSLLNGAESPAAPAPSSSTAVMATLYVTLSAMVMNFF
VLKVPKESFNMCSTYNAVGNFITTDPETVKLGTIGVYYFICIVDRHCFEGQKLSVTVSTSTPGGAMPPSSNTAQPPSDDDACAPTPANSLSSSLSVTVHVVGDSTGWTVP
QGGAGFYSDWASRNNFAVGDSLTFNFGTNMHDVLKVTKDSFDACNSNNAIGNVITTGPATVKLDTAGVHYFICTVGQHCFGGQKLSVTVSASTPGGAMPPSSNTPRPPAA
NTPSSHDDACAPTPANSSSLSPPIHGGSPSAPAPSSSTALMATFYVTLSAIHPWRPDLGLFWVFLRWFFFNMPPLRRFMLSVTTPVGQFHRTAPLSIPAGLLTRISAHDV
LKVSKESFDRCNFTGDDDDIIRTGPTTVRLNETDMHYFICTIGTHCSLGQKLSINVVAATAGGPMSPSPPSSVFPPPSPSSSNALIATLCLNFSALLMAFF