; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy1G019930 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy1G019930
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionMucin-5AC
Genome locationchrH01:24943349..24945087
RNA-Seq ExpressionChy1G019930
SyntenyChy1G019930
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0046658 - anchored component of plasma membrane (cellular component)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin
IPR039391 - Phytocyanin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058752.1 mucin-5AC [Cucumis melo var. makuwa]2.24e-27086.99Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MA RFA VL FALFLFL+YSAAQTVYTVGDS+GWTVPANG  FY++WAA K F+ GDSL      G DEV+RVTKMGFD+CSDDNEIGDSIETGPATI L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR
         +PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSS+VPPRTPAP RAPVTHVVGDT GWGIPQGGAMFY+NW AGKRF VGD LVFNF TREDD+VR
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR

Query:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAA-PGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV
        VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAA PGPM PPSSNPPPSTPRPAPVTHIVG SVGWT PPGGAAFYV
Subjt:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAA-PGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV

Query:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG
        NWTA KTFAVGDSLVFNF+T+VHDVERVPK SFD CSDDNEIG+TIESGPATVVLTTPGEHYYIS ENQDC+LGQKLAINVVA+RSTGPVTS+STPPTSG
Subjt:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG

Query:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        PT GGS    G G PNSS NTIAAALSATVFGL LSFF
Subjt:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

KAG6593389.1 hypothetical protein SDJN03_12865, partial [Cucurbita argyrosperma subsp. sororia]1.29e-20569.61Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MAA+ A VL FALFLFL +SAAQTV+ VGDS+GW +P   + FY  WA  K F  GDSLVFNFT   D+V RV K  FDMCSDDNEIGDSIE GPAT+ L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPR---SPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDD
        +T GE+YFISSED HCQQGQKLAINVTAAP      +PPS+  PP T    RAPVTHVVGD  GW IPQGG MFY NWA GK F VGD L+FNF    DD
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPR---SPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDD

Query:  VVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAP-GPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAA
        VVRVTK+SFDLC+DD +IG+DID  PA   L  PG+YYFIS+EDGHCQQGQKLAINVTAA  GPM PPS+  PP  P+PAPVTH+VGD+VGWT P GGAA
Subjt:  VVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAP-GPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAA

Query:  FYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPP
        FY NW A KTFAVGDSLVFNF++EVHDV+RV K SFD CSDD+EIG++I+S PAT+VLT PGEHYYISTENQDC+LGQKLAINVVA+RS  P TS++T P
Subjt:  FYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPP

Query:  TSGPTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        +SGP +     G+G+G P SSANT+AAALSAT+FGL L+FF
Subjt:  TSGPTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

XP_004135850.1 early nodulin-like protein 2 [Cucumis sativus]6.11e-29793.14Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MAARFAFVL FALFLFLQYSAAQTVYTVGDS GWTVPANGE+FYKTWAADKIFY GDSLVFNFT  +DEVARVTKMGFDMCSDDNEIGDSIETGPATISL
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR
         TPGEY+FISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPP+TPAP+RAPVTHVVGDTAGWGIP+GGA+FYSNWAAGK F  GD LVFNF T +DDVVR
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR

Query:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYVN
        V+KQSFDLCNDDGEIG+DIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPG MAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYVN
Subjt:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYVN

Query:  WTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSGP
        WT  KTFAVGDS+VFNF TEVHDVERVPKASFD CSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSGP
Subjt:  WTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSGP

Query:  TAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        TAGGSPFGTGAGQP SSANTIAAA+SATVFGLALSFF
Subjt:  TAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

XP_008461119.1 PREDICTED: uncharacterized protein LOC103499800 [Cucumis melo]1.42e-27688.13Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MA RFA VL FALFLFL+YSAAQTVYTVGDS+GWTVPANG  FY++WAA K F+ GDSLVFNF  G DEV+RVTKMGFD+CSDDNEIGDSIETGPATI L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR
         +PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSS+VPPRTPAP RAPVTHVVGDT GWGIPQGGAMFY+NW AGKRF VGD LVFNF TREDD+VR
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR

Query:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAA-PGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV
        VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAA PGPM PPSSNPPPSTPRPAPVTHIVG SVGWT PPGGAAFYV
Subjt:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAA-PGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV

Query:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG
        NWTA KTFAVGDSLVFNFQT+VHDVERVPK SFD CSDDNEIG+TIESGPATVVLTTPGEHYYIS ENQDC+LGQKLAINVVA+RSTGPVTS+STPPTSG
Subjt:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG

Query:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        PT GGS    G G PNSS NTIAAALSATVFGL LSFF
Subjt:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

XP_038899883.1 blue copper protein-like [Benincasa hispida]1.61e-24278.78Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MA RFA VL FA+FLFLQYSAAQTV+TVGDS+GW +P NG  FY  WAA K F  GDSLVFNFT G DEV RV+K  FD+CSDD+EIGDSIETGPATI L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR
         +PGEYYFISSEDRHC+QGQKLAINVTAAPG  +PPSSN PP TP   RAPVTHVVG  AGW +PQGGAMFY NWAAGK F VGD L+FNF T  DDVVR
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR

Query:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPGPMAPPSSNPPPS-----TPRPAPVTHIVGDSVGWTTPPGGA
        V+K+SFDLC+DD EIGDDID+GPATI L TPGEYYFISNED HCQQGQKLAINV+AA GPMAPPSSNPPPS     TPRPAPVTHIVGD+VGWTTPPGGA
Subjt:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPGPMAPPSSNPPPS-----TPRPAPVTHIVGDSVGWTTPPGGA

Query:  AFYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTP
        AFYVNW A KTFAVGDSLVFNFQT+VHDVERVPK SFD CSDD++IG++IESGPAT+VL   GEHYYISTENQDCQLGQKLAINV ATR   PVTS+STP
Subjt:  AFYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTP

Query:  PTSGPTAGGSPFGTGA-GQPNSSANTIAAALSATVFGLALSFF
        PTSGPT G SP   G  G PNSSANT+AAALSAT+FGL LSFF
Subjt:  PTSGPTAGGSPFGTGA-GQPNSSANTIAAALSATVFGLALSFF

TrEMBL top hitse value%identityAlignment
A0A0A0K6B8 Uncharacterized protein2.0e-23493.14Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MAARFAFVL FALFLFLQYSAAQTVYTVGDS GWTVPANGE+FYKTWAADKIFY GDSLVFNFT  +DEVARVTKMGFDMCSDDNEIGDSIETGPATISL
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR
         TPGEY+FISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPP+TPAP+RAPVTHVVGDTAGWGIP+GGA+FYSNWAAGK F  GD LVFNF T +DDVVR
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR

Query:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYVN
        V+KQSFDLCNDDGEIG+DIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPG MAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYVN
Subjt:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYVN

Query:  WTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSGP
        WT  KTFAVGDS+VFNF TEVHDVERVPKASFD CSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSGP
Subjt:  WTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSGP

Query:  TAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        TAGGSPFGTGAGQP SSANTIAAA+SATVFGLALSFF
Subjt:  TAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

A0A1S3CDI6 uncharacterized protein LOC1034998006.9e-21988.13Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MA RFA VL FALFLFL+YSAAQTVYTVGDS+GWTVPANG  FY++WAA K F+ GDSLVFNF  G DEV+RVTKMGFD+CSDDNEIGDSIETGPATI L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR
         +PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSS+VPPRTPAP RAPVTHVVGDT GWGIPQGGAMFY+NW AGKRF VGD LVFNF TREDD+VR
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR

Query:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV
        VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT AAPGPM PPSSNPPPSTPRPAPVTHIVG SVGWT PPGGAAFYV
Subjt:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV

Query:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG
        NWTA KTFAVGDSLVFNFQT+VHDVERVPK SFD CSDDNEIG+TIESGPATVVLTTPGEHYYIS ENQDC+LGQKLAINVVA+RSTGPVTS+STPPTSG
Subjt:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG

Query:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        PT GGS    G G PNSS NTIAAALSATVFGL LSFF
Subjt:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

A0A5D3CJG7 Mucin-5AC6.7e-21486.99Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MA RFA VL FALFLFL+YSAAQTVYTVGDS+GWTVPANG  FY++WAA K F+ GDSL      G DEV+RVTKMGFD+CSDDNEIGDSIETGPATI L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR
         +PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSS+VPPRTPAP RAPVTHVVGDT GWGIPQGGAMFY+NW AGKRF VGD LVFNF TREDD+VR
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVR

Query:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV
        VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT AAPGPM PPSSNPPPSTPRPAPVTHIVG SVGWT PPGGAAFYV
Subjt:  VTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYV

Query:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG
        NWTA KTFAVGDSLVFNF+T+VHDVERVPK SFD CSDDNEIG+TIESGPATVVLTTPGEHYYIS ENQDC+LGQKLAINVVA+RSTGPVTS+STPPTSG
Subjt:  NWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSG

Query:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        PT GGS    G G PNSS NTIAAALSATVFGL LSFF
Subjt:  PTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

A0A6J1H8Q4 uncharacterized protein LOC1114611189.8e-16569.84Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MAA+ A VL FALFLFL +SAAQTV+ VGDS+GW +P   + FY  WA  K F  GDSLVFNFT   D+V RV K  FDMCSDDNEIGDSIE GPAT+ L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPG---PRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDD
        +T GE+YFISSED HCQQGQKLAINVTAAP      +PPS+  PP  P   RAPVTHVVGD  GW IPQGG MFY NWA GK F VGD L+FNF +  DD
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPG---PRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDD

Query:  VVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAA
        VVRVTK+SFDLC+DD +IG+DID  PA      PGEYYFIS+EDGHCQQGQKLAINVT AA GPM PPS+  PP  P+PAPVTH+VGD+VGWT P GGAA
Subjt:  VVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAA

Query:  FYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPP
        FY NW A KTFAVGDSLVFNF++EVHDV+RV K SFD CSDD+EIG++I+S PAT+VL  PGEHYYISTENQDC+LGQKLAINVVA+RS  P TS++T P
Subjt:  FYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPP

Query:  TSGPTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        +SGP +  SP G+G+G P SSANT+AAALSAT+FGL L+FF
Subjt:  TSGPTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

A0A6J1KVF8 uncharacterized protein LOC1114979494.1e-16369.84Show/hide
Query:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL
        MAA+ A VL  ALFLFL +SAAQTV+ VGDS+GW +P   + FY  WA  K F  GDSLVFNFT   D+V RV K  F++CSDDNEIGDSIE GPATI L
Subjt:  MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISL

Query:  STPGEYYFISSEDRHCQQGQKLAINVTAAPG---PRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDD
        ST GEYYFISSED HCQQGQKLAINVTAAP      +PPS+  PP  P   RAPVTHVVGD  GW IPQGG +FY NWA GK F VGD L+FNF    DD
Subjt:  STPGEYYFISSEDRHCQQGQKLAINVTAAPG---PRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDD

Query:  VVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAA
        VVRVTK+SFDLC+DD +IG+DID  PATI L   GEYYFIS+EDGHCQQGQKLAINVT AA GPM PPS+  PP  P+ APVTH+VGD+VGWT P GGAA
Subjt:  VVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT-AAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAA

Query:  FYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPP
        FY NW A  TFAVGDSLVFNF+ EVHDVERV K SFD CSDD+EIG++I+S PAT+VLT+PG HYYISTENQDC+LGQKLAINVVATRS  P TS++T P
Subjt:  FYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPP

Query:  TSGPTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF
        +SGP +  +P G+G+G P SSANT+AAALSAT+FGL L+FF
Subjt:  TSGPTAGGSPFGTGAGQPNSSANTIAAALSATVFGLALSFF

SwissProt top hitse value%identityAlignment
A0A0M4FTF3 Blue copper protein2.4e-1434.44Show/hide
Query:  AAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHC-QQG
        A QT Y VGD  GWT+    ++ Y+ WA  K F  GD+LVF +TKG   V +V + GF  C       + + +G   I+L+ PG+ ++I     HC +  
Subjt:  AAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHC-QQG

Query:  QKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAM
        QKLAI V  AP  ++PP     P    P++    +    T+ + +  GGA+
Subjt:  QKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAM

P00302 Stellacyanin1.7e-1744.76Show/hide
Query:  TVYTVGDSSGWTVPANGEIFYK-TWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHCQQGQKL
        TVYTVGDS+GW VP  G++ Y   WA++K F+ GD LVF + +    V +VT+  +  C+D   I  S  TG   I+L T G+ Y+I    +HC  GQK+
Subjt:  TVYTVGDSSGWTVPANGEIFYK-TWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHCQQGQKL

Query:  AINVT
         INVT
Subjt:  AINVT

P29602 Cucumber peeling cupredoxin1.6e-1843.18Show/hide
Query:  TVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARV-TKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHCQQGQKL
        TV+ VGD++GW+VP++   FY  WAA K F  GDSL FNF      V  + TK  FD C+  N   D   T P    L   G +YF+ +   HC  GQKL
Subjt:  TVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARV-TKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHCQQGQKL

Query:  AINVTAA-------PGPRSPPSSNVPPRTPAP
        +INV AA       P   SPPSS +PP    P
Subjt:  AINVTAA-------PGPRSPPSSNVPPRTPAP

P42849 Umecyanin7.3e-1641.51Show/hide
Query:  VGDSVGWTTPPGGAAFYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVV
        VG  + W   P    FY+ W   KTF VGD L F+F   +HDV  V K +FD C  +N I   + + P  ++L T G  YYI T    C++GQKL+INVV
Subjt:  VGDSVGWTTPPGGAAFYVNWTASKTFAVGDSLVFNFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVV

Query:  ATRSTG
             G
Subjt:  ATRSTG

Q07488 Blue copper protein6.9e-1439.47Show/hide
Query:  VGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT
        VGD   W  P     FY+ WA GK F VGD L F+F     DV  V++ +F+ C  +  I   +   P  I L T G  YFI     HC+ GQKL+I V 
Subjt:  VGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVT

Query:  AA-------PGPMAPPSSNPPPS----TPRPAPVTHIVGDSVGWTTPPGGAA
        AA       PG  A P+    PS    TP  A  T     S G TTP G AA
Subjt:  AA-------PGPMAPPSSNPPPS----TPRPAPVTHIVGDSVGWTTPPGGAA

Arabidopsis top hitse value%identityAlignment
AT1G45063.1 copper ion binding;electron carriers1.3e-2834.77Show/hide
Query:  FAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVT-KMGFDMCSDDNEIGDSI-ETGPATISLST
        F+FV+     LF   SA  TVY VGDS GWT  A   ++Y  W  DK  + GDSL+F +    ++V +V+  + ++ C  D+    ++  TG   ++ + 
Subjt:  FAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVT-KMGFDMCSDDNEIGDSI-ETGPATISLST

Query:  PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSP---PSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVV
        PG YYFI+S    C  GQ+L + V   P   SP   PS  +P R          + VGD+  WG+      FY NW+  K+F VGD L+F +    + V 
Subjt:  PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSP---PSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVV

Query:  RVTKQ-SFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINV
         ++    F  C+    I      G   I L  PG +YFIS+E GHC  G KL + V
Subjt:  RVTKQ-SFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINV

AT1G45063.2 copper ion binding;electron carriers1.3e-2834.77Show/hide
Query:  FAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVT-KMGFDMCSDDNEIGDSI-ETGPATISLST
        F+FV+     LF   SA  TVY VGDS GWT  A   ++Y  W  DK  + GDSL+F +    ++V +V+  + ++ C  D+    ++  TG   ++ + 
Subjt:  FAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVT-KMGFDMCSDDNEIGDSI-ETGPATISLST

Query:  PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSP---PSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVV
        PG YYFI+S    C  GQ+L + V   P   SP   PS  +P R          + VGD+  WG+      FY NW+  K+F VGD L+F +    + V 
Subjt:  PGEYYFISSEDRHCQQGQKLAINVTAAPGPRSP---PSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVV

Query:  RVTKQ-SFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINV
         ++    F  C+    I      G   I L  PG +YFIS+E GHC  G KL + V
Subjt:  RVTKQ-SFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINV

AT2G23990.1 early nodulin-like protein 113.7e-1534.29Show/hide
Query:  APVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISN-EDGHCQQGQ
        A + +V G    W +P+      ++WA   RF VGD L+F + ++ D V++VTK++++ CN    + +  D G  T+ L   G YYFIS    G+C +G+
Subjt:  APVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISN-EDGHCQQGQ

Query:  KLAINVTA----APGPMA-----PPSSNPPPSTPRPAPVT
        K+ + V +     PGP A     PP  +  P+ P PAP T
Subjt:  KLAINVTA----APGPMA-----PPSSNPPPSTPRPAPVT

AT2G25060.1 early nodulin-like protein 144.9e-1532.58Show/hide
Query:  VVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINV
        V G +  W IP   +  ++ WA   RF VGD +VF + + +D V+ VTK++++ CN    + +  D G   + L   G +YFIS  +GHC++GQKL++ V
Subjt:  VVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVRVTKQSFDLCNDDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINV

Query:  TAAPGPMAPPSSNPPPSTPRPAPVTHIVGDSV
         +    +  P+ +P      PA     +  SV
Subjt:  TAAPGPMAPPSSNPPPSTPRPAPVTHIVGDSV

AT2G26720.1 Cupredoxin superfamily protein1.3e-1538.64Show/hide
Query:  TVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHCQQGQKLA
        TV+ VG++ GWT+   G+  Y+ WA+ ++F  GD+LVF + K   +V  VT   F+MC     +    +TG  +ISL+ PG  +FI     HC++GQKL 
Subjt:  TVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISLSTPGEYYFISSEDRHCQQGQKLA

Query:  INVTAA---------PGP-RSPPSSNVPPRTP
        I+V  A         PGP RS  SS+ P  +P
Subjt:  INVTAA---------PGP-RSPPSSNVPPRTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTAGATTCGCTTTTGTTCTGGCTTTTGCACTGTTTTTATTTCTTCAATACTCCGCCGCTCAGACTGTTTATACCGTCGGTGACTCCTCTGGCTGGACA
GTACCTGCAAACGGCGAGATTTTCTATAAAACCTGGGCTGCCGATAAAATTTTCTATGCCGGTGATTCGCTCGTTTTCAATTTCACGAAGGGTGAGGATGAAGTG
GCGAGAGTTACGAAAATGGGGTTTGATATGTGCAGTGACGACAATGAAATTGGAGACTCCATTGAAACAGGGCCGGCAACCATTTCTTTATCAACTCCGGGCGAG
TACTATTTCATCAGCTCTGAGGATAGGCACTGCCAACAAGGTCAAAAATTAGCCATTAATGTCACCGCCGCCCCTGGACCTCGGTCGCCGCCCTCTTCCAACGTT
CCTCCACGAACCCCAGCCCCAAGACGAGCTCCGGTGACCCATGTCGTCGGTGACACCGCTGGGTGGGGGATTCCTCAGGGCGGCGCCATGTTCTACTCCAACTGG
GCTGCCGGAAAACGCTTCGCTGTTGGCGATATTCTAGTGTTCAATTTCGGAACCCGCGAAGATGACGTAGTCAGAGTAACAAAACAATCGTTCGATTTATGTAAC
GATGACGGTGAGATCGGTGACGATATCGACCATGGGCCAGCAACCATCCCGCTGTTAACTCCCGGCGAGTATTACTTCATAAGCAATGAGGATGGGCACTGCCAG
CAAGGTCAGAAGTTAGCGATCAATGTCACAGCCGCCCCCGGACCAATGGCTCCACCTTCTTCAAATCCTCCACCATCTACCCCACGCCCTGCCCCCGTCACCCAT
ATCGTCGGAGACTCCGTAGGCTGGACCACTCCACCAGGCGGCGCCGCTTTCTACGTCAACTGGACTGCCAGCAAGACATTCGCCGTCGGTGATTCTCTTGTGTTC
AACTTCCAAACCGAAGTACACGACGTTGAGAGAGTACCGAAAGCATCCTTCGACAGATGTAGCGATGACAACGAGATCGGAGAGACCATTGAATCTGGGCCTGCA
ACAGTTGTTCTCACTACTCCCGGCGAACATTACTACATTAGTACTGAGAATCAAGATTGTCAACTGGGCCAAAAATTAGCCATCAATGTAGTGGCCACCAGATCC
ACTGGTCCTGTTACATCCGTTTCAACCCCTCCAACCTCCGGACCAACCGCTGGCGGTTCACCCTTTGGTACTGGTGCCGGACAACCCAACTCCTCTGCTAACACC
ATCGCCGCCGCTCTCTCCGCCACAGTCTTTGGCCTTGCTCTTAGCTTCTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTAGATTCGCTTTTGTTCTGGCTTTTGCACTGTTTTTATTTCTTCAATACTCCGCCGCTCAGACTGTTTATACCGTCGGTGACTCCTCTGGCTGGACA
GTACCTGCAAACGGCGAGATTTTCTATAAAACCTGGGCTGCCGATAAAATTTTCTATGCCGGTGATTCGCTCGTTTTCAATTTCACGAAGGGTGAGGATGAAGTG
GCGAGAGTTACGAAAATGGGGTTTGATATGTGCAGTGACGACAATGAAATTGGAGACTCCATTGAAACAGGGCCGGCAACCATTTCTTTATCAACTCCGGGCGAG
TACTATTTCATCAGCTCTGAGGATAGGCACTGCCAACAAGGTCAAAAATTAGCCATTAATGTCACCGCCGCCCCTGGACCTCGGTCGCCGCCCTCTTCCAACGTT
CCTCCACGAACCCCAGCCCCAAGACGAGCTCCGGTGACCCATGTCGTCGGTGACACCGCTGGGTGGGGGATTCCTCAGGGCGGCGCCATGTTCTACTCCAACTGG
GCTGCCGGAAAACGCTTCGCTGTTGGCGATATTCTAGTGTTCAATTTCGGAACCCGCGAAGATGACGTAGTCAGAGTAACAAAACAATCGTTCGATTTATGTAAC
GATGACGGTGAGATCGGTGACGATATCGACCATGGGCCAGCAACCATCCCGCTGTTAACTCCCGGCGAGTATTACTTCATAAGCAATGAGGATGGGCACTGCCAG
CAAGGTCAGAAGTTAGCGATCAATGTCACAGCCGCCCCCGGACCAATGGCTCCACCTTCTTCAAATCCTCCACCATCTACCCCACGCCCTGCCCCCGTCACCCAT
ATCGTCGGAGACTCCGTAGGCTGGACCACTCCACCAGGCGGCGCCGCTTTCTACGTCAACTGGACTGCCAGCAAGACATTCGCCGTCGGTGATTCTCTTGTGTTC
AACTTCCAAACCGAAGTACACGACGTTGAGAGAGTACCGAAAGCATCCTTCGACAGATGTAGCGATGACAACGAGATCGGAGAGACCATTGAATCTGGGCCTGCA
ACAGTTGTTCTCACTACTCCCGGCGAACATTACTACATTAGTACTGAGAATCAAGATTGTCAACTGGGCCAAAAATTAGCCATCAATGTAGTGGCCACCAGATCC
ACTGGTCCTGTTACATCCGTTTCAACCCCTCCAACCTCCGGACCAACCGCTGGCGGTTCACCCTTTGGTACTGGTGCCGGACAACCCAACTCCTCTGCTAACACC
ATCGCCGCCGCTCTCTCCGCCACAGTCTTTGGCCTTGCTCTTAGCTTCTTCTAA
Protein sequenceShow/hide protein sequence
MAARFAFVLAFALFLFLQYSAAQTVYTVGDSSGWTVPANGEIFYKTWAADKIFYAGDSLVFNFTKGEDEVARVTKMGFDMCSDDNEIGDSIETGPATISLSTPGE
YYFISSEDRHCQQGQKLAINVTAAPGPRSPPSSNVPPRTPAPRRAPVTHVVGDTAGWGIPQGGAMFYSNWAAGKRFAVGDILVFNFGTREDDVVRVTKQSFDLCN
DDGEIGDDIDHGPATIPLLTPGEYYFISNEDGHCQQGQKLAINVTAAPGPMAPPSSNPPPSTPRPAPVTHIVGDSVGWTTPPGGAAFYVNWTASKTFAVGDSLVF
NFQTEVHDVERVPKASFDRCSDDNEIGETIESGPATVVLTTPGEHYYISTENQDCQLGQKLAINVVATRSTGPVTSVSTPPTSGPTAGGSPFGTGAGQPNSSANT
IAAALSATVFGLALSFF