; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G10980 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G10980
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationClcChr01:16059946..16061463
RNA-Seq ExpressionClc01G10980
SyntenyClc01G10980
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583745.1 hypothetical protein SDJN03_19677, partial [Cucurbita argyrosperma subsp. sororia]7.3e-8881.96Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
        M+ RV SFH+S +LFWL  FFFF+LFGHGFPM VE AE E P+SDLL RDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDE QVHAWPI+GAMVGVNC 
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH

Query:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQAC
        + GKNSKS +WV+GVTDEFGDF+IDIPSHLHA +SFE  CSIKILRTPKN  C+PAHLAG +QLQLSSFGGGIRTYTSG+LRLQHQTS PLQAC
Subjt:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQAC

XP_004139527.1 uncharacterized protein LOC101215830 [Cucumis sativus]7.2e-10488.94Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
        M+NRVWSFHSSVNLFWLCIFFF+ L GHGFPM +ESAEDE PVSDLLSRD WR+IAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH

Query:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG
        + GKNSKSSDWVHGVTDEFGDF+IDIPSHLHA +SFENVCSIKILRTPKN HC+PAHLAGRK LQLSSFGGGIRTYTSGVLRLQHQTS PLQAC NEGR 
Subjt:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG

Query:  SGGHQISW
         GG Q SW
Subjt:  SGGHQISW

XP_008463598.1 PREDICTED: uncharacterized protein LOC103501709 [Cucumis melo]6.8e-10287.5Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
        MMNRVWSFH SVNLFWLCIFFF+ L GHGFPM  ESAEDE PVSDLL+RD WR+IAGYGEERLSTVLVTGSVLCE+CLHGDEPQVHAWPI+GAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH

Query:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG
        + GKNSKSSDWVHGVTDEFGDF+IDIPS LHA QSFENVCSIKILRTPKN HC+PAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS PLQAC NEGR 
Subjt:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG

Query:  SGGHQISW
          G Q SW
Subjt:  SGGHQISW

XP_022142515.1 uncharacterized protein LOC111012615 [Momordica charantia]3.9e-8977.51Show/hide
Query:  MMNRVWSFHSSVNLFWLC-IFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNC
        MM R   FH SV  FW    FFFF+L GHGFP+ VESAE + PVSDLLSRDDWRQ+AGYGEERLSTVLVTGSVLCEACLHGDEPQ+H+WPI GAMVGV+C
Subjt:  MMNRVWSFHSSVNLFWLC-IFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNC

Query:  HDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGR
        H+NG+NSKSSDW HGVTDEFGDFIIDIPSHLHA +SFE VCSIKIL+TPKNA C+PAH AGR+QLQLSSFGGGIRTYTSG L+LQH+TS PLQ C N  +
Subjt:  HDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGR

Query:  GSGGHQISW
        GSG  Q  W
Subjt:  GSGGHQISW

XP_038895694.1 uncharacterized protein LOC120083866 [Benincasa hispida]7.0e-10790.34Show/hide
Query:  MNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHD
        MNRVWS HSSVNLFW+CIFFFF LFGHGFP+GVE+AE+E PVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH+
Subjt:  MNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHD

Query:  NGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRGS
        NGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA +SFENVCSIKIL+TPKN HC+PAHLAG KQLQLSSFGGGIRTYTSGVLRLQHQTS PLQACTNEGRG 
Subjt:  NGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRGS

Query:  GGHQISW
           Q SW
Subjt:  GGHQISW

TrEMBL top hitse value%identityAlignment
A0A0A0LT20 Uncharacterized protein3.5e-10488.94Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
        M+NRVWSFHSSVNLFWLCIFFF+ L GHGFPM +ESAEDE PVSDLLSRD WR+IAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH

Query:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG
        + GKNSKSSDWVHGVTDEFGDF+IDIPSHLHA +SFENVCSIKILRTPKN HC+PAHLAGRK LQLSSFGGGIRTYTSGVLRLQHQTS PLQAC NEGR 
Subjt:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG

Query:  SGGHQISW
         GG Q SW
Subjt:  SGGHQISW

A0A1S3CJM3 uncharacterized protein LOC1035017093.3e-10287.5Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
        MMNRVWSFH SVNLFWLCIFFF+ L GHGFPM  ESAEDE PVSDLL+RD WR+IAGYGEERLSTVLVTGSVLCE+CLHGDEPQVHAWPI+GAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH

Query:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG
        + GKNSKSSDWVHGVTDEFGDF+IDIPS LHA QSFENVCSIKILRTPKN HC+PAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS PLQAC NEGR 
Subjt:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG

Query:  SGGHQISW
          G Q SW
Subjt:  SGGHQISW

A0A5D3E5G2 Pollen_Ole_e_I domain-containing protein3.3e-10287.5Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
        MMNRVWSFH SVNLFWLCIFFF+ L GHGFPM  ESAEDE PVSDLL+RD WR+IAGYGEERLSTVLVTGSVLCE+CLHGDEPQVHAWPI+GAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH

Query:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG
        + GKNSKSSDWVHGVTDEFGDF+IDIPS LHA QSFENVCSIKILRTPKN HC+PAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS PLQAC NEGR 
Subjt:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRG

Query:  SGGHQISW
          G Q SW
Subjt:  SGGHQISW

A0A6J1CL55 uncharacterized protein LOC1110126151.9e-8977.51Show/hide
Query:  MMNRVWSFHSSVNLFWLC-IFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNC
        MM R   FH SV  FW    FFFF+L GHGFP+ VESAE + PVSDLLSRDDWRQ+AGYGEERLSTVLVTGSVLCEACLHGDEPQ+H+WPI GAMVGV+C
Subjt:  MMNRVWSFHSSVNLFWLC-IFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNC

Query:  HDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGR
        H+NG+NSKSSDW HGVTDEFGDFIIDIPSHLHA +SFE VCSIKIL+TPKNA C+PAH AGR+QLQLSSFGGGIRTYTSG L+LQH+TS PLQ C N  +
Subjt:  HDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGR

Query:  GSGGHQISW
        GSG  Q  W
Subjt:  GSGGHQISW

A0A6J1EGW2 uncharacterized protein LOC1114341954.6e-8881.96Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH
        M+ RV SFH+SV+LFWL  FFFF+LFGHGFPM VE AE E P+SDLL RDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDE QVHAWPI+GAMVGVNC 
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCH

Query:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHAQ-SFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQAC
        + GKNSKS +WV+GVTDEFGDF+IDIPSHLHA  SFE  CSIKILRTPKN  C+PAHLAG +QLQLSS GGGIRTYTSG+LRLQHQTS PLQAC
Subjt:  DNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHAQ-SFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQAC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40113.1 Pollen Ole e 1 allergen and extensin family protein1.8e-2034.43Show/hide
Query:  CIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTD
        C    F  F   F          A  S L S  +   +AGYGE +LS+V++TGS+LC              P+ GA V + CH   K  + S W+  VT+
Subjt:  CIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTD

Query:  EFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAH-CQPAHLAG--RKQLQLSSFGGGIRTYTSGVLRL----QHQTSGPLQA
        +FG+F+I +PSHLHA    E  C +K +  PK+ H C          K ++L S   G R YTSG ++L      +TS P +A
Subjt:  EFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAH-CQPAHLAG--RKQLQLSSFGGGIRTYTSGVLRL----QHQTSGPLQA

AT4G17215.1 Pollen Ole e 1 allergen and extensin family protein4.3e-2240.28Show/hide
Query:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK
        SD  SRD+  ++AGYGE++LS+VL+T S+L  +          + PI GA +G  CH    + + S W+  VT+E G F+ID+PSHLHA    +  C IK
Subjt:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK

Query:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQT
         L  PK   C      G   +QL S   G R YT+G + LQ  T
Subjt:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQT

AT5G15780.1 Pollen Ole e 1 allergen and extensin family protein1.5e-0629.41Show/hide
Query:  STVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTDEFGDFIIDIPSHL--HAQSFENVCSIKILRT--PKNAHCQPAHLAG
        S+ +V G+V C+ C +G   +     I GA+V V C D  +NSK S      TD+ G+F + +P  +  H +  +  CS+K+L +  P  +    A  + 
Subjt:  STVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTDEFGDFIIDIPSHL--HAQSFENVCSIKILRT--PKNAHCQPAHLAG

Query:  RKQLQLSSFGGGIRTYTSG
         K+L+ +  G   R +++G
Subjt:  RKQLQLSSFGGGIRTYTSG

AT5G47635.1 Pollen Ole e 1 allergen and extensin family protein1.7e-2641.38Show/hide
Query:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK
        ++L +R +  ++AGYGE++LS+V++TGS+LC+       P +H+ PI GA V + CH   K  + S W+  VTDE G+F ID+PS LHA    EN C IK
Subjt:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK

Query:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS
         +  P+   C        K ++L S   G R YTSG +RLQ  +S
Subjt:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAACAGAGTTTGGAGCTTCCACAGTTCTGTCAACCTATTTTGGCTTTGTATCTTCTTCTTCTTCGTCCTTTTTGGCCATGGATTTCCGATGGGAGTCGAATCTGC
TGAGGACGAGGCCCCAGTATCTGACCTTTTGAGTCGAGATGATTGGAGGCAGATAGCTGGATATGGTGAGGAGAGATTGTCCACAGTTTTGGTCACAGGCTCTGTTCTTT
GTGAAGCTTGTTTGCATGGTGATGAACCTCAAGTTCATGCATGGCCTATTAAAGGTGCCATGGTGGGTGTGAATTGCCACGACAATGGAAAGAACAGCAAATCTTCTGAT
TGGGTACATGGAGTCACTGATGAATTTGGAGACTTTATTATTGATATTCCATCCCATCTTCATGCCCAAAGCTTCGAAAATGTCTGTTCCATCAAGATTCTTCGGACACC
AAAAAACGCACACTGCCAACCTGCTCATTTAGCTGGCAGGAAGCAGCTGCAATTATCGTCATTCGGAGGTGGCATCCGTACATATACTTCTGGCGTCCTCAGGCTGCAGC
ACCAAACATCTGGACCTCTGCAAGCTTGTACAAATGAGGGGAGAGGGAGTGGTGGCCATCAGATCTCATGGTAG
mRNA sequenceShow/hide mRNA sequence
CTCATATCAAGAAAGCAGCCATGTGGTCAAATTCACATAAAATACTTTTAAAAACAAAGCATCTTTTTCATTTGGTAAAAGGCCAACTTACGAGGGAAGATGGCAAAGAT
CCAACATAGAGAGAAACTGCCAAGTCAATGTCAAACCCAGAAAGGGTATTTCTGTAAATTCGACGTGTAGAGAAACTGCAAGTAATCTCTTTCGGTATACATTATATAAA
CCAATGAATATGTATCTAGAAATGGTGTCAGGTTGATTTGTGTGATCATATTGATAAAGTTTCAGTTTCCTAAAACAAAAGCAAGAACATGATGAACAGAGTTTGGAGCT
TCCACAGTTCTGTCAACCTATTTTGGCTTTGTATCTTCTTCTTCTTCGTCCTTTTTGGCCATGGATTTCCGATGGGAGTCGAATCTGCTGAGGACGAGGCCCCAGTATCT
GACCTTTTGAGTCGAGATGATTGGAGGCAGATAGCTGGATATGGTGAGGAGAGATTGTCCACAGTTTTGGTCACAGGCTCTGTTCTTTGTGAAGCTTGTTTGCATGGTGA
TGAACCTCAAGTTCATGCATGGCCTATTAAAGGTGCCATGGTGGGTGTGAATTGCCACGACAATGGAAAGAACAGCAAATCTTCTGATTGGGTACATGGAGTCACTGATG
AATTTGGAGACTTTATTATTGATATTCCATCCCATCTTCATGCCCAAAGCTTCGAAAATGTCTGTTCCATCAAGATTCTTCGGACACCAAAAAACGCACACTGCCAACCT
GCTCATTTAGCTGGCAGGAAGCAGCTGCAATTATCGTCATTCGGAGGTGGCATCCGTACATATACTTCTGGCGTCCTCAGGCTGCAGCACCAAACATCTGGACCTCTGCA
AGCTTGTACAAATGAGGGGAGAGGGAGTGGTGGCCATCAGATCTCATGGTAGACTGAAGCCTAGTAAAGTGAATGAATTGGTCGCAGACCTTAATATATGGCTCATGTAG
ATGGCGCTGAAGTCACAACGATGAGTTATTTGTTCGTAGTTAGCACGATAAATCAAGTTTGCATTGATATGTATTGTTTGTGTATAAAAACATCTAGATTATGTAATACA
CCAATCTGTTTCATACCATACATGACAGATACCTCGACTGCATCGAGTCTTCCAGACAGTTGAATTTACTCGCAAGGTTGTGTAATTGTACTCATCAAAAATCAATTGGG
AGCCAAAGAGATTATGTGTTCTCTAACAATCATTGCATAGCCTGATGGATGATCAAATCCCACACAAAAAATGAACTACATAGAAAGATCAGGCTATATAGTGAACGAGG
AAACCTTAGTGACGAATGGGAGTTACACAATGATTAAAGACGAAAAAACTGAAGTTGCTCTCAGAGTTTCTCCTCTTTCTTTTCCTCTACTTGAGAAGTTAT
Protein sequenceShow/hide protein sequence
MMNRVWSFHSSVNLFWLCIFFFFVLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDEPQVHAWPIKGAMVGVNCHDNGKNSKSSD
WVHGVTDEFGDFIIDIPSHLHAQSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQACTNEGRGSGGHQISW