; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G010880 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G010880
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationCicolChr01:16782690..16784473
RNA-Seq ExpressionCcUC01G010880
SyntenyCcUC01G010880
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583745.1 hypothetical protein SDJN03_19677, partial [Cucurbita argyrosperma subsp. sororia]6.8e-8681.87Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH
        M+ RV SFH+S +LFWL  FFFF LFGHGFPM VE AE E P+SDLL RDDWRQIAGYGEERLSTVLVTGSVLCEACLHGD  QVHAWPI+GAMVGVNC 
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH

Query:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQA
        N GKNSKS +WV+GVTDEFGDF+IDIPSHLHA +SFE  CSIKILRTPKN  C+PAHLAG +QLQLSSFGGGIRTYTSG+LRLQHQTS PLQA
Subjt:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQA

XP_004139527.1 uncharacterized protein LOC101215830 [Cucumis sativus]5.2e-10288.46Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH
        M+NRVWSFHSSVNLFWLCI FFF+L GHGFPM +ESAEDE PVSDLLSRD WR+IAGYGEERLSTVLVTGSVLCEACLHGD PQVHAWPIKGAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH

Query:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG
        N GKNSKSSDWVHGVTDEFGDF+IDIPSHLHA +SFENVCSIKILRTPKN HC+PAHLAGRK LQLSSFGGGIRTYTSGVLRLQHQTS PLQA  NEGR 
Subjt:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG

Query:  SGGHQISW
         GG Q SW
Subjt:  SGGHQISW

XP_008463598.1 PREDICTED: uncharacterized protein LOC103501709 [Cucumis melo]4.9e-10087.02Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH
        MMNRVWSFH SVNLFWLCI FFF+L GHGFPM  ESAEDE PVSDLL+RD WR+IAGYGEERLSTVLVTGSVLCE+CLHGD PQVHAWPI+GAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH

Query:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG
        N GKNSKSSDWVHGVTDEFGDF+IDIPS LHA QSFENVCSIKILRTPKN HC+PAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS PLQA  NEGR 
Subjt:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG

Query:  SGGHQISW
          G Q SW
Subjt:  SGGHQISW

XP_022142515.1 uncharacterized protein LOC111012615 [Momordica charantia]9.5e-8877.51Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFF-LFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNC
        MM R   FH SV  FW   FFFFF L GHGFP+ VESAE + PVSDLLSRDDWRQ+AGYGEERLSTVLVTGSVLCEACLHGD PQ+H+WPI GAMVGV+C
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFF-LFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNC

Query:  HNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGR
        HNNG+NSKSSDW HGVTDEFGDFIIDIPSHLHA +SFE VCSIKIL+TPKNA C+PAH AGR+QLQLSSFGGGIRTYTSG L+LQH+TS PLQ   N  +
Subjt:  HNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGR

Query:  GSGGHQISW
        GSG  Q  W
Subjt:  GSGGHQISW

XP_038895694.1 uncharacterized protein LOC120083866 [Benincasa hispida]1.3e-10590.34Show/hide
Query:  MNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHN
        MNRVWS HSSVNLFW+CIFFFFFLFGHGFP+GVE+AE+E PVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGD PQVHAWPIKGAMVGVNCHN
Subjt:  MNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHN

Query:  NGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRGS
        NGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA +SFENVCSIKIL+TPKN HC+PAHLAG KQLQLSSFGGGIRTYTSGVLRLQHQTS PLQA TNEGRG 
Subjt:  NGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRGS

Query:  GGHQISW
           Q SW
Subjt:  GGHQISW

TrEMBL top hitse value%identityAlignment
A0A0A0LT20 Uncharacterized protein2.5e-10288.46Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH
        M+NRVWSFHSSVNLFWLCI FFF+L GHGFPM +ESAEDE PVSDLLSRD WR+IAGYGEERLSTVLVTGSVLCEACLHGD PQVHAWPIKGAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH

Query:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG
        N GKNSKSSDWVHGVTDEFGDF+IDIPSHLHA +SFENVCSIKILRTPKN HC+PAHLAGRK LQLSSFGGGIRTYTSGVLRLQHQTS PLQA  NEGR 
Subjt:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG

Query:  SGGHQISW
         GG Q SW
Subjt:  SGGHQISW

A0A1S3CJM3 uncharacterized protein LOC1035017092.3e-10087.02Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH
        MMNRVWSFH SVNLFWLCI FFF+L GHGFPM  ESAEDE PVSDLL+RD WR+IAGYGEERLSTVLVTGSVLCE+CLHGD PQVHAWPI+GAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH

Query:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG
        N GKNSKSSDWVHGVTDEFGDF+IDIPS LHA QSFENVCSIKILRTPKN HC+PAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS PLQA  NEGR 
Subjt:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG

Query:  SGGHQISW
          G Q SW
Subjt:  SGGHQISW

A0A5D3E5G2 Pollen_Ole_e_I domain-containing protein2.3e-10087.02Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH
        MMNRVWSFH SVNLFWLCI FFF+L GHGFPM  ESAEDE PVSDLL+RD WR+IAGYGEERLSTVLVTGSVLCE+CLHGD PQVHAWPI+GAMVGVNCH
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH

Query:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG
        N GKNSKSSDWVHGVTDEFGDF+IDIPS LHA QSFENVCSIKILRTPKN HC+PAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS PLQA  NEGR 
Subjt:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRG

Query:  SGGHQISW
          G Q SW
Subjt:  SGGHQISW

A0A6J1CL55 uncharacterized protein LOC1110126154.6e-8877.51Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFF-LFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNC
        MM R   FH SV  FW   FFFFF L GHGFP+ VESAE + PVSDLLSRDDWRQ+AGYGEERLSTVLVTGSVLCEACLHGD PQ+H+WPI GAMVGV+C
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFF-LFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNC

Query:  HNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGR
        HNNG+NSKSSDW HGVTDEFGDFIIDIPSHLHA +SFE VCSIKIL+TPKNA C+PAH AGR+QLQLSSFGGGIRTYTSG L+LQH+TS PLQ   N  +
Subjt:  HNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGR

Query:  GSGGHQISW
        GSG  Q  W
Subjt:  GSGGHQISW

A0A6J1EGW2 uncharacterized protein LOC1114341954.3e-8681.87Show/hide
Query:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH
        M+ RV SFH+SV+LFWL  FFFF LFGHGFPM VE AE E P+SDLL RDDWRQIAGYGEERLSTVLVTGSVLCEACLHGD  QVHAWPI+GAMVGVNC 
Subjt:  MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCH

Query:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHAQ-SFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQA
        N GKNSKS +WV+GVTDEFGDF+IDIPSHLHA  SFE  CSIKILRTPKN  C+PAHLAG +QLQLSS GGGIRTYTSG+LRLQHQTS PLQA
Subjt:  NNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHAQ-SFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40113.1 Pollen Ole e 1 allergen and extensin family protein3.7e-2134.78Show/hide
Query:  CIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTD
        C    FF F   F          A  S L S  +   +AGYGE +LS+V++TGS+LC              P+ GA V + CH   K  + S W+  VT+
Subjt:  CIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTD

Query:  EFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAH-CQPAHLAG--RKQLQLSSFGGGIRTYTSGVLRL----QHQTSGPLQAS
        +FG+F+I +PSHLHA    E  C +K +  PK+ H C          K ++L S   G R YTSG ++L      +TS P +A+
Subjt:  EFGDFIIDIPSHLHA-QSFENVCSIKILRTPKNAH-CQPAHLAG--RKQLQLSSFGGGIRTYTSGVLRL----QHQTSGPLQAS

AT4G17215.1 Pollen Ole e 1 allergen and extensin family protein4.3e-2240.28Show/hide
Query:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK
        SD  SRD+  ++AGYGE++LS+VL+T S+L  +          + PI GA +G  CH    + + S W+  VT+E G F+ID+PSHLHA    +  C IK
Subjt:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK

Query:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQT
         L  PK   C      G   +QL S   G R YT+G + LQ  T
Subjt:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQT

AT5G15780.1 Pollen Ole e 1 allergen and extensin family protein7.4e-0628.57Show/hide
Query:  STVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTDEFGDFIIDIPSHL--HAQSFENVCSIKILRT--PKNAHCQPAHLAG
        S+ +V G+V C+ C +G   +     I GA+V V C +  +NSK S      TD+ G+F + +P  +  H +  +  CS+K+L +  P  +    A  + 
Subjt:  STVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTDEFGDFIIDIPSHL--HAQSFENVCSIKILRT--PKNAHCQPAHLAG

Query:  RKQLQLSSFGGGIRTYTSG
         K+L+ +  G   R +++G
Subjt:  RKQLQLSSFGGGIRTYTSG

AT5G47635.1 Pollen Ole e 1 allergen and extensin family protein2.9e-2641.38Show/hide
Query:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK
        ++L +R +  ++AGYGE++LS+V++TGS+LC+       P +H+ PI GA V + CH   K  + S W+  VTDE G+F ID+PS LHA    EN C IK
Subjt:  SDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSDWVHGVTDEFGDFIIDIPSHLHA-QSFENVCSIK

Query:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS
         +  P+   C        K ++L S   G R YTSG +RLQ  +S
Subjt:  ILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAACAGAGTTTGGAGCTTCCACAGTTCTGTCAACCTATTTTGGCTTTGTATCTTCTTCTTCTTCTTCCTTTTTGGCCATGGATTTCCGATGGGAGTCGAATCTGC
TGAGGACGAGGCCCCAGTATCCGACCTTTTGAGTCGAGATGATTGGAGGCAGATAGCTGGATATGGTGAGGAGAGATTGTCCACAGTTTTGGTCACAGGCTCTGTTCTTT
GTGAAGCTTGTTTGCATGGTGATGGACCTCAAGTTCATGCATGGCCTATTAAAGGTGCCATGGTGGGTGTGAATTGCCACAACAATGGAAAGAACAGCAAATCTTCTGAT
TGGGTACATGGAGTCACTGATGAATTTGGAGACTTTATTATTGATATTCCATCCCATCTTCATGCCCAAAGCTTCGAAAATGTCTGTTCCATCAAGATTCTTCGGACACC
AAAAAACGCACACTGCCAACCTGCTCATTTAGCTGGCAGGAAGCAGCTGCAATTATCGTCGTTCGGAGGCGGCATCCGTACATATACTTCCGGCGTCCTCAGGCTGCAGC
ACCAAACATCTGGACCTCTGCAAGCTAGTACAAATGAGGGGAGGGGGAGCGGTGGCCATCAGATCTCATGGTAG
mRNA sequenceShow/hide mRNA sequence
ATTTTTTTTGTAATTATTTTTAAAAAGATAAGGAGGGTCAATTTCCTCTCATATCAAGAAAGCAGCCATGTGGTCAAATTCACATAAAATACTTTTAAAAACAAAGCATC
TTTTTCATTTGGTAAAAGGCCAACTTAAGAGGGAAGATGGCAAAAATCCAACATAGAGAGAAACTGCCAAGTCAATGTCAAACCCAGAAAGGGTATTTCTGTAAATTCGA
CGTGTAGAGAAACTGCAAGTAATCTCTTTTGGTATACATTATATAAACCAATGAATATGTATCTAGAAATGGTGTCAGGTTGATTGTGTGATCATATTGATAAAGTTTCA
GTTTCCTAAAACAAAAGCAAGAACATGATGAACAGAGTTTGGAGCTTCCACAGTTCTGTCAACCTATTTTGGCTTTGTATCTTCTTCTTCTTCTTCCTTTTTGGCCATGG
ATTTCCGATGGGAGTCGAATCTGCTGAGGACGAGGCCCCAGTATCCGACCTTTTGAGTCGAGATGATTGGAGGCAGATAGCTGGATATGGTGAGGAGAGATTGTCCACAG
TTTTGGTCACAGGCTCTGTTCTTTGTGAAGCTTGTTTGCATGGTGATGGACCTCAAGTTCATGCATGGCCTATTAAAGGTGCCATGGTGGGTGTGAATTGCCACAACAAT
GGAAAGAACAGCAAATCTTCTGATTGGGTACATGGAGTCACTGATGAATTTGGAGACTTTATTATTGATATTCCATCCCATCTTCATGCCCAAAGCTTCGAAAATGTCTG
TTCCATCAAGATTCTTCGGACACCAAAAAACGCACACTGCCAACCTGCTCATTTAGCTGGCAGGAAGCAGCTGCAATTATCGTCGTTCGGAGGCGGCATCCGTACATATA
CTTCCGGCGTCCTCAGGCTGCAGCACCAAACATCTGGACCTCTGCAAGCTAGTACAAATGAGGGGAGGGGGAGCGGTGGCCATCAGATCTCATGGTAGACTGAAGCCTAG
TAAAGTGAATGAATTGGTTGCAGACCTTAATATATGGCTCATGTAGATGGCGCTGAAGTCACAACGATGAGTTATTTGTTCGTAGATAGCACGATAAATCAAGTTTGCAT
TGATATGTATTGTTTGTATAAAAACATCTAGATTATGTAATACACCAATCTGTTTCATACCATACATGACAGATACATCGACTGCATCAGTTGAATTTACTCGCAAGGTT
GTGTAATTGTACTCATCAAAAATCAATTGGGAGCCAAAGAGATTATGTGTTCTCTAACAATCATTGCATACCTGATGGATGATCAAATCCCACACAAAAAATGAACTAAC
ATAGAAAGATCAGGCTATATAGTGAACGAGGAAACCTTAGTGACGAATGGGAGTTGCACAATGATTAAAGACAAAAAAACTGAAGTTGCTCTCAGAGTTTCTCCTCTTTC
TTTTCCTCTACTTGAGAAGTTATGTATCGTGCTACATCTGCACAGCACGTAAGCTTATCTGCCTGTTCATCTGGGATTTCAATGGAGAATTCTTGTTCAAAAGCCATAAC
AAGCTCCACCCTGTCCAAGCTGTCCAGGCTTAAGTCCTTCTGGAAATCAGCCGTTTCAGTAACCTGCTCAGCTCAACACGACATTAAATGTTAAGAAATATATTGTATTG
TATTTAATAGAATCCGATAATTAGGAGGGAATCCACAG
Protein sequenceShow/hide protein sequence
MMNRVWSFHSSVNLFWLCIFFFFFLFGHGFPMGVESAEDEAPVSDLLSRDDWRQIAGYGEERLSTVLVTGSVLCEACLHGDGPQVHAWPIKGAMVGVNCHNNGKNSKSSD
WVHGVTDEFGDFIIDIPSHLHAQSFENVCSIKILRTPKNAHCQPAHLAGRKQLQLSSFGGGIRTYTSGVLRLQHQTSGPLQASTNEGRGSGGHQISW