; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000939 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000939
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionphotosystem II core complex proteins psbY, chloroplastic
Genome locationscaffold36:375008..375767
RNA-Seq ExpressionMS000939
SyntenyMS000939
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0034219 - carbohydrate transmembrane transport (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009523 - photosystem II (cellular component)
GO:0009534 - chloroplast thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030145 - manganese ion binding (molecular function)
GO:0051119 - sugar transmembrane transporter activity (molecular function)
InterPro domainsIPR009388 - Photosystem II PsbY
IPR038760 - Photosystem II PsbY, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029231.1 Photosystem II core complex proteins psbY, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]7.5e-8788.61Show/hide
Query:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL
        MAATMATMAVLSVKCTSINS++NH TPK I  PISLLSLQNLPK LISSK++ + NLSTFLSSTAIAGAVFSA SSSDPAFAAQQIA+IAA+GDNRG+AL
Subjt:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGL AS FMYAPDASA+E+AMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

XP_004142959.3 photosystem II core complex proteins psbY, chloroplastic [Cucumis sativus]2.8e-9781.96Show/hide
Query:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI
        W +L L+SP     LI+ ++ LPKPNL  FILHL  TCQ P   ST TSKSTSMAATMATMAVLSVKCTSINS+K H T K I  PISLLSLQNLPK LI
Subjt:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI

Query:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM
        SSK++QN NLSTFLSSTAIAGAVF+   SSDPAFAAQQIAEIAA+GDNRGLALLLPLIPA+AWVLFNILQPALNQINRMRSDKGVIIGLGLGGL AS FM
Subjt:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM

Query:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE
        YAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMRSE
Subjt:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE

XP_008444395.1 PREDICTED: photosystem II core complex proteins psbY, chloroplastic [Cucumis melo]5.2e-9681.18Show/hide
Query:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI
        W +L L SP     LI+ ++ LPKPN   FILHL  TCQ P   STETSKSTSMAATMATMAVLSVKCTSINS+K H T K I  PISLLSLQNLPK LI
Subjt:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI

Query:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM
        SSK+++N NLSTFLSSTAIAGAVF+   +SDPAFAAQQIAEIAA+GDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGL AS FM
Subjt:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM

Query:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE
        YAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMRS+
Subjt:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE

XP_022131700.1 photosystem II core complex proteins psbY, chloroplastic [Momordica charantia]2.5e-9899.5Show/hide
Query:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL
        MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASA EIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

XP_038886078.1 LOW QUALITY PROTEIN: photosystem II core complex proteins psbY, chloroplastic-like [Benincasa hispida]7.5e-10384.38Show/hide
Query:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP----STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKEL
        W +L L SP     L SL   LPKPNLS FILH+ PTCQRP    STETSKSTSMAATMATMAVLSVKCTSINS+KNH TPKPI  PISLLSLQNLPK L
Subjt:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP----STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKEL

Query:  ISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEF
        +SSK++QN NLST LSSTAIAGAVFS  SSSDPAFAAQQIAEIAA+GDNRGLALLLPLIPAIAWVLFNILQPALNQIN+MRSDKGVIIGLGLGGL AS F
Subjt:  ISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEF

Query:  MYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE
        MYAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMRSE
Subjt:  MYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE

TrEMBL top hitse value%identityAlignment
A0A0A0LKA1 Uncharacterized protein1.3e-9781.96Show/hide
Query:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI
        W +L L+SP     LI+ ++ LPKPNL  FILHL  TCQ P   ST TSKSTSMAATMATMAVLSVKCTSINS+K H T K I  PISLLSLQNLPK LI
Subjt:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI

Query:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM
        SSK++QN NLSTFLSSTAIAGAVF+   SSDPAFAAQQIAEIAA+GDNRGLALLLPLIPA+AWVLFNILQPALNQINRMRSDKGVIIGLGLGGL AS FM
Subjt:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM

Query:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE
        YAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMRSE
Subjt:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE

A0A1S3B9R1 photosystem II core complex proteins psbY, chloroplastic2.5e-9681.18Show/hide
Query:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI
        W +L L SP     LI+ ++ LPKPN   FILHL  TCQ P   STETSKSTSMAATMATMAVLSVKCTSINS+K H T K I  PISLLSLQNLPK LI
Subjt:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI

Query:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM
        SSK+++N NLSTFLSSTAIAGAVF+   +SDPAFAAQQIAEIAA+GDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGL AS FM
Subjt:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM

Query:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE
        YAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMRS+
Subjt:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE

A0A5A7UYJ2 Photosystem II core complex proteins psbY2.5e-9681.18Show/hide
Query:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI
        W +L L SP     LI+ ++ LPKPN   FILHL  TCQ P   STETSKSTSMAATMATMAVLSVKCTSINS+K H T K I  PISLLSLQNLPK LI
Subjt:  WHFLDLASPLFFHLLISLHLNLPKPNLSLFILHL-PTCQRP---STETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELI

Query:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM
        SSK+++N NLSTFLSSTAIAGAVF+   +SDPAFAAQQIAEIAA+GDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGL AS FM
Subjt:  SSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFM

Query:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE
        YAPDASA+EIAMIADASSSD+RGQLLLFV++PAILWVLYNILQPALNQLNRMRS+
Subjt:  YAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMRSE

A0A6J1BQF3 photosystem II core complex proteins psbY, chloroplastic1.2e-9899.5Show/hide
Query:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL
        MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR
        LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASA EIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

F6GVJ9 Uncharacterized protein8.7e-7377.23Show/hide
Query:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL
        MAAT+ATMA+L+ KC SINS KN    KP   PISLLS+QNLPK L + K+S+N NLST L+ TAIAGA+FS  SS DPA AAQQIAEI A+GDNRGLAL
Subjt:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLAL

Query:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR
        LLP+IPAIAWVLFNILQPALNQ+N+MRS KGVIIGLGLGGLAAS FM  P ASA+EIA +ADA+SSD RGQLLLFV++PAILWVLYNILQPALNQLNRMR
Subjt:  LLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLNRMR

Query:  SE
        SE
Subjt:  SE

SwissProt top hitse value%identityAlignment
O49347 Photosystem II core complex proteins psbY, chloroplastic8.7e-4654.9Show/hide
Query:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIA---AEGDNRG
        MAA MAT    + KC S+N S     PK        L  Q   K  IS      PN+S  ++STA+AGAVFS+ S S+PA A QQIA++A   A  DNRG
Subjt:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIA---AEGDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLN
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+GG  A+  +  P   A   A  A A+SSD+RGQLLL V++PA+LWVLYNILQPALNQ+N
Subjt:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLN

Query:  RMRS
        +MRS
Subjt:  RMRS

P80470 Photosystem II core complex proteins psbY, chloroplastic1.0e-4658.37Show/hide
Query:  MAATMA-TMAVLSVKCTSINSSK-NHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAE---GDN
        MAATMA TMAVL+ KC ++N++K   T+PKP + PISL      P  L +SK      LS  +++ AIAGAVF+   S DPAFA QQ+A+IAAE    DN
Subjt:  MAATMA-TMAVLSVKCTSINSSK-NHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAE---GDN

Query:  RGLALLLPLIPAIAWVLFNILQPALNQINRMRSD-KGVIIGLGLGGLAASEFMYA-PDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPAL
        RGLALLLP+IPA+ WVLFNILQPALNQIN+MR++ K  I+GLGL GLA S  + A P+A A    +   A  SD RG LLL V+ PAI WVL+NILQPAL
Subjt:  RGLALLLPLIPAIAWVLFNILQPALNQINRMRSD-KGVIIGLGLGGLAASEFMYA-PDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPAL

Query:  NQLNRMRSE
        NQLN+MRS+
Subjt:  NQLNRMRSE

Arabidopsis top hitse value%identityAlignment
AT1G67740.1 photosystem II BY6.2e-4754.9Show/hide
Query:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIA---AEGDNRG
        MAA MAT    + KC S+N S     PK        L  Q   K  IS      PN+S  ++STA+AGAVFS+ S S+PA A QQIA++A   A  DNRG
Subjt:  MAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLSTFLSSTAIAGAVFSAFSSSDPAFAAQQIAEIA---AEGDNRG

Query:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLN
        LALLLP++PAIAWVL+NILQPA+NQ+N+MR  KG+++GLG+GG  A+  +  P   A   A  A A+SSD+RGQLLL V++PA+LWVLYNILQPALNQ+N
Subjt:  LALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDARGQLLLFVISPAILWVLYNILQPALNQLN

Query:  RMRS
        +MRS
Subjt:  RMRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTGGTTGGCATTTCCTTGACTTGGCTTCACCTCTGTTCTTCCATCTTCTCATCTCATTACACCTAAATTTGCCAAAACCCAACCTTTCCCTCTTCATTCTGCATCTCCC
CACCTGCCAAAGGCCATCAACAGAAACAAGCAAGTCAACTTCAATGGCAGCCACCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCAGCAAAA
ACCACACCACCCCAAAGCCTATCACCAACCCCATCTCTCTCCTCTCTCTTCAGAACCTTCCTAAAGAACTAATCTCGTCAAAAGCTAGTCAAAATCCCAACTTATCAACT
TTCCTCTCCAGCACAGCCATTGCTGGAGCTGTCTTCTCAGCCTTCAGCTCATCGGATCCTGCCTTCGCAGCCCAGCAAATTGCGGAGATTGCTGCCGAAGGCGACAACCG
CGGCTTAGCCCTTTTGCTACCCCTTATTCCGGCCATAGCATGGGTTTTGTTCAACATACTACAGCCAGCACTCAACCAGATCAACAGAATGCGCAGTGACAAGGGTGTGA
TAATTGGGTTGGGATTAGGGGGGCTGGCTGCATCAGAGTTTATGTACGCACCTGATGCTTCTGCCACTGAGATCGCCATGATTGCTGATGCTTCCTCAAGTGATGCCAGG
GGGCAGCTTCTGCTGTTTGTCATATCACCCGCCATTCTTTGGGTGCTGTACAACATTCTCCAGCCAGCTTTAAATCAACTCAACAGGATGAGGTCCGAG
mRNA sequenceShow/hide mRNA sequence
ATTGGTTGGCATTTCCTTGACTTGGCTTCACCTCTGTTCTTCCATCTTCTCATCTCATTACACCTAAATTTGCCAAAACCCAACCTTTCCCTCTTCATTCTGCATCTCCC
CACCTGCCAAAGGCCATCAACAGAAACAAGCAAGTCAACTTCAATGGCAGCCACCATGGCTACAATGGCAGTGCTCAGTGTCAAGTGCACAAGCATCAACTCCAGCAAAA
ACCACACCACCCCAAAGCCTATCACCAACCCCATCTCTCTCCTCTCTCTTCAGAACCTTCCTAAAGAACTAATCTCGTCAAAAGCTAGTCAAAATCCCAACTTATCAACT
TTCCTCTCCAGCACAGCCATTGCTGGAGCTGTCTTCTCAGCCTTCAGCTCATCGGATCCTGCCTTCGCAGCCCAGCAAATTGCGGAGATTGCTGCCGAAGGCGACAACCG
CGGCTTAGCCCTTTTGCTACCCCTTATTCCGGCCATAGCATGGGTTTTGTTCAACATACTACAGCCAGCACTCAACCAGATCAACAGAATGCGCAGTGACAAGGGTGTGA
TAATTGGGTTGGGATTAGGGGGGCTGGCTGCATCAGAGTTTATGTACGCACCTGATGCTTCTGCCACTGAGATCGCCATGATTGCTGATGCTTCCTCAAGTGATGCCAGG
GGGCAGCTTCTGCTGTTTGTCATATCACCCGCCATTCTTTGGGTGCTGTACAACATTCTCCAGCCAGCTTTAAATCAACTCAACAGGATGAGGTCCGAG
Protein sequenceShow/hide protein sequence
IGWHFLDLASPLFFHLLISLHLNLPKPNLSLFILHLPTCQRPSTETSKSTSMAATMATMAVLSVKCTSINSSKNHTTPKPITNPISLLSLQNLPKELISSKASQNPNLST
FLSSTAIAGAVFSAFSSSDPAFAAQQIAEIAAEGDNRGLALLLPLIPAIAWVLFNILQPALNQINRMRSDKGVIIGLGLGGLAASEFMYAPDASATEIAMIADASSSDAR
GQLLLFVISPAILWVLYNILQPALNQLNRMRSE