; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0104111 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0104111
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
Descriptionpentatricopeptide repeat-containing protein At2g15820, chloroplastic
Genome locationCMiso1.1chr04:21510270..21518914
RNA-Seq ExpressionCmc04g0104111
SyntenyCmc04g0104111
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0045292 - mRNA cis splicing, via spliceosome (biological process)
GO:0048564 - photosystem I assembly (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152074.2 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis sativus]9.9e-8992.82Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG
        MVFSMSIPTSAFSTVT LRSLTLSLSPYHHYFH PNHIIPTLF+ +YSVKV RQLPRIRAFASGSFVKQLVYD DSPSESEEHLSS +SNGGDGFHFENG
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG

Query:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        FASVDLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPAQKPGTVIRLLNAQ+KWMGQDDATYL VHCLRIRENETAFRV
Subjt:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

XP_008465080.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucumis melo]7.5e-97100Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
        MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF

Query:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
Subjt:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

XP_022158727.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Momordica charantia]7.6e-6577.11Show/hide
Query:  TLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVR-QLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALE
        TL RSLT SL  +H +F   N+I+ TLFI ++S K R +LPRI AFAS S V QL+YDRDSPS+SEEH  SPYSNG DGFHFEN FAS DLKHLG PALE
Subjt:  TLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVR-QLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALE

Query:  VKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        VKELDELPEQWRRSKLAWLCKELPA KPGT++RLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Subjt:  VKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

XP_022949171.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata]7.1e-6375.71Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAF+TVTLLRSLTL  S  HH+F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYS G +      GFAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

XP_038887990.1 pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Benincasa hispida]2.3e-6980.79Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAFS+VTLLRS +LSLSPYHHYF  PNHI+ T+FI  YSVK  +QLPRI +FAS S V+QLVYDRDS  ESEEHLSSPYSNG D      GFAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        DLKHL  PALEVKELDELP+QWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDATYLTVHCLRIRENETAFRV
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

TrEMBL top hitse value%identityAlignment
A0A0A0LBL0 LAGLIDADG_2 domain-containing protein4.8e-8992.82Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG
        MVFSMSIPTSAFSTVT LRSLTLSLSPYHHYFH PNHIIPTLF+ +YSVKV RQLPRIRAFASGSFVKQLVYD DSPSESEEHLSS +SNGGDGFHFENG
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKV-RQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENG

Query:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        FASVDLKHLGTP LEVKELDELPEQWRRSK+AWLCKELPAQKPGTVIRLLNAQ+KWMGQDDATYL VHCLRIRENETAFRV
Subjt:  FASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

A0A1S3CPK0 pentatricopeptide repeat-containing protein At2g15820, chloroplastic3.6e-97100Show/hide
Query:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
        MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF
Subjt:  MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGF

Query:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
Subjt:  ASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

A0A6J1DXY9 pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like3.7e-6577.11Show/hide
Query:  TLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVR-QLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALE
        TL RSLT SL  +H +F   N+I+ TLFI ++S K R +LPRI AFAS S V QL+YDRDSPS+SEEH  SPYSNG DGFHFEN FAS DLKHLG PALE
Subjt:  TLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVR-QLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALE

Query:  VKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        VKELDELPEQWRRSKLAWLCKELPA KPGT++RLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Subjt:  VKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

A0A6J1GB98 pentatricopeptide repeat-containing protein At2g15820, chloroplastic3.4e-6375.71Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAF+TVTLLRSLTL  S  HH+F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYS G +      GFAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPAQKPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

A0A6J1KB64 pentatricopeptide repeat-containing protein At2g15820, chloroplastic2.5e-6174.58Show/hide
Query:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV
        MSI TSAF+TVTLLRSLTL  S  H++F   N++I +L I +YS K  RQLPRI AFAS S V+ LVYDRDSP+ESEE L SPYSNG +       FAS 
Subjt:  MSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVK-VRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASV

Query:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
        DLKHLG PALEVKELDELPEQWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA YL VHCLRIRENETAFRV
Subjt:  DLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

SwissProt top hitse value%identityAlignment
Q6ZHJ5 Pentatricopeptide repeat-containing protein OTP51, chloroplastic2.1e-2550.79Show/hide
Query:  PRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRK
        P I A AS   ++ L+ D D   E E+            F  E   A+ + + + +P L V EL+ELPEQWRRS++AWLCKELPA K  T  R+LNAQRK
Subjt:  PRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRK

Query:  WMGQDDATYLTVHCLRIRENETAFRV
        W+ QDDATY+ VHCLRIR N+ AFRV
Subjt:  WMGQDDATYLTVHCLRIRENETAFRV

Q9XIL5 Pentatricopeptide repeat-containing protein At2g15820, chloroplastic2.2e-2744.39Show/hide
Query:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG
        S P    S+ TL RSL+ SL  +   +          +  H   T F S  S +   L    + A  SG+FV+ L       +ESEE +S   +NG GD 
Subjt:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG

Query:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
            N   +V  + + T   EV+EL+ELPE+WRRSKLAWLCKE+P  K  T++RLLNAQ+KW+ Q+DATY++VHC+RIRENET FRV
Subjt:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV

Arabidopsis top hitse value%identityAlignment
AT2G15820.1 endonucleases1.6e-2844.39Show/hide
Query:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG
        S P    S+ TL RSL+ SL  +   +          +  H   T F S  S +   L    + A  SG+FV+ L       +ESEE +S   +NG GD 
Subjt:  SIPTSAFSTVTLLRSLTLSLSPYHHYFH---------YPNHIIPTLFISSYSVKVRQLPRIRAFA--SGSFVKQLVYDRDSPSESEEHLSSPYSNG-GDG

Query:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV
            N   +V  + + T   EV+EL+ELPE+WRRSKLAWLCKE+P  K  T++RLLNAQ+KW+ Q+DATY++VHC+RIRENET FRV
Subjt:  FHFENGFASVDLKHLGTPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTCTCCATGTCCATTCCTACCTCTGCATTTTCCACTGTGACCCTTCTCCGTTCTCTCACTCTTTCCCTCTCTCCGTACCATCACTACTTTCATTATCCCAATCA
TATAATCCCTACTCTCTTTATTTCTTCATATTCTGTTAAAGTGCGACAACTTCCCAGAATTCGTGCCTTTGCTTCCGGTTCTTTTGTTAAACAGCTGGTGTATGACCGGG
ATTCCCCGTCCGAATCGGAGGAGCACTTATCATCTCCATACAGTAATGGGGGTGATGGTTTTCATTTTGAAAATGGTTTTGCATCAGTGGATTTGAAACATTTGGGAACG
CCTGCGCTCGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGCCAGCACAAAAGCCGGGAACAGTGATACG
ACTGCTTAATGCACAGAGAAAATGGATGGGGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATCCGTGAAAACGAGACAGCATTTAGGGTAACTTTTTAG
mRNA sequenceShow/hide mRNA sequence
GCGCACCCATAGAAGAACCCTAAGCCCAATTTAAAAATAAAAAGTAAAATTAGTTGGAAAGCGCTAAAACTTCTCCCTTTCCCTCAAGCCTAGTCACGCCTCACGCCTCC
TCCCTCTGCAGACACTTCTCCTCCCGACGCCGTCCGCCGCAACAATTCAGTTGCTTGCTGGCTGATGCTTTGAAGTGGCTCCCAGATTCGGGTTAGTCACTCCAAACTCT
GCGTTTTCTTTCTAAGCGTAATCCTCCTATGGTTTTCTCCATGTCCATTCCTACCTCTGCATTTTCCACTGTGACCCTTCTCCGTTCTCTCACTCTTTCCCTCTCTCCGT
ACCATCACTACTTTCATTATCCCAATCATATAATCCCTACTCTCTTTATTTCTTCATATTCTGTTAAAGTGCGACAACTTCCCAGAATTCGTGCCTTTGCTTCCGGTTCT
TTTGTTAAACAGCTGGTGTATGACCGGGATTCCCCGTCCGAATCGGAGGAGCACTTATCATCTCCATACAGTAATGGGGGTGATGGTTTTCATTTTGAAAATGGTTTTGC
ATCAGTGGATTTGAAACATTTGGGAACGCCTGCGCTCGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGAAGATCCAAATTGGCTTGGCTTTGTAAGGAATTGC
CAGCACAAAAGCCGGGAACAGTGATACGACTGCTTAATGCACAGAGAAAATGGATGGGGCAGGATGATGCGACCTATCTCACCGTGCATTGTTTGCGTATCCGTGAAAAC
GAGACAGCATTTAGGGTAACTTTTTAGTGGTGGTTCTTACTGGATTCGTCGGAGGAAAGGGAACTGCTCAGTTTTTGTAATAATTTAAATTATTATTATTTTTAATAACT
ATTTTGTTCCCTCCGCTGGTCCAAAATTAAGTTCCCCTCAAGAACCTCATATGGATAATGGATGTACCAAAAATGATTTAAATTTTCCTGTAGTATCTTTGCGTATAGGA
GCTCTAAATGCACAGCCGACTTCAACACTCTCAGTAGAGACACCTCAGTCGGCTTCTCTCTCCCTCGGGTAGCAGTCTTTGTTTTCATGATGGGAAATCCCAGATCATCT
CTTCTGCCTACTGCCACCATCCCATTGTGGCCGGAGTGTGGTTCTTTACGGCCATTTTTGGTATGTCTGCTTCCCCAAAGTATTGACGATTGTTTTGCTGAAGTTCTTTT
GTTGGTCGGTTGGTGACTGAAAGCAAGCCTGAAAACTTTTAGACATTTGCAGGTCAAGCTGTTCTTTGGCTTATTTGGTTAGAATGAAACGAAAAGATCTTTTTTTCTAA
TCTTTGTGGCCATAATTTATTTTAATGTCGTATGGTAAATGGCCCTTGTAGACTGTACCTTAGATTTTATTTTTTATGACGATACTACGATGGTGTTATTTTTAATCTTG
ATCCCTGGAGTAACAAATCTTTTTTGGTGGGGGTTCACTCTACCAAGCTCGTAAGTGTGTGGGCGCTTGTTCGTTTGCGTGTGACCATGATCGTCCTTTCTCTATCTCTG
TCGTTGGATCTGCTGCAACCCCGAGGGTATAACATATTTTTCTGTTGTTGGCATGTTCCTCACTGTCTTCTGCAACCCCTCCATCTTTCAATTTCTAGTTGTTGAGCTGT
CAGGGATTAATTTTTTTTGTTTCACTTTTTTTCTTCCATTCTACCAATTCGTAAATATGGTCAAATTGCAAATGGGTCATATTTAATCTAAAGTCATGCTTAATTAAAAC
CTAGTTGTGAAAGGGATTTGTTCGAAGGGTAACCGAGTGTTTAGCTTCACACTTTCTTTCCTTCCTTTTATTTCCTCTCCTTTTTGCCTGATAAAGCAACTGCTAGACCA
AAAGAGATGTCATTGGGGAGGTTCTCAAAGATAAAGGGATTTGACCTTCTTACGTACCCCAATCCCAAATTAGGCCGATTTTAATAGGTGTGCCTTTAACTCTTTTATAA
GAACCTTTCCTCATTTTTGACAAAAATTACATTGCCAAGCAAAATTCTCTCCACTGCCCAATCTTTCATGTTTTGTGACATATTGCTATAATGGTTTTGTTTCATCTTCA
ATAGTCATTCTTGAATTCATACTGTCCATTGTTTGGCCACAAAGCATGAAAAATAAATAAATAAATAAATAAAAGGAGAGACACAAAAAGAAACTGATAGTGAGATGGCA
TCCTTACTTTGGGGAAACTGGACTAAAGCCAACGACAAGAGGGATGGGCGGCGACCGACGAGAGGGTTGGCGGTGACCCGTGAGCAAGTTAGGGCAGAGAGGGTTGGCAA
TGATAAATTGGAGAAATCGGCATAGGGGAGAAATGCTTGAAAAAAATATTTGAAAAGATAAAATAAAAAAAACCTAATAATTGTTGGGTCGGTCTAATGAGCTTTTCTAA
CATTTTCTGTGTTCTTTTCCTTGGAGTTATGCCTTGGTTGTGTTGGGGCCATGTCCGAAATAAAAAAATAAAAAAAATAATCTACATGCCAGCTGGCATAGGAATGTATC
TGATACTAACACCTAGCCATCTTAGAAGTGTTCGTGCTTCTTAGGTTCATGTTATCTGTATCTTGCGCACCTTCCCAGAAAGAATCACGAATCAACTTATCCAAAACTTT
GATAATAACGAAAGGAACGTGATGGAGAGATGACGACTCCCTTTCAAAATATATGCATATTTCCAATTATAAAGCTTATATTGCATTCTCTCTATAATGGAATGCTAGAA
AGCATTGGAAGTAGAATTCCACCTAAAAGCAAGCTAAGGTAGGCAGAAGGTCAATTAACGTGTTTGCAACCAAAAGAAGAAGCCAAGGAAGTCATAACAAAAGCATTTAT
TTTAAGATGCTCACTCTTGAAAAGATTAATAGACAATCTAGAAGCAACTTCAAAGATACGAATCACCTAAAAAAGATGCAAAAGGCAGAAGGAGTCATAGAAGAAAAGAT
CAATGTATCATCAACGAACTGTAAATGGCTCGAAACAAAATTAGAATCACCAATAAAGTGGACAACCTTGGATAAACTATGGATCCATAAGACAACTACGACAATCAACA
ACCAAATTAAATAAAAATGGGGATAGGGTTGCGTTGCTTGATACCAACCGAAGAGATATTCTTACCCCTAGCCCGACCATTAATAATGATGGAGATATTTGTACTAGAGA
TCCAACCACGAAACCAAGATTGCCACAACTGACCTAAGCCTTCTATCTAAAAAATAGCATCGAGAAAATCCTAGTCCACCGTGTCAAAAGCTTTCTTCAAATTGAGTTTC
ATAATTTTATATGTTGTCATTGTAGCCAAATCGTACAGCAATATGCAGAGCAAAGAAAGGGGGTTTCAAGGATACTCCTGCGGAGGATCTTCTTGCATCTGTTTTAAAGG
TAGAGGAGTCCTGACAAATTGGTATCAGAGCTGCCAAAGATCCTGAGAGGATACGAACAATAGCACATAAATAATTAAAGGCAAGGATGGAGATGAGTGAGAGAGAGATC
ATGGGTTTGAAACAAATGATACTCGGTCTAACTAAGAGTGGAAAAACTGTCCGACAAAGTGAAAGAAAGCAGTGTGACCAAGTGACCAGAGAGAAGAATTGTGTGCGTCG
GATGGGTTTGGGTTGAAACTAAAAGGTAAGGTGGAGGAAGTTGATGCGACTTCTAGCCTCGTTAAGGGTCCTCCTAATAGAAGCAAGTATAAGAAGTTGGAAATGCCCGT
ATTTGCCGGTGTAAACTCAAAATCATGGATTTATAAGGCAGAACATTATTTTGAGATCAATGAGCTAATTGACACGGAGGTGTAGGTGGCTGTCGTTAGTTTCGCCCAAG
ACGAAGTGGATTCATTTCGATGGAGCAACAATTGGAAGAAAATCACGTCGTGGGAAGACCTAAAGGGGAGGATGTTTGAACACTTTAAGGTCCCTAGAGAAGGAAGCCTG
AGCGCTTGCCTCATATGCTAGCAAGATGGAATGTATACAAACTATGTGAAGAGATTTTTGAACTACTCCACACTTTTGCTGAAGATGGCAAAGAGTGTTTGGATAAATGC
TTTCGTAACCGATTTAGAACCAGTGCTTCAAGTAGAGGTGAAGAGCCGCTATCCCATAACTATGAGAGAGGCCCAATTAGTGAAGGATAGAAATTTGGCTCTCAAGATGG
CCCTAAATGAGTTGGGTGGCAGTGGACCGAGTATTTCAGAGGCTCAAACCCAAACTATGAAAGATGGAAGAACAAATACGAAAAAGAAAGGGGGAAGACAAACTGAGTAC
CCTATGAGGCAAATTTCGATTCTAGTCAAGGGAAGTTATACAAGGGGTGAGCTGCCAGTAAGATTTTTGTCAGATAATGAGTTCAAGGAAAGATTGGACAAGGGGTTATG
CTTTTGTTGTAATGATAAGTACTCCCATGGGCACATATGCAAGATCAAGGAGAATCGTTACACTACAACAATTAAGAATTATTATTCTTGACGGTTTTAAAATCATCATT
GAAGCCAATGTTAAGAAAGTCAAAGTTCATGACGGTTAATAAACGTCAAGAATAATACGTTGACAGTTTATAAACACATTGTTTATTATTTTCTTAGATCCCTCTCCTCC
TAAGGAGTCGATTTTTGTGTGGTACGTCTTACATCTTGGTATTTCGACTAAATCAATTCGGCTATAGCCCAACTCTTAGCTTGTTTATGATGTTTTCCTAAGATCTTTTG
ACATGTACGAGTACTAACATACTTAGAAATTTTAAAC
Protein sequenceShow/hide protein sequence
MVFSMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVRQLPRIRAFASGSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGT
PALEVKELDELPEQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVTF