; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014661 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014661
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPAP_fibrillin domain-containing protein
Genome locationtig00000892:678179..685947
RNA-Seq ExpressionSgr014661
SyntenySgr014661
Gene Ontology termsNA
InterPro domainsIPR006843 - Plastid lipid-associated protein/fibrillin conserved domain
IPR039633 - Plastid-lipid-associated protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570366.1 putative plastid-lipid-associated protein 12, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.4e-11490.43Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  VEEAIDKLIS+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKS D+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EPKQ
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

XP_022153755.1 probable plastid-lipid-associated protein 12, chloroplastic [Momordica charantia]7.5e-12196.09Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTG GVEEAI+KLISK
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
        +QNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLGL FSM GTFV+SEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EPKQ
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

XP_022943649.1 probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucurbita moschata]8.0e-11590.43Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  V+EAIDKLIS+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKSED+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EPKQ
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

XP_022943650.1 probable plastid-lipid-associated protein 12, chloroplastic isoform X2 [Cucurbita moschata]8.0e-11590.43Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  V+EAIDKLIS+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKSED+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EPKQ
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

XP_022987045.1 probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucurbita maxima]6.8e-11489.57Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  VEEAIDKL+S+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKS D+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EP+Q
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

TrEMBL top hitse value%identityAlignment
A0A6J1DHQ4 probable plastid-lipid-associated protein 12, chloroplastic3.6e-12196.09Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTG GVEEAI+KLISK
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
        +QNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLGL FSM GTFV+SEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EPKQ
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

A0A6J1FUZ3 probable plastid-lipid-associated protein 12, chloroplastic isoform X13.9e-11590.43Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  V+EAIDKLIS+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKSED+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EPKQ
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

A0A6J1FY29 probable plastid-lipid-associated protein 12, chloroplastic isoform X23.9e-11590.43Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  V+EAIDKLIS+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKSED+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EPKQ
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

A0A6J1JD06 probable plastid-lipid-associated protein 12, chloroplastic isoform X23.3e-11489.57Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  VEEAIDKL+S+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKS D+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EP+Q
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

A0A6J1JFQ6 probable plastid-lipid-associated protein 12, chloroplastic isoform X13.3e-11489.57Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A ASVKDGKRILFQFD+AAFS KFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+TEARQNLL+AIS G  VEEAIDKL+S+
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL
         +N+NKF++ELLEGDW MLWSSQMETDSWIENAANGLMGMQIIKNGQMKF+VDMLLG+RFSM GTFVKS D+TYDV+MDDAAIIGGPFGYPVEMESRFKL
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFKL

Query:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ
        QLLYNDGKIRITRGYNNILFVHLRV EP+Q
Subjt:  QLLYNDGKIRITRGYNNILFVHLRVHEPKQ

SwissProt top hitse value%identityAlignment
Q8LAP6 Probable plastid-lipid-associated protein 12, chloroplastic4.3e-8769.33Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A AS+KDGKR+LF+FDRAAF LKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+T  RQ LL  IS   GV EAID+ ++ 
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQII-KNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFK
        + N  +   ELLEG W+M+WSSQM TDSWIENAANGLMG QII K+G++KF+V+++   RFSM G F+KSE +TYD+ MDDAAIIGG FGYPV++ +  +
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQII-KNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFK

Query:  LQLLYNDGKIRITRGYNNILFVHLR
        L++LY D K+RI+RG++NI+FVH+R
Subjt:  LQLLYNDGKIRITRGYNNILFVHLR

Arabidopsis top hitse value%identityAlignment
AT1G51110.1 Plastid-lipid associated protein PAP / fibrillin family protein3.1e-8869.33Show/hide
Query:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK
        A AS+KDGKR+LF+FDRAAF LKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+T  RQ LL  IS   GV EAID+ ++ 
Subjt:  ATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISK

Query:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQII-KNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFK
        + N  +   ELLEG W+M+WSSQM TDSWIENAANGLMG QII K+G++KF+V+++   RFSM G F+KSE +TYD+ MDDAAIIGG FGYPV++ +  +
Subjt:  SQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQII-KNGQMKFQVDMLLGLRFSMIGTFVKSEDNTYDVTMDDAAIIGGPFGYPVEMESRFK

Query:  LQLLYNDGKIRITRGYNNILFVHLR
        L++LY D K+RI+RG++NI+FVH+R
Subjt:  LQLLYNDGKIRITRGYNNILFVHLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTACTTCCAAGCTATCTGCAGAATTATCCCCTGCTAAAGAACAGAGAGCAATGGATAAAATTCCCTACTCTAATATAATGGGAATTTGGAAGTTTGATGTACTACA
TGATTATTCTACAACTGATCTGGCATATGCTGCAAGTTTGGCAACTGCATCAGTCAAAGATGGCAAACGCATCCTTTTCCAATTCGACAGAGCAGCATTTTCTTTGAAGT
TTTTGCCTTTTAAGGTTCCATATCCTGTTCCATTTAGACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCTTCTGGAAACCTCCGCATTTCA
AGAGGAAACAAGGGCACCACATTTGTGTTGCAAAAGAAAACGGAAGCAAGGCAAAACTTGCTGGTAGCAATTTCTACAGGTATAGGAGTTGAAGAGGCAATTGATAAGCT
TATCTCTAAAAGTCAAAATGACAATAAATTTGAAAAGGAGCTTCTTGAGGGAGACTGGAAGATGTTATGGAGTTCACAGATGGAAACAGATAGTTGGATAGAGAATGCTG
CAAATGGTCTCATGGGGATGCAGATTATCAAGAATGGACAAATGAAGTTTCAAGTGGATATGTTGCTTGGACTGAGATTTTCCATGATTGGTACTTTTGTGAAATCTGAA
GACAACACATACGATGTTACCATGGATGATGCTGCAATTATTGGAGGCCCATTTGGATATCCCGTGGAAATGGAGAGTCGGTTTAAGCTCCAGCTTCTATACAATGATGG
GAAGATAAGAATCACACGAGGATACAATAACATCTTATTTGTTCATCTACGGGTCCACGAACCAAAGCAGTCTTTCCAATCCAGCAACTACATCAGAAGTATACTTACAG
CTACAGCTCATTTCCCAGACAGCAAATGCATCTTGCACGTAAATCTTGTATCTGATACCTTCCAACCATTCTTTCTAAGTGACCTGTTCGGCCACTGGCAACCAAGTTTT
CAAGAACATAGGAACGAATCCTCTTTTGATTCTCCACAGCCAGAATCCTTTACATCTGCAACTAGGTTTACAGGTACAACTCTATTCCGCTCTCCTTCATCATTGACCAT
GGCTTCCTTTCCAACCCCTGAATCCGCATGTAAAGTATTACCCAAATTTTCATCCATATCCTCGCATTCATTTACCTTTTCCAATACAAGTTCACCCTTCATTTCTTCAT
CTTTTGCAACATCGTCTTCTGGAAGCTGTTCATCTGATTCCTTCACCATTTCATCCTTCACCATGTATTTCTCTTCAACCACTCTCCCTCTCGTAAAGTCTTCCTCAACA
CATTTATCCCCACCCTCAATCTCTGACACTTCTTCTTCGTCCTCCTTAGAATCAACATCAATAACTTGGTTTGCTCTAATCTCAGATACATACTCCATCTCCAACCCTCC
ATCACTTTCATTCTCAGATACATTCTCCACTACCTCCTCATTTACATCATCAATAGCTGGGTTTTGCCCTTCAAGAAGTTGCAGCTTCCTCCTCAATTATTATACTCTAC
TCCATAGTTCTGGTATATCATTTCAAATCGGCTCAAGTCATGGTCCAATCTTCGTTCTACCTCCAGATGCTCCTCCACGTCACTATCACTCGACCCACCACCACCTTCCT
CTTCTTCCTCATCATGCACTTCCGCCTTATTCTCAGATCCTCGGAGCCTTATCAACCCTTTCATTGATCCATAACCATCAACGCCCCACTCCTCCTTGGCCATATCACCC
AACGCCTCCCTCGATGGGGAGAACATTGGATAAAATTTTCGAGCTTTTATACGATTTTGAGACTGCAATTCATGAAGACTTGGAGGTTGATATCCTCCTACGTTATTTGG
ACCACTATGGAACCCGGCTTCGAGAAGTTGCGATACTTCTTGTTAGGATCCAAGGGGGGCTGCGGTCGTGCCCATACCTTATAGTTGCCACTCATCATAGGCGGCTGAGA
CTGAGTCATGATCAACTGGGACTGCTGCATAGCCTGAGGTTGCGGCAAAAGATTTGGCTGATTGATGGACTGCGACTGAGACATTATCTGAGGATGGTTCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTACTTCCAAGCTATCTGCAGAATTATCCCCTGCTAAAGAACAGAGAGCAATGGATAAAATTCCCTACTCTAATATAATGGGAATTTGGAAGTTTGATGTACTACA
TGATTATTCTACAACTGATCTGGCATATGCTGCAAGTTTGGCAACTGCATCAGTCAAAGATGGCAAACGCATCCTTTTCCAATTCGACAGAGCAGCATTTTCTTTGAAGT
TTTTGCCTTTTAAGGTTCCATATCCTGTTCCATTTAGACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCTTCTGGAAACCTCCGCATTTCA
AGAGGAAACAAGGGCACCACATTTGTGTTGCAAAAGAAAACGGAAGCAAGGCAAAACTTGCTGGTAGCAATTTCTACAGGTATAGGAGTTGAAGAGGCAATTGATAAGCT
TATCTCTAAAAGTCAAAATGACAATAAATTTGAAAAGGAGCTTCTTGAGGGAGACTGGAAGATGTTATGGAGTTCACAGATGGAAACAGATAGTTGGATAGAGAATGCTG
CAAATGGTCTCATGGGGATGCAGATTATCAAGAATGGACAAATGAAGTTTCAAGTGGATATGTTGCTTGGACTGAGATTTTCCATGATTGGTACTTTTGTGAAATCTGAA
GACAACACATACGATGTTACCATGGATGATGCTGCAATTATTGGAGGCCCATTTGGATATCCCGTGGAAATGGAGAGTCGGTTTAAGCTCCAGCTTCTATACAATGATGG
GAAGATAAGAATCACACGAGGATACAATAACATCTTATTTGTTCATCTACGGGTCCACGAACCAAAGCAGTCTTTCCAATCCAGCAACTACATCAGAAGTATACTTACAG
CTACAGCTCATTTCCCAGACAGCAAATGCATCTTGCACGTAAATCTTGTATCTGATACCTTCCAACCATTCTTTCTAAGTGACCTGTTCGGCCACTGGCAACCAAGTTTT
CAAGAACATAGGAACGAATCCTCTTTTGATTCTCCACAGCCAGAATCCTTTACATCTGCAACTAGGTTTACAGGTACAACTCTATTCCGCTCTCCTTCATCATTGACCAT
GGCTTCCTTTCCAACCCCTGAATCCGCATGTAAAGTATTACCCAAATTTTCATCCATATCCTCGCATTCATTTACCTTTTCCAATACAAGTTCACCCTTCATTTCTTCAT
CTTTTGCAACATCGTCTTCTGGAAGCTGTTCATCTGATTCCTTCACCATTTCATCCTTCACCATGTATTTCTCTTCAACCACTCTCCCTCTCGTAAAGTCTTCCTCAACA
CATTTATCCCCACCCTCAATCTCTGACACTTCTTCTTCGTCCTCCTTAGAATCAACATCAATAACTTGGTTTGCTCTAATCTCAGATACATACTCCATCTCCAACCCTCC
ATCACTTTCATTCTCAGATACATTCTCCACTACCTCCTCATTTACATCATCAATAGCTGGGTTTTGCCCTTCAAGAAGTTGCAGCTTCCTCCTCAATTATTATACTCTAC
TCCATAGTTCTGGTATATCATTTCAAATCGGCTCAAGTCATGGTCCAATCTTCGTTCTACCTCCAGATGCTCCTCCACGTCACTATCACTCGACCCACCACCACCTTCCT
CTTCTTCCTCATCATGCACTTCCGCCTTATTCTCAGATCCTCGGAGCCTTATCAACCCTTTCATTGATCCATAACCATCAACGCCCCACTCCTCCTTGGCCATATCACCC
AACGCCTCCCTCGATGGGGAGAACATTGGATAAAATTTTCGAGCTTTTATACGATTTTGAGACTGCAATTCATGAAGACTTGGAGGTTGATATCCTCCTACGTTATTTGG
ACCACTATGGAACCCGGCTTCGAGAAGTTGCGATACTTCTTGTTAGGATCCAAGGGGGGCTGCGGTCGTGCCCATACCTTATAGTTGCCACTCATCATAGGCGGCTGAGA
CTGAGTCATGATCAACTGGGACTGCTGCATAGCCTGAGGTTGCGGCAAAAGATTTGGCTGATTGATGGACTGCGACTGAGACATTATCTGAGGATGGTTCATTAG
Protein sequenceShow/hide protein sequence
MFTSKLSAELSPAKEQRAMDKIPYSNIMGIWKFDVLHDYSTTDLAYAASLATASVKDGKRILFQFDRAAFSLKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRIS
RGNKGTTFVLQKKTEARQNLLVAISTGIGVEEAIDKLISKSQNDNKFEKELLEGDWKMLWSSQMETDSWIENAANGLMGMQIIKNGQMKFQVDMLLGLRFSMIGTFVKSE
DNTYDVTMDDAAIIGGPFGYPVEMESRFKLQLLYNDGKIRITRGYNNILFVHLRVHEPKQSFQSSNYIRSILTATAHFPDSKCILHVNLVSDTFQPFFLSDLFGHWQPSF
QEHRNESSFDSPQPESFTSATRFTGTTLFRSPSSLTMASFPTPESACKVLPKFSSISSHSFTFSNTSSPFISSSFATSSSGSCSSDSFTISSFTMYFSSTTLPLVKSSST
HLSPPSISDTSSSSSLESTSITWFALISDTYSISNPPSLSFSDTFSTTSSFTSSIAGFCPSRSCSFLLNYYTLLHSSGISFQIGSSHGPIFVLPPDAPPRHYHSTHHHLP
LLPHHALPPYSQILGALSTLSLIHNHQRPTPPWPYHPTPPSMGRTLDKIFELLYDFETAIHEDLEVDILLRYLDHYGTRLREVAILLVRIQGGLRSCPYLIVATHHRRLR
LSHDQLGLLHSLRLRQKIWLIDGLRLRHYLRMVH