; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G024590 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G024590
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionINVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationCicolChr02:8099204..8112066
RNA-Seq ExpressionCcUC02G024590
SyntenyCcUC02G024590
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009522 - photosystem I (cellular component)
InterPro domainsIPR008796 - Photosystem I reaction centre subunit N, chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136934.1 uncharacterized protein LOC101214221 [Cucumis sativus]1.5e-4179.41Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY
        MSSIGQ ILMALAVTLNKFASSNVQSVQRN+      ATATAT  S  GRR LLLS +A  +AA    +DSRTELLK  YLKKSEENKEKN+KERLESYY
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        KRNYKDYFEFVEGSVKNKNELSEAEKGI+EWLKR+K
Subjt:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_008455049.1 PREDICTED: uncharacterized protein LOC103495319 [Cucumis melo]3.7e-4079.56Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIA-AVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESY
        MSSIGQ ILMALAVTLNKFASSNVQSVQRN+        ATAT  S  GRR LLLS +A A  AA   A+DSRTELLK  YLKKSEENKEKN+KERLESY
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIA-AVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESY

Query:  YKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        YKRNYKDYFEFVEGSVKNKNELSEAEKGI+EWLKRNK
Subjt:  YKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_022927848.1 uncharacterized protein LOC111434615 [Cucurbita moschata]3.5e-3876.43Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANG----RRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERL
        MSSIGQ ILMALA+TLN+FASSNVQSVQRN+   P T TAT +A ++      RRGLLLS  AAVAA    A+DSRTELLK  YLKKSEENKEKN+KERL
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANG----RRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERL

Query:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_022989008.1 uncharacterized protein LOC111486201 [Cucurbita maxima]2.7e-3879.41Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY
        MSSIGQ ILMALA+TLN+FASSNVQSVQRN+   P T TAT +A S   RRGLLLS  AAVAA    A+DSRTELLK  YLKKSEENKEKN+KERLES+Y
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        KRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

XP_038887440.1 uncharacterized protein LOC120077574 [Benincasa hispida]6.1e-5190.44Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY
        MSSIGQ ILMALAVTLNKFASSNVQSVQRNQANKPR  TATAT GS  GRRGLLLSA+AA AA PEEA+DSRTELLK  YLKKSEENKEKN+KERLESYY
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
Subjt:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

TrEMBL top hitse value%identityAlignment
A0A1S3C176 uncharacterized protein LOC1034953191.8e-4079.56Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIA-AVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESY
        MSSIGQ ILMALAVTLNKFASSNVQSVQRN+        ATAT  S  GRR LLLS +A A  AA   A+DSRTELLK  YLKKSEENKEKN+KERLESY
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIA-AVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESY

Query:  YKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        YKRNYKDYFEFVEGSVKNKNELSEAEKGI+EWLKRNK
Subjt:  YKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

A0A2I4HLI8 uncharacterized protein LOC1090192133.4e-3167.88Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY
        MSSIGQ ILMAL VT+N+FASSNVQ+V R +   P + T T T  S  GRR LLLS +    AAP+ A DSRT+LLK  YLKKSEENK KN+KERL+SYY
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVK-NKNELSEAEKGIIEWLKRNK
        KRNYKDYFEFVEG+ K N+ +LSEAEKGII+WL+RNK
Subjt:  KRNYKDYFEFVEGSVK-NKNELSEAEKGIIEWLKRNK

A0A6J1D574 uncharacterized protein LOC111017388 isoform X16.0e-3674.26Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY
        MSSIGQ ILMALAVT+NKFASSNVQSV RNQ        + A A S  GRRGLL SA+AA A AP   +DSRTELLK  YLKKSE+NKEKN+KERL+SYY
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        KRNYKDYFEFVEGSV+NK+ELSE EK IIEWL+RNK
Subjt:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

A0A6J1EM63 uncharacterized protein LOC1114346151.7e-3876.43Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANG----RRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERL
        MSSIGQ ILMALA+TLN+FASSNVQSVQRN+   P T TAT +A ++      RRGLLLS  AAVAA    A+DSRTELLK  YLKKSEENKEKN+KERL
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANG----RRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERL

Query:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        ES+YKRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  ESYYKRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

A0A6J1JNZ7 uncharacterized protein LOC1114862011.3e-3879.41Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY
        MSSIGQ ILMALA+TLN+FASSNVQSVQRN+   P T TAT +A S   RRGLLLS  AAVAA    A+DSRTELLK  YLKKSEENKEKN+KERLES+Y
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYY

Query:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK
        KRNYKDYFEFVEGS+KNK+ELSEAEKGIIEWLKRNK
Subjt:  KRNYKDYFEFVEGSVKNKNELSEAEKGIIEWLKRNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49975.1 INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.0e-2859.71Show/hide
Query:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSAN-GRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESY
        MSSI Q ILMAL VT+NK+ASSNVQ+V+RN      T   + TA  A+ GRR +L S+ + +AA    A+ S  +LL+  YLKK+EENK KN+KERL+S+
Subjt:  MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSAN-GRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESY

Query:  YKRNYKDYFEFVEGSVKNKN--ELSEAEKGIIEWLKRNK
        YKRNYKDYFEFVEGS+K K   ELSE+EK I+EWLK NK
Subjt:  YKRNYKDYFEFVEGSVKNKN--ELSEAEKGIIEWLKRNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCATCGGCCAAATCATTCTGATGGCTCTCGCCGTCACTCTCAACAAATTCGCTTCCTCCAACGTTCAATCCGTTCAGAGAAACCAAGCCAACAAGCCTCGCAC
CGCCACCGCCACCGCCACTGCCGGTTCTGCAAATGGAAGAAGAGGCCTCCTCTTGTCTGCCATTGCCGCCGTCGCTGCCGCTCCCGAAGAAGCCATGGACTCCAGAACCG
AGCTGCTAAAAAGTGGGTACCTCAAGAAGTCTGAAGAAAACAAAGAAAAGAATGAGAAGGAGAGATTGGAGAGTTACTACAAGAGAAATTACAAAGATTATTTTGAGTTT
GTTGAAGGATCAGTGAAGAATAAGAACGAACTTTCAGAAGCTGAAAAAGGTATTATTGAGTGGCTTAAACGAAATAAATGA
mRNA sequenceShow/hide mRNA sequence
GTATATGGACCAAGAAAGCCTCATCTATTCTTCAACCTCATCCTCAAACTCAAAACTATCACTCCGATGAGTTCCATCGGCCAAATCATTCTGATGGCTCTCGCCGTCAC
TCTCAACAAATTCGCTTCCTCCAACGTTCAATCCGTTCAGAGAAACCAAGCCAACAAGCCTCGCACCGCCACCGCCACCGCCACTGCCGGTTCTGCAAATGGAAGAAGAG
GCCTCCTCTTGTCTGCCATTGCCGCCGTCGCTGCCGCTCCCGAAGAAGCCATGGACTCCAGAACCGAGCTGCTAAAAAGTGGGTACCTCAAGAAGTCTGAAGAAAACAAA
GAAAAGAATGAGAAGGAGAGATTGGAGAGTTACTACAAGAGAAATTACAAAGATTATTTTGAGTTTGTTGAAGGATCAGTGAAGAATAAGAACGAACTTTCAGAAGCTGA
AAAAGGTATTATTGAGTGGCTTAAACGAAATAAATGAAAGATACTTTTCGTTTTTATTGGAATTAATTAATTCATAAGTTCCATCCTTCACAACAATTATAACAATAGTT
CTAGTTTAGTTTAGGCCTCCAATTTTATTTATAGGTTTATGGTTTTAAAAACAAAGCTTATTTGATGATAATTTATTTTTTCAAAAAGAAAGGTATGTAAAAACTACTCA
TAAGTATAGAGTTGTAAGTGCTCAATAAATAATATAGGGTTTCATGGTAAAGGTTTTGGAATTTTGAAATAAAATAATTGCTTAAATAATAAAATTTAATAATAAAATTT
AAGGGTGTTAAAAATAACAACTATTGTTCTCATTTTAGTTCTCCTTGTTTAGTTAATTTTCACCAAGTAGCTAAATTGAAGAGTCAAAAGCAATTTTTTGAAATGTAATG
TAACGACCCGACTTATACAAACTAATCAGATCATTACTCAATGCATGCAAAACTTTAAAGAACGACACTACGCTGAACAGTGAAGAAGGATCAAATCTTTTCATGAAAAT
TTCTACTTATATTAACTCAAGAAAATAACACATTAATAAATAGAAAGGAAATTTTGGTTCGGGGTACCCCTAATAAAATCAATGAATAAAAAAAATCCACATAATTAAAA
AAATAGACAAGTCTCAAAACAAAATTAGACAGTTGGACAACAATTTTAGTAAAACATGCAAGACTACGCGGAAGCTACTACGGGTCCCGTGATACCATCAAAGATTCTTC
TTGTCATTCGCCGACACATTTCTTTCTTTACCTGAAAAACATATAAGAAGAAAAGATGAGTAACTATGTTACTCAGTAAGTGACCCCACTACCGGAGTCGGGTTAGACAC
CTAAGTCCTTTAAGTGCGTCAAACATGAGACATGCATATCTATCATACTTCTATAGGAAAATGGTTATACCCGCCCACCGTTGGTTATGGATGCCCACAAACCTCTAGAG
CCTCGAAGGAAACTCTTACCTTTAGTGATCCCGAAGGAACACAAACCTCTAGTGCACTCGAAAGAGCACAACCTCTGGTGAACCCGAAGGAACACAACCTCTAGTGAACC
CGAAGGAACACATCCTCTAGTGATCCCGAAGGAACACAACCTCTAGTGAACCGGAGGAACACATCTTCTAGTAATCCCAAAGGAACACAACCTCTTATGGACATCCAACT
GACGTAGGGGAGTTACCACTACCCATCATACTGATCAATCAAATTCACACTCCTTCCATTCCATGAATTCATACATTAACAGTACTGTTCATAAACATCTCATAGTATAA
GCTTATACACTATGAGGCACATACAACTTTAAGACATGGAAATTTCATACAAGAATCTTATATCATACGTGGCTTTGGAAATCTACTACATCATGGTAACTAGACATATG
GCCCGAAGGCGGTTCAGGTAGTAAGATCACTTACCTGAGGGTTGATCCATAAAGCTAACCACACTGCAACTCTTGTAAAATTCCTGAACCAACTAGGAATAAAGTTCCCC
TAACATAAGTTATTCATCAAGCCCAACCTATCATAAGCAAACTCAAAATTTGTTCGAGAAAGTCCTTACCCGAACGTGACCAAACTCGCAGTTTCCTCCGACCCTATGGA
CTAAAGCAATCGCTGACCACCTATTAAAATTTACCAATTAATTCGCAATCCCCAAATATGCCACTAACCTACTCAGGTCTGAGATACTTACTTAGGGAGTACTGGTCTGG
CGACTGACGGTCAGGCGAGCAGTTAACTGCCGTCGAGAACACAATTCTGAGCGAGAGAGAGAGCCGAGCGAGAGTCGAGCGAGAGAGCGAGAGTTCGAGAGTCAGCGAGA
GCGAGAGAAAGAGAGAGCAAGAGTTCGAGAGTCAACGAGAGCTAGCGAGAGAGCGAGAGCTCGAAAGCGTGACCCTGGCGTAGAGAGTGTGATTTTTGAGTGTGAGAGTG
GGGGAGCGTGAGTCAGCGAGAGAGCGAGAGAGCCGCGGGATATAGTGATACCCTGGCTTGGAGAGCGTGAATTTCGGGTGTGAAACCTCCGGGAACGAGCAGCAATAGAG
AGAGAGAGGTGGCGGGAGCAGAGAGAGACCGAAAACGAAGGAAGGGCGGTGGCGGCGACGAAAACCGAAGAAGGGAGGAGAGAGAGGCGGTGGAGAGGGAGAGAGATGGC
AGGGCAGAGGGAGAGAAATGGCGGGGAAAAGCTTTTCTTTTCTCTTTTTTTTTTTACTACACGCAAATCGAGGAGGGTTTAAAATAAACCCTAATTCCTTCCCCTCCTTA
ACCCATTAATTCCACTTTTCCCTTTTTGTTTCCCAACCAAATAAAATAAAATCTCCTTTATTCACACTAAAATCCCACCAAATCTATTTATTTTATTTTATTTGCCCCAA
AGTAAGCTTTCCTTTGCCAATTCCCAATAACTTCCTTAAATCCACCTAAGACTCACCAACCACTATAAATGCTAATTTTAAAACTTTCCTCCCTAAACTAAATAACTACG
TAAATACATATAAATAAATATAATTTTCTTTTGATGCATAATGATTTAAATTAACTTAATAAGAGGATTTACCTTAATGATTTGGGTTGTCACAATCTTCCCCCCTTAAG
AAACTTTCGTCCTCAAAAGTTTTACTGTTGGAACAACTATGGATGCTTTGACCTCATTTCATCCTCTCGTTCCCACGTTGCCTCCTCACGCTGGTGGTTCTGCCAGAGGA
CTTTCACCAGTGCTATCTCCCGGTTGCGCAGAACCTTTACTTCCCTTGCGAGAATTTGCACCGGCTTTTCCTCATAACTTAAATCTTTGCTCAACTGTAAAGGCTCAAAA
TCGATCACGTGGGATGGATCCGCCATATACTTTGGAAGCATGGAGACATGGAACACGTTGTGAACTGTGGAGAGCGACGGTGGCAAGGCTAAACGATAGGCCACGGGGCC
AACTTGCTCCAATATCTCAAACGGTCCAATGAATCGGGGGCTTAACTTCCCCTTTTTCCCGAACCTCAAGACACCCTTCATAGGTTCCACTTTCAAGAATACCTTATCAC
CTACCTCGAATTTCAGGTCTCGCCGTCTTACATCAGCGGAACTTTTCTGTCTACTCTAAGCTGTCGCATCCTTGCCCTAATCTTCTGTATAGCCTCGTTAGTGAGTTGTA
CTAACTCTGGTCCCAATAGTCTTCGCTCCCCAACTTCATCCCAGCAAACTGGAGATTTGCAACTCTTACCATATAAGGCTTTGAATGGCGCCGTGCCAATGGTAGCTTGA
TAACTGTTATTGTAGGCAAACTCCATCAAGTGCAAATGGGAGTCCCAACTTCCTGGAAGGTCTAGGGCGCAGGCACGCAACATGTCCTCTAGTACCTGATTCAAACGTTC
CATTTGTCCATCCGTTTGGGGGTGAAAAGCCGTACTAAAATCCAATCGAGTGCCCAATGCAGTTTAGAGGCTCTTACAAAAGTTAGATGTGAAACGAGGGTCTCTGTCAG
ACACTATAGAAACCGGTACTCCGTGTAGCCTTACCACTTCCTTTATATAAATTTGTGTCCACTCTGGTACCGACAGGGGCTGCAACAACCCTGCCTGCTTCTGCCTCGGA
GCCTTCACCTACTGGCATACAAGACACTAGCTAACAAATTCAGCCACTTCCCTCTTCATATTGGACCACCAGTAATGATGTTTCAGGTCCTGATACATCTTGGTACTACC
CAGATGCATCGTAAAAGGGGAGTTATGAGCTTCCGACAACAGTTCGTCCTTAACCACATTGCCTGCTGGAACACAAAAGCGTCCGTGGTATGAGAGGCCACTATCAAAGG
ATACCGAGAACTCACTGATCTGCCCCGACTCCATTTGGCGAAGTTTCTTCCTAAGGTATGGATCTCCCTGCTGGGCGTCTATGATTCTTTGCCTCAAAGTGGGTTGCACC
GATAGTTGAGCTAACTGGGAAGTGACTTCTCCTAGCGCCACTACAATGCCTGCTCGCTTAAGGTCCCTACACAGTGGGGTCTGGTCAGTAATCAGAGCTGCTGAATGAAC
TGCCTTCTGCTCAAGGCATCTGCTACTACATTTGCTTTTCCGGGATGGTACAGGATTTCTACATCATAATCCTTTACTAACTCCAACCACTTGCACTGTCTCATGTTCAA
CTCTTTCTGAGTGAAGAAGTACTTTAAGCTCTTGTGGTCTGTGAAGATTTGGATTCTTTCCTCGTACAAGTAGTGCCTTCATATCTTCAGAGCGAAAACAACTGCTGCCA
ACTCTAAATCGTGGGTCGGGTAATTCTGTTCATGGCTTTTTAACTGCCGAGATGCATAAGCAACCACCTTGTCGTGCTGCATCAACATGCAACCCAACCCCTTTTTGGAA
GCATCACTATAAACGAAAAAACTCCCTGACCCATCTGGAACGGTAAGTACCGGTGCAGACACTAGCCTCTGCTTAAGGTCTTGGAAGCTCACCTCACAGGCCTTACTCCA
AACAAAGGCAGTACCTTTCCTGGTCAACTAGGTCAGTGGCATGGCTACACGAGAGAAATCTTTCACAAACCGACGGTAGTAGCCAGCTAAACCTAGAAAACTGCGTACCT
CACTAACTGTCGTAGGGCGAGGCCAGCTCGTAACCGCTTCTATCTTGGCAGGATCTACCGAGACACCATCTTTTGACACCACATGCCCTAAAAAGGATACCTGCTTCAAC
CAGAATTCGCACTTAGAAAACTTAGCATACAACTTATGTGCCCTTAAGGTTTCGAGAACCTTTCACAGATGCTCCTCGTGCTCTACCTCCGTCTTGGAATATATCAAAAT
ATCATCGATGAATACGATGACAAAAGTGTCAAGAAAGTCTTTGAACACCCGGTTCATTAGATCCATGAATACCGCAGGGGCATTCGTCAGACCAAAGGACATAACGATGA
ATTCGTAGTGTCCATATCGGGAACGAAAAGCGGTTTTAGGGACGTCACTATCCTTTATCCTCAACTGATGGTATCCTGAACGAAGATCAATCTTGGAGAAAATTGTGGCT
TCCTGCAACTGATCAAATAGGTCGTCAATTATGGGGAGAGGGTATTTGTTTTTTATCGTCACCTTGTTTAGCTCTCTGTAGTCAATGCAAAGGCGTAGCGAACCATCCTT
CTTCTTCACGAACAACAATGGTGCACCCAAGGTGACACACTCGGTCGGATAAAGCCCTTGTCAAGCAACTCTTGTAGCTGCAACTTAAGCTCTCTTAACTCTGCTGGAGC
CATACGATAAGGAGCCTTGGATATGGGAACCGTACCCAGCTCCAACTCTATGGCGAAGTCAATCTCCCGAACTGGTGGTAAGCCTGGAAGGTCCTCTGGAAAAACGTCCA
GATAATCTCGTACCACAGGTTCTGAAGTCAGGGTGGCCTCATCCTTTCTAACGTCTACCACATTGACCAAAATACCCCAGGTACCCTGGTTAAGCAATCTGCGGACCTTT
AGAGCTGAAATAACCTTGGGCAGGACCACCGTCCCTGCTCCCTTAAATTTGAAGCTAGGCCCTGTAGGAGGAGTGAAAATAACCTCCTTTCTTGCACAATCTATACTAGC
ATGGTTAGCAGCCAACCAATCCATGCCTACTATGACGTCGAAGTCACGCCTGGCCAAGACGATTAGGGTAACCTTTAATTCTCGATTTGCTATTTCTACCCGACAGGCCT
TTATCATTTCTTTTGCTAACAAGATCTCCCCAGATGGGATAGAGACCGATAACGCAAAAGGTAAGGGTTCTAGCTCTAACACAGCATGTTTAACGAATACACCAGAAATA
AACGAATGCGACGAGCTTGAGTCAAACAACATTAGAGCACGATGCCCCAAAATAGGGAGCATACCTGTCACCACTGTGTCTGATTTATCAGCTTCCTGCCGAGTGGTAGC
ATAGACCCGACCATGCTGTTGCTACACCCTACCAGAGTCCTCGAGGGAGGCGCTTGCATGCTGCCCCTTTCATCTGCTGCTCCCTTCTTTGGACACTTGCTAGACATGTC
CCCTCTTGGCCACACCGAAAACAAACTCCTGAGCCTGCTAAACATTGCCCTCAGTGACTTTTTCCACAGGAACCACATATTGGCTTTTCTCTCTCTACTAAGCCCAAGGC
CGAAGGCTGTTGTCTGAAGTGCTGTATGAGCCTGTTTGCCCGGTTCTTTGGAGGAAACTCTTGGATTTTCTGCTCTAACTTCCTTTTCTGGCCTAAAGAGGCTTCTGATG
TCATAGCTCTGACATACTCTTCTCCGGCACGGGAGTCAATCCTCACTGCTGCTCGAAATGCTGCTGCGTAACTGCTGGGTTCCAAGGCTTGCACGAGACCCCTTAACCCA
TCCTTCAGACCTTGCACAAAGCACTCTGTCTTATCTTCCTCAATAGCTACCATCTTGAGGGAAAAACGGGATAACTTGGTGAACTCTCTCTCGTACTCCTTAACGGACAT
GATCCCTTGCTTCAGCTTCATGAACTCCACCTGCTTATTGTACCTCGTATTGGCGGAGAAATATTTCTCATTGAATCGCTCCTTGAACTGAGCCTAGGTAACCGATCCTC
CTATGGTAACCACTGACCTCTCTGCCGACTGCCACCAGATCTCGGCGTCATCTGTCAAAACAAATACTGCACACTAAAGCTTCTGGTCCTCTGGGCACTTCATATATCGA
AAGAAAGTCTCGATGGACGATAGCCACATCTCTGCCTTGGTAGGGTCCTTCAATGACCCGTCAAAAGCGCGCGGATTGTATTTCCTGAAGTCCCTCAGATGTTTTGCTTC
TGGGGATAAGTCCTGGTAATGCTGATGCTGGGCCGGCACCTGCGGGGGTGGAATCCGTTGGGCTAATTGTTCTGCCACAGAGGTGCGAGTTAGCTCCTGAATCTTGCCTA
ACAAAGTGGCAGTAACAGTCTCCCCTATCTGAGCTACCACTGCAGCGAAATCGGTTTGGGCCACTGGTGGGTCTTGTGTCTGGGACGGCTCAGGCACGTTACAAGGCCTT
CCTCTTGCCCTGGTTCCATGCTCCTCTTGCCTGACGGGATCTTGTGGCGGTGGAGGTCCTTGTGGTGCAGGGGTGTCTTGCTCCATAGGGTCCTGGACTTGAGGGTTTAG
TAATCCTTTTGCCCTACCCCTCCTGCCTCTACCTCTAACTCTACCTCCACGAGCCAATCCTAATTTACGGACCACAGGTACCATTAATTTCCTTGCTAATGGTGTTATGT
CAAAGTATCAATGCAAGGGTAAACTAGTACGTTGTAGGCAGAATATTCTAAAGACATGTAATTGGAAATGCGTACCTGGCGATGACGAAGAATCCTTTAATGGTTATGGG
AACCCCTCGACTTACTTAGTAAGCAAGTCTACATAACCCAAAACTCTGGGCTCTGATACCAACTGTAACGACCCGACTTATACAAACTAATATGGACCATTACTCAATGC
ATGCAAAACTTCAATGAACGACACTACGCTGAACAATGAAGAAGGATCAAAGCTTTTCATGAAAATTTCTACTTATATTAACTCAAGAAAATAACACATTAATAAATAGA
AAGGAAATTTTGGTTCGGGGTACCCCTAATAAAATAAATGAATAAAAAAAAATCCACATAATTAAAAAAATAGACAAGTCTCAAAACAAAATTAGACAGTTGGACAACAA
TTTTAGTAAAACATGCAAGACTACGTGGAAGCTACTATGGGTCCCGTGATACCATAAAGGATTCTTCTTGTCATTCGCCGACACATTCCTTCCTTTACCTGAAGAAAGGA
TGAGTAATTATATTACTCAGTAAGTGACCTCACTACCTAAGTCGGGCTAGGCACCTAAGTCCTTTAAGTGCGTCAAACATGAGACATGCATATCTATCATACTTCTATAG
GGAAATGGTTATACCCGCCCACTGTTGGTTATGGATGCCCATTAACCTCTAGAGCCTCGAAGGAAACTCTTACCTCTGGTGATCCCGAAGGAACACAAACCTCTAGTGCA
CTCGAAAGAGCACAACCTCTGGTGAACCTGAAGGAATACAACCTCTAGTGAACCCGAAGGAACACATCCTCTAGTGAACCCGAAGGAACACATCCTCTAGTGATCCCAAA
GGAACACAACCTTCTGTGGACATCCAACCGACGTAGGGGAGTTACCACTACCCATCATACATGATCAATCAAATTCACACTCCTTCCATTCCATGAATTCATATATTAAC
AGTACTGTTCATAAACGTCTCATAGTATAAGCTTATACACATCCAACCGACGTAGGGGAGTTACCACTACCCATCATACATGATCAATCAAATTCACACACCTTCCATTC
CATGAATTCATATATTAACAGTACTGTTCATAAACGTCTCATAGTATAAGCTTATACACTATGAGGCACATACAACTTTAAGACATGGAAATTTCATACAAGCATCTTAT
ATCATACGTGGCTTTGGAAATCTACTACATCATGGTAACTAGACATATGGACCGAAGGCGATTCAGGTAGTAAGATCACTTACCTGAGAGTTGATCCATAAAGCTAACCA
CACTGCAACTCTTGCAAAATTCCTGAACCAACTGGGAATAAAGTTCCCCTAATGCAAGTTATTCATCAAGCCCAACCTATCATAAGCAAATTCAAAATTTGTTCGAGAGA
GTCCTTACCCGAACGTGACCAAACTCGCAGTTTCCTCCGACCCTACGGACTAAAGCAATCCCCGACCACCTATTAAGATTTACCAATTAATTCGTAATCCCCAAATATGC
CACTAACCTACTCAGGTCTGAGATACTTACTTAGGGAGTACTGGTCTGACGATGGACGGTCAGACGAGCAGTTAACTACCGTCGAGAACACAGTTCTGAGCGAGAGTGAG
AGCCGAGCGAGAGTCCAGCCAGAGAGCATGAGAGTGTGACCCTGGTGTGGAGAGCGTGATTTCCGAGTGTGAGAGTGGGGGAGCGTGAGTCAGCGAAAGAGCGAGAGAGC
TGCAGAAGACAAGATACCCTGGCGTGGAGAGCGTGAATTTCGAGCGTGAAACCTCCGGGAACGAGCGGAAATAGAGAGAGAGAGGTGGCGAGAACAAGAGAGACCAGAAA
CGAAGGAAGGGCGGCGGCGGTGACGAAAACTGTGGAGGAAGGGAGGAGAGAGAGGCAGTGGAGAGGGAGAGAGATGGCAGGGCAGAGGGAGAGAGATGGGGGGAAAAGCT
TTTCTTTTCTCTTTTTTTTTTGACTACACGCAAATAGAGGAGGGTTTAAAATAAACCCTAATTCCTTTCCCTCCTTAACCCATTAATTCCACTTTTCCCTTTTTGTTTCC
CAACCAAATAAAATAAAATCTCCTTTATTCACACTCAAATCCAACCAAATCTATTTATTTTATTTTATTTGCCCCAAAGTAAACTATCCTTTGCCAATTCCCAACAACTT
CCTTAAATCCACCTAAGACTCACCAACCACTATAAATGCTAATTTTAAAACTTTCTCCCTTAAACTAAATAACTACATAAATACATATAAATAAATATAATTTTCTTTTG
ATGCATAATGACTTAAATTAACTTAATAAGAGGATTTACCTTAATGATTTGGGTTGTCACATGTAATCCTTCTCAACTTTAGTTTTAAAATTTTATCAATTTTGAAAATA
TTTTAAAGGTAATTTATCCTACGCAAATATAATTTTTAAAAATAGAAACCAAAAAGGAAAATAGTTATCAAATGATTTATTTTCCGTGGAATCTAATGAGTTTCTATTGT
AATATAATGGTTATAATACTATTTTTTGTCCCTACACTTTGAAGTTGGTTCAATTTCGAGTCTTTATACTTTCAATTGTTCAATCGTAGTCCATCTATTTTCATTAAATC
TTAAATTTAGTTTCCAATGCTAGTTTATTATTGACTTTTTCAAACCTTTTTATTATCTATTAGCATTTTTACTATAAATTTTAAAAACTTATTCACATATTCTATCTATT
TGTATGAAAATTATTATTATTATTTAACCAATTCGATAAAAATTAATTTTGAGGGACTCAATTCAAGATTTATGAAATTAGGACTAAAAGTTGACAAACTTCAAAGTATA
TGACCAAAATAATTGTTTAACCTAATATAATTGAATAAGAAGTTGTTGGAAATTAAAATGTCCCTTGTTGTTTCTTTTTTTTTAATTTAAATCAATTTATATCCATAAAC
TTTAGAATTGTATTAATTTAAATTCTAAACTTTTGTAAGTGTATCAATTTACACCCCCTATTATGTTATGTTTGAAGAAACTTCGTACG
Protein sequenceShow/hide protein sequence
MSSIGQIILMALAVTLNKFASSNVQSVQRNQANKPRTATATATAGSANGRRGLLLSAIAAVAAAPEEAMDSRTELLKSGYLKKSEENKEKNEKERLESYYKRNYKDYFEF
VEGSVKNKNELSEAEKGIIEWLKRNK