; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC00g0509 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC00g0509
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionkinetochore protein SPC24 homolog
Genome locationscaffold196:527618..531968
RNA-Seq ExpressionMC00g0509
SyntenyMC00g0509
Gene Ontology termsGO:0051983 - regulation of chromosome segregation (biological process)
InterPro domainsIPR044951 - Kinetochore protein SPC24-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144074.1 uncharacterized protein LOC111013852 isoform X1 [Momordica charantia]9.41e-13197.56Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

XP_022144075.1 uncharacterized protein LOC111013852 isoform X2 [Momordica charantia]3.56e-12595.12Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVH+    YEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

XP_022931907.1 uncharacterized protein LOC111438184 [Cucurbita moschata]1.94e-10782.44Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGK+NIE+LLSYG+DLVALLKDQ DVQTLNQCL+H  ALQS   DD  NVHSSVQDYEKKIE CR KTEEAKARTVAD EMD+LE+E+EEE+R+EH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LV+++INELD QRISVQE+KQA KKLEQQELRAQRKLSMYASVTDIIPNMDD SKISGHIVDRNKRVVQKFELDPTK S+FDICN IW+
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

XP_022966574.1 uncharacterized protein LOC111466216 [Cucurbita maxima]1.69e-11085.37Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGK+NIEELLSYGNDLV LLKDQ DVQTLNQCL+H  ALQS C DDFSNVH  VQDYEKKIE CR KTEEAKARTVAD EMD+LE+E+EEELR+EH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LVT++INELD QRISVQERKQA KKLEQQELRAQRKLSMYASVTDIIPNMDD SKISGHIVDRNKRVVQKFELDPTKTS+FDICN IW+
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

XP_023518318.1 uncharacterized protein LOC111781837 [Cucurbita pepo subsp. pepo]8.27e-10983.41Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGK+NIE+LLSYG+DLVALLKDQ DVQTLNQCL+   ALQS C DD  NVHSSVQDYEKKIE CR KTEEAKARTVAD EMD+LE+E+EEE+R+EH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LVT++INELD QRISVQERKQA KKLEQQELRAQRKLSMYASVTDIIPNMDD SKISGHIVDRNKRVVQKFELDPTK S+FDICN IW+
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

TrEMBL top hitse value%identityAlignment
A0A6J1CQL9 uncharacterized protein LOC111013852 isoform X21.73e-12595.12Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVH+    YEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

A0A6J1CR83 uncharacterized protein LOC111013852 isoform X14.55e-13197.56Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

A0A6J1F034 uncharacterized protein LOC1114381849.41e-10882.44Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGK+NIE+LLSYG+DLVALLKDQ DVQTLNQCL+H  ALQS   DD  NVHSSVQDYEKKIE CR KTEEAKARTVAD EMD+LE+E+EEE+R+EH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LV+++INELD QRISVQE+KQA KKLEQQELRAQRKLSMYASVTDIIPNMDD SKISGHIVDRNKRVVQKFELDPTK S+FDICN IW+
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

A0A6J1HU69 uncharacterized protein LOC1114662168.16e-11185.37Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MGDFSGK+NIEELLSYGNDLV LLKDQ DVQTLNQCL+H  ALQS C DDFSNVH  VQDYEKKIE CR KTEEAKARTVAD EMD+LE+E+EEELR+EH
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     LVT++INELD QRISVQERKQA KKLEQQELRAQRKLSMYASVTDIIPNMDD SKISGHIVDRNKRVVQKFELDPTKTS+FDICN IW+
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

A0A6J1IPN8 uncharacterized protein LOC1114783208.99e-10681.46Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        M DF+GK NIEE+LSYG+DLVALL DQ DVQTLNQCLEH K LQS C DDFSNV SSVQDYEKKIEACRQKTEEAKA TVAD E+D+LE+EL EEL K +
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
        LLMEEI     ++TSEIN+LD QRISVQERKQAMKKLEQQELR QRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQ+FE DPT  S+FDICN IW+
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MINSP
        MINSP
Subjt:  MINSP

SwissProt top hitse value%identityAlignment
Q67XT3 Kinetochore protein SPC24 homolog3.6e-4243.35Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MG+ S   +IE+L+SYG+DL+ LL  +     ++Q  E  KAL   C +DF+ +  S++D + K+ AC++KTEEA +   A+ E++ L++EL+EE+ +E 
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
         L +E+     LV  E+ +L+ Q  S+ E KQ+ K+  + +LRA++KLSMYASVT++IP++DD SKISG++VDR KR+++KF+ +  K +A++ CNSIW 
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MIN
        +IN
Subjt:  MIN

Arabidopsis top hitse value%identityAlignment
AT3G08880.1 unknown protein2.6e-4343.35Show/hide
Query:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH
        MG+ S   +IE+L+SYG+DL+ LL  +     ++Q  E  KAL   C +DF+ +  S++D + K+ AC++KTEEA +   A+ E++ L++EL+EE+ +E 
Subjt:  MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEH

Query:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD
         L +E+     LV  E+ +L+ Q  S+ E KQ+ K+  + +LRA++KLSMYASVT++IP++DD SKISG++VDR KR+++KF+ +  K +A++ CNSIW 
Subjt:  LLMEEIFFINTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWD

Query:  MIN
        +IN
Subjt:  MIN

AT5G01570.1 unknown protein1.7e-3138.62Show/hide
Query:  NIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEHLLMEEIFF
        N E ++S+G++L+ +L D+K    L Q LE ++A+   CD+DF  +H S+QD +KK++ C++KT+EA +    + E++ L++EL+EEL  E  L EE+ +
Subjt:  NIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEHLLMEEIFF

Query:  INTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTK-TSAFDICN
                           +E + A+K+ ++ +LR + KL MYASVT +IPN+DD  K SG++V R+KR++ KFE D  K TS ++ CN
Subjt:  INTLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTK-TSAFDICN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATTTCTCAGGAAAGATCAACATTGAGGAGTTGTTATCCTATGGCAATGATCTGGTTGCACTCTTGAAGGACCAAAAGGACGTTCAAACCTTGAATCAATGTCT
CGAACATGTCAAGGCTCTTCAATCTTTTTGCGATGATGATTTCAGTAATGTTCACAGTTCAGTTCAAGATTATGAGAAGAAAATCGAGGCATGCCGGCAGAAAACGGAGG
AGGCGAAAGCGAGAACTGTTGCAGATGCTGAAATGGATGTTCTCGAAGAGGAGCTCGAAGAGGAACTTAGGAAAGAACATTTGCTCATGGAGGAGATTTTTTTCATCAAC
ACACTTGTCACCAGTGAGATTAATGAACTAGACTGCCAAAGGATTTCTGTTCAAGAAAGGAAGCAGGCGATGAAGAAACTTGAGCAACAAGAACTTAGAGCTCAGAGGAA
GCTTTCAATGTATGCTTCGGTCACTGATATAATTCCAAACATGGATGATCAGTCTAAAATTTCTGGCCATATTGTGGATAGAAACAAAAGGGTGGTGCAGAAGTTCGAAT
TGGACCCGACTAAGACATCTGCTTTTGATATATGCAATAGCATTTGGGATATGATTAATTCCCCTTAG
mRNA sequenceShow/hide mRNA sequence
GCCCTAACTAAATAATTGAGATTTCCCCCCAAAAAGGGCTCTCCACTCTCACGCCGGCAGCAATACCGACGCCCCCCACCCCGGCGGCGTGCAGATCCGGTGCAGCACCT
GCGACTCGAAGCCACGAACGACCCACGACGATCGGAGGTGAGATCTGTAGTTGATTCGTTGTTCGATTAGAGGAATGGGAGATTTCTCAGGAAAGATCAACATTGAGGAG
TTGTTATCCTATGGCAATGATCTGGTTGCACTCTTGAAGGACCAAAAGGACGTTCAAACCTTGAATCAATGTCTCGAACATGTCAAGGCTCTTCAATCTTTTTGCGATGA
TGATTTCAGTAATGTTCACAGTTCAGTTCAAGATTATGAGAAGAAAATCGAGGCATGCCGGCAGAAAACGGAGGAGGCGAAAGCGAGAACTGTTGCAGATGCTGAAATGG
ATGTTCTCGAAGAGGAGCTCGAAGAGGAACTTAGGAAAGAACATTTGCTCATGGAGGAGATTTTTTTCATCAACACACTTGTCACCAGTGAGATTAATGAACTAGACTGC
CAAAGGATTTCTGTTCAAGAAAGGAAGCAGGCGATGAAGAAACTTGAGCAACAAGAACTTAGAGCTCAGAGGAAGCTTTCAATGTATGCTTCGGTCACTGATATAATTCC
AAACATGGATGATCAGTCTAAAATTTCTGGCCATATTGTGGATAGAAACAAAAGGGTGGTGCAGAAGTTCGAATTGGACCCGACTAAGACATCTGCTTTTGATATATGCA
ATAGCATTTGGGATATGATTAATTCCCCTTAGTACCAGACATCAATTACATTGTGAATTAGCTTGTCATAGCCTTCTTCTTCTTTCTTCATTTCATGCTATTAAAATAGG
ATGTTTGTTTTGTTGTATATGTATGTTTAAAGAACTAATCTCTGAAGTACTGTGTCGTCTATGATATCTTATTGTAAATCACATCTCCTTGCTAATTTTTGGATCAAAGA
ACCCACCTTCTAGCAGTTGTTTTGTTTGTATGCATCTTCTGTACGAAACAAGACACCATGACTTACTGAGTGAGTTAACTTTTACAAACTTTGCCCATTAGTTTGGCTTT
TATTCTATTTCTATATTGGAAGTAAGCAATTCACTCTAACCCATAAGTTCTTTATGTTACTTGATTAACATAAGTTGCTGGATGAGTTACTCTCTAGTCTGTATTGTTGG
CTTGACCTGCAGGCCAAGGGTACAGCATTCCCATGGTAAGGCTTGCAAATATAGGTTGCAAACATTTTATAAGGCTGCTTTCTTCACTCCTCACATCCATTAATGGATAT
GTATGGGATAGACCAAAACAACCGGAAATGTTTCAAAAAAAGTCAGGCATACGGTGTACAAAAAGTTCACGCCCTTCTTTATCACAAAACCCTTATCACTTAGTTTTCGA
CTTTTCAGAAATATATTAGCCAATAAATTAGTAGTAGATAAGCAATTTCGCCGGCATACCGTTGTGTAAGCTGCAGTTTATGTTGAAGTTTATGATTTGTGAGCCATAAA
CGAAATATCTAGTTGCATTGTCTTTCTAGTTGAATTGTCTTTATGTGATGCATATATTTACTTCAGTTGTTTGCATCTCTTTACATTATATTGCTTCTAACAAGGAGGCA
CATACGGTTGCTATTGGAGATAAGGTGTCACATGCTTTCTTCAACCTTTTAAATTGAAGTGTTAATAGTTCCTCTTAAGCTTTCTTGCAAAATTCTATGTTCCATGAATT
TCAGCCCTTTGTAGTTTTAAATCTTTTGTATGACTGATATGAAGAGTATGAGGCATGTTTTGATTAAGTCAAGAGTGCCCATTATGACAGCTTTAGTTTTTTGATACAAC
AATTTATATCTAATCCTAATGGTTGTTAAAAATGATTAGATCATATAATCATGGGTTGTAAAAATACATTACTATGATGGGATGTCGTTGTATCGAACTCTTGTTCTAAT
TGATGATATTGTATGTTATTGATAGTAGTGATTGGTGCCATAAATGATACTCATATGTAGAACTAGTAGCGAAGGAACAAGTATTGTTCAGAATATAATTTGGACAGAGA
AAATCAAAAAAATTCACCACTGAGCAGGGCAGACAAGATCCAGCCATCTATCAGATTCGGTTTCCACAAGTTTCATAAACCCTTCGGAATATCAGGTTCTATTCTTTCCC
TTGTGCTATTATAGATCTTTATAAGTTGAATTTGCCAATGAAACTTATAACTCAAGGACACAATTATACAACCTATAGAACAGATCCTGTTCAAGTAAGGTCGTGAATTT
GCCTCTGAAACTTAAACCCAAATTTTATTCCAATCTCTGGTATGTAAAATAGTGTAGATTTGTTTTGTAGATAGAGTTATACTTGTGTTACTTTTCGTGTGAAAGGACAA
GATGAGACAGTGAGAGTTTTAACTAAGTTCCAAACTTTATGCTGTCTGAGCAAAGCCGATATATCATCAACACAAAAACATTAGCACTACCTCATTGTAACTCTGCAACT
GAAATTGATACGTTCTGAAATTATCATCTACTACTGTCATTTTGTAAATGGAACTTCTCCATTGACTGTAGCATCTAATTTTCTGGCTTGTCTGATCTTGAGATCTCTCC
TAGCCAGTATTTAAGATGTCGAGTTTCGCGTTGCAGTTGCATAACAGACTTGACTTCCACCATCGACATTCTCCAGGTACTGGCGGTCGTCGAGCAATATAGCCCTGCAA
GCATCTGGTGCAGGACTGCCTTTGTAAGCCTTGCAGATGTAGGCCATGAAGTTTAGGTAGTCCTGGTTCAACAAAAAAAAAAAAAACAGCTTCATTTTAGAGAATCACAG
GCAAAGAACAAGGAAGCCAAACTGGCTCGAATTGAAGCAGACCGACCTCTTGGAGAGGGTGATCATTGACAATGACCCATGGAACAAATCGGTGAGGTGGGTTGAGGCGA
GTGGTTTCAGAAGCATAGTTCTGTTCAAGCTGCGTTTCAATGGCTTTCCATATCAGACTCAGTGGCTCAGCAAAGGAAGAAAGAAACAGAGAGAAGATGGTGATGAAAAA
GAAAGAAATATTACCACTTTTCCATGGCCATTTCTGTAGCAATCTATTGGCACAGTACTCAACTTGGCAATGTCAAAACACTTGGTCCATTCATTGTGTCTATTTTGCAC
TGTAAATCCTTCAACACAATGGATGAATCTGAAATGCTTTTCCTTCGCAGGCCAAAAAACACATATTGGCAGACTTTTCAATTCACATAAATGTGAGTCTGAACATATGA
ATCAATATATTGATGGAAGCCACAATTATTCATAGTAATGAAAATGTCAATTGAAATGATAACAAGTTGGACTAAAACTTTACTCCATGAGAATTAACATTATCAATAAG
CATTGTTCTAATTTATTTTACAAGAGTACAAATGGAGGTGAGACTAGGGTAGGTGTGACAAATCAGAGACTTAGATCAAATCTTAAATTAATTAGACTTGTGAGTT
Protein sequenceShow/hide protein sequence
MGDFSGKINIEELLSYGNDLVALLKDQKDVQTLNQCLEHVKALQSFCDDDFSNVHSSVQDYEKKIEACRQKTEEAKARTVADAEMDVLEEELEEELRKEHLLMEEIFFIN
TLVTSEINELDCQRISVQERKQAMKKLEQQELRAQRKLSMYASVTDIIPNMDDQSKISGHIVDRNKRVVQKFELDPTKTSAFDICNSIWDMINSP