; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G006060 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G006060
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAspartate racemase
Genome locationCmo_Chr10:2740672..2746475
RNA-Seq ExpressionCmoCh10G006060
SyntenyCmoCh10G006060
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0036361 - racemase activity, acting on amino acids and derivatives (molecular function)
InterPro domainsIPR001920 - Asp/Glu racemase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022961059.1 uncharacterized protein LOC111461681 [Cucurbita moschata]5.8e-13390.51Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
                               GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM++
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM

XP_022987789.1 uncharacterized protein LOC111485232 isoform X1 [Cucurbita maxima]2.8e-12787.59Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSF ALNCPAGVRRNANES+FRFRRRLN YLSVQIFSVIQTDENDNLAASKKMSSSGKSL KTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSES KLPFLHVGDCVA ELNEAKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
                               GFDVVVPD+ATMEHIVI AVEAFHKRDHEGARNLLRIAVHVLLTRAVNM++
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM

XP_023516195.1 uncharacterized protein LOC111780131 isoform X1 [Cucurbita pepo subsp. pepo]3.9e-13773.53Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSF ALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELN+AKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM----------------------------
                               GFDVVVPDEATMEHIVI AVEAFHKRDHEGARNLLRIAVHVLLTRAVNM                            
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM----------------------------

Query:  --------------------------------------------MMKFVYEAMEATLAFPLRLKVLRRKSKAGK
                                                    MMKFVYEAMEATLAFPLRLKVLRRKSKAGK
Subjt:  --------------------------------------------MMKFVYEAMEATLAFPLRLKVLRRKSKAGK

XP_023516200.1 uncharacterized protein LOC111780131 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-13090.07Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSF ALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELN+AKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM
                               GFDVVVPDEATMEHIVI AVEAFHKRDHEGARNLLRIAVHVLLTRAVNM
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM

XP_023516201.1 uncharacterized protein LOC111780131 isoform X3 [Cucurbita pepo subsp. pepo]1.2e-13089.42Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSF ALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELN+AKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
                               GFDVVVPDEATMEHIVI AVEAFHKRDHEGARNLLRIAVHVLLTRAVNM++
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM

TrEMBL top hitse value%identityAlignment
A0A1S3CMU2 uncharacterized protein LOC103502741 isoform X15.8e-10270.59Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        ML GGMAM F A  CPA VRRNANE I RFRRR+NLY SVQI SV+QTD NDNL  SKK+SS GKSLSKTRT KPLLVQPNTVGVIGGVSVFSTLLF+EK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIP----------------HSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVAR
        LVWWS KDG+ESIPFVVCS+P L KGIP                HSDA IIENL +K AFLE SGARCLITPCHL+HRWL DTSESCKLPFLHVGDCVAR
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIP----------------HSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVAR

Query:  ELNEAKLKQLEAGSN-----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
        EL EA LK LE G N                       GFDV++PDEATM+HIVI AVEA +KRD EGARNLLRIAVHVLL RAVNM++
Subjt:  ELNEAKLKQLEAGSN-----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM

A0A5A7TXR8 Amino-acid racemase isoform 25.8e-10270.59Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        ML GGMAM F A  CPA VRRNANE I RFRRR+NLY SVQI SV+QTD NDNL  SKK+SS GKSLSKTRT KPLLVQPNTVGVIGGVSVFSTLLF+EK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIP----------------HSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVAR
        LVWWS KDG+ESIPFVVCS+P L KGIP                HSDA IIENL +K AFLE SGARCLITPCHL+HRWL DTSESCKLPFLHVGDCVAR
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIP----------------HSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVAR

Query:  ELNEAKLKQLEAGSN-----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
        EL EA LK LE G N                       GFDV++PDEATM+HIVI AVEA +KRD EGARNLLRIAVHVLL RAVNM++
Subjt:  ELNEAKLKQLEAGSN-----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM

A0A6J1HCW6 uncharacterized protein LOC1114616812.8e-13390.51Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
                               GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM++
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM

A0A6J1JHU3 uncharacterized protein LOC111485232 isoform X21.4e-12788.24Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSF ALNCPAGVRRNANES+FRFRRRLN YLSVQIFSVIQTDENDNLAASKKMSSSGKSL KTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSES KLPFLHVGDCVA ELNEAKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM
                               GFDVVVPD+ATMEHIVI AVEAFHKRDHEGARNLLRIAVHVLLTRAVNM
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNM

A0A6J1JJU3 uncharacterized protein LOC111485232 isoform X11.4e-12787.59Show/hide
Query:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
        MLDGGMAMSF ALNCPAGVRRNANES+FRFRRRLN YLSVQIFSVIQTDENDNLAASKKMSSSGKSL KTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK
Subjt:  MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEK

Query:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-
        LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSES KLPFLHVGDCVA ELNEAKLKQLEAGSN 
Subjt:  LVWWSRKDGEESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSN-

Query:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
                               GFDVVVPD+ATMEHIVI AVEAFHKRDHEGARNLLRIAVHVLLTRAVNM++
Subjt:  -----------------------GFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15410.1 aspartate-glutamate racemase family5.0e-5041.73Show/hide
Query:  RRRLNLYLSVQIFSV-IQTDENDNLAASKK---MSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEKLVWWSRKDGEESIPFVVCSDPVLDKG
        R RL+  L++   SV +  DE+++L   KK   +S   ++   +     LL Q NTVG+IGGVS  STL F++KLV WS  DG+ S+PFV+CSDP L+K 
Subjt:  RRRLNLYLSVQIFSV-IQTDENDNLAASKK---MSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEKLVWWSRKDGEESIPFVVCSDPVLDKG

Query:  I------------------PHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAG-----------
        +                  P    +I+ENL+ K  +LE+ GA+ ++ PCH++H W  +  E   +P LH+G+C+A+EL EAK+K LEAG           
Subjt:  I------------------PHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAG-----------

Query:  -------------SNGFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM
                     SNGF+ V+PD+ATMEH VI ++EA  ++D EGARNLLRIA+ VLL +AVN++M
Subjt:  -------------SNGFDVVVPDEATMEHIVISAVEAFHKRDHEGARNLLRIAVHVLLTRAVNMMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGATGGTGGTATGGCTATGTCATTTTGTGCATTGAATTGCCCAGCAGGTGTTCGACGAAATGCCAATGAGAGCATTTTTCGGTTTAGAAGGAGATTAAACCTGTA
TTTATCTGTACAGATCTTCTCTGTTATTCAAACTGATGAGAATGACAACTTAGCAGCATCCAAGAAGATGTCGAGTTCAGGAAAATCTCTGTCGAAAACTCGAACTCTTA
AGCCCCTTCTTGTCCAGCCAAATACTGTGGGTGTCATAGGGGGAGTGTCGGTTTTTTCCACTCTACTTTTCATGGAAAAGCTCGTTTGGTGGAGTAGGAAGGATGGAGAA
GAGAGCATACCTTTTGTTGTCTGCAGCGATCCAGTATTAGACAAGGGAATTCCTCATAGTGATGCCATGATCATCGAGAATTTGAAGCAGAAAACGGCGTTTCTCGAGCA
GTCGGGAGCTCGATGCCTAATTACACCTTGCCATCTTTCACATAGGTGGCTTGGTGACACATCTGAGAGCTGCAAATTACCTTTCCTTCACGTCGGAGATTGTGTAGCTA
GGGAGCTTAATGAGGCTAAGCTTAAGCAACTTGAAGCAGGTAGCAATGGCTTCGATGTCGTCGTGCCGGACGAGGCAACCATGGAGCATATAGTAATTTCTGCAGTGGAA
GCTTTCCATAAAAGGGATCACGAAGGAGCAAGAAATCTGTTGAGAATTGCTGTTCATGTTCTATTGACAAGGGCTGTGAATATGATGATGAAGTTTGTTTATGAGGCGAT
GGAGGCGACTCTCGCTTTTCCACTGAGATTGAAGGTGTTGCGACGAAAATCGAAAGCGGGAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGATGGTGGTATGGCTATGTCATTTTGTGCATTGAATTGCCCAGCAGGTGTTCGACGAAATGCCAATGAGAGCATTTTTCGGTTTAGAAGGAGATTAAACCTGTA
TTTATCTGTACAGATCTTCTCTGTTATTCAAACTGATGAGAATGACAACTTAGCAGCATCCAAGAAGATGTCGAGTTCAGGAAAATCTCTGTCGAAAACTCGAACTCTTA
AGCCCCTTCTTGTCCAGCCAAATACTGTGGGTGTCATAGGGGGAGTGTCGGTTTTTTCCACTCTACTTTTCATGGAAAAGCTCGTTTGGTGGAGTAGGAAGGATGGAGAA
GAGAGCATACCTTTTGTTGTCTGCAGCGATCCAGTATTAGACAAGGGAATTCCTCATAGTGATGCCATGATCATCGAGAATTTGAAGCAGAAAACGGCGTTTCTCGAGCA
GTCGGGAGCTCGATGCCTAATTACACCTTGCCATCTTTCACATAGGTGGCTTGGTGACACATCTGAGAGCTGCAAATTACCTTTCCTTCACGTCGGAGATTGTGTAGCTA
GGGAGCTTAATGAGGCTAAGCTTAAGCAACTTGAAGCAGGTAGCAATGGCTTCGATGTCGTCGTGCCGGACGAGGCAACCATGGAGCATATAGTAATTTCTGCAGTGGAA
GCTTTCCATAAAAGGGATCACGAAGGAGCAAGAAATCTGTTGAGAATTGCTGTTCATGTTCTATTGACAAGGGCTGTGAATATGATGATGAAGTTTGTTTATGAGGCGAT
GGAGGCGACTCTCGCTTTTCCACTGAGATTGAAGGTGTTGCGACGAAAATCGAAAGCGGGAAAGTAATCGTGATCTTTTGAGTAAGGAATTTGAAATTCTGTCGCTGAAA
GTAATGCAGGGCAAACCCAGTCTCCAGATTTCCATCCAGGTGGGATGCTTGCATCAGATCCATACTGATCGGAACCCATCATGTAGGCGCCATAATCACCAGGATAGACG
GACTTAAATGCACCACATCTAAAGCAGTTCGGGCGGCTGGCGTAATTGTGAGCTCCGCAGCTGACGGCAGTGCAGTACCAGTCTCCGGCCAACACCTCCGTCTTGTTGTA
ACTAAAACGTTTGGCCCGACGGGCAACCCGGTTGGTAAGAGCTTGGAGTCTCTTAGGCATACCGGCTTAGATATCCCAAATTTGAATCTTCAGGTGAGCTTAATACTAAA
TTTTTTTTTATGTTTCTTTTGTTCGGGGCTTGCGTTGGGCATAGGTGCCCCCTATTATCTATAATATACATATATATCTTATTGAAAACTTCACATGGTAGATTAATTTC
TTTAAGCAATACAAATCTATTCAGAAAGGTACATTTCATTCCTAGAAAAATAGACATCTCGTGCAATCATATGAGTATTTTAAATCATTATTCCAGTTCTTGTGCTTTTG
TTTGTAGGGTAATAACTTTTCTCAAAAGTATTCCGCGTTTACTCTAG
Protein sequenceShow/hide protein sequence
MLDGGMAMSFCALNCPAGVRRNANESIFRFRRRLNLYLSVQIFSVIQTDENDNLAASKKMSSSGKSLSKTRTLKPLLVQPNTVGVIGGVSVFSTLLFMEKLVWWSRKDGE
ESIPFVVCSDPVLDKGIPHSDAMIIENLKQKTAFLEQSGARCLITPCHLSHRWLGDTSESCKLPFLHVGDCVARELNEAKLKQLEAGSNGFDVVVPDEATMEHIVISAVE
AFHKRDHEGARNLLRIAVHVLLTRAVNMMMKFVYEAMEATLAFPLRLKVLRRKSKAGK