; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0377 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0377
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description8-amino-7-oxononanoate synthase
Genome locationMC01:10487526..10492165
RNA-Seq ExpressionMC01g0377
SyntenyMC01g0377
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147924.1 uncharacterized protein LOC101218084 [Cucumis sativus]1.16e-13283.66Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSP--EGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGE
        MI  KPIQTSFTV+ +TFLYT KLP SK+   CLC SNTSDS APS+P  EGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEADAEFEKIGE
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSP--EGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGE

Query:  DALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAF
        DAL+GLDEASARIM NIESQMQ FEESV+LNRQEIEKNDDMLA+FEG+IE +RNEGL F+NLR  KP DK KAKVEMEKI +LTKENAGSKTRRYIYLAF
Subjt:  DALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAF

Query:  IGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK
        IG+LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ+++SEIEKTEIKEQ+EEK
Subjt:  IGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK

XP_022151554.1 uncharacterized protein LOC111019467 [Momordica charantia]3.55e-166100Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA
        MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA

Query:  LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG
        LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG
Subjt:  LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG

Query:  VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE
        VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE
Subjt:  VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE

XP_022931938.1 uncharacterized protein LOC111438212 [Cucurbita moschata]2.47e-12680.47Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED
        +I  KPIQTSFTVH HTFLYT KL NSK+ S L  C SNTSDS+     EGDPQKQEILARIAQLQTQKLRLT+FLDEKSA LTQFAEEADAEFEKIGED
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED

Query:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI
        A K L++ASARIMENIESQMQ FEESV+LNRQEIEKNDDMLA+FEGRIE +RNEGL FKNLRQ KPVDK  AK+EMEKI+ELT E AGSKTRRYIYLAFI
Subjt:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI

Query:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK
        G+LVIAIAESFLSSPDW+KVAVLG +L+A++ QFSYEQ++SSE+EKT+IKEQ EEK
Subjt:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK

XP_023518588.1 uncharacterized protein LOC111782049 [Cucurbita pepo subsp. pepo]7.44e-12880.86Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED
        +I  KPIQTSFTVH HTFLYT KLPNSK+ S L  C SNTSDS+     EGDPQKQEILARIAQLQTQKLRLT+FLDEKSA LTQFAEEADAEFEKIGED
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED

Query:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI
        A K +++ASARIMENIESQMQ FEESV+LNRQEIEKNDDMLA+FEGRIE +RNEGL FKNLRQ KPVDK  AKVEMEKI+ELT E AGSKTRRYIYLAFI
Subjt:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI

Query:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK
        G+LVIAIAESFLSSPDW+KVAVLG +L+A++ QFSYEQ++SSE+EKT+IKEQ EEK
Subjt:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK

XP_038882548.1 uncharacterized protein LOC120073781 [Benincasa hispida]2.41e-13684.38Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA
        MI  KPIQTSF VH +TFLYT +LP SK+ SLC C SNTSDS+A + PEGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEA+AEFEKIGEDA
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA

Query:  LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG
        L+GLDEASARIMENIESQMQ FEES +LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNLRQ KP DK KAKVEMEKI+ELTKENAGSKTRRYIYLAFIG
Subjt:  LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG

Query:  VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE
        +LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIK+Q EEKE
Subjt:  VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE

TrEMBL top hitse value%identityAlignment
A0A0A0L2K0 Uncharacterized protein5.61e-13383.66Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSP--EGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGE
        MI  KPIQTSFTV+ +TFLYT KLP SK+   CLC SNTSDS APS+P  EGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEADAEFEKIGE
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSP--EGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGE

Query:  DALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAF
        DAL+GLDEASARIM NIESQMQ FEESV+LNRQEIEKNDDMLA+FEG+IE +RNEGL F+NLR  KP DK KAKVEMEKI +LTKENAGSKTRRYIYLAF
Subjt:  DALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAF

Query:  IGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK
        IG+LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ+++SEIEKTEIKEQ+EEK
Subjt:  IGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK

A0A5D3BB95 8-amino-7-oxononanoate synthase1.72e-12472.97Show/hide
Query:  MIALKPIQTSFTVHSHTFL-YTTKLPNSKSFSLCLCHSNTSDSTAPSSP--EGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIG
        MI   PIQTSFTVH +TFL +T KLP S++   CLC SNTSDST PS+P  EGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEADAEFEKIG
Subjt:  MIALKPIQTSFTVHSHTFL-YTTKLPNSKSFSLCLCHSNTSDSTAPSSP--EGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIG

Query:  EDALKGLDEASAR-------------------------------------IMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNL
        EDAL+GLDEASAR                                     IMENIESQMQ FEES++LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNL
Subjt:  EDALKGLDEASAR-------------------------------------IMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNL

Query:  RQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE
        R  KP D  KAKVEMEKI ELTKENAGSKTRRYIYLAFIG+LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIKEQ+EEKE
Subjt:  RQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE

A0A6J1DDE0 uncharacterized protein LOC1110194671.72e-166100Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA
        MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGEDA

Query:  LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG
        LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG
Subjt:  LKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFIG

Query:  VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE
        VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE
Subjt:  VLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE

A0A6J1F063 uncharacterized protein LOC1114382121.20e-12680.47Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED
        +I  KPIQTSFTVH HTFLYT KL NSK+ S L  C SNTSDS+     EGDPQKQEILARIAQLQTQKLRLT+FLDEKSA LTQFAEEADAEFEKIGED
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED

Query:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI
        A K L++ASARIMENIESQMQ FEESV+LNRQEIEKNDDMLA+FEGRIE +RNEGL FKNLRQ KPVDK  AK+EMEKI+ELT E AGSKTRRYIYLAFI
Subjt:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI

Query:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK
        G+LVIAIAESFLSSPDW+KVAVLG +L+A++ QFSYEQ++SSE+EKT+IKEQ EEK
Subjt:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK

A0A6J1HS02 uncharacterized protein LOC1114662136.89e-12679.3Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED
        +I  KPIQTSFTVH HTFLYT KLPNSK+ S L  C SNTSDS+     EGDPQKQE+LARIAQLQTQKLRLT+FLDEKSA LTQFAEEADAEFEKIGED
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFS-LCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEKIGED

Query:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI
        A K L++ASARIMENIESQMQ FEESV+LNRQEIEKNDD+LA+FEGRIE +RNEGL FKNLRQ KPVDK  AK+EMEKI+ELT E AGSKTRRYIYLAFI
Subjt:  ALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIYLAFI

Query:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK
        G+LVI IAESFLSSPDW+KVAV G +L+A++ QFSYEQ++SSE+EKT+IKEQ EEK
Subjt:  GVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09050.1 unknown protein1.9e-7159.6Show/hide
Query:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHS--NTSDSTA---PSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEK
        M  L   QT+ T + H     ++   S+   LCL  S   TSDS +   P  PEGD ++QE+LARIA +QT K+RLTDFLDE+S +LT+FAEEA+AEF+K
Subjt:  MIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHS--NTSDSTA---PSSPEGDPQKQEILARIAQLQTQKLRLTDFLDEKSAHLTQFAEEADAEFEK

Query:  IGEDALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIY
        +GEDA+K LDEAS RI+ENIES+MQ FEES  LNR EIE+ND+ LAEFE +I+ DRNEGL FK+LR  KPVD+ +A+ E EKI+E+TKE+AGSK+RR IY
Subjt:  IGEDALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKENAGSKTRRYIY

Query:  LAFIGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEK
        L  IG++V+AIA+SF+SSPDW+KVA+LGA+LV L++QF YEQ + SE +K
Subjt:  LAFIGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGTCCAGAAGTCCATCTAAGCTTCTCCCCCTTTCTCCTCCTCCCTCCTTCCACTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAAGAAAAACAAGAGAAAT
GATAGCGCTCAAACCCATTCAAACCTCTTTCACAGTCCACAGCCATACCTTCTTATACACAACAAAACTTCCCAACTCTAAGAGCTTCTCCTTATGCCTTTGCCACTCCA
ACACTTCCGACTCCACTGCTCCCTCTTCACCTGAAGGAGATCCTCAAAAGCAGGAGATACTGGCCAGAATTGCACAACTTCAGACCCAAAAACTCCGACTCACCGACTTT
TTGGACGAAAAATCCGCTCATCTTACTCAGTTTGCTGAAGAGGCCGATGCCGAGTTTGAGAAGATTGGAGAAGACGCCCTCAAAGGGCTTGATGAAGCCAGTGCACGGAT
TATGGAAAACATCGAGAGCCAGATGCAGGAGTTTGAGGAGTCTGTAGATTTGAACAGGCAGGAGATTGAGAAGAATGATGATATGTTGGCAGAGTTTGAGGGTCGGATCG
AAAACGATAGAAATGAAGGTCTTCTCTTTAAAAACCTGAGGCAGAGTAAGCCCGTAGACAAAGCGAAAGCTAAAGTGGAAATGGAGAAGATTAGAGAGCTTACAAAAGAA
AATGCTGGATCAAAGACGAGGCGATACATATATCTTGCATTCATTGGCGTGCTCGTCATAGCGATTGCCGAATCGTTCCTTTCTTCACCTGATTGGCAGAAAGTTGCAGT
TCTTGGAGCAATGCTTGTTGCTTTGATTTCCCAATTTTCTTATGAGCAAAAGGTGTCATCTGAAATAGAAAAAACAGAAATCAAAGAGCAAACTGAAGAAAAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATCGATTTTCGCAAGAATTCGAAAGTTTGGATCTCCATCCATTGTATTACGACCTAAAAACATAACAACTTAAAGAAAGGATAAGAAAACGAGTCAAACTCCAATGAGAA
TCAATTATTAATATCTCATATTTCATATATGAAACTGATAAAGTACATAAAATGAGAGGTCCAGAAGTCCATCTAAGCTTCTCCCCCTTTCTCCTCCTCCCTCCTTCCAC
TTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAAGAAAAACAAGAGAAATGATAGCGCTCAAACCCATTCAAACCTCTTTCACAGTCCACAGCCATACCTTCTTATACA
CAACAAAACTTCCCAACTCTAAGAGCTTCTCCTTATGCCTTTGCCACTCCAACACTTCCGACTCCACTGCTCCCTCTTCACCTGAAGGAGATCCTCAAAAGCAGGAGATA
CTGGCCAGAATTGCACAACTTCAGACCCAAAAACTCCGACTCACCGACTTTTTGGACGAAAAATCCGCTCATCTTACTCAGTTTGCTGAAGAGGCCGATGCCGAGTTTGA
GAAGATTGGAGAAGACGCCCTCAAAGGGCTTGATGAAGCCAGTGCACGGATTATGGAAAACATCGAGAGCCAGATGCAGGAGTTTGAGGAGTCTGTAGATTTGAACAGGC
AGGAGATTGAGAAGAATGATGATATGTTGGCAGAGTTTGAGGGTCGGATCGAAAACGATAGAAATGAAGGTCTTCTCTTTAAAAACCTGAGGCAGAGTAAGCCCGTAGAC
AAAGCGAAAGCTAAAGTGGAAATGGAGAAGATTAGAGAGCTTACAAAAGAAAATGCTGGATCAAAGACGAGGCGATACATATATCTTGCATTCATTGGCGTGCTCGTCAT
AGCGATTGCCGAATCGTTCCTTTCTTCACCTGATTGGCAGAAAGTTGCAGTTCTTGGAGCAATGCTTGTTGCTTTGATTTCCCAATTTTCTTATGAGCAAAAGGTGTCAT
CTGAAATAGAAAAAACAGAAATCAAAGAGCAAACTGAAGAAAAAGAGTGAAGCCGTGTTACTGCAGTAAGTATGAGCAAAAGATGTAGAATAATAACTATAGCATGATAC
ATAAACCCAAGTGAATGCTCTCAGCTCGGGTCTGATACGTCAAGCTACTTTGCTTTTGCATCGATAACATTCGATCTTGGTGATTATGACTTCAATCACCCTGACCTCTG
TAATGAATTACAACATTTGATGCTTGGAATTTTCACGATGGTCTTTCAGAAAGATCTTGTACTAATATCTTCAGCATAATTACATGGGCCTAATTTTGCAGTATCTTTTT
TTTTTATGATACAAATCTGTCTGTATGTCTATAACATATTTGCACGAGGAAAAATCAAGTGCAAATTGCAGAGCAAATTAATAAAACAGGAAGCAGAAGCTGCAGAACAT
ATATTTTTCCTGCCATCTATCCAAAGTTAAAAACACCATGACATTGGAGCCTGATTGGATAGCTTTCAAATTCTAATCTAAACAGAAACTCTTTAGGAAAACATGATTAT
TAACCATGGGCTCAATAAGATGTATCGCTATACTAAAATAAAAGTGAAAATTTTAGAAAACTTTGAACTTCACTGGCCAGCCATCCAAACTAAAAATGGATGAGGCACGG
CATTCCTTAACTGATGATCCTACCTCTTTGCCGGACAGCTTTTAAGTCTCTTCAAGAGCAATAACACATAATTAACAAATAAGACCTGCAAGAATTGCCGGAAAGAAATT
GATACTGGAAAGAGCAAGGACCTAAAAGCCAGTAATAGCTGATCAAACAGTTGGGTCAACTTGTTTTCCGAAGGAAGTGCAGCTCACTTATTCGAAGCATCTGCACTTAA
AGATCTGGTATCCTTGTAGGAGTTATGAGGAATGCAAGTGCTAGCACATCTAGCTGAAGTTTGCTGTTGGGATTTGAATATTTTTCAAAGAAATGTTAAAAAGAGCACTT
GCATTCTTAAGGCAGCTTAATTACCTAGCAAGTGATAAACAGTAGAAGGAATCAGTAAGATCCATTAGAAGGTGTTTACTATGTAAGCAGCAGATTCAGTAGAAGAAATT
GGGATTGTGGGATGAGAGAGAGGAGACAGTCTTACCCTTTGAATCGCTTAAATCGAAAAGATGATGTTGAAGATCCATAATCAATAATTGCTCTAGCAAGAACGTTCCAT
GTTATCGCAGTTGGAAGAAGTCTTTGATTCATAGCATCAGATGAGAATCAAATTACACCCGAGACTCTGCAGTAAGAGCATATTCCATTAAAGGCCTGTCATCAGCAACT
GCTCTTACAAGAATGTTCCATGTTGTGGCAGTGGGAAGGATTCCATGATTCAAAGCATCATATAGGAACCCAATGGCATCTGAAACTCTAGCACAAGAGCAAAGTCCCTT
GAAAGTAATGTTATAAGAAATAATATCTGGATGAAGACCCTCTTCCAAGATGCGGTTCCAAATCTTTAAAGCCTCTGCACAGTCTCCGGCTTTATGAAGACCTTCCATGA
TGGTGTTGTGTGTTACAAGATCCGGAACACAGTTCACCTGTGCCATTTGAGTAAAGATCTGCAGGGCAACGTCAACTTTTCGGGCGGTACAAAGACCATGAATTATTATA
TTGTGTATGGTTACATCGGGCTTAAAACCCTTATCGATGCATTGGTGCCACAAGTTGAGCGCCACGTCAAGCTTCTCACCTCGACACAGGCCACCAATCAACAAACTATA
AGTAATCATATCAGGCTTCAACCCCTCTTCCAGCATCTCCTTCAGAAAAAGGTATGCATCGCTAAATCTTTCTACCTTGCACAAACCATTGATTAGAGTGTTGTAGGAAA
CAACAGTAGGAGCACAGTTTTTCTTGCTCATTTCCCTTAGAAGAAAAATAGCCTCTTCTAGTTTAGAAGCTCGGACAAATCCATTAATCAGTGAATTGTAGACATGAGAA
TTCAGTTTATGTTCATGTTGGTTCATCTGGTTAGACAACTCTAAAGCTTCATCCAGCCTCCCTTTCTTGCATAACCCATCAATCATTGAGGAGTATGAATAAGTATCCAA
ATCAGCTCCCTCATTTTCTGCTTCTTTCAGTATCCTTAAAGCCTTACTCAAGTATCCATTTTTACACAGCCCGTGAATCAACACTCCGTAGGTTGTTGAATCTGCCTTCA
AGCCTCTCTCACGTAAGAGCTGCCAGTAACAAATAGCTTCTTCCACCTTCTTGTTGTCAAACAATCCTTGAATTAATATGTTATAACTAACAATATTGCAAAAATTATTC
TTAACCATCACCTCCCACAACTCAAAGCACTTACATAGCTTTCTGGCTCGAAATAGACCGTTAAGCATTGTATTATATGTTGTCACATCAGCGGATAACCCACTGTCAAC
CATCTCCTGAAAAATTCTCTCAGCAGCATCGAAGTTTTCTGCTTTGATCAAGCCATGTATCATAGAACTAAAAGTAAATAAATCAAGTGACCTTTTGTTCTCCTTCATCC
TATTCCATATCTCCATACTCTCATTGAACTTCCCGAGCTTGCATAAACCATTAATCATAATGTTATACGTCGCCACACTCGGATAAACTGAAGATTCCCTCAGTAATCTC
TCCCAAAACTCATTAGCCTTCACAAAATCACCTTTCCTGAAAAACCCATCAATCAGAATATTATAACACATAACATCAGGGTCCACCCGTCTCTCAGACATTTGATCGAA
CACCTCCACGGCATCAGATAAGTTACCACTCTTCGCAAGTGCGTTAATTAAAGTACCATAGCTGAAAACATCAGGGTTCAGACCCTTTTCCGACATCCAATTCAACAGTT
TCTTCGCCTTTTCAAACTGCTTCTTCTTGCAAGATATCTTGATCAATATATTATACGTTTGCAGATTGGGCGACATGCCCACTGTCTGAAAATATGCGAAAAACAGTTCA
GCCCGACTCCACTGATTAGACTCAATGAAGGCATTAAGCATAGAGTTATATGACCTAATTCCCGGTCTACACCCAAAAATGTCGACCATCCCTTGAAACAAATACAGAGC
TTGATCGGGCATAGAACACTTGGCATACGCTTTGATAGCCGTCAGCGCAACATCTTCGGAGCAGATGCATCTTTGAGCTCGTATCAGGTCGACGATCCGACCAACGTGAA
CGACGAGCTTCGGGTCGACTAGTCGCCGGAGTATGTGGTGGAATACGAATGGTGAGTGAGCATAACCCGGGTGCTGACACGCCGAGTCGAACAGAGCCAGTGCCGAATTT
GGGTTTTTCTCTGCTTTGAGGAGCTTCAGAACCAGTGTCGGGGATAGGATTTTGGGGAGCTCGACCATGGCAGAGGCGCAAATAATTAGGGTTTCCGTCTTGTCCACGAT
ATTCAATGATACCGGTACTTACTGATCGAAGTTGCAATCGGCGTCGAGTTTCCGGCGAGGGTAGGGGTTCCTTAACACATGATCGATCTTTTTGGTTCTCCCTTTTCCCC
CCTTTCCTTTTGGGCACCATACATTATAC
Protein sequenceShow/hide protein sequence
MRGPEVHLSFSPFLLLPPSTSLSLSLSLSLSRKTREMIALKPIQTSFTVHSHTFLYTTKLPNSKSFSLCLCHSNTSDSTAPSSPEGDPQKQEILARIAQLQTQKLRLTDF
LDEKSAHLTQFAEEADAEFEKIGEDALKGLDEASARIMENIESQMQEFEESVDLNRQEIEKNDDMLAEFEGRIENDRNEGLLFKNLRQSKPVDKAKAKVEMEKIRELTKE
NAGSKTRRYIYLAFIGVLVIAIAESFLSSPDWQKVAVLGAMLVALISQFSYEQKVSSEIEKTEIKEQTEEKE