; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi03G001663 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi03G001663
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
Description8-amino-7-oxononanoate synthase
Genome locationchr3:45535659..45539977
RNA-Seq ExpressionBhi03G001663
SyntenyBhi03G001663
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045058.1 8-amino-7-oxononanoate synthase [Cucumis melo var. makuwa]4.9e-11080.68Show/hide
Query:  MIVFKPIQTSFAVHKNTF-LYTARLPISKNS-LCFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF PIQTSF VHKNTF L+T +LPIS+NS  C CQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFKPIQTSFAVHKNTF-LYTARLPISKNS-LCFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
        DALRGLDEASAR                                     IMENIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
Subjt:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR

Query:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
         RKPAD VKAKVEMEKI ELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK+Q EEKE
Subjt:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

KAG6603599.1 hypothetical protein SDJN03_04208, partial [Cucurbita argyrosperma subsp. sororia]2.3e-10787.6Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKNS-LCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MIVFKP+QT F VH+NTF YTAR P SK+S LC CQ NTSDSS+STPPEGD QKQEILARIAQLQTQKLRLT FLDEKSADLTQFAE+ANAEFEKIGEDA
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKNS-LCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI
        LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKN+RQ+KPAD    KVEMEKI+ELTKEN AGSKTRRYIYL  I
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK
        GLLVVAI ESF+SSPDWRKVAVLGAML AL+S+FSYEQRMSSEIE+TEIK
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK

XP_004147924.1 uncharacterized protein LOC101218084 [Cucumis sativus]2.0e-11691.83Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKNSL-CFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED
        MIVFKPIQTSF V+KNTFLYT +LPISKNS  C CQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGED
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKNSL-CFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED

Query:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI
        ALRGLDEASARIM NIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFF+NLR RKPADKVKAKVEMEKI +LTKENAGSKTRRYIYLAFI
Subjt:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
        GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRM+SEIEKTEIK+Q EEK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

XP_022151554.1 uncharacterized protein LOC111019467 [Momordica charantia]1.0e-10784.38Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKN-SLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MI  KPIQTSF VH +TFLYT +LP SK+ SLC C SNTSDS+A + PEGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEA+AEFEKIGEDA
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKN-SLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG
        L+GLDEASARIMENIESQMQ FEES +LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNLRQ KP DK KAKVEMEKI+ELTKENAGSKTRRYIYLAFIG
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG

Query:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
        +LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIK+Q EEKE
Subjt:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

XP_038882548.1 uncharacterized protein LOC120073781 [Benincasa hispida]1.6e-129100Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKNSLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDAL
        MIVFKPIQTSFAVHKNTFLYTARLPISKNSLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDAL
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKNSLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDAL

Query:  RGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGL
        RGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGL
Subjt:  RGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGL

Query:  LVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
        LVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
Subjt:  LVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

TrEMBL top hitse value%identityAlignment
A0A0A0L2K0 Uncharacterized protein9.9e-11791.83Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKNSL-CFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED
        MIVFKPIQTSF V+KNTFLYT +LPISKNS  C CQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGED
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKNSL-CFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED

Query:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI
        ALRGLDEASARIM NIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFF+NLR RKPADKVKAKVEMEKI +LTKENAGSKTRRYIYLAFI
Subjt:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
        GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRM+SEIEKTEIK+Q EEK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

A0A5D3BB95 8-amino-7-oxononanoate synthase2.4e-11080.68Show/hide
Query:  MIVFKPIQTSFAVHKNTF-LYTARLPISKNS-LCFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF PIQTSF VHKNTF L+T +LPIS+NS  C CQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFKPIQTSFAVHKNTF-LYTARLPISKNS-LCFCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
        DALRGLDEASAR                                     IMENIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
Subjt:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR

Query:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
         RKPAD VKAKVEMEKI ELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK+Q EEKE
Subjt:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

A0A6J1DDE0 uncharacterized protein LOC1110194674.9e-10884.38Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKN-SLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MI  KPIQTSF VH +TFLYT +LP SK+ SLC C SNTSDS+A + PEGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEA+AEFEKIGEDA
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKN-SLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG
        L+GLDEASARIMENIESQMQ FEES +LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNLRQ KP DK KAKVEMEKI+ELTKENAGSKTRRYIYLAFIG
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG

Query:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
        +LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIK+Q EEKE
Subjt:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

A0A6J1F063 uncharacterized protein LOC1114382126.7e-10583.66Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKNS--LCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED
        +IVFKPIQTSF VHK+TFLYT +L  SK S  L FCQSNTSDSS     EGDPQKQEILARIAQLQTQKLRLT FLDEKSADLTQFAEEA+AEFEKIGED
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKNS--LCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED

Query:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI
        A + L++ASARIMENIESQMQVFEES ELNRQEIEKNDDMLAKFEG+IEEERNEGLFFKNLRQRKP DK  AK+EMEKIKELT E AGSKTRRYIYLAFI
Subjt:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE
        GLLV+AIAESFLSSPDWRKVAVLG +LIA++ QFSYEQR+SSE+EKT+IK+QPEEK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE

A0A6J1GFQ8 uncharacterized protein LOC1114537649.3e-10787.2Show/hide
Query:  MIVFKPIQTSFAVHKNTFLYTARLPISKNS-LCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MIVFKP+QT F VH+NTF YTAR P SK+S LC CQ NTSDSS+STPPEGD QKQEILARIAQLQTQKLRLT FLDEKSADLTQFAE+ANAEFEKIGEDA
Subjt:  MIVFKPIQTSFAVHKNTFLYTARLPISKNS-LCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI
        LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKN+RQ+KPAD    KVEMEKI+ELTKEN AGSKTRRYI L  I
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK
        GLLVVAI ESF+SSPDWRKVAVLGAML AL+S+FSYEQRMSSEIE+TEIK
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09050.1 unknown protein1.3e-6857.43Show/hide
Query:  QTSFAVHKNTFLYTARLPISK-NSLCFCQS----NTSDSSASTP-PEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDALR
        QT+   + +     +R  +S+   LC  +S    + SDS    P PEGD ++QE+LARIA +QT K+RLT FLDE+S  LT+FAEEANAEF+K+GEDA++
Subjt:  QTSFAVHKNTFLYTARLPISK-NSLCFCQS----NTSDSSASTP-PEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDALR

Query:  GLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLL
         LDEAS RI+ENIES+MQ FEESA LNR EIE+ND+ LA+FE +I+ +RNEGLFFK+LR +KP D+ +A+ E EKI+E+TKE+AGSK+RR IYL  IG++
Subjt:  GLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLL

Query:  VVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQ
        V+AIA+SF+SSPDWRKVA+LGA+L+ L++QF YEQ + SE +K +  K+
Subjt:  VVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGTTTTCAAACCAATTCAAACCTCTTTCGCAGTCCACAAGAATACCTTCTTATACACAGCAAGACTTCCCATTTCAAAGAACTCCTTGTGCTTTTGCCAGTCTAA
CACTTCTGATTCAAGTGCTTCCACACCACCTGAAGGAGATCCCCAAAAGCAAGAGATATTAGCCAGAATAGCACAACTTCAAACCCAAAAACTCCGTCTCACCGGCTTCT
TAGATGAAAAGTCTGCTGATCTCACTCAGTTTGCTGAAGAGGCCAATGCAGAGTTTGAGAAGATTGGAGAAGACGCCCTCAGAGGGCTCGACGAAGCTAGTGCAAGGATT
ATGGAGAACATTGAAAGCCAGATGCAGGTCTTTGAGGAGTCTGCAGAATTGAACAGGCAAGAAATAGAGAAAAACGATGATATGTTGGCAAAATTTGAAGGCCAAATTGA
AGAAGAACGAAATGAAGGTCTTTTCTTTAAGAACCTGAGGCAGAGAAAGCCTGCTGACAAAGTGAAGGCTAAAGTGGAAATGGAGAAGATTAAAGAGCTTACTAAAGAAA
ATGCTGGTTCGAAGACAAGGCGATATATCTATCTTGCATTCATTGGTCTGCTAGTCGTAGCAATTGCCGAATCATTCCTTTCTTCACCTGATTGGCGGAAAGTTGCAGTT
CTTGGGGCAATGCTTATTGCTTTGATTTCTCAGTTTTCTTATGAGCAAAGGATGTCATCTGAAATAGAAAAAACAGAAATCAAAAAGCAACCTGAGGAAAAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
CATATTCTTAATATAATAAAGTGATAAAAATAACATAAAATGAGAGGTCCAAAAGAATTCCAAGCTCTCTCCTTTTTCTTCCTCCCTCCCTCAGCTCTCTCTCTCTCTAA
AAGATAAACAAAAGATGATAGTTTTCAAACCAATTCAAACCTCTTTCGCAGTCCACAAGAATACCTTCTTATACACAGCAAGACTTCCCATTTCAAAGAACTCCTTGTGC
TTTTGCCAGTCTAACACTTCTGATTCAAGTGCTTCCACACCACCTGAAGGAGATCCCCAAAAGCAAGAGATATTAGCCAGAATAGCACAACTTCAAACCCAAAAACTCCG
TCTCACCGGCTTCTTAGATGAAAAGTCTGCTGATCTCACTCAGTTTGCTGAAGAGGCCAATGCAGAGTTTGAGAAGATTGGAGAAGACGCCCTCAGAGGGCTCGACGAAG
CTAGTGCAAGGATTATGGAGAACATTGAAAGCCAGATGCAGGTCTTTGAGGAGTCTGCAGAATTGAACAGGCAAGAAATAGAGAAAAACGATGATATGTTGGCAAAATTT
GAAGGCCAAATTGAAGAAGAACGAAATGAAGGTCTTTTCTTTAAGAACCTGAGGCAGAGAAAGCCTGCTGACAAAGTGAAGGCTAAAGTGGAAATGGAGAAGATTAAAGA
GCTTACTAAAGAAAATGCTGGTTCGAAGACAAGGCGATATATCTATCTTGCATTCATTGGTCTGCTAGTCGTAGCAATTGCCGAATCATTCCTTTCTTCACCTGATTGGC
GGAAAGTTGCAGTTCTTGGGGCAATGCTTATTGCTTTGATTTCTCAGTTTTCTTATGAGCAAAGGATGTCATCTGAAATAGAAAAAACAGAAATCAAAAAGCAACCTGAG
GAAAAAGAGTGAAGAAGTCTTTGTTATTGCTGAAAGCATGAGGAAAAGTTAGAAAATAATAACTATAGCATGAGATATAAACCCCAAGTGAATCCTCTCTGCTCGGGTCT
GATACATCAAGCTACTTCACTTTTGCATCATTAACAATTGGTCTTGGTGATTATGCCTTCAATCACCAAGACCTCTGTAATGAATTACAACATTTGATGCTTGGAATCTT
CATGACGATTATTTGGAAAGATCTTGTACCAATAGCTTCAACATAATTATTAATTACATGACCTATTTTTGCAGTGGTTTGTTTAGTGCATAAAAATCCATTTTTGTATT
ATTTCTTTTCCACTGCAGAGGAAAATCAAGTGCAAGTTACGGAGCAGTTAAATAAAATAGCAAGCCGATCATATTTTCCTGCTATCTACCCATCAATCCAAAATTTATAA
TAATAGTTATAGGAAAATAGAAAAAGTAAGACGTCGGTGCCTAATTGAAAAGCTTTTAAATTTGAAGCAAAATTGTTTAAACTCTTTTGAAAACATAATTATTAATAATG
GCTCTATAAAATATATCACAATAAAAATGAAAATTTTGAAAACCTTGAACATCATTGGCCAGCCATCGAAACTAAAAGTTAAAATTGAGTTTCTTTGAATGATGGTCCTA
TCTCTAAGCCGAAGAGCTTGCAAATTATTCAAAGTACTCGGCATGCCAAGAAAGTGCAGTCTACTAGTTCAAAGCATATGTACTAGCAGATCTAGCACAATCACCAAGCA
AGAGAGAAAACAGTCAAAGGAATCAGTATAAATCATTAGAAGATATCAAGTTGAAAAAAACAAAAGTGATTTGGAGGAGGAGGGGACTTACTTTTTAATTCATTTCAATC
AAGAAGATCAAGGTGAAGATCTATTGTCAATAATTTCCATAGCAAGAAGGTTGAAGATCCTCGTGGATTCAGAGCATCACACAAGAATCATATCATGTCGGAGACTCTGT
AATAAGAACATATTCCGTTAAAGGTCTGTCATCAACTACTGCTCTTACAAGAATGTTCCATGTTGGGGCATTTGGAAGAATTCCATGCTGCAGAGCATCATATAGGAACC
CAATGGCATCTGAAACTCGAGTGCAAGAGCAGAGCCCCTTAAAAGCAATGTTATAAGAAATAATATCTGGCTGAAGATGCGCTTCCAAGATGCGGTCCCAAATCTTTAAA
GCCTCTGCGCAGTCTCCAGCTTTGTAAAGACCTTCCATGATGCTGTTGTGTGTTACAAGATCTGGAACACAGTTGACCTGTGCCATTTGATTAAAGATGTCCAGGGCAAG
ATCAACTTTTTGGGCAGTACAAAGACCATGAATTATTATGTTGTGCATGGTTACATCGGGCTTAAGACCCTTATTAATACAATGATGCCATAATTTGAGTGCCATGTCAA
GCTTTTCTCCTCGACACAGACCATCAATCAACAAGGTATAAGTAATCACATCAGGCTTCAAACCCTCTTCCAGCATTTCCTTAAGAGAAAGATATGCATCGCTAAATCTT
TCTGCCTTGCACAAACCATTGATAATAGTGTTGTAGGAAACTACAGTAGGAGCACAATCTTTGTTTTTCATTTCCCTTAGAAGAAGAATAGCCTCTTCAAGTTTAGAAGC
TCGGACATATCCATTAATCAGTGAATTGAAGACATGAGAATTCAGTTTATGCTTATGTTTGTTCATCTGATGAACCAGTTCTACTGCTTGATCCAACCTCCCTTTTTTGC
ATAACCCATGAATCATTGATGAGTACGCAAAAGTATCCAAATCAGCTCCCTCATTTTCTGCTTCTTTTAATATTCTTAAAGCCTTACTCAAGTATCCATTTTTACACAAC
CCGTGAATCAACAGTCCGTATGTTGTTGAATCTGCCTTTAAGCCCCTCTCGTGTAAGAGCTGCCAATAACAAATCGCTTTTTCCACTTTCTTGTTGTCAAGCAACCCTTG
AATCAATATGTTATAACTAACAATATTGCAACAGTTATTCTTACCCATTACTTCCCACAACTCAAAGCATTTACCTAGTTTCCCGGCTCGAAATAGACCACTGAGCATTG
CATTATATGTTGTCACATCAGGGGATAACCCGTTGTCAATCATCTCCTGGAAAATTTTCTCAGCAGCATCGATGTTTCCTGCTTTGCTCAAGCCGTGAATCATAGAACTA
AAAGTAAATAAATCGAATGACCTTTCGTTGTTCTTCATTCTAGTCCATATCTCCATACTCATCTCGAACTTCCCGAGCTTACATAAACCATTTATCATAATGTTATACGT
TGCCACACTCGGATAAACTGAAGATTCTCTCAGTAATCTCTCCCAAATCTCATTGGCCTTCACAAGATCTCCTTTTCTGAAAAACCCATCAATCAGAATATTATAACACA
TAACATCAGGGTTCACCCCTCTCTCAGACATTTCATCAAACACCTCCAGTGCATCCGATAAGTTACCACTCTTCGCAAGTGCATTAATTAAAGTACCATAGCTTAAAACA
TCAGGGTTCAAACCCTTTTCCGACATCCATTTCAACAATCCCTTAGCCTTTTCAAACTGCTTCTTCTTGCACGATATCTTGATCAAAATATTATAAGTCTGCAGATTGGG
CGACATGCCCACTGTCTGAAAGTATGTGAAAAACAGTTCAGCTCGGCTCCATTGATTAGACTCAATGAACGCATTAAGCATAGAGTTATATGACCTAATTCCCGGTTTAC
ACCCAAAAATGTCAACCATGTTTTGAAACAAATAAAGCGCTTGATCGGGCATTGAACACTTCGCATAGGCCTTGATAGCCGTCAGCGCAACATCTTCGGAGAAAGTGCAT
CTTTGAGTTCGCATCAGGTCCACGATCCGACCAACGTGAACAACGAGCTTCGGGTCGATAAGTCGCCGGAGAATATGGTGGAATACGAATGGTGAGTGAGCATAACCAGG
ATGCTGACAGGCCGAGTCGAATAGAGCGAGTGCCGCATTGGGGTTTTTCTCTGCTTTGAGGAGCTTTAGAAGCAGTGCAGGGGATAGGGCTTTGGGGAGCTCGACCATGA
CAAAAGCGTTAATAATTAGGGTTTCTGTATCGATTCAGGACATGCTATGATGATGATGCCGATACTTCCAGAACGGAAAATGCAATCGGAGACGAGTTTCCGGCGGGAAG
TATTGGTTCATTGACGTGTGGCCGATACTTCCTTCTCATTCTTCTCTTCCCGTATTCCTCTTCCCTTTCGGTTCGCCACGCCAGCCATAACCATATAGTTCATAACACAG
ATTTTTGGAATTTTTATTTTCCTAACGTGATATTTATAGTTTGATTTCAAG
Protein sequenceShow/hide protein sequence
MIVFKPIQTSFAVHKNTFLYTARLPISKNSLCFCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDALRGLDEASARI
MENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAV
LGAMLIALISQFSYEQRMSSEIEKTEIKKQPEEKE