; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0001249 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0001249
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Description8-amino-7-oxononanoate synthase
Genome locationchr08:30472607..30477410
RNA-Seq ExpressionPay0001249
SyntenyPay0001249
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045058.1 8-amino-7-oxononanoate synthase [Cucumis melo var. makuwa]3.4e-12787.12Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
        DALRGLDEASAR                                     IMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
Subjt:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR

Query:  PRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        PRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
Subjt:  PRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

XP_004147924.1 uncharacterized protein LOC101218084 [Cucumis sativus]1.3e-12394.19Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF PIQTSFTV+KNTF L+TPKLPIS+NSF CLCQSNTSDS PSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAF
        DALRGLDEASARIM NIESQMQVFEES+ELNRQEIEKNDDMLAKFEGQIEEERNEGLFF+NLRPRKPAD VKAKVEMEKIN+LTKENAGSKTRRYIYLAF
Subjt:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAF

Query:  IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRM+SEIEKTEIKEQSEEK+
Subjt:  IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

XP_022151554.1 uncharacterized protein LOC111019467 [Momordica charantia]8.2e-10583.01Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDST-PSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG
        MI   PIQTSFTVH +TF L+T KLP S++   CLC SNTSDST PS+  PEGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEA+AEFEKIG
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDST-PSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG

Query:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA
        EDAL+GLDEASARIMENIESQMQ FEES++LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNLR  KP D  KAKVEMEKI ELTKENAGSKTRRYIYLA
Subjt:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA

Query:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        FIG+LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIKEQ+EEKE
Subjt:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

XP_023518588.1 uncharacterized protein LOC111782049 [Cucurbita pepo subsp. pepo]1.0e-10281.47Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISR-NSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG
        +IVF PIQTSFTVHK+TF L+TPKLP S+ +SF   CQSNTSDS+      EGDPQKQEILARIAQLQTQKLRLT FLDEKSADLTQFAEEA+AEFEKIG
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISR-NSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG

Query:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA
        EDA + +++ASARIMENIESQMQVFEES+ELNRQEIEKNDDMLAKFEG+IEEERNEGLFFKNLR RKP D   AKVEMEKI ELT E AGSKTRRYIYLA
Subjt:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA

Query:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        FIGLLV+AIAESFLSSPDWRKVAVLG +LIA++ QFSYEQR+SSE+EKT+IKEQ EEK+
Subjt:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

XP_038882548.1 uncharacterized protein LOC120073781 [Benincasa hispida]5.4e-11792.64Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF PIQTSF VHKNTF L+T +LPIS+NS  C CQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAF
        DALRGLDEASARIMENIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR RKPAD VKAKVEMEKI ELTKENAGSKTRRYIYLAF
Subjt:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAF

Query:  IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK+Q EEKE
Subjt:  IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

TrEMBL top hitse value%identityAlignment
A0A0A0L2K0 Uncharacterized protein6.5e-12494.19Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF PIQTSFTV+KNTF L+TPKLPIS+NSF CLCQSNTSDS PSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAF
        DALRGLDEASARIM NIESQMQVFEES+ELNRQEIEKNDDMLAKFEGQIEEERNEGLFF+NLRPRKPAD VKAKVEMEKIN+LTKENAGSKTRRYIYLAF
Subjt:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAF

Query:  IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRM+SEIEKTEIKEQSEEK+
Subjt:  IGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

A0A5D3BB95 8-amino-7-oxononanoate synthase1.6e-12787.12Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
        DALRGLDEASAR                                     IMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
Subjt:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR

Query:  PRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        PRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
Subjt:  PRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

A0A6J1DDE0 uncharacterized protein LOC1110194673.9e-10583.01Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDST-PSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG
        MI   PIQTSFTVH +TF L+T KLP S++   CLC SNTSDST PS+  PEGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEA+AEFEKIG
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDST-PSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG

Query:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA
        EDAL+GLDEASARIMENIESQMQ FEES++LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNLR  KP D  KAKVEMEKI ELTKENAGSKTRRYIYLA
Subjt:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA

Query:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        FIG+LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIKEQ+EEKE
Subjt:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

A0A6J1F063 uncharacterized protein LOC1114382127.0e-10281.08Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISR-NSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG
        +IVF PIQTSFTVHK+TF L+TPKL  S+ +SF   CQSNTSDS+      EGDPQKQEILARIAQLQTQKLRLT FLDEKSADLTQFAEEA+AEFEKIG
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISR-NSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIG

Query:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA
        EDA + L++ASARIMENIESQMQVFEES+ELNRQEIEKNDDMLAKFEG+IEEERNEGLFFKNLR RKP D   AK+EMEKI ELT E AGSKTRRYIYLA
Subjt:  EDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLA

Query:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE
        FIGLLV+AIAESFLSSPDWRKVAVLG +LIA++ QFSYEQR+SSE+EKT+IKEQ EEK+
Subjt:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE

A0A6J1GFQ8 uncharacterized protein LOC1114537642.0e-10181.4Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF P+QT F VH+NTF  +T + P S++SF C CQ NTSDS+ ST PPEGD QKQEILARIAQLQTQKLRLT FLDEKSADLTQFAE+ANAEFEKIGE
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKEN-AGSKTRRYIYLA
        DALRGLDEASARIMENIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKN+R +KPAD    KVEMEKI ELTKEN AGSKTRRYI L 
Subjt:  DALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKEN-AGSKTRRYIYLA

Query:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEK
         IGLLVVAI ESF+SSPDWRKVAVLGAML AL+S+FSYEQRMSSEIE+TEIK+   ++
Subjt:  FIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09050.1 unknown protein1.3e-7157.31Show/hide
Query:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQS----NTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFE
        M   +  QT+ T + +  L +  +  +SR  F CL +S    + SDS P  P PEGD ++QE+LARIA +QT K+RLT FLDE+S  LT+FAEEANAEF+
Subjt:  MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQS----NTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFE

Query:  KIGEDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYI
        K+GEDA++ LDEAS RI+ENIES+MQ FEES  LNR EIE+ND+ LA+FE +I+ +RNEGLFFK+LR +KP D  +A+ E EKI E+TKE+AGSK+RR I
Subjt:  KIGEDALRGLDEASARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYI

Query:  YLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEE
        YL  IG++V+AIA+SF+SSPDWRKVA+LGA+L+ L++QF YEQ + SE +K   KE  +E
Subjt:  YLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGTTTTCAACCCAATTCAAACCTCTTTCACAGTCCACAAAAATACCTTCTTATTACACACACCAAAACTCCCCATTTCAAGGAACTCCTTCTTCTGCCTTTGTCA
GTCCAACACTTCTGATTCAACTCCTTCCACACCACCACCTGAAGGAGATCCCCAAAAGCAAGAGATACTTGCTAGAATTGCACAACTTCAAACTCAAAAACTCCGTCTCA
CCGGCTTCTTAGACGAAAAATCTGCTGATCTTACTCAGTTTGCTGAAGAGGCCAATGCAGAGTTTGAGAAGATTGGTGAAGATGCCCTCAGAGGGCTAGACGAAGCCAGT
GCACGGATTATGGAGAACATTGAGAGTCAGATGCAGGTCTTTGAGGAATCTATAGAGTTGAACAGACAGGAAATAGAGAAAAATGATGATATGTTGGCAAAATTTGAAGG
CCAAATTGAAGAAGAACGAAATGAAGGTCTTTTCTTTAAGAATCTTAGGCCCAGAAAGCCTGCAGACATAGTGAAGGCTAAAGTGGAGATGGAGAAGATTAATGAGCTTA
CGAAAGAAAATGCTGGTTCGAAGACGAGGCGATATATCTATCTTGCATTCATTGGCCTGCTAGTCGTAGCAATTGCCGAATCATTCCTTTCTTCACCTGATTGGCGAAAA
GTCGCTGTTCTTGGAGCAATGCTTATTGCTTTGATTTCTCAATTTTCTTATGAGCAAAGGATGTCATCTGAAATAGAAAAAACAGAAATCAAAGAGCAATCTGAGGAAAA
AGAGTGA
mRNA sequenceShow/hide mRNA sequence
GGGTCAATTAATTAAGGGAAACTGTAAAAATTAACAAAAAAATTTCAAGATTTTATTAAAGATTATCAATAGGCTTTTTTAGAAAAAAGAAGAAAAAAGAATTTATTTTT
TAATTTGGCCAACTGAATTAGACCACCGGAAACTATCGTAACTTTGGAATGGATAAGAAAACGAAGCAAACCCCAATATGAATCAATGATCAATAACATAAAATGAGAGG
TCCAGAAGAATTTCAAGCTTTCTCCATTTTCTTCCTCCCTCTCCCAGTTCTCCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTAAAGAAAGATAAACAAAGATG
ATAGTTTTCAACCCAATTCAAACCTCTTTCACAGTCCACAAAAATACCTTCTTATTACACACACCAAAACTCCCCATTTCAAGGAACTCCTTCTTCTGCCTTTGTCAGTC
CAACACTTCTGATTCAACTCCTTCCACACCACCACCTGAAGGAGATCCCCAAAAGCAAGAGATACTTGCTAGAATTGCACAACTTCAAACTCAAAAACTCCGTCTCACCG
GCTTCTTAGACGAAAAATCTGCTGATCTTACTCAGTTTGCTGAAGAGGCCAATGCAGAGTTTGAGAAGATTGGTGAAGATGCCCTCAGAGGGCTAGACGAAGCCAGTGCA
CGGATTATGGAGAACATTGAGAGTCAGATGCAGGTCTTTGAGGAATCTATAGAGTTGAACAGACAGGAAATAGAGAAAAATGATGATATGTTGGCAAAATTTGAAGGCCA
AATTGAAGAAGAACGAAATGAAGGTCTTTTCTTTAAGAATCTTAGGCCCAGAAAGCCTGCAGACATAGTGAAGGCTAAAGTGGAGATGGAGAAGATTAATGAGCTTACGA
AAGAAAATGCTGGTTCGAAGACGAGGCGATATATCTATCTTGCATTCATTGGCCTGCTAGTCGTAGCAATTGCCGAATCATTCCTTTCTTCACCTGATTGGCGAAAAGTC
GCTGTTCTTGGAGCAATGCTTATTGCTTTGATTTCTCAATTTTCTTATGAGCAAAGGATGTCATCTGAAATAGAAAAAACAGAAATCAAAGAGCAATCTGAGGAAAAAGA
GTGAAGAAGTTTGTCTTGTTATTGCTGAAAGCATGAGGAAAAGATAGAAAATAATAACTATAGCATGAGACATAAACTCCAAGCGAATCCTCTCTGCTCGGGTCTGATAC
ATCAAGCTACTTCACTTTTGCATCGATAACATTTGGTCTTGGTGATTATGACTTCAATCACCACGACCTTTGTAATGAACTACAAAAATTGATGCTTGGAATCGACATGA
CTAGTATTTGAAAAGATCTTGTATCAATATCTTAAACATAATTACATGAACTGTTTTTGCAGTAGTTTGTTTAGTGCATAAAAACCCCTTTTTGTATGTTTATGAGTTAT
TTCACTGCAGAGGAAAATCAAGTGCAAATCAAGTGAAAGTTCCAAAGGAGCTAAATAAAATAGAAAGCAGATCATATTTTACCTGCCATCAACCCATCAATTCAAAATCA
ATAATAGTAAGAAAATAGAAAAAGTAGGACGTTGGTGCCTAATTGAAGAGCTTTTAAATTTGAATCAGAATCGTTTTTCTTCTAGTGACATAATTATTAATAATGAATGG
CTCTATGAAATATATCACAATGGAAATGAAAATTTTGAAATCCTTGAACATCATTGGCCAGCCATCTTCGAAGTACTCGGTATGCCAAGAAAGTGCGGTCTACTAGTTAA
AAGCATATGTACTAGCAGATCTGGCACAGCTACCCAGCAAGAGAGCAAACAGCCAAAGGAATCAGTATAAATCATTAGAAGATATCTAATTGAAAAAACTATCATTTGGA
GGGAGGGACACTTACTTTATAATTCATTTCAACGAAGATCAAGTTGAAGACATATCGTCAATAATTGTTAAAATGTTGAGAATCCCATGGAGAAATACCTAGGGGATTTG
CACTCTCTCCTTGCCAATTAGTTTTGTGATGGAATCTCATATTATCAAATAATAATTTCCTAGCAAGAACAACCAAGATCTGCATCACATGAGAATCACATCACGTCAGA
GACTCTGTCATAAGAGCATATTCCGTTAAAGGGTTGTCATCAACAACTGCTCTTACAAGAATGTTCCATGTTGGGGCATTTGGAAGAATTCCACGATCCAGAGCATCGTA
TAGGAATTCAATGGCATCTGAAACTCTAGCGCAAGAGCAGAGTCCCTTACAAGTAATGTTATAAGAAATAATATCAGGCTGAAGACCCGCTTCCAAAATGCTGTCCCAAA
TTTTTAAAGCCTCAGCGCAATCTCCAGCTTTGTAAAGACCTTCCATGATGGTGTTGTGTGTTACAAGATCTGGAACACAGTTGACCTGCCCCATTCGAGTAAAAATCTCC
AGGGCCACATCAACTTTTTGGGCAGTACAAAGACCGTGAATTATTATATTGTGCATTTTTACATCGGGCTTAAGACGCTTGTTAATACATTGATTCCATAAATTGAGTGC
CATGTCAACCTTTTCTCCTCGACATAGACCATCAATCAACAAGCTATAAGTAATAATATCAGGCTTCAAACCCTCTTCCAGCATCTCCTGCAGTGAAAGATTTGCATCGC
TAAATCTTTCTGCCTTACACAATCCATTAATAATAGTGTTGTAGGAGACTACAGTAGGGGCACAGTCTTTGTTTTTCATTTCCCTAAGAACAGAAATAGCCTCTTCAAGT
TTAAAAGCTCGGACATATCCATTAATCAGTGAATTGAAAACATGAGAATTCAGTTTACGTTTATTTTTGTTCATCTGATGAATCAGTTCTACGGCTTGTTCGAGCCTCCC
TTTTTTGCATAACCCATGAATCATTGATGAATACGCATAAGTATCCAAATCAGCTCCCTCATTTTCCGCTTCTTCTAATATCCTTAAAGCCTTATTCAAGTATCCATTTT
TACACAGCCCGTTGATCAACAGTCCGTATGTTGTTGAATCTGCCTTTAAGCCCCTCTCGTGTAAGAACTGCCAATAACAAATCGCTTGTTCCACTTTCTTGTTGTCAAGC
AACCCTTGAATCAATATGTTATAACTAACGATATTGCAACAGTTATTTTTACTCATTACGTCCCACAACTCAAAGCATTTACTTAGTTTCCCAGCTCGAAATAGACCACT
GAGCATTGCATTATATGTTCTCACATCAGGGGATAATCCACTTTCAATCATCTCCTGAAAAACTTTCTCGGCTGCGTCGAAGTTTCCTGCTTTGTTCAAGCCGTGAATCA
TGGAACTAAAAGTAAATAAATCGAGTGACCTTTCATTCTTCTTCATTCTATTCCACATCTCCATACTTTCAGCGAACTTCCCGAGCTTACATAAACCATTTATCATAATG
TTATATGTTTCCACACTCGGATAAACTGAAGACTCTCTAAGTAATCTCTTCCAAATCTCATTAGCCTTCAGAAAATCTCCTTTTCTGAAAAATCCATCAATCAGAATATT
ATAACACATAACATCAGGATTCACTCCTCTCTCAGACATTTCATCGAACACCTCCACGGCATCCAATATGTTACCACTCTTCGCAAGTGCATTAATTAATGTACCATAGC
TTAAAACATCAGGGTCCAAACCATTCTCGAACATCCATGTCAACAATCCCTTAGCCTTCTCAAATTGTCTCTTCTTGCACGAAATCTTGATCAGAATATTATAAGTCTGA
AGATTGGGCGACATGCCCACCGTCCGAAAGTACGTGAAAAAGAGTTCAGCTCGGCGCCACTGATTAGATTCAACGAACGCATTAAGCATAGAATTAAATGACCTAATTCC
CGGTTCACACCCAAAAATGTCAACCATGTTCTGAAACAAATTCAGCGCTTGATCGGGCATTGAACATTTCGCATACGCCTTGATAGCCGTCAGTGCAACATCTTCGGAGC
AGGTGCATCTTTGAGCTCGCATCAGGTCCACGATCCGACCAACGTGAACAACGAGCTTCGGGTCGATAAGTCGCCGTAGAATATAGTGGAATACGAATGGTGAGTGAGCA
TAACCAGGATGCCGACAGGCCGAGTCGAATATAGCGAGTGCCGCATTGGGGTTTTTCTCTGCTTTGAGAAGCTTCAGAACCAGTGCAGGGGAAAGAACTTTTGGAAGCTC
AACCATGGCAAAGGTGCTAATAATTGGAGTTTCTGTATCGATTCAGGACATGGTATGATGATTAGGTGATACTCCGGCGGGAAGCATTGGTTCCTTGACTTACGATCGAT
ACTTCCTTCAAGATCTTCTCTTCTAGTTGGCCTTGGCCTTGGCCATGCCAGCCACAACCTTAAGTTCCGAAAATGAATGTAAATTGGCGTCCAATGGTGGATGAATGATT
AATAAATTGGGGGGAAAATGTTAAAAAGGACCCGAGACGTCGTGCGACGTGATGTTGAATTCGACTAAAATTATTAATAAAATGCACTCCAAGCATTTGAAAAAAAGCCG
TCTCGCTCAAAATGAGTGTTGCTCTATGCTCAAGATGAGTGTTGCTCTCGCTGAAACTCTGCGTCCATTTTGATCAACGTTGCTCTCACTTAAATTTGTATGGGTCAGCT
CACTCTTCTTCAAACTTGCTCTACGCTCTCCCTCTTTCTCCCCCACTCTCTCCCTCCCCTTCACTCTCTCTACTTCC
Protein sequenceShow/hide protein sequence
MIVFNPIQTSFTVHKNTFLLHTPKLPISRNSFFCLCQSNTSDSTPSTPPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDALRGLDEAS
ARIMENIESQMQVFEESIELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRPRKPADIVKAKVEMEKINELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRK
VAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQSEEKE