; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G07580 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G07580
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description8-amino-7-oxononanoate synthase
Genome locationClcChr04:21324180..21325624
RNA-Seq ExpressionClc04G07580
SyntenyClc04G07580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045058.1 8-amino-7-oxononanoate synthase [Cucumis melo var. makuwa]3.1e-11281.36Show/hide
Query:  MIVFKPIQTSFTVHKNTF-VYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF PIQTSFTVHKNTF ++T +LP S+NSF CLCQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFKPIQTSFTVHKNTF-VYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
        DALRGLDEASAR                                     IMENIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
Subjt:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR

Query:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE
         RKPAD VKAKVEMEKI ELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQ EEKE
Subjt:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE

KAG6603599.1 hypothetical protein SDJN03_04208, partial [Cucurbita argyrosperma subsp. sororia]1.9e-10987.65Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MIVFKP+QT F VH+NTF YTAR P SK+SFLC CQ NTSDSS+STPPEGD QKQEILARIAQLQTQKLRLT FLDEKSADLTQFAE+ANAEFEKIGEDA
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI
        LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKN+RQ+KPAD    KVEMEKI+ELTKEN AGSKTRRYIYL  I
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKE
        GLLVVAI ESF+SSPDWRKVAVLGAML AL+S+FSYEQRMSSEIE+TEIK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKE

XP_004147924.1 uncharacterized protein LOC101218084 [Cucumis sativus]2.2e-11892.61Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED
        MIVFKPIQTSFTV+KNTF+YT +LP SKNSF CLCQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGED
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED

Query:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI
        ALRGLDEASARIM NIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFF+NLR RKPADKVKAKVEMEKI +LTKENAGSKTRRYIYLAFI
Subjt:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE
        GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRM+SEIEKTEIKEQ EEK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE

XP_022151554.1 uncharacterized protein LOC111019467 [Momordica charantia]3.2e-10984.77Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MI  KPIQTSFTVH +TF+YT +LP SK+  LCLC SNTSDS+A + PEGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEA+AEFEKIGEDA
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG
        L+GLDEASARIMENIESQMQ FEES +LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNLRQ KP DK KAKVEMEKI+ELTKENAGSKTRRYIYLAFIG
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG

Query:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE
        +LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIKEQ EEKE
Subjt:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE

XP_038882548.1 uncharacterized protein LOC120073781 [Benincasa hispida]4.1e-12597.66Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MIVFKPIQTSF VHKNTF+YTARLP SKNS LC CQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG
        LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG

Query:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE
        LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIK+QPEEKE
Subjt:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE

TrEMBL top hitse value%identityAlignment
A0A0A0L2K0 Uncharacterized protein1.1e-11892.61Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED
        MIVFKPIQTSFTV+KNTF+YT +LP SKNSF CLCQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGED
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGED

Query:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI
        ALRGLDEASARIM NIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFF+NLR RKPADKVKAKVEMEKI +LTKENAGSKTRRYIYLAFI
Subjt:  ALRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE
        GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRM+SEIEKTEIKEQ EEK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE

A0A5D3BB95 8-amino-7-oxononanoate synthase1.5e-11281.36Show/hide
Query:  MIVFKPIQTSFTVHKNTF-VYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE
        MIVF PIQTSFTVHKNTF ++T +LP S+NSF CLCQSNTSDS+ ST PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEA+AEFEKIGE
Subjt:  MIVFKPIQTSFTVHKNTF-VYTARLPTSKNSFLCLCQSNTSDSSAST-PPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGE

Query:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
        DALRGLDEASAR                                     IMENIESQMQVFEES ELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR
Subjt:  DALRGLDEASAR-------------------------------------IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLR

Query:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE
         RKPAD VKAKVEMEKI ELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQ EEKE
Subjt:  QRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE

A0A6J1DDE0 uncharacterized protein LOC1110194671.5e-10984.77Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MI  KPIQTSFTVH +TF+YT +LP SK+  LCLC SNTSDS+A + PEGDPQKQEILARIAQLQTQKLRLT FLDEKSA LTQFAEEA+AEFEKIGEDA
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG
        L+GLDEASARIMENIESQMQ FEES +LNRQEIEKNDDMLA+FEG+IE +RNEGL FKNLRQ KP DK KAKVEMEKI+ELTKENAGSKTRRYIYLAFIG
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIG

Query:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE
        +LV+AIAESFLSSPDW+KVAVLGAML+ALISQFSYEQ++SSEIEKTEIKEQ EEKE
Subjt:  LLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE

A0A6J1GFQ8 uncharacterized protein LOC1114537647.6e-10987.25Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MIVFKP+QT F VH+NTF YTAR P+SK+SFLC CQ NTSDSS+STPPEGD QKQEILARIAQLQTQKLRLT FLDEKSADLTQFAE+ANAEFEKIGEDA
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI
        LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKN+RQ+KPAD    KVEMEKI+ELTKEN AGSKTRRYI L  I
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKE
        GLLVVAI ESF+SSPDWRKVAVLGAML AL+S+FSYEQRMSSEIE+TEIK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKE

A0A6J1IPL4 uncharacterized protein LOC1114783052.1e-10686.06Show/hide
Query:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA
        MIVFKP+QT FTVH+NTF YTAR P SK+SFL  CQ NTSDSS+ TPPEGD QKQEILARIAQLQTQKLRLT FLDEKSADLTQFAE+A+AEFEKIGEDA
Subjt:  MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDA

Query:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI
        LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEE RNEGLFFKN+RQ+KPAD    KVEMEKI+ELTKEN AGSKTRRYIYLA I
Subjt:  LRGLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKEN-AGSKTRRYIYLAFI

Query:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKE
        GLLVVAI ESF+SS DWRKVAVLGAML AL+S+FSYEQRMSS+IE+TEIK+
Subjt:  GLLVVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEKTEIKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09050.1 unknown protein4.8e-7159.67Show/hide
Query:  QTSFTVHKNTFVYTARLPTSKNSFLCLCQS----NTSDSSASTP-PEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDALR
        QT+ T + +     +R   S+  FLCL +S    + SDS    P PEGD ++QE+LARIA +QT K+RLT FLDE+S  LT+FAEEANAEF+K+GEDA++
Subjt:  QTSFTVHKNTFVYTARLPTSKNSFLCLCQS----NTSDSSASTP-PEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDALR

Query:  GLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLL
         LDEAS RI+ENIES+MQ FEESA LNR EIE+ND+ LA+FE +I+ +RNEGLFFK+LR +KP D+ +A+ E EKI+E+TKE+AGSK+RR IYL  IG++
Subjt:  GLDEASARIMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLL

Query:  VVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEK
        V+AIA+SF+SSPDWRKVA+LGA+L+ L++QF YEQ + SE +K
Subjt:  VVAIAESFLSSPDWRKVAVLGAMLIALISQFSYEQRMSSEIEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGTTTTCAAACCAATTCAAACCTCTTTCACAGTCCACAAAAATACCTTCGTATACACAGCAAGACTTCCCACTTCAAAGAATTCCTTCTTGTGCCTTTGCCAGTC
CAACACTTCTGACTCAAGTGCTTCCACACCACCTGAAGGAGATCCCCAAAAGCAAGAGATACTAGCTAGAATAGCACAACTTCAAACACAAAAACTCCGACTCACCGGCT
TCTTAGACGAAAAATCTGCTGATCTCACTCAATTTGCTGAAGAGGCCAATGCAGAGTTTGAGAAGATTGGAGAAGATGCCCTCAGAGGGCTCGACGAAGCCAGTGCACGG
ATTATGGAAAACATTGAGAGCCAGATGCAGGTCTTTGAGGAATCTGCAGAGTTGAACAGGCAGGAAATAGAGAAAAATGATGATATGTTGGCAAAGTTTGAAGGCCAAAT
TGAAGAAGAACGAAATGAAGGCCTTTTCTTTAAGAATCTGAGGCAGAGAAAGCCCGCAGACAAAGTGAAAGCTAAAGTGGAAATGGAGAAGATTAAAGAGCTTACAAAAG
AAAACGCCGGTTCGAAGACGAGGCGTTATATCTATCTTGCATTCATTGGTCTGCTAGTCGTAGCGATTGCCGAATCATTCCTTTCTTCACCTGATTGGCGGAAAGTTGCA
GTTCTTGGGGCAATGCTTATTGCATTGATTTCTCAATTTTCTTATGAGCAAAGGATGTCATCTGAGATAGAAAAAACAGAAATCAAAGAGCAACCTGAGGAAAAAGAGTG
A
mRNA sequenceShow/hide mRNA sequence
GAATCAATGATCGATATCTCATACTTTTAATATAAAACTGATAAAGAACATAAAATGAGAGGTCCAAGAGAATTCCAAGCTTCTCCTTTTTCTGTCCTTCCTCCCCCAGT
TCGTTCTCTCTCTCTCTCTCTCTCTAAAAGATAAACAAAGATGATAGTTTTCAAACCAATTCAAACCTCTTTCACAGTCCACAAAAATACCTTCGTATACACAGCAAGAC
TTCCCACTTCAAAGAATTCCTTCTTGTGCCTTTGCCAGTCCAACACTTCTGACTCAAGTGCTTCCACACCACCTGAAGGAGATCCCCAAAAGCAAGAGATACTAGCTAGA
ATAGCACAACTTCAAACACAAAAACTCCGACTCACCGGCTTCTTAGACGAAAAATCTGCTGATCTCACTCAATTTGCTGAAGAGGCCAATGCAGAGTTTGAGAAGATTGG
AGAAGATGCCCTCAGAGGGCTCGACGAAGCCAGTGCACGGATTATGGAAAACATTGAGAGCCAGATGCAGGTCTTTGAGGAATCTGCAGAGTTGAACAGGCAGGAAATAG
AGAAAAATGATGATATGTTGGCAAAGTTTGAAGGCCAAATTGAAGAAGAACGAAATGAAGGCCTTTTCTTTAAGAATCTGAGGCAGAGAAAGCCCGCAGACAAAGTGAAA
GCTAAAGTGGAAATGGAGAAGATTAAAGAGCTTACAAAAGAAAACGCCGGTTCGAAGACGAGGCGTTATATCTATCTTGCATTCATTGGTCTGCTAGTCGTAGCGATTGC
CGAATCATTCCTTTCTTCACCTGATTGGCGGAAAGTTGCAGTTCTTGGGGCAATGCTTATTGCATTGATTTCTCAATTTTCTTATGAGCAAAGGATGTCATCTGAGATAG
AAAAAACAGAAATCAAAGAGCAACCTGAGGAAAAAGAGTGAAGAGGTCTTGTTATTGCTGAAAGCATGAGGAAAAGATAGAAAATAATAACTATAGCATGAGACATAAAC
CCCAAGTGAATCCTCTCTACTCGGGTCTGATACATCAAGCTACTTCACTTTTGCATCGATAACATTCGGTCTTGGTGATTATGACTTCAATCACCAAGACCTCTGTAACG
AATTACAACATTTGATGCTTGGAGTCTTCATGACGATTATTTGAAAAGATCTTGTACCAGTATTTTCAACATAATTACATGACCTATTTTTGCAGTAGCTTGTTTAGTGC
ATAAAAATCCCTTTTTGTATGTTTATGAGTTCTTTTCATTGCAGAGGGAAAATCAAGTGCACAGAGCAGCTAAATAAAATAGCAAGCACAAGCAGATCATATTTTTCCTC
TCATCTACCCATCAATCCAAAATTTATTATAGTAATAACAAT
Protein sequenceShow/hide protein sequence
MIVFKPIQTSFTVHKNTFVYTARLPTSKNSFLCLCQSNTSDSSASTPPEGDPQKQEILARIAQLQTQKLRLTGFLDEKSADLTQFAEEANAEFEKIGEDALRGLDEASAR
IMENIESQMQVFEESAELNRQEIEKNDDMLAKFEGQIEEERNEGLFFKNLRQRKPADKVKAKVEMEKIKELTKENAGSKTRRYIYLAFIGLLVVAIAESFLSSPDWRKVA
VLGAMLIALISQFSYEQRMSSEIEKTEIKEQPEEKE