; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009041 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009041
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold8:16902761..16906366
RNA-Seq ExpressionSpg009041
SyntenySpg009041
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]3.0e-2033.6Show/hide
Query:  FAKRPRTRSMDTSPAVSPTVSPAKPKAKSPKAPSPKNSFPEVFKDVNFQERV-EIMRKKDFLNEKGF---SNRARALPEFVSRVISQYKWQEFCAHPQEA
        FAKRP + S    PA+       K  A  P + + + S    F D   ++R  E +  ++ + EKGF    +     P F+S VI    WQ FC HP + 
Subjt:  FAKRPRTRSMDTSPAVSPTVSPAKPKAKSPKAPSPKNSFPEVFKDVNFQERV-EIMRKKDFLNEKGF---SNRARALPEFVSRVISQYKWQEFCAHPQEA

Query:  VVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGND----IIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVW
        +VPLV+EFY  L+ +  +   +    + F+S  I+ V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     +   ++L+P + VW
Subjt:  VVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGND----IIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVW

Query:  LHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRK
         HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I ++I AC  K
Subjt:  LHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRK

EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.2e-2133.53Show/hide
Query:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKES
        P F++RVI Q+ W++FC HP   +VPLVREFY  L + +     ++   V F++  I+ ++ ++  ++    D     + +Q++  L  VA +G  W+ S
Subjt:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKES

Query:  QTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR
             + +  +LK  + +W HF+  R MP+TH  T++ DRV+LLY I+ G+ +N+  I  +EI AC   R
Subjt:  QTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]7.5e-2737.3Show/hide
Query:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE
        EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   +RG  V++S   I+ V+ +  P++   ++ I N +   +  
Subjt:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE

Query:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR
         L+ VA  G +W  S     + + + L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++
Subjt:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.2e-2837.84Show/hide
Query:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE
        EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   +RG  V++S   I+ V+ +  P++   ++ I+N + + +  
Subjt:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE

Query:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR
         L+ VA  G +W  S     + + + L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++
Subjt:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]7.0e-2532.24Show/hide
Query:  KAPSPKNSFPEVFKDVNFQERVEIMRKKDFLNEKGFSNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGI
        KA   ++   E+  + N Q R  +  +K+F+ +   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   I
Subjt:  KAPSPKNSFPEVFKDVNFQERVEIMRKKDFLNEKGFSNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGI

Query:  SRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGS
        + ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + + L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG 
Subjt:  SRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGS

Query:  IIREEILACGRKRA
        +I  EI AC  +++
Subjt:  IIREEILACGRKRA

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.6e-2737.3Show/hide
Query:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE
        EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   +RG  V++S   I+ V+ +  P++   ++ I N +   +  
Subjt:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE

Query:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR
         L+ VA  G +W  S     + + + L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++
Subjt:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR

A0A2P5BCG4 Uncharacterized protein (Fragment)2.5e-2837.84Show/hide
Query:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE
        EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   +RG  V++S   I+ V+ +  P++   ++ I+N + + +  
Subjt:  EKGF----SNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKE

Query:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR
         L+ VA  G +W  S     + + + L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++
Subjt:  ALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR

A0A2P5DAQ2 Uncharacterized protein3.4e-2532.24Show/hide
Query:  KAPSPKNSFPEVFKDVNFQERVEIMRKKDFLNEKGFSNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGI
        KA   ++   E+  + N Q R  +  +K+F+ +   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   I
Subjt:  KAPSPKNSFPEVFKDVNFQERVEIMRKKDFLNEKGFSNRARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGI

Query:  SRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGS
        + ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + + L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG 
Subjt:  SRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGS

Query:  IIREEILACGRKRA
        +I  EI AC  +++
Subjt:  IIREEILACGRKRA

W9QTD9 Uncharacterized protein6.0e-2233.53Show/hide
Query:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKES
        P F++RVI Q+ W++FC HP   +VPLVREFY  L + +     ++   V F++  I+ ++ ++  ++    D     + +Q++  L  VA +G  W+ S
Subjt:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKES

Query:  QTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR
             + +  +LK  + +W HF+  R MP+TH  T++ DRV+LLY I+ G+ +N+  I  +EI AC   R
Subjt:  QTKVKSLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKR

W9RBS1 Uncharacterized protein1.5e-2033.6Show/hide
Query:  FAKRPRTRSMDTSPAVSPTVSPAKPKAKSPKAPSPKNSFPEVFKDVNFQERV-EIMRKKDFLNEKGF---SNRARALPEFVSRVISQYKWQEFCAHPQEA
        FAKRP + S    PA+       K  A  P + + + S    F D   ++R  E +  ++ + EKGF    +     P F+S VI    WQ FC HP + 
Subjt:  FAKRPRTRSMDTSPAVSPTVSPAKPKAKSPKAPSPKNSFPEVFKDVNFQERV-EIMRKKDFLNEKGF---SNRARALPEFVSRVISQYKWQEFCAHPQEA

Query:  VVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGND----IIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVW
        +VPLV+EFY  L+ +  +   +    + F+S  I+ V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     +   ++L+P + VW
Subjt:  VVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGND----IIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVSNDLKPESAVW

Query:  LHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRK
         HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I ++I AC  K
Subjt:  LHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGAGAAACATTGGAGGATATACTTGGAGATGAAGAACCAGAAGGAAAAGGTGGGAAAAAGATTGAAGTTGAAGAAATCAAGCAACCCATGAAAAGACAAAGGAT
TAAACCATATTGGGGGAAAGGCTTCGAGGATGAGGAAGCCCATGTCTCCGTGATTGACCTGAGTGCTCGAACCGGCCACGAAGCTGAAGCAAGTGACCAACGGCAAAAGG
AGAACCCCGAAATACACATGCACGGCATGAAAAGGACGAGACCCACGGGATTCTCGCCGGCGATCGTGAACCAAGAACCGAACGCTCAAACTCCCTCTTCGTCGACAATG
CCGACCACGTCGAGGGATAATCCGAGTTCGTCTTCACTGAGAAGGTCCACACGCACCACTACCGTACGCCAAAGCCAAAAACCCGCCACTCAACAGTTTAGAAAACGCTC
GCAGGAATGGTTTGCAATGATCCGGGCGATGGGAGCTCAAAGACGTGCTGCCCTTAAAAAGGAAGCAAATAGGCGAGATGAAGAAGGAGCCACCAAGGCAGCTGGAAGCT
CTCGGCAAGAGGGAACTTCAATGGGTAAAAATTCTGAACCTTCAACTAACCCCTCTTCAGCTTGCAGGACCAAACCACTTGTTACCTATAGTGCAAGGAAGAGGAGCCCG
AAGAAGGTTGTGCCTGAAAAGCAGCTTGTTATTGAGCCTCTCAAAATCGCAAGAATGCCTCCAAACGTATTCGAAGGGATAATCCGCCAAGCTATGGCAAAGGCTCTTGC
TATTGCTGAAGGTTACAAGGCTGAACAAGAAGCTTTGAAGGAAATCGAAGCTGAGAGAGAGATAGAAAACCAACAAATGAGGGGAGAAGATGATTTTGCAAGAGAAAGAG
ATCTTGAAGAAGAAAAGAAAAGAGAAGAGGAAAGACAAGAGGCCGAGAGGGCCTTAGAAGCTGAAGAAGAAAGAAAGTTTAAGGAAAACCTCAGGAGGGCAACCATTGAT
TTGCAACTCCTTGAGGAAGAGAAAAAGATAAGAGAAGAATTGAAAGAAGATGAAAAAAGAAGGAAGGAAGCGGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGC
TCAAAGTGAAGCTGAATTGCTGCAAGGGAGGAATGCGACCGCATCTAGGTCGCATTCTGAAGAAGGCCTAGCCGAGGCCATCATTGATCAGCCAGTTGATGAGGTTTTCG
AACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAAGGGGATGAAGAAGAAGGAGAAAGTAAGGAGGCCGAGACCTCCACTGACTCAGAG
GCAGAGTCCGATTCAGAGATTAAGGAGCTGGATGATGACCAAGTTTCTATCTCTGCAGCGTTGAGGAGAAAGAGAAAAAGAGAGATAAAGGCTGAGAGGAGAACAAAGAA
CAAGAATGATCCAATCTTTGCCAAGAGGCCGAGGACGAGGTCCATGGACACCTCTCCTGCAGTTTCTCCTACCGTCTCACCCGCCAAACCCAAGGCCAAGTCACCGAAAG
CTCCATCTCCCAAAAATTCATTTCCTGAGGTATTCAAAGATGTTAATTTTCAGGAGAGGGTGGAGATAATGAGAAAAAAAGATTTCCTCAATGAGAAGGGATTCTCTAAC
AGAGCTAGAGCACTGCCAGAGTTCGTAAGCAGAGTTATCTCCCAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTAGTGCCTTTAGTTCGAGAGTTTTA
CGTCGGCCTGAGGGAGGAAAGTATTAGTATGGCGGTGCTGAGAGGCAAAATGGTCAACTTCTCTTCTGTAGGTATTAGCAGGGTGTACAGGATCAAGGCACCCTTGAATC
CAAGAGGGAACGACATTATTAGAAACCCCTCGGCCAAGCAGATGAAAGAAGCACTTAAACTTGTGGCGAACAAGGGTGTCCAATGGAAAGAATCTCAGACAAAAGTGAAG
TCTCTAGTGTCAAACGACTTAAAGCCAGAATCGGCAGTTTGGCTCCACTTCATCAAAAACCGTTTGATGCCAACCACCCACGATAGCACGATTTCAGTAGATAGAGTGAT
GCTACTCTACTGCATTATGAAGGGGTTGGAGATCAATGTGGGCAGCATAATAAGGGAGGAGATCCTAGCCTGTGGAAGAAAAAGAGCAGAACAGCATCCAAAGGAAAGAC
AAAGCCTCGACATCTCAGGCCACTCCACCTTCGGGTCGAGCATGGCTTCTCCATCCCAGCACACTTCTTTTACAGGGCTCTCACCGTCATCCGAAGCCCTAGCCATTGCC
TACCGCCAACTTGATCAAATCAGCATTGCTCCGGTCTTTCCAGATTTCCCTCAGTCGCTGCGGCCTCAAGAGAACAAGGATTCTGATGAAGAGGATGATGAAAATAATGA
TGAAGATGATGAAGAGAAAGACAGTTCCTCGAACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGAGAAACATTGGAGGATATACTTGGAGATGAAGAACCAGAAGGAAAAGGTGGGAAAAAGATTGAAGTTGAAGAAATCAAGCAACCCATGAAAAGACAAAGGAT
TAAACCATATTGGGGGAAAGGCTTCGAGGATGAGGAAGCCCATGTCTCCGTGATTGACCTGAGTGCTCGAACCGGCCACGAAGCTGAAGCAAGTGACCAACGGCAAAAGG
AGAACCCCGAAATACACATGCACGGCATGAAAAGGACGAGACCCACGGGATTCTCGCCGGCGATCGTGAACCAAGAACCGAACGCTCAAACTCCCTCTTCGTCGACAATG
CCGACCACGTCGAGGGATAATCCGAGTTCGTCTTCACTGAGAAGGTCCACACGCACCACTACCGTACGCCAAAGCCAAAAACCCGCCACTCAACAGTTTAGAAAACGCTC
GCAGGAATGGTTTGCAATGATCCGGGCGATGGGAGCTCAAAGACGTGCTGCCCTTAAAAAGGAAGCAAATAGGCGAGATGAAGAAGGAGCCACCAAGGCAGCTGGAAGCT
CTCGGCAAGAGGGAACTTCAATGGGTAAAAATTCTGAACCTTCAACTAACCCCTCTTCAGCTTGCAGGACCAAACCACTTGTTACCTATAGTGCAAGGAAGAGGAGCCCG
AAGAAGGTTGTGCCTGAAAAGCAGCTTGTTATTGAGCCTCTCAAAATCGCAAGAATGCCTCCAAACGTATTCGAAGGGATAATCCGCCAAGCTATGGCAAAGGCTCTTGC
TATTGCTGAAGGTTACAAGGCTGAACAAGAAGCTTTGAAGGAAATCGAAGCTGAGAGAGAGATAGAAAACCAACAAATGAGGGGAGAAGATGATTTTGCAAGAGAAAGAG
ATCTTGAAGAAGAAAAGAAAAGAGAAGAGGAAAGACAAGAGGCCGAGAGGGCCTTAGAAGCTGAAGAAGAAAGAAAGTTTAAGGAAAACCTCAGGAGGGCAACCATTGAT
TTGCAACTCCTTGAGGAAGAGAAAAAGATAAGAGAAGAATTGAAAGAAGATGAAAAAAGAAGGAAGGAAGCGGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGC
TCAAAGTGAAGCTGAATTGCTGCAAGGGAGGAATGCGACCGCATCTAGGTCGCATTCTGAAGAAGGCCTAGCCGAGGCCATCATTGATCAGCCAGTTGATGAGGTTTTCG
AACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAAGGGGATGAAGAAGAAGGAGAAAGTAAGGAGGCCGAGACCTCCACTGACTCAGAG
GCAGAGTCCGATTCAGAGATTAAGGAGCTGGATGATGACCAAGTTTCTATCTCTGCAGCGTTGAGGAGAAAGAGAAAAAGAGAGATAAAGGCTGAGAGGAGAACAAAGAA
CAAGAATGATCCAATCTTTGCCAAGAGGCCGAGGACGAGGTCCATGGACACCTCTCCTGCAGTTTCTCCTACCGTCTCACCCGCCAAACCCAAGGCCAAGTCACCGAAAG
CTCCATCTCCCAAAAATTCATTTCCTGAGGTATTCAAAGATGTTAATTTTCAGGAGAGGGTGGAGATAATGAGAAAAAAAGATTTCCTCAATGAGAAGGGATTCTCTAAC
AGAGCTAGAGCACTGCCAGAGTTCGTAAGCAGAGTTATCTCCCAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTAGTGCCTTTAGTTCGAGAGTTTTA
CGTCGGCCTGAGGGAGGAAAGTATTAGTATGGCGGTGCTGAGAGGCAAAATGGTCAACTTCTCTTCTGTAGGTATTAGCAGGGTGTACAGGATCAAGGCACCCTTGAATC
CAAGAGGGAACGACATTATTAGAAACCCCTCGGCCAAGCAGATGAAAGAAGCACTTAAACTTGTGGCGAACAAGGGTGTCCAATGGAAAGAATCTCAGACAAAAGTGAAG
TCTCTAGTGTCAAACGACTTAAAGCCAGAATCGGCAGTTTGGCTCCACTTCATCAAAAACCGTTTGATGCCAACCACCCACGATAGCACGATTTCAGTAGATAGAGTGAT
GCTACTCTACTGCATTATGAAGGGGTTGGAGATCAATGTGGGCAGCATAATAAGGGAGGAGATCCTAGCCTGTGGAAGAAAAAGAGCAGAACAGCATCCAAAGGAAAGAC
AAAGCCTCGACATCTCAGGCCACTCCACCTTCGGGTCGAGCATGGCTTCTCCATCCCAGCACACTTCTTTTACAGGGCTCTCACCGTCATCCGAAGCCCTAGCCATTGCC
TACCGCCAACTTGATCAAATCAGCATTGCTCCGGTCTTTCCAGATTTCCCTCAGTCGCTGCGGCCTCAAGAGAACAAGGATTCTGATGAAGAGGATGATGAAAATAATGA
TGAAGATGATGAAGAGAAAGACAGTTCCTCGAACGAGGACTAG
Protein sequenceShow/hide protein sequence
MKGETLEDILGDEEPEGKGGKKIEVEEIKQPMKRQRIKPYWGKGFEDEEAHVSVIDLSARTGHEAEASDQRQKENPEIHMHGMKRTRPTGFSPAIVNQEPNAQTPSSSTM
PTTSRDNPSSSSLRRSTRTTTVRQSQKPATQQFRKRSQEWFAMIRAMGAQRRAALKKEANRRDEEGATKAAGSSRQEGTSMGKNSEPSTNPSSACRTKPLVTYSARKRSP
KKVVPEKQLVIEPLKIARMPPNVFEGIIRQAMAKALAIAEGYKAEQEALKEIEAEREIENQQMRGEDDFARERDLEEEKKREEERQEAERALEAEEERKFKENLRRATID
LQLLEEEKKIREELKEDEKRRKEAEDFLAAFEPLHKAQSEAELLQGRNATASRSHSEEGLAEAIIDQPVDEVFEPLFKDDPPAADSTSSGEKGDEEEGESKEAETSTDSE
AESDSEIKELDDDQVSISAALRRKRKREIKAERRTKNKNDPIFAKRPRTRSMDTSPAVSPTVSPAKPKAKSPKAPSPKNSFPEVFKDVNFQERVEIMRKKDFLNEKGFSN
RARALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVGLREESISMAVLRGKMVNFSSVGISRVYRIKAPLNPRGNDIIRNPSAKQMKEALKLVANKGVQWKESQTKVK
SLVSNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIREEILACGRKRAEQHPKERQSLDISGHSTFGSSMASPSQHTSFTGLSPSSEALAIA
YRQLDQISIAPVFPDFPQSLRPQENKDSDEEDDENNDEDDEEKDSSSNED