; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032866 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032866
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEukaryotic translation initiation factor 3 subunit I
Genome locationscaffold11:14244701..14246740
RNA-Seq ExpressionSpg032866
SyntenySpg032866
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]3.3e-2227.44Show/hide
Query:  PSPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVD
        P+P  P P  F D   +E  + ++ R    E GF         L   V  V++++KWQ+   HP      +V+EFY+ + E N    +VRG  + F+   
Subjt:  PSPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVD

Query:  INRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIR
        INR +K++        +  +    +  +  L+ +   G +W   Q K KT+    L P   +W HF+K+KLMPT+H++T+S +R++LL+SI+ G  ++I 
Subjt:  INRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIR

Query:  SIIREEIFACGRKKVGKLFFGPTIDLSLI-----------------------------------GKLQQNSVQR-----KDRASTSQSSEALAIAYQQLD
         II E    C +++   L F P +  +L                                    GK  + +  R       RAS++   +A+   +Q + 
Subjt:  SIIREEIFACGRKKVGKLFFGPTIDLSLI-----------------------------------GKLQQNSVQR-----KDRASTSQSSEALAIAYQQLD

Query:  QIRDNLRTYWAYAKERD
        Q+ D L  Y+AYAK RD
Subjt:  QIRDNLRTYWAYAKERD

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.1e-2935.04Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN
        ++ R    EKGF    S   G LP F+  VI+Q+ W++ CAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN

Query:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG
         +   +   L+ V   G +W  S     T + S L P + +W HFLK+ L+PTTH  T+S +R++LL+S++ G  +N+  +I  EI AC  +K G LFF 
Subjt:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG

Query:  PTIDLSLIGKLQQNSVQRKDRASTSQSSEALAIA
        P++   L    +   +  +++   +   +A+A+A
Subjt:  PTIDLSLIGKLQQNSVQRKDRASTSQSSEALAIA

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.0e-3130.77Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN
        ++ R    EKGF    S   G LP F+  VI+Q+ W++ CAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN

Query:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG
         + + +   L+ V   G +W  S     T + S L P + +W HFLK++L+PTTH  T+S +R++LL+S++ G  +N+  +I  EI AC  +K G LFF 
Subjt:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG

Query:  P------------------------TIDLSLIGKLQQ----NSVQR--KDRASTSQSSEALAIAYQQLDQIRDNL---------------------RTYW
                                  ID   + ++ Q     S Q+    R +T+ S+       QQL  +   L                     + +W
Subjt:  P------------------------TIDLSLIGKLQQ----NSVQR--KDRASTSQSSEALAIAYQQLDQIRDNL---------------------RTYW

Query:  AYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE
        AY+KERD A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  AYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.4e-2636.93Show/hide
Query:  PEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKES
        P F+  VI Q+ WQ  CAHP++ +VPLVREFY  +   +     +RG  V  S   IN ++ +  P+    +  +++ +  ++   L+ V   G +W  S
Subjt:  PEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKES

Query:  QTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFF
             T + S L P + +W HFLK++L+PTTH  T+S E V LLYS++ G  +N+  +I  EI AC  +K G LFF
Subjt:  QTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFF

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.9e-2230.58Show/hide
Query:  VPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N +  ++   L+ V   G +W  S     T + S L P + +W HFLK
Subjt:  VPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLK

Query:  NKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFGP----------------------TIDLSLIGKL---------QQNSV
        ++L+PTTH   +S +R++LL+S++ G  +N+  +I  EI AC  +K G LFF                         ID   + ++         QQ S 
Subjt:  NKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFGP----------------------TIDLSLIGKL---------QQNSV

Query:  QRKDRASTS-------QSSEALAIAYQQLDQIRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE
         R   AS+S       Q  +AL     Q +      + +WAY+KERD A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  QRKDRASTS-------QSSEALAIAYQQLDQIRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-2935.04Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN
        ++ R    EKGF    S   G LP F+  VI+Q+ W++ CAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN

Query:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG
         +   +   L+ V   G +W  S     T + S L P + +W HFLK+ L+PTTH  T+S +R++LL+S++ G  +N+  +I  EI AC  +K G LFF 
Subjt:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG

Query:  PTIDLSLIGKLQQNSVQRKDRASTSQSSEALAIA
        P++   L    +   +  +++   +   +A+A+A
Subjt:  PTIDLSLIGKLQQNSVQRKDRASTSQSSEALAIA

A0A2P5BCG4 Uncharacterized protein (Fragment)4.9e-3230.77Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN
        ++ R    EKGF    S   G LP F+  VI+Q+ W++ CAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKN

Query:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG
         + + +   L+ V   G +W  S     T + S L P + +W HFLK++L+PTTH  T+S +R++LL+S++ G  +N+  +I  EI AC  +K G LFF 
Subjt:  PSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFG

Query:  P------------------------TIDLSLIGKLQQ----NSVQR--KDRASTSQSSEALAIAYQQLDQIRDNL---------------------RTYW
                                  ID   + ++ Q     S Q+    R +T+ S+       QQL  +   L                     + +W
Subjt:  P------------------------TIDLSLIGKLQQ----NSVQR--KDRASTSQSSEALAIAYQQLDQIRDNL---------------------RTYW

Query:  AYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE
        AY+KERD A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  AYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE

A0A2P5DAQ2 Uncharacterized protein3.1e-2636.93Show/hide
Query:  PEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKES
        P F+  VI Q+ WQ  CAHP++ +VPLVREFY  +   +     +RG  V  S   IN ++ +  P+    +  +++ +  ++   L+ V   G +W  S
Subjt:  PEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKES

Query:  QTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFF
             T + S L P + +W HFLK++L+PTTH  T+S E V LLYS++ G  +N+  +I  EI AC  +K G LFF
Subjt:  QTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFF

A0A2P5DXM3 Uncharacterized protein9.3e-2330.58Show/hide
Query:  VPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N +  ++   L+ V   G +W  S     T + S L P + +W HFLK
Subjt:  VPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLK

Query:  NKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFGP----------------------TIDLSLIGKL---------QQNSV
        ++L+PTTH   +S +R++LL+S++ G  +N+  +I  EI AC  +K G LFF                         ID   + ++         QQ S 
Subjt:  NKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFGP----------------------TIDLSLIGKL---------QQNSV

Query:  QRKDRASTS-------QSSEALAIAYQQLDQIRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE
         R   AS+S       Q  +AL     Q +      + +WAY+KERD A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  QRKDRASTS-------QSSEALAIAYQQLDQIRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEE

A0A6A2ZUE4 Uncharacterized protein1.6e-2227.44Show/hide
Query:  PSPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVD
        P+P  P P  F D   +E  + ++ R    E GF         L   V  V++++KWQ+   HP      +V+EFY+ + E N    +VRG  + F+   
Subjt:  PSPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVD

Query:  INRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIR
        INR +K++        +  +    +  +  L+ +   G +W   Q K KT+    L P   +W HF+K+KLMPT+H++T+S +R++LL+SI+ G  ++I 
Subjt:  INRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNKGVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIR

Query:  SIIREEIFACGRKKVGKLFFGPTIDLSLI-----------------------------------GKLQQNSVQR-----KDRASTSQSSEALAIAYQQLD
         II E    C +++   L F P +  +L                                    GK  + +  R       RAS++   +A+   +Q + 
Subjt:  SIIREEIFACGRKKVGKLFFGPTIDLSLI-----------------------------------GKLQQNSVQR-----KDRASTSQSSEALAIAYQQLD

Query:  QIRDNLRTYWAYAKERD
        Q+ D L  Y+AYAK RD
Subjt:  QIRDNLRTYWAYAKERD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCCGGATGTGTTCGAGAACATAATCCGCCAAGTTGTGGCAAAGGCTCTCGTGATTGCTGAAGGTTATAAGGCTGAACAAGAAGCCTTGAGGGAAATTGAGGCCGA
GAGGGAGCTTGAAAATCAAAGCATGAGGGAAGAGGATGAATTTGCGAGAAAAAGAGACCTTGAAGAAGAAAGAAAGAAGGAAGAGGAAAAGCAAGAGGCCGAGAGGGCCT
TAGAAACTGAAAAAGAAAGAAAAGTAGATGAAGACCTCAGGAGGGCAGCTGCTGAATTGCAACTCCTTGAGGAAGAAAAACAGAGAAGGGAAGACTTGAAAGAAGATGAG
AAAAGAAGGAAGGAAGCCGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACGAGGCTCAAAGTGAGGCTGAGATGCTGCAAAGAAGGGAAGAAGAGGCCCTTGAGAGGCC
AACTGAAGAAAATCAAGAAAAAGAAAAAGAAAAAGAAGAAGCGGATGAAGGCCATAATACGACCGCATCTGGGCCGCATTCTGAAGAAGGCCAAGAAAAGGCCACAGAAG
CACAACCAGCTGACGAGGCTTTGGATCCTCTGTTCGAGTATGATGTGAGAGGACCTCCACCAGCAGCTGATAGCACCTCTTCAAGAGAGAAGAGGGATGAAGAAGAGAAA
GAGAATAAGGAGGCCGAGACCTCGAGTGACTCAGAAACAGAATTCGATTCAGAGATTAAGGAATTGGATGATGACCAAGTTCTTATCTCTGCAACATTGAGAAAAAAGAG
AAGAAGAGAGATCAGGGCCGAGAGGAGGACCAAAAACAAGAATGATCCGATTTTTGCCAAGAGGCCGAGGACAATGTCCATGGACGCCTCTCTTGCAGCTCCTCCTACCG
TCTCACCTGCCAAACCAAAAGCCAAATCCCCGAAGGCTCCATCTCCTAAAAATCCATTCCCCGAAGTCTTCAGAGATGTAAATTTTCAGGAAAGGATGGAGATTATGAGA
AAAAGAGATTTCCTCAACGAGAAGGGATTCTCTAACAGAGCGGGAACACTGCCAGAGTTCGTAACCATAGTTATCTCACAGTACAAGTGGCAGGAGTTATGTGCTCACCC
TCAGGAGGCTGTAGTGCCTTTAGTTCGAGAATTTTACGCCGGACTGAGGGAGGAAAACATGAGTATGGCAGTAGTGAGAGGCAAGATGGTTAGCTTCTCTTCTGTTGACA
TCAACCGAGTGTACAAAATGAAGGCACCATTGCATCCAAGAGGGAACAATGCCATTAAGAACCCCTCAGCCAAACAAATGAAGGAAGCGCTGAAATTGGTGGACAACAAG
GGAGTTCAGTGGAAAGAGTCCCAAACGAAGGTGAAAACATTAGTGTCAAGCGATCTCAAGCCAGAATCGGCAATGTGGCTTCACTTTTTGAAGAACAAGTTGATGCCAAC
AACCCATGATAGCACAATTTCAGTAGAGAGAGTCATGCTCCTCTACAGTATCATGAAGGGGTTTGAATTAAACATAAGGAGCATTATCAGGGAGGAAATCTTTGCCTGTG
GAAGGAAGAAAGTAGGGAAGCTCTTCTTTGGGCCTACCATTGACCTGTCCTTGATCGGGAAGCTTCAACAGAACAGCGTCCAAAGAAAAGATAGAGCTTCCACATCTCAA
TCATCCGAAGCCCTAGCAATTGCCTACCAACAACTAGATCAAATAAGGGACAACCTGAGGACTTATTGGGCATATGCCAAGGAGAGAGATGAAGCCATTAGAGAGTTCTA
CCTCTCTATCGCCCCGAGTATTGCTCCTGTCTTTCCTGATTTCCCTCAATCGTTGCTGCCTCAAGAAGATAAGGATTCTGATGATGAAGAAGATGAAAATGAAGATGAAG
AGAAAGAGAGTTCCTCGGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCCCGGATGTGTTCGAGAACATAATCCGCCAAGTTGTGGCAAAGGCTCTCGTGATTGCTGAAGGTTATAAGGCTGAACAAGAAGCCTTGAGGGAAATTGAGGCCGA
GAGGGAGCTTGAAAATCAAAGCATGAGGGAAGAGGATGAATTTGCGAGAAAAAGAGACCTTGAAGAAGAAAGAAAGAAGGAAGAGGAAAAGCAAGAGGCCGAGAGGGCCT
TAGAAACTGAAAAAGAAAGAAAAGTAGATGAAGACCTCAGGAGGGCAGCTGCTGAATTGCAACTCCTTGAGGAAGAAAAACAGAGAAGGGAAGACTTGAAAGAAGATGAG
AAAAGAAGGAAGGAAGCCGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACGAGGCTCAAAGTGAGGCTGAGATGCTGCAAAGAAGGGAAGAAGAGGCCCTTGAGAGGCC
AACTGAAGAAAATCAAGAAAAAGAAAAAGAAAAAGAAGAAGCGGATGAAGGCCATAATACGACCGCATCTGGGCCGCATTCTGAAGAAGGCCAAGAAAAGGCCACAGAAG
CACAACCAGCTGACGAGGCTTTGGATCCTCTGTTCGAGTATGATGTGAGAGGACCTCCACCAGCAGCTGATAGCACCTCTTCAAGAGAGAAGAGGGATGAAGAAGAGAAA
GAGAATAAGGAGGCCGAGACCTCGAGTGACTCAGAAACAGAATTCGATTCAGAGATTAAGGAATTGGATGATGACCAAGTTCTTATCTCTGCAACATTGAGAAAAAAGAG
AAGAAGAGAGATCAGGGCCGAGAGGAGGACCAAAAACAAGAATGATCCGATTTTTGCCAAGAGGCCGAGGACAATGTCCATGGACGCCTCTCTTGCAGCTCCTCCTACCG
TCTCACCTGCCAAACCAAAAGCCAAATCCCCGAAGGCTCCATCTCCTAAAAATCCATTCCCCGAAGTCTTCAGAGATGTAAATTTTCAGGAAAGGATGGAGATTATGAGA
AAAAGAGATTTCCTCAACGAGAAGGGATTCTCTAACAGAGCGGGAACACTGCCAGAGTTCGTAACCATAGTTATCTCACAGTACAAGTGGCAGGAGTTATGTGCTCACCC
TCAGGAGGCTGTAGTGCCTTTAGTTCGAGAATTTTACGCCGGACTGAGGGAGGAAAACATGAGTATGGCAGTAGTGAGAGGCAAGATGGTTAGCTTCTCTTCTGTTGACA
TCAACCGAGTGTACAAAATGAAGGCACCATTGCATCCAAGAGGGAACAATGCCATTAAGAACCCCTCAGCCAAACAAATGAAGGAAGCGCTGAAATTGGTGGACAACAAG
GGAGTTCAGTGGAAAGAGTCCCAAACGAAGGTGAAAACATTAGTGTCAAGCGATCTCAAGCCAGAATCGGCAATGTGGCTTCACTTTTTGAAGAACAAGTTGATGCCAAC
AACCCATGATAGCACAATTTCAGTAGAGAGAGTCATGCTCCTCTACAGTATCATGAAGGGGTTTGAATTAAACATAAGGAGCATTATCAGGGAGGAAATCTTTGCCTGTG
GAAGGAAGAAAGTAGGGAAGCTCTTCTTTGGGCCTACCATTGACCTGTCCTTGATCGGGAAGCTTCAACAGAACAGCGTCCAAAGAAAAGATAGAGCTTCCACATCTCAA
TCATCCGAAGCCCTAGCAATTGCCTACCAACAACTAGATCAAATAAGGGACAACCTGAGGACTTATTGGGCATATGCCAAGGAGAGAGATGAAGCCATTAGAGAGTTCTA
CCTCTCTATCGCCCCGAGTATTGCTCCTGTCTTTCCTGATTTCCCTCAATCGTTGCTGCCTCAAGAAGATAAGGATTCTGATGATGAAGAAGATGAAAATGAAGATGAAG
AGAAAGAGAGTTCCTCGGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MPPDVFENIIRQVVAKALVIAEGYKAEQEALREIEAERELENQSMREEDEFARKRDLEEERKKEEEKQEAERALETEKERKVDEDLRRAAAELQLLEEEKQRREDLKEDE
KRRKEAEDFLAAFEPLHEAQSEAEMLQRREEEALERPTEENQEKEKEKEEADEGHNTTASGPHSEEGQEKATEAQPADEALDPLFEYDVRGPPPAADSTSSREKRDEEEK
ENKEAETSSDSETEFDSEIKELDDDQVLISATLRKKRRREIRAERRTKNKNDPIFAKRPRTMSMDASLAAPPTVSPAKPKAKSPKAPSPKNPFPEVFRDVNFQERMEIMR
KRDFLNEKGFSNRAGTLPEFVTIVISQYKWQELCAHPQEAVVPLVREFYAGLREENMSMAVVRGKMVSFSSVDINRVYKMKAPLHPRGNNAIKNPSAKQMKEALKLVDNK
GVQWKESQTKVKTLVSSDLKPESAMWLHFLKNKLMPTTHDSTISVERVMLLYSIMKGFELNIRSIIREEIFACGRKKVGKLFFGPTIDLSLIGKLQQNSVQRKDRASTSQ
SSEALAIAYQQLDQIRDNLRTYWAYAKERDEAIREFYLSIAPSIAPVFPDFPQSLLPQEDKDSDDEEDENEDEEKESSSDED