; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019036 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019036
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein MNN4-like
Genome locationscaffold12:25569676..25571955
RNA-Seq ExpressionSpg019036
SyntenySpg019036
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.1e-2637.56Show/hide
Query:  PEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKES
        P F+TRVI Q+ W+ FC HP   +VPLV+EFYA L + +     V+   V F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S
Subjt:  PEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKES

Query:  QTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILAC--GRKRAGKLFFGSLITQLCQRVKIVPGKDE
             + +  +LK  + +W HF+  R MP+TH  T++ DRV+LLY ++ G+ VN   I   EI AC   RKR G L+F SLITQL  +  +   KDE
Subjt:  QTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILAC--GRKRAGKLFFGSLITQLCQRVKIVPGKDE

PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]1.2e-2532.3Show/hide
Query:  PKGKSPKAASPKNPFPEVFKDVSFQERMEI-MKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMV
        PK K  +  S  +     F   S +ER    +  +V + E+GF  +  A  E +   + + KW+ F A P+  V+PLV+EFYA   E      +VRG+ V
Subjt:  PKGKSPKAASPKNPFPEVFKDVSFQERMEI-MKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMV

Query:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM
         F SV IN +Y I     P   D   N   +    ++  + +   G QWK ++ +  S   + L   + +WL FI  R++PT H   ++ DR +LLYC+M
Subjt:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM

Query:  KGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID
         G   + G II D I+         L+F SLIT+LC R  +   + EE  F +  ID
Subjt:  KGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.6e-3638.43Show/hide
Query:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK
        EKGF    SE  G LP F+ +VI Q+ W+ FCAHP++ +VPLV+EFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N +   +  
Subjt:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK

Query:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ
         L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  +N G +I  EI AC  ++ G LFF SLIT+LC+
Subjt:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ

Query:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ
          +     +EE+      ID   + ++ Q
Subjt:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.2e-3738.86Show/hide
Query:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK
        EKGF    SE  G LP F+ +VI Q+ W+ FCAHP++ +VPLV+EFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N + + +  
Subjt:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK

Query:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ
         L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  +N G +I  EI AC  ++ G LFF SLIT+LC+
Subjt:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ

Query:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ
          +     +EE+      ID   + ++ Q
Subjt:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.2e-3133.2Show/hide
Query:  KAASPKNPFPEVFKDVSFQER-MEIMKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVD
        KA   ++   E+  + + Q R + + K+ V+ N K   +     P F+  VI Q+ WQ FCAHP++ +VPLV+EFY  +         +RG  V  S   
Subjt:  KAASPKNPFPEVFKDVSFQER-MEIMKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  +N G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEG

Query:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein6.0e-2632.3Show/hide
Query:  PKGKSPKAASPKNPFPEVFKDVSFQERMEI-MKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMV
        PK K  +  S  +     F   S +ER    +  +V + E+GF  +  A  E +   + + KW+ F A P+  V+PLV+EFYA   E      +VRG+ V
Subjt:  PKGKSPKAASPKNPFPEVFKDVSFQERMEI-MKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMV

Query:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM
         F SV IN +Y I     P   D   N   +    ++  + +   G QWK ++ +  S   + L   + +WL FI  R++PT H   ++ DR +LLYC+M
Subjt:  SFSSVDINRVYRIKAPLNPRGNDVIRN--PSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLM

Query:  KGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID
         G   + G II D I+         L+F SLIT+LC R  +   + EE  F +  ID
Subjt:  KGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTID

A0A2P5AGA5 Uncharacterized protein (Fragment)2.2e-3638.43Show/hide
Query:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK
        EKGF    SE  G LP F+ +VI Q+ W+ FCAHP++ +VPLV+EFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I N +   +  
Subjt:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK

Query:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ
         L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  +N G +I  EI AC  ++ G LFF SLIT+LC+
Subjt:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ

Query:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ
          +     +EE+      ID   + ++ Q
Subjt:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ

A0A2P5BCG4 Uncharacterized protein (Fragment)1.5e-3738.86Show/hide
Query:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK
        EKGF    SE  G LP F+ +VI Q+ W+ FCAHP++ +VPLV+EFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N + + +  
Subjt:  EKGF----SERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKK

Query:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ
         L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  +N G +I  EI AC  ++ G LFF SLIT+LC+
Subjt:  ALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQ

Query:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ
          +     +EE+      ID   + ++ Q
Subjt:  RVKIVPGKDEERHFFKPTIDLSLIGKLQQ

A0A2P5DAQ2 Uncharacterized protein5.6e-3233.2Show/hide
Query:  KAASPKNPFPEVFKDVSFQER-MEIMKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVD
        KA   ++   E+  + + Q R + + K+ V+ N K   +     P F+  VI Q+ WQ FCAHP++ +VPLV+EFY  +         +RG  V  S   
Subjt:  KAASPKNPFPEVFKDVSFQER-MEIMKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  +N G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEG

Query:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

W9QTD9 Uncharacterized protein5.4e-2737.56Show/hide
Query:  PEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKES
        P F+TRVI Q+ W+ FC HP   +VPLV+EFYA L + +     V+   V F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S
Subjt:  PEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKES

Query:  QTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILAC--GRKRAGKLFFGSLITQLCQRVKIVPGKDE
             + +  +LK  + +W HF+  R MP+TH  T++ DRV+LLY ++ G+ VN   I   EI AC   RKR G L+F SLITQL  +  +   KDE
Subjt:  QTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEVNEGSIIRDEILAC--GRKRAGKLFFGSLITQLCQRVKIVPGKDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACAATCAAGGATCGTCATCATCACGCAAAGACACTCGATCTCAAAGTGCACAAGCAACCCACGAAGAAGAAGCAAGCGTGCGACGGCAAGAGGAGAACCCCGA
ATTGCCCATGCAAGGCACACGAAGGACGAGACCCACGGGATTCTTGCTGACGGTCATGAACCAAACGTCCAACGCTCCAACTCCATCTTCTTTGGCAATGCCGGCTAGTT
CGAGGGAGATGCCGAGTTCATCTATGCCAAGGAGGTTCACTTGCGCCACTGCTGTCCGCCAAACCCAAAAACCCGTCGCTCAACAGTTTAGAAAACGTTCGCGAGAGTGG
TTTGGAATGATCCGTGAGATGGGTGCCAAGAGACGAGCTGCCCTTGAAGAAGAAGGGAATCGACAAGATGAAGAAAAAGCCGCCAAGGCAGCCGAAAGCTCTCGGCAAGG
AGAAGCTTCAATGGGTAAGGTTTCTGAACCTTCATCTAACCCTTCTTTTTCTTGCAGGTCAAAACCCGTTGTTACTTATAGCGCAAGAAAGAGGAGCCCGAAAAAGGTTG
TGTCTGAAAGGCCATTAAAAATTGAGCCCCTCAAAACCGCAAGGATGCCTCCCATTGTATTCGAAGGAATAATTCGCCAAGTAGTGGCAAAGGCCCTTGAGATTGCTGAG
GGATACAAGGTTGAACAGGATGCTCTGAAAGATGTTGAAGCGGAGAGAGAAATGGAAAATCCGAAAATGGATGAGGAAGATGAGTTTGCAAAGGAAAGAGATGAGATAGA
AGAGAAAAGAAAAAGAGAAGAAGAGCAAGAGGCCAAGAGGGCCTTAGAAGTTGAGGAAGAAAGAAAGTATGAGGAAAACCTCAGGAGGGCAGCCATTGACTTAGCTCAAA
GTGAGGCTAATGTGCTACAAGGAAGGGTAGAAGAAAAGGCCCAACAGGGGCCAACAGAAGAAAATTTTGAAAAAGAAAAAGAAAGAGAAGTGGAGAATGAAGGCCAGAAT
GTGACCGTATCTGGGCCGCATTCTGAGGAAGGCCTAGACGAGGCCACCGTTAATCAGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGTTGA
TAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAGAAAAAAGAAGACGAGGAGGCCGAGACCTCCGGTGATTCTGACTCTGATACAGAATCTGATTCAGAGGTAAGGGAGC
TAGATGATGACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGGAAGAGAGAGATTAAGGCTGAGAGGAGGACGAAGAACAAGAATGACCCGATATTTGCTAAGAGG
CCGAGGACAAGGTCCATGGATGCCTCTCCTGTAGTTCCTCCGACTACCTCACCCGCCAAGCCTAAGGGCAAGTCACCGAAGGCCGCATCTCCTAAAAATCCATTCCCTGA
GGTATTTAAAGATGTTAGTTTTCAGGAAAGGATGGAGATCATGAAGAAAAGAGTTTTCCTCAACGAGAAGGGATTCTCTGAAAGAGCTGGAGCACTGCCTGAGTTCGTAA
CAAGAGTTATCTTCCAGTACAAGTGGCAGAACTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTCTAGTTCAAGAGTTTTACGCTGGCCTGAGGGAGGAGAGTATTAGC
ATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAACAGGGTGTACAGGATCAAAGCACCCCTGAATCCACGAGGGAATGACGTTATCAGGAACCC
TTCGGCCAAGCAGATGAAAAAAGCTCTTAAACTTGTGGCCAATAAGGGGGTTCAATGGAAAGAATCGCAGACAAAAGTGAAGTCTTTAGTGCCAAGCGACTTAAAGCCAG
AATCAGCAGTTTGGCTTCACTTCATCAAGAACCGTCTAATGCCAACCACCCACGACAGCACGATTTCAGTGGATAGAGTGATGCTACTCTATTGTCTTATGAAGGGGTTG
GAAGTCAATGAAGGGAGCATCATCAGGGATGAGATCCTAGCCTGTGGACGGAAAAGGGCAGGCAAGCTTTTCTTTGGCTCACTCATCACCCAGCTCTGTCAGAGGGTGAA
GATTGTGCCAGGCAAGGACGAGGAGCGCCACTTCTTTAAACCAACCATTGACTTGTCCTTGATAGGGAAGCTTCAGCAGAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACAATCAAGGATCGTCATCATCACGCAAAGACACTCGATCTCAAAGTGCACAAGCAACCCACGAAGAAGAAGCAAGCGTGCGACGGCAAGAGGAGAACCCCGA
ATTGCCCATGCAAGGCACACGAAGGACGAGACCCACGGGATTCTTGCTGACGGTCATGAACCAAACGTCCAACGCTCCAACTCCATCTTCTTTGGCAATGCCGGCTAGTT
CGAGGGAGATGCCGAGTTCATCTATGCCAAGGAGGTTCACTTGCGCCACTGCTGTCCGCCAAACCCAAAAACCCGTCGCTCAACAGTTTAGAAAACGTTCGCGAGAGTGG
TTTGGAATGATCCGTGAGATGGGTGCCAAGAGACGAGCTGCCCTTGAAGAAGAAGGGAATCGACAAGATGAAGAAAAAGCCGCCAAGGCAGCCGAAAGCTCTCGGCAAGG
AGAAGCTTCAATGGGTAAGGTTTCTGAACCTTCATCTAACCCTTCTTTTTCTTGCAGGTCAAAACCCGTTGTTACTTATAGCGCAAGAAAGAGGAGCCCGAAAAAGGTTG
TGTCTGAAAGGCCATTAAAAATTGAGCCCCTCAAAACCGCAAGGATGCCTCCCATTGTATTCGAAGGAATAATTCGCCAAGTAGTGGCAAAGGCCCTTGAGATTGCTGAG
GGATACAAGGTTGAACAGGATGCTCTGAAAGATGTTGAAGCGGAGAGAGAAATGGAAAATCCGAAAATGGATGAGGAAGATGAGTTTGCAAAGGAAAGAGATGAGATAGA
AGAGAAAAGAAAAAGAGAAGAAGAGCAAGAGGCCAAGAGGGCCTTAGAAGTTGAGGAAGAAAGAAAGTATGAGGAAAACCTCAGGAGGGCAGCCATTGACTTAGCTCAAA
GTGAGGCTAATGTGCTACAAGGAAGGGTAGAAGAAAAGGCCCAACAGGGGCCAACAGAAGAAAATTTTGAAAAAGAAAAAGAAAGAGAAGTGGAGAATGAAGGCCAGAAT
GTGACCGTATCTGGGCCGCATTCTGAGGAAGGCCTAGACGAGGCCACCGTTAATCAGCCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGTTGA
TAGCACCTCTTCGGGAGAGAAGAGGGTTGAAGAGAAAAAAGAAGACGAGGAGGCCGAGACCTCCGGTGATTCTGACTCTGATACAGAATCTGATTCAGAGGTAAGGGAGC
TAGATGATGACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGGAAGAGAGAGATTAAGGCTGAGAGGAGGACGAAGAACAAGAATGACCCGATATTTGCTAAGAGG
CCGAGGACAAGGTCCATGGATGCCTCTCCTGTAGTTCCTCCGACTACCTCACCCGCCAAGCCTAAGGGCAAGTCACCGAAGGCCGCATCTCCTAAAAATCCATTCCCTGA
GGTATTTAAAGATGTTAGTTTTCAGGAAAGGATGGAGATCATGAAGAAAAGAGTTTTCCTCAACGAGAAGGGATTCTCTGAAAGAGCTGGAGCACTGCCTGAGTTCGTAA
CAAGAGTTATCTTCCAGTACAAGTGGCAGAACTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTCTAGTTCAAGAGTTTTACGCTGGCCTGAGGGAGGAGAGTATTAGC
ATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATTAACAGGGTGTACAGGATCAAAGCACCCCTGAATCCACGAGGGAATGACGTTATCAGGAACCC
TTCGGCCAAGCAGATGAAAAAAGCTCTTAAACTTGTGGCCAATAAGGGGGTTCAATGGAAAGAATCGCAGACAAAAGTGAAGTCTTTAGTGCCAAGCGACTTAAAGCCAG
AATCAGCAGTTTGGCTTCACTTCATCAAGAACCGTCTAATGCCAACCACCCACGACAGCACGATTTCAGTGGATAGAGTGATGCTACTCTATTGTCTTATGAAGGGGTTG
GAAGTCAATGAAGGGAGCATCATCAGGGATGAGATCCTAGCCTGTGGACGGAAAAGGGCAGGCAAGCTTTTCTTTGGCTCACTCATCACCCAGCTCTGTCAGAGGGTGAA
GATTGTGCCAGGCAAGGACGAGGAGCGCCACTTCTTTAAACCAACCATTGACTTGTCCTTGATAGGGAAGCTTCAGCAGAACTAA
Protein sequenceShow/hide protein sequence
MKNNQGSSSSRKDTRSQSAQATHEEEASVRRQEENPELPMQGTRRTRPTGFLLTVMNQTSNAPTPSSLAMPASSREMPSSSMPRRFTCATAVRQTQKPVAQQFRKRSREW
FGMIREMGAKRRAALEEEGNRQDEEKAAKAAESSRQGEASMGKVSEPSSNPSFSCRSKPVVTYSARKRSPKKVVSERPLKIEPLKTARMPPIVFEGIIRQVVAKALEIAE
GYKVEQDALKDVEAEREMENPKMDEEDEFAKERDEIEEKRKREEEQEAKRALEVEEERKYEENLRRAAIDLAQSEANVLQGRVEEKAQQGPTEENFEKEKEREVENEGQN
VTVSGPHSEEGLDEATVNQPAEEVFEPLFTHDPPAVDSTSSGEKRVEEKKEDEEAETSGDSDSDTESDSEVRELDDDQVPISAALRRKRKREIKAERRTKNKNDPIFAKR
PRTRSMDASPVVPPTTSPAKPKGKSPKAASPKNPFPEVFKDVSFQERMEIMKKRVFLNEKGFSERAGALPEFVTRVIFQYKWQNFCAHPQEAVVPLVQEFYAGLREESIS
MAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKKALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGL
EVNEGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN