; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028037 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028037
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold2:44155114..44157712
RNA-Seq ExpressionSpg028037
SyntenySpg028037
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]3.8e-2031.53Show/hide
Query:  EKGFSNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNPSAKQTKVKTLV
        E+GF  + EA  E +   + + KW+ F A  +  V+PLVREFY +  E      ++RG+   F  V IN +Y I         +   N    +   +TL 
Subjt:  EKGFSNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNPSAKQTKVKTLV

Query:  P-----------------SDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVP
        P                 + L   + IWL F+  R++PT H   ++ADR +LLYCIM G   ++G II D I+         L+F SLIT+LC R  V  
Subjt:  P-----------------SDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVP

Query:  GKDKE----RHSFKPTIDLSLI
         + +E    RH    T  L ++
Subjt:  GKDKE----RHSFKPTIDLSLI

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.0e-2532.56Show/hide
Query:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPL--------NPK
        ++ R    EKGF   ++ T     F+ +VI+Q+ W++FCAH ++ +VPLVREFY +L +   +   +RG   S+S   IN V+ +  P+        N  
Subjt:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPL--------NPK

Query:  GNDVIR------------NPSAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFG
         +D+I             N SA+     T + S L P + +W HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++   LFF 
Subjt:  GNDVIR------------NPSAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFG

Query:  SLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQATPQSGS
        SLIT+LC+        ++E+      ID   + ++ Q    +   + S+S+    S S
Subjt:  SLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQATPQSGS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.4e-2929.14Show/hide
Query:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNP
        ++ R    EKGF   ++ T     F+ +VI+Q+ W++FCAH ++ +VPLVREFY +L +   +   +RG   S+S   IN V+ +  P++ + ++ I+N 
Subjt:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNP

Query:  SAKQ--TKVKTL-----------------VPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGS
        + +   T ++T+                 + S L P + +W HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++   LFF S
Subjt:  SAKQ--TKVKTL-----------------VPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGS

Query:  LITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQ-ATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWA
        LIT+LC+        ++E+      ID   + ++ Q    +   + S+S+ AT  S        Q         S + +   +    L    +  + +WA
Subjt:  LITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQ-ATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWA

Query:  YAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD
        Y+KER  A+++   +      P FP FP+ +L   D + + + + DG ++
Subjt:  YAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.6e-2432.46Show/hide
Query:  ASPKNSFPKVFKDVNFQERMEIMRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVD
        AS    F     ++ ++E ++    R    EK F   +++    P F+  VI Q+ WQ FCAH ++ +VPLVREFY ++         +RG     S+  
Subjt:  ASPKNSFPKVFKDVNFQERMEIMRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVD

Query:  INRVYRIKAPLNPKGNDV--IRNP----------------SAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGS
        IN ++ +  P++     V  I  P                +       T + S L P + +W HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G 
Subjt:  INRVYRIKAPLNPKGNDV--IRNP----------------SAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGS

Query:  IIRDEILACWRKRAAKLFFGSLITQLCQ
        +I  EI AC  +++  LFF SLIT +C+
Subjt:  IIRDEILACWRKRAAKLFFGSLITQLCQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.5e-2129.43Show/hide
Query:  VPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLN-----------PKGNDVIRNPSAKQTK-------VKTLVPSDLKPESTIWLHFLKN
        +PLVREFY +L +   +   +RG   S+S   IN V+ +  P++           P+   V+   +A   +         T + S L P + +W HFLK+
Subjt:  VPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLN-----------PKGNDVIRNPSAKQTK-------VKTLVPSDLKPESTIWLHFLKN

Query:  RLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKLQHNSIQQKDKAST
        RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++   LFF SLIT+LC+    +  ++K  ++ +  ID   + ++     Q+    ST
Subjt:  RLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKLQHNSIQQKDKAST

Query:  SQATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLRTYWAYAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD
         Q  P S    A+ S  T         +AL     Q +   +  + +WAY+KER  A+++   +      P FP FP+ +L   D + + + + DG ++
Subjt:  SQATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLRTYWAYAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein1.8e-2031.53Show/hide
Query:  EKGFSNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNPSAKQTKVKTLV
        E+GF  + EA  E +   + + KW+ F A  +  V+PLVREFY +  E      ++RG+   F  V IN +Y I         +   N    +   +TL 
Subjt:  EKGFSNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNPSAKQTKVKTLV

Query:  P-----------------SDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVP
        P                 + L   + IWL F+  R++PT H   ++ADR +LLYCIM G   ++G II D I+         L+F SLIT+LC R  V  
Subjt:  P-----------------SDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVP

Query:  GKDKE----RHSFKPTIDLSLI
         + +E    RH    T  L ++
Subjt:  GKDKE----RHSFKPTIDLSLI

A0A2P5AGA5 Uncharacterized protein (Fragment)5.0e-2632.56Show/hide
Query:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPL--------NPK
        ++ R    EKGF   ++ T     F+ +VI+Q+ W++FCAH ++ +VPLVREFY +L +   +   +RG   S+S   IN V+ +  P+        N  
Subjt:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPL--------NPK

Query:  GNDVIR------------NPSAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFG
         +D+I             N SA+     T + S L P + +W HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++   LFF 
Subjt:  GNDVIR------------NPSAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFG

Query:  SLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQATPQSGS
        SLIT+LC+        ++E+      ID   + ++ Q    +   + S+S+    S S
Subjt:  SLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQATPQSGS

A0A2P5BCG4 Uncharacterized protein (Fragment)1.7e-2929.14Show/hide
Query:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNP
        ++ R    EKGF   ++ T     F+ +VI+Q+ W++FCAH ++ +VPLVREFY +L +   +   +RG   S+S   IN V+ +  P++ + ++ I+N 
Subjt:  MRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNP

Query:  SAKQ--TKVKTL-----------------VPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGS
        + +   T ++T+                 + S L P + +W HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++   LFF S
Subjt:  SAKQ--TKVKTL-----------------VPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGS

Query:  LITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQ-ATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWA
        LIT+LC+        ++E+      ID   + ++ Q    +   + S+S+ AT  S        Q         S + +   +    L    +  + +WA
Subjt:  LITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKL-QHNSIQQKDKASTSQ-ATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRENLRTYWA

Query:  YAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD
        Y+KER  A+++   +      P FP FP+ +L   D + + + + DG ++
Subjt:  YAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD

A0A2P5DAQ2 Uncharacterized protein1.2e-2432.46Show/hide
Query:  ASPKNSFPKVFKDVNFQERMEIMRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVD
        AS    F     ++ ++E ++    R    EK F   +++    P F+  VI Q+ WQ FCAH ++ +VPLVREFY ++         +RG     S+  
Subjt:  ASPKNSFPKVFKDVNFQERMEIMRKRDFLNEKGF---SNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVD

Query:  INRVYRIKAPLNPKGNDV--IRNP----------------SAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGS
        IN ++ +  P++     V  I  P                +       T + S L P + +W HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G 
Subjt:  INRVYRIKAPLNPKGNDV--IRNP----------------SAKQTKVKTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGS

Query:  IIRDEILACWRKRAAKLFFGSLITQLCQ
        +I  EI AC  +++  LFF SLIT +C+
Subjt:  IIRDEILACWRKRAAKLFFGSLITQLCQ

A0A2P5DXM3 Uncharacterized protein1.7e-2129.43Show/hide
Query:  VPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLN-----------PKGNDVIRNPSAKQTK-------VKTLVPSDLKPESTIWLHFLKN
        +PLVREFY +L +   +   +RG   S+S   IN V+ +  P++           P+   V+   +A   +         T + S L P + +W HFLK+
Subjt:  VPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLN-----------PKGNDVIRNPSAKQTK-------VKTLVPSDLKPESTIWLHFLKN

Query:  RLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKLQHNSIQQKDKAST
        RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++   LFF SLIT+LC+    +  ++K  ++ +  ID   + ++     Q+    ST
Subjt:  RLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKLQHNSIQQKDKAST

Query:  SQATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLRTYWAYAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD
         Q  P S    A+ S  T         +AL     Q +   +  + +WAY+KER  A+++   +      P FP FP+ +L   D + + + + DG ++
Subjt:  SQATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLRTYWAYAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACGCAAAAACCCTCATCATCACACAAGATCACTCGATCTCAGAGTGCTCGAACTAGCCACGAAGCTAAAGCAAGTGACCAACGGCAAGATGAGCAACCCGA
AAGACCCATGCATGGCATGAGAAGAACGAGACCCACGGGATTCTCGCCGGCCATCGTGAACCCGGAGCCCAACGCTCAAACTCCATCTTCTTCGACAATTCCGACCACGT
CGAGGGAGAATCCGAGTTTGTCTTCAACGAGAAGGCCCACGGGCGCCAATGTCGTCCGCCAAACCCAAAAACCCGCCACTCAACAGTTTAGAAAACGCTCGCGGGAGTGG
ACCAAGCCAGTTGTTACCTACAGTGCAAGGAAGGGGAGCCCGAAGAAGGTTGTGCCTGAAAAGCCGCTTGTACTTGAGCCCCTCAAAACCGCGAGAATGCCTCCGGATGT
GTTTGAAGATATAATCCACCAAGCAGTGGCAAAGGCTCTCGTGATTGCTGAAGGGTACAAGGCTGAACAAGAGGCTTTGGAAGTTATTGAGGCTGAGAGAGAGATGGAAA
ATCAACACATGATGGAGGAAGATGATTTTGCGAAGAGAAGAGATGATGAAGATAGGGAAGAAAAGAGAAAGAGAGAAGAAGAGACGACAGCCATTGACTTGCAACTCCTT
GAGGAAGAGAAAAAGAGAAGAGAAGAATTGAAAGAAGACGAAAAAAGAAGAAAGGAAGTGGAAGACTTCCTTGCAGCATTTGAGCCACTTCACAAGGCTCAAAGTGAAGC
TGAATTGCTGCAAGGGAGGGTAGACGAAGAGGCCCAACCGGGGCCAAGAGAAGAAAAAGAAAAAGAAAGAGAAGTAGAGGATGAAGGCCAGAATGCGACCGCATCTGGGT
CGCATTCTGAAGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAGAAGTTTTTGAACCTCTATTCACGAATGACCCACCAGCAGCTGATAGCACCTCTTCGGGA
GAGAAGAGGGACGAAGAAGAGAAAGAAGACAAGGAGGCCAAGACCTCCACTGACTCTGAAACAGAATCAGACTCAGAGATCAAGGAGCTGGATGGCGACCGAGTTCCTAT
CTCTGCAGCGTTGAGGAGAAAGAGAAAGAGAGAGATAAAGGCTGAAAGGAGGACAAAGAACAAAAATGACCCAATTTTTGCAAAAAGGCCGAGGATTAGGTCAATGGACG
TCTCTCCTGCAGTTCCTCCTACCATCTCACCCGCCATGCCGAAGGGCAAATCACCTAAGGCTGCATCTCCTAAAAATTCATTCCCAAAGGTATTCAAAGATGTTAATTTT
CAAGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAATGAGAAGGGATTCTCTAATAGAACAGAAGCACTGCCAGAGTTCGTAACAAGAGTTATCTCCCAATACAA
GTGGCAGGAGTTTTGTGCTCACCATCAGGAGGCTGTAGTGCCTTTAGTTCGAGAGTTTTACCCCAGCCTGAAGGAGGAAAGCATTAGTATAGCGGTAATGAGAGGCAAAA
TCGCCAGCTTCTCTTTAGTGGATATTAACAGGGTGTACAGGATCAAGGCACCCCTGAATCCAAAAGGGAATGATGTTATTAGGAACCCCTCGGCCAAGCAGACGAAGGTG
AAGACTTTAGTGCCAAGCGACTTAAAGCCAGAATCGACAATTTGGCTTCACTTTCTGAAAAACCGCTTGATGCCAACAACCCACGACAGCACGATCTCAGCAGATAGAGT
GATGCTACTCTACTGTATTATGAAGGGGTTGGAGATCAACATCGGGAGCATAATCAGGGATGAGATTCTAGCCTGTTGGAGAAAAAGAGCAGCCAAGCTTTTCTTTGGAT
CACTCATCACCCAGCTTTGTCAGAGGGTGAATGTCGTTCCAGGCAAGGACAAGGAGCGTCATTCCTTCAAGCCGACCATTGATTTGTCCTTGATCGGGAAGCTCCAGCAT
AATAGCATCCAACAGAAAGACAAAGCCTCCACATCTCAGGCTACTCCTCAATCAGGGTCGAATGTAGCCTCTCCATCCCAGCACACTCCTTTCACAGGGCCTTCGCCATC
TTCTGAGGCCCTAGCCATTGCGTACCGCCAGCTTGATCAAATCAGGGAAAACCTGAGGACATATTGGGCATACGCAAAGGAAAGGGTTGAGGCCATTAGAGAGTTCTATC
TCTCTATTGCCCCGAGTATTGCTCCAGTCTTTCCAAATTTCCCCCGGTCGCTGCTGCCTAAAGAGGATGAGGATTCTGATGAAGATGATGAGAATGATGGTGAAGATGAT
GAAGAGAAAGAGAGTTCCTCGGACGAGGACCAGGGTAGTTTTCTGATCCCCTTTGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACGCAAAAACCCTCATCATCACACAAGATCACTCGATCTCAGAGTGCTCGAACTAGCCACGAAGCTAAAGCAAGTGACCAACGGCAAGATGAGCAACCCGA
AAGACCCATGCATGGCATGAGAAGAACGAGACCCACGGGATTCTCGCCGGCCATCGTGAACCCGGAGCCCAACGCTCAAACTCCATCTTCTTCGACAATTCCGACCACGT
CGAGGGAGAATCCGAGTTTGTCTTCAACGAGAAGGCCCACGGGCGCCAATGTCGTCCGCCAAACCCAAAAACCCGCCACTCAACAGTTTAGAAAACGCTCGCGGGAGTGG
ACCAAGCCAGTTGTTACCTACAGTGCAAGGAAGGGGAGCCCGAAGAAGGTTGTGCCTGAAAAGCCGCTTGTACTTGAGCCCCTCAAAACCGCGAGAATGCCTCCGGATGT
GTTTGAAGATATAATCCACCAAGCAGTGGCAAAGGCTCTCGTGATTGCTGAAGGGTACAAGGCTGAACAAGAGGCTTTGGAAGTTATTGAGGCTGAGAGAGAGATGGAAA
ATCAACACATGATGGAGGAAGATGATTTTGCGAAGAGAAGAGATGATGAAGATAGGGAAGAAAAGAGAAAGAGAGAAGAAGAGACGACAGCCATTGACTTGCAACTCCTT
GAGGAAGAGAAAAAGAGAAGAGAAGAATTGAAAGAAGACGAAAAAAGAAGAAAGGAAGTGGAAGACTTCCTTGCAGCATTTGAGCCACTTCACAAGGCTCAAAGTGAAGC
TGAATTGCTGCAAGGGAGGGTAGACGAAGAGGCCCAACCGGGGCCAAGAGAAGAAAAAGAAAAAGAAAGAGAAGTAGAGGATGAAGGCCAGAATGCGACCGCATCTGGGT
CGCATTCTGAAGAAGGCCTAGCCGAGGCCACCATTGATCAGCCAGCTGAAGAAGTTTTTGAACCTCTATTCACGAATGACCCACCAGCAGCTGATAGCACCTCTTCGGGA
GAGAAGAGGGACGAAGAAGAGAAAGAAGACAAGGAGGCCAAGACCTCCACTGACTCTGAAACAGAATCAGACTCAGAGATCAAGGAGCTGGATGGCGACCGAGTTCCTAT
CTCTGCAGCGTTGAGGAGAAAGAGAAAGAGAGAGATAAAGGCTGAAAGGAGGACAAAGAACAAAAATGACCCAATTTTTGCAAAAAGGCCGAGGATTAGGTCAATGGACG
TCTCTCCTGCAGTTCCTCCTACCATCTCACCCGCCATGCCGAAGGGCAAATCACCTAAGGCTGCATCTCCTAAAAATTCATTCCCAAAGGTATTCAAAGATGTTAATTTT
CAAGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAATGAGAAGGGATTCTCTAATAGAACAGAAGCACTGCCAGAGTTCGTAACAAGAGTTATCTCCCAATACAA
GTGGCAGGAGTTTTGTGCTCACCATCAGGAGGCTGTAGTGCCTTTAGTTCGAGAGTTTTACCCCAGCCTGAAGGAGGAAAGCATTAGTATAGCGGTAATGAGAGGCAAAA
TCGCCAGCTTCTCTTTAGTGGATATTAACAGGGTGTACAGGATCAAGGCACCCCTGAATCCAAAAGGGAATGATGTTATTAGGAACCCCTCGGCCAAGCAGACGAAGGTG
AAGACTTTAGTGCCAAGCGACTTAAAGCCAGAATCGACAATTTGGCTTCACTTTCTGAAAAACCGCTTGATGCCAACAACCCACGACAGCACGATCTCAGCAGATAGAGT
GATGCTACTCTACTGTATTATGAAGGGGTTGGAGATCAACATCGGGAGCATAATCAGGGATGAGATTCTAGCCTGTTGGAGAAAAAGAGCAGCCAAGCTTTTCTTTGGAT
CACTCATCACCCAGCTTTGTCAGAGGGTGAATGTCGTTCCAGGCAAGGACAAGGAGCGTCATTCCTTCAAGCCGACCATTGATTTGTCCTTGATCGGGAAGCTCCAGCAT
AATAGCATCCAACAGAAAGACAAAGCCTCCACATCTCAGGCTACTCCTCAATCAGGGTCGAATGTAGCCTCTCCATCCCAGCACACTCCTTTCACAGGGCCTTCGCCATC
TTCTGAGGCCCTAGCCATTGCGTACCGCCAGCTTGATCAAATCAGGGAAAACCTGAGGACATATTGGGCATACGCAAAGGAAAGGGTTGAGGCCATTAGAGAGTTCTATC
TCTCTATTGCCCCGAGTATTGCTCCAGTCTTTCCAAATTTCCCCCGGTCGCTGCTGCCTAAAGAGGATGAGGATTCTGATGAAGATGATGAGAATGATGGTGAAGATGAT
GAAGAGAAAGAGAGTTCCTCGGACGAGGACCAGGGTAGTTTTCTGATCCCCTTTGACTAA
Protein sequenceShow/hide protein sequence
MKNTQKPSSSHKITRSQSARTSHEAKASDQRQDEQPERPMHGMRRTRPTGFSPAIVNPEPNAQTPSSSTIPTTSRENPSLSSTRRPTGANVVRQTQKPATQQFRKRSREW
TKPVVTYSARKGSPKKVVPEKPLVLEPLKTARMPPDVFEDIIHQAVAKALVIAEGYKAEQEALEVIEAEREMENQHMMEEDDFAKRRDDEDREEKRKREEETTAIDLQLL
EEEKKRREELKEDEKRRKEVEDFLAAFEPLHKAQSEAELLQGRVDEEAQPGPREEKEKEREVEDEGQNATASGSHSEEGLAEATIDQPAEEVFEPLFTNDPPAADSTSSG
EKRDEEEKEDKEAKTSTDSETESDSEIKELDGDRVPISAALRRKRKREIKAERRTKNKNDPIFAKRPRIRSMDVSPAVPPTISPAMPKGKSPKAASPKNSFPKVFKDVNF
QERMEIMRKRDFLNEKGFSNRTEALPEFVTRVISQYKWQEFCAHHQEAVVPLVREFYPSLKEESISIAVMRGKIASFSLVDINRVYRIKAPLNPKGNDVIRNPSAKQTKV
KTLVPSDLKPESTIWLHFLKNRLMPTTHDSTISADRVMLLYCIMKGLEINIGSIIRDEILACWRKRAAKLFFGSLITQLCQRVNVVPGKDKERHSFKPTIDLSLIGKLQH
NSIQQKDKASTSQATPQSGSNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRENLRTYWAYAKERVEAIREFYLSIAPSIAPVFPNFPRSLLPKEDEDSDEDDENDGEDD
EEKESSSDEDQGSFLIPFD