; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031151 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031151
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold8:29579344..29586670
RNA-Seq ExpressionSpg031151
SyntenySpg031151
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]5.8e-1226.25Show/hide
Query:  DFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGND----VIRNP
        + + EKGF    +     P F+S VI    WQ FC HP + +VPLV+EFY  L+ +  +   V    + F+S  IN V  I     P  +D    +I + 
Subjt:  DFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGND----VIRNP

Query:  SAKQMKEALKLVANKGVQW----KESQT------------------------------------------KGLEINIGSIIREEILACGRKRACKLFFGS
          +Q+KE LK +A  G QW    K S T                                           G  IN+G +I ++I AC  K    L+F S
Subjt:  SAKQMKEALKLVANKGVQW----KESQT------------------------------------------KGLEINIGSIIREEILACGRKRACKLFFGS

Query:  LITQLCQRVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTGPSPSSEVL-----------VIAYRQLDQIRE
        LI++LC +  +     E R      + L  I ++     ++ +K    +  +   P+R S+S HT     + S E L              +  L Q +E
Subjt:  LITQLCQRVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTGPSPSSEVL-----------VIAYRQLDQIRE

Query:  NLSTYWAYAKERDEAIREFY
         L  +W Y+++RD A+++ +
Subjt:  NLSTYWAYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.1e-1428.46Show/hide
Query:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE
        EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  V +S   IN V+ +  P+    ++ I N +   +  
Subjt:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE

Query:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ
         L+ VA  G +W                                                S   G  IN+G +I  EI AC  ++   LFF SLIT+LC+
Subjt:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ

Query:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNR--TSSSQHT
          +     +EE+        L   G++   ++ R  +   +++TQ    +R  T+SS  T
Subjt:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNR--TSSSQHT

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.9e-1927.17Show/hide
Query:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE
        EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  V +S   IN V+ +  P+    ++ I+N + + +  
Subjt:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE

Query:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ
         L+ VA  G +W                                                S   G  IN+G +I  EI AC  ++   LFF SLIT+LC+
Subjt:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ

Query:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTG-PSPSSEVLVIAYRQ---LDQIRENLSTYWAYAKERDE
          +     +EE+      I    + ++ Q       +  +S     +  NRT+              S+  V  Y     L    +    +WAY+KERD 
Subjt:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTG-PSPSSEVLVIAYRQ---LDQIRENLSTYWAYAKERDE

Query:  AIREF----YLSIAPVFPDFPRSLL----FEEDKDSDENEENEENE
        A+++     +    P FP FP+ +L    +E + +SD++  NE  E
Subjt:  AIREF----YLSIAPVFPDFPRSLL----FEEDKDSDENEENEENE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.7e-1127.41Show/hide
Query:  PEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQW---
        P F++ VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P+    ++ + + +  ++   L+ VA  G +W   
Subjt:  PEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQW---

Query:  -------------------------------------KE------SQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQRVKIVLGKDEER
                                             KE      S   G  IN+G +I  EI AC  +++  LFF SLIT +C+  +     +EE+
Subjt:  -------------------------------------KE------SQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQRVKIVLGKDEER

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-1428.46Show/hide
Query:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE
        EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  V +S   IN V+ +  P+    ++ I N +   +  
Subjt:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE

Query:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ
         L+ VA  G +W                                                S   G  IN+G +I  EI AC  ++   LFF SLIT+LC+
Subjt:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ

Query:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNR--TSSSQHT
          +     +EE+        L   G++   ++ R  +   +++TQ    +R  T+SS  T
Subjt:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNR--TSSSQHT

A0A2P5BCG4 Uncharacterized protein (Fragment)1.4e-1927.17Show/hide
Query:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE
        EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  V +S   IN V+ +  P+    ++ I+N + + +  
Subjt:  EKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKE

Query:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ
         L+ VA  G +W                                                S   G  IN+G +I  EI AC  ++   LFF SLIT+LC+
Subjt:  ALKLVANKGVQWK----------------------------------------------ESQTKGLEINIGSIIREEILACGRKRACKLFFGSLITQLCQ

Query:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTG-PSPSSEVLVIAYRQ---LDQIRENLSTYWAYAKERDE
          +     +EE+      I    + ++ Q       +  +S     +  NRT+              S+  V  Y     L    +    +WAY+KERD 
Subjt:  RVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTG-PSPSSEVLVIAYRQ---LDQIRENLSTYWAYAKERDE

Query:  AIREF----YLSIAPVFPDFPRSLL----FEEDKDSDENEENEENE
        A+++     +    P FP FP+ +L    +E + +SD++  NE  E
Subjt:  AIREF----YLSIAPVFPDFPRSLL----FEEDKDSDENEENEENE

W9RBS1 Uncharacterized protein2.8e-1226.25Show/hide
Query:  DFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGND----VIRNP
        + + EKGF    +     P F+S VI    WQ FC HP + +VPLV+EFY  L+ +  +   V    + F+S  IN V  I     P  +D    +I + 
Subjt:  DFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGND----VIRNP

Query:  SAKQMKEALKLVANKGVQW----KESQT------------------------------------------KGLEINIGSIIREEILACGRKRACKLFFGS
          +Q+KE LK +A  G QW    K S T                                           G  IN+G +I ++I AC  K    L+F S
Subjt:  SAKQMKEALKLVANKGVQW----KESQT------------------------------------------KGLEINIGSIIREEILACGRKRACKLFFGS

Query:  LITQLCQRVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTGPSPSSEVL-----------VIAYRQLDQIRE
        LI++LC +  +     E R      + L  I ++     ++ +K    +  +   P+R S+S HT     + S E L              +  L Q +E
Subjt:  LITQLCQRVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTGPSPSSEVL-----------VIAYRQLDQIRE

Query:  NLSTYWAYAKERDEAIREFY
         L  +W Y+++RD A+++ +
Subjt:  NLSTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACTCCAAAATCTTCATCATCTCGCAAGATCACTCGAGTCCAGAATGCTCAAACCGCCCAAGAAGCTGAAGCAAACGTTCAACGTCAAGAAAAGCAACCTGA
CGCCTCCATGCACAGCATGAGAAGGACGAGACCTTCGGGTTTCTCACCGGCTATCGTGAACCAAGGAACCAGTGCTCAAACTCCTTCTTCCTCGACAATGTCGGCCACCT
CGAGGGAGAATCCGACTCAAAGACATGCAACTCTTGAAGAAGAAGCGAATAGGCGAGATGAAGAAGAAGCCGCCAAAGCAGCAGAAAGCTCTCGGCAAAGGGAGGCCTCC
ACGGGTAAAAATTTCGAACCTTCAACTAACCACTCTTCATCTTGCAGGAACAAACCATTCGTCACTTACAGTGCAAGGAAGAGGAGTCCCAAGAAAGTTGCACCCGAAAA
GCTGCTTGTCATCGAGCCTTTGAAAACCGCAAGAATGCCCCCGGATGTGTTTGAGGACATAATTCACCAAGTTGTGGCAAAGGCTCTAGTGATTGTCGAAGGCTACAAGG
CTGAACAAGAAGCCTTGAGGGAAATTGAGGCTGAAAGAGAACTTGAAAATCAGCACATGAGGGAAGAAGATGAGGTTGCAAGAGAAAGAGATCTTGAAGAAGAAAAGAAG
AAAGAAGAGGAAAGGAGGGCAGCAGTTGAATTGAAACTCCTTGAGGAAGAAAAACGAAGAAGGGAAGAATTAAAAGAAGATGAGAAAAGAAGGAAGGAAGCTGAAGACTT
CCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCTCAGATGCTGCAAGGAAGGAAAGAAGAGGTCCTTCAGGGGCCAAGCCACAAAGAGGCCACTGAAGCTC
AGCCAGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCACCTCTTCGGGAGATAAGAGGGATGAAGAAGAGAAAGAAAGCAAGGAGGCT
GAGACCTCTAGTGATTCAGAGACAGAGTCCGACTCAGAGAGTAAGGAGCAAGATGACAACCAAGTTCCTATCTCTGCAGCATTGAGGAGAAAGAGAAGAAGAGAGATTAA
AGCTGAAAGGAGGACCAAAAACAAGAATGACCCCATATTTGCCAAGAGGCCGAGGATTAGGTCCATGGACGCCTCTCCTGCAGCTCCTCCTACCGTCTCACCCGCCAAGC
AGCAACAGAGACAGCGTCGAGACGCTGTCTCCTTAGCGTCTCGACGCTATCGACAGATTTGCTATTTAATGTTCTTTTCATGCTACGGCCAAGGACTTCAAACCCCAAAG
GAAGTTAGTGAGGTGGGATCCACTTCCATAGGATGTCTAGTGAGTCTTGTGGCCGATGATTCGTTGGCACTCATTCATGAGCTTAATTCTTTCGATCTTGCTAAGGATGA
GCAGTTAGAGGTGAGTTGCATTAATAGTGGGGAAGAATTGGAGAGTTGTAGCACCTATCAAGAACATGTTTGTGAGGAAGAAAAAGAGATTGAGCTTGAAGAAGTGACAG
AGGAAGTTCAGGAGGTAGAGGTTGAGGTTGAAAAGCCTTCGTCTGTTTTATCTTCTCCATCCTTTATCAGCGTCGGGACGCTGCTCTTGGAGCGTCTCGACGCTGCCTTT
CCTTTTCTGAATCAGGCAGCAACAGAGACAGTGTCGAGACGCTGTCTCCTTAGCGTCTCGACGCTATCGACAGATTTGCTATTTAATGTTCTTTTCATGCTACGGCCAAG
GAAAATTCCACATCAGAAAGCCAAATCTCCTAAGGTTGCGTCTCCTAAAAATCCATTCCCCAAAGTATTCAGAGATGTTAATTTTCAGGAACGGATGGAGATAATGAGAA
AAGGAGATTTTCTCAACGAGAAGGGATTCTCTAACAGAGCAGGAGCACTACCAGAGTTCGTAAGCAGAGTTATCTCGCAGTACAAGTGGCAGGAGTTCTGTGCTCACCCT
CAAGAGGTCGTGGTGCCTTTAGTGAGAGAGTTTTACGTCGGCCTGAGAGAGGAAAGCATCAGTATGGCAGTAGTGAGAGGCAAAATGGTCATCTTCTCTTCAGTGGACAT
CAACAGGGTGTACAAAATCAAGGCACCCTTGCATCCAAGAGGGAATGATGTTATTAGGAACCCCTCGGCCAAGCAGATGAAAGAAGCACTGAAATTAGTGGCCAACAAGG
GAGTTCAGTGGAAAGAATCCCAAACGAAGGGGTTGGAGATCAATATAGGGAGCATAATCAGGGAGGAGATTCTTGCCTGTGGAAGGAAAAGAGCATGTAAACTTTTCTTT
GGATCACTTATCACCCAGCTTTGTCAGAGGGTGAAGATCGTTCTGGGAAAGGATGAGGAGCGTCATTTCTTCAAGCCTACCATTTACCTATCCTTGATTGGGAAGCTTCA
ACAGAATAGCCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGCCACTCAACCATCAGGGCCGAATAGGACTTCTTCATCCCAACACACTCCTTTTACAGGGCCCTCAC
CATCATCTGAAGTCCTAGTCATTGCCTATCGTCAGCTTGATCAAATCAGGGAAAACCTGAGCACTTATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTC
TATCTCTCTATTGCCCCGGTTTTTCCCGATTTCCCTCGATCGCTGCTGTTTGAAGAAGACAAGGATTCTGATGAAAATGAAGAAAATGAAGAAAATGAAGAGAAAGAGAG
TTCCTCAGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACTCCAAAATCTTCATCATCTCGCAAGATCACTCGAGTCCAGAATGCTCAAACCGCCCAAGAAGCTGAAGCAAACGTTCAACGTCAAGAAAAGCAACCTGA
CGCCTCCATGCACAGCATGAGAAGGACGAGACCTTCGGGTTTCTCACCGGCTATCGTGAACCAAGGAACCAGTGCTCAAACTCCTTCTTCCTCGACAATGTCGGCCACCT
CGAGGGAGAATCCGACTCAAAGACATGCAACTCTTGAAGAAGAAGCGAATAGGCGAGATGAAGAAGAAGCCGCCAAAGCAGCAGAAAGCTCTCGGCAAAGGGAGGCCTCC
ACGGGTAAAAATTTCGAACCTTCAACTAACCACTCTTCATCTTGCAGGAACAAACCATTCGTCACTTACAGTGCAAGGAAGAGGAGTCCCAAGAAAGTTGCACCCGAAAA
GCTGCTTGTCATCGAGCCTTTGAAAACCGCAAGAATGCCCCCGGATGTGTTTGAGGACATAATTCACCAAGTTGTGGCAAAGGCTCTAGTGATTGTCGAAGGCTACAAGG
CTGAACAAGAAGCCTTGAGGGAAATTGAGGCTGAAAGAGAACTTGAAAATCAGCACATGAGGGAAGAAGATGAGGTTGCAAGAGAAAGAGATCTTGAAGAAGAAAAGAAG
AAAGAAGAGGAAAGGAGGGCAGCAGTTGAATTGAAACTCCTTGAGGAAGAAAAACGAAGAAGGGAAGAATTAAAAGAAGATGAGAAAAGAAGGAAGGAAGCTGAAGACTT
CCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCTCAGATGCTGCAAGGAAGGAAAGAAGAGGTCCTTCAGGGGCCAAGCCACAAAGAGGCCACTGAAGCTC
AGCCAGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCACCTCTTCGGGAGATAAGAGGGATGAAGAAGAGAAAGAAAGCAAGGAGGCT
GAGACCTCTAGTGATTCAGAGACAGAGTCCGACTCAGAGAGTAAGGAGCAAGATGACAACCAAGTTCCTATCTCTGCAGCATTGAGGAGAAAGAGAAGAAGAGAGATTAA
AGCTGAAAGGAGGACCAAAAACAAGAATGACCCCATATTTGCCAAGAGGCCGAGGATTAGGTCCATGGACGCCTCTCCTGCAGCTCCTCCTACCGTCTCACCCGCCAAGC
AGCAACAGAGACAGCGTCGAGACGCTGTCTCCTTAGCGTCTCGACGCTATCGACAGATTTGCTATTTAATGTTCTTTTCATGCTACGGCCAAGGACTTCAAACCCCAAAG
GAAGTTAGTGAGGTGGGATCCACTTCCATAGGATGTCTAGTGAGTCTTGTGGCCGATGATTCGTTGGCACTCATTCATGAGCTTAATTCTTTCGATCTTGCTAAGGATGA
GCAGTTAGAGGTGAGTTGCATTAATAGTGGGGAAGAATTGGAGAGTTGTAGCACCTATCAAGAACATGTTTGTGAGGAAGAAAAAGAGATTGAGCTTGAAGAAGTGACAG
AGGAAGTTCAGGAGGTAGAGGTTGAGGTTGAAAAGCCTTCGTCTGTTTTATCTTCTCCATCCTTTATCAGCGTCGGGACGCTGCTCTTGGAGCGTCTCGACGCTGCCTTT
CCTTTTCTGAATCAGGCAGCAACAGAGACAGTGTCGAGACGCTGTCTCCTTAGCGTCTCGACGCTATCGACAGATTTGCTATTTAATGTTCTTTTCATGCTACGGCCAAG
GAAAATTCCACATCAGAAAGCCAAATCTCCTAAGGTTGCGTCTCCTAAAAATCCATTCCCCAAAGTATTCAGAGATGTTAATTTTCAGGAACGGATGGAGATAATGAGAA
AAGGAGATTTTCTCAACGAGAAGGGATTCTCTAACAGAGCAGGAGCACTACCAGAGTTCGTAAGCAGAGTTATCTCGCAGTACAAGTGGCAGGAGTTCTGTGCTCACCCT
CAAGAGGTCGTGGTGCCTTTAGTGAGAGAGTTTTACGTCGGCCTGAGAGAGGAAAGCATCAGTATGGCAGTAGTGAGAGGCAAAATGGTCATCTTCTCTTCAGTGGACAT
CAACAGGGTGTACAAAATCAAGGCACCCTTGCATCCAAGAGGGAATGATGTTATTAGGAACCCCTCGGCCAAGCAGATGAAAGAAGCACTGAAATTAGTGGCCAACAAGG
GAGTTCAGTGGAAAGAATCCCAAACGAAGGGGTTGGAGATCAATATAGGGAGCATAATCAGGGAGGAGATTCTTGCCTGTGGAAGGAAAAGAGCATGTAAACTTTTCTTT
GGATCACTTATCACCCAGCTTTGTCAGAGGGTGAAGATCGTTCTGGGAAAGGATGAGGAGCGTCATTTCTTCAAGCCTACCATTTACCTATCCTTGATTGGGAAGCTTCA
ACAGAATAGCCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGCCACTCAACCATCAGGGCCGAATAGGACTTCTTCATCCCAACACACTCCTTTTACAGGGCCCTCAC
CATCATCTGAAGTCCTAGTCATTGCCTATCGTCAGCTTGATCAAATCAGGGAAAACCTGAGCACTTATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTC
TATCTCTCTATTGCCCCGGTTTTTCCCGATTTCCCTCGATCGCTGCTGTTTGAAGAAGACAAGGATTCTGATGAAAATGAAGAAAATGAAGAAAATGAAGAGAAAGAGAG
TTCCTCAGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MKNTPKSSSSRKITRVQNAQTAQEAEANVQRQEKQPDASMHSMRRTRPSGFSPAIVNQGTSAQTPSSSTMSATSRENPTQRHATLEEEANRRDEEEAAKAAESSRQREAS
TGKNFEPSTNHSSSCRNKPFVTYSARKRSPKKVAPEKLLVIEPLKTARMPPDVFEDIIHQVVAKALVIVEGYKAEQEALREIEAERELENQHMREEDEVARERDLEEEKK
KEEERRAAVELKLLEEEKRRREELKEDEKRRKEAEDFLAAFEPLHKAQSEAQMLQGRKEEVLQGPSHKEATEAQPADEVFEPLFKDDPPAADSTSSGDKRDEEEKESKEA
ETSSDSETESDSESKEQDDNQVPISAALRRKRRREIKAERRTKNKNDPIFAKRPRIRSMDASPAAPPTVSPAKQQQRQRRDAVSLASRRYRQICYLMFFSCYGQGLQTPK
EVSEVGSTSIGCLVSLVADDSLALIHELNSFDLAKDEQLEVSCINSGEELESCSTYQEHVCEEEKEIELEEVTEEVQEVEVEVEKPSSVLSSPSFISVGTLLLERLDAAF
PFLNQAATETVSRRCLLSVSTLSTDLLFNVLFMLRPRKIPHQKAKSPKVASPKNPFPKVFRDVNFQERMEIMRKGDFLNEKGFSNRAGALPEFVSRVISQYKWQEFCAHP
QEVVVPLVREFYVGLREESISMAVVRGKMVIFSSVDINRVYKIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKGLEINIGSIIREEILACGRKRACKLFF
GSLITQLCQRVKIVLGKDEERHFFKPTIYLSLIGKLQQNSLQRKDKASTSQATQPSGPNRTSSSQHTPFTGPSPSSEVLVIAYRQLDQIRENLSTYWAYAKERDEAIREF
YLSIAPVFPDFPRSLLFEEDKDSDENEENEENEEKESSSDED