; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027056 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027056
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold8:8209312..8211939
RNA-Seq ExpressionSpg027056
SyntenySpg027056
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]2.1e-2931.68Show/hide
Query:  RDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIR
        R+ + EKGF    S  +G  P F++ VI    WQ FC HP + +VPLV EFYA L+++  +   V    ++F+S  IN V  I     P  +D    +I 
Subjt:  RDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIR

Query:  NPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFF
        +   +Q+KE LK +A  G QW  S     +    +L+P + VW HF+ + L+ +TH  TIS +R +LLY ++ G  IN G +I D+I AC  K  G ++F
Subjt:  NPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFF

Query:  GSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALA-----------IAYRQLDQI
         SLI++LC +  +     E R      +DL  I ++     ++ +K    +   Q  P+  S S HT     + S E L              +  L Q 
Subjt:  GSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALA-----------IAYRQLDQI

Query:  RDNLKTYWAYAKERDEAIREFY
        ++ L  +W Y+++RD A+++ +
Subjt:  RDNLKTYWAYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.5e-3535.98Show/hide
Query:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S  +G LP F+ +VI Q+ W+ FCAHP++ +VPLV EFYA L D   +   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  IN G +I  EI AC  ++ G +FF 
Subjt:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG

Query:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.9e-3932.49Show/hide
Query:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S  +G LP F+ +VI Q+ W+ FCAHP++ +VPLV EFYA L D   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  IN G +I  EI AC  ++ G +FF 
Subjt:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG

Query:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRDNLKTYW
        SLIT+LC+  +     +EE+      ID   + ++ Q   +   +  +S+  AT  S        Q         S + +   +    L       + +W
Subjt:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRDNLKTYW

Query:  AYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEENDDEEKE
        AY+KERD A+++   +      P FP FPQ +L   D + + E  ++  +E  E
Subjt:  AYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEENDDEEKE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.5e-3032.78Show/hide
Query:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVD
        KA   ++   E+  + N Q R + + K+  + N K         P F+  VI Q+ WQ FCAHP++ +VPLV EFY  + +       +RG  V  S   
Subjt:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S + V LLY ++ G  IN G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGG

Query:  NIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEER
         +I  EI AC  +++G +FF SLIT +C+  +     +EE+
Subjt:  NIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.8e-2831.8Show/hide
Query:  VPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIK
        +PLV EFYA L D   +   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + VW HF+K
Subjt:  VPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIK

Query:  NCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS
        + L+PTTH   +S DR++LL+ ++ G  IN G +I  EI AC  ++ G +FF SLIT+LC+    +   +EE+      ID   + ++ Q     +    
Subjt:  NCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYR--QLDQIRDNLKTYWAYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEEND
        ++Q    S P  AS S+    T      +  A+  R  Q +      + +WAY+KERD A+++   +      P FP FPQ +L   D + + E  ++  
Subjt:  TSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYR--QLDQIRDNLKTYWAYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEEND

Query:  DEEKE
        +E  E
Subjt:  DEEKE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)7.3e-3635.98Show/hide
Query:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S  +G LP F+ +VI Q+ W+ FCAHP++ +VPLV EFYA L D   +   VRG  VS+S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG
         +   +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  IN G +I  EI AC  ++ G +FF 
Subjt:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG

Query:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)1.9e-3932.49Show/hide
Query:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S  +G LP F+ +VI Q+ W+ FCAHP++ +VPLV EFYA L D   +   VRG  VS+S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG
         + + +   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  IN G +I  EI AC  ++ G +FF 
Subjt:  PSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFG

Query:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRDNLKTYW
        SLIT+LC+  +     +EE+      ID   + ++ Q   +   +  +S+  AT  S        Q         S + +   +    L       + +W
Subjt:  SLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYRQ--LDQIRDNLKTYW

Query:  AYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEENDDEEKE
        AY+KERD A+++   +      P FP FPQ +L   D + + E  ++  +E  E
Subjt:  AYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEENDDEEKE

A0A2P5DAQ2 Uncharacterized protein1.2e-3032.78Show/hide
Query:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVD
        KA   ++   E+  + N Q R + + K+  + N K         P F+  VI Q+ WQ FCAHP++ +VPLV EFY  + +       +RG  V  S   
Subjt:  KAASPKNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S + V LLY ++ G  IN G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGG

Query:  NIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEER
         +I  EI AC  +++G +FF SLIT +C+  +     +EE+
Subjt:  NIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEER

A0A2P5DXM3 Uncharacterized protein8.7e-2931.8Show/hide
Query:  VPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIK
        +PLV EFYA L D   +   VRG  VS+S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     + + S L P + VW HF+K
Subjt:  VPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIK

Query:  NCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS
        + L+PTTH   +S DR++LL+ ++ G  IN G +I  EI AC  ++ G +FF SLIT+LC+    +   +EE+      ID   + ++ Q     +    
Subjt:  NCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFFGSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYR--QLDQIRDNLKTYWAYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEEND
        ++Q    S P  AS S+    T      +  A+  R  Q +      + +WAY+KERD A+++   +      P FP FPQ +L   D + + E  ++  
Subjt:  TSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYR--QLDQIRDNLKTYWAYAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEEND

Query:  DEEKE
        +E  E
Subjt:  DEEKE

W9RBS1 Uncharacterized protein1.0e-2931.68Show/hide
Query:  RDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIR
        R+ + EKGF    S  +G  P F++ VI    WQ FC HP + +VPLV EFYA L+++  +   V    ++F+S  IN V  I     P  +D    +I 
Subjt:  RDFLNEKGF----SNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIR

Query:  NPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFF
        +   +Q+KE LK +A  G QW  S     +    +L+P + VW HF+ + L+ +TH  TIS +R +LLY ++ G  IN G +I D+I AC  K  G ++F
Subjt:  NPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEILACGRKRAGKIFF

Query:  GSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALA-----------IAYRQLDQI
         SLI++LC +  +     E R      +DL  I ++     ++ +K    +   Q  P+  S S HT     + S E L              +  L Q 
Subjt:  GSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALA-----------IAYRQLDQI

Query:  RDNLKTYWAYAKERDEAIREFY
        ++ L  +W Y+++RD A+++ +
Subjt:  RDNLKTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACACTCCAAAACCCTCATCATCATGCAAGAACACTCGATCTCAGAGTGCTCAAGTGACCCACGAAGCTGAAGCTAGCGTGCGACGACAAGAGAAGAACCCCGA
AACGCCCATGCACGACACAAGAAGAACGAGACCCACAGGATTCTCACCGGCGGTCGTGAACCAAGCGTCCAACGCTCCAACTCCATCTTCTTCGGCAATGCCGGCCAGTT
CGAGGGAGATGCCGAGTTCATCTACACCATGGAGGTTCAAGCGCGCTGCCGCCGTCCATCAAACCCAAAAACCCACCGATCAAAAGTTCAAGAAACGTTCGCGGGAATGG
TTTGCAATGATTCGGGAGATGGGAGCTCAGAGACGTGCTGCCCTTGAAGAGGAAGGGAATCGGCAAGATGAAAAAGAAGCTGCCAAGGCAGCTGGAAGCTCTCGGCAAGG
AGGAGCTTCAATGGGTAAGCATTCTGAACCTTCAACTAACCCCTCTTTATCTTGCAGGACCAAACCATTTGTTACCTACAGCGCAAGAAAGAGGAGCTCGAAGAAGGCTG
TGCCTGAAAGGCCGCTTGAGGTCAAGCCCCTAAAAACCGCAAGGATGCCTCCCGACGTATTCGAAGGAATAATTCGTCAAGACGTGGCAAAGGCCCTTGAGATTTCTAAG
GGATATAAGGCTGAACAGGATGCTTTGAAAGAGATTGAGGATGAGAGAGAGATGGAAAATCAGAAAATGGTTGATGAAGACGAGTTTGCAAAGGAAAGAGATCGTGCAGA
AGAGAAAAGAAGAAGAGAAGAAGAGAAGGAAGCTAAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAGCACTGCAAGGAAAGGTAGAAG
AAAAGGCCCAACAGGGCCCAACTGAAGAAAATTTGGAAAAAGAAAAAGGAAGAGAAGTAGAGAAAGAAGGACAGAATGCGACCGCATCTGGGCCGCATTTTGAAGAAGGC
CTAGCTGAGGCCACCATTGATCAGCCAGTTGAAGAGGTTTTTGAGCCTCTATTCACGAATGACCCACCAGCTTCTGATAGCACCTCTTCGGGAGAGAAGAGGGACGAAGA
GGAAAAGGAAGATGAGGAGGTCGAGACCTCCACTAACTCTGACACAGAATCTGATTCAGAGATTAGGGAACTAGATGGCGACCAAGTTTCTATCTCTGCAGCGTTGAGAA
GAAAGAGAAAGAGAGAGATTAAGGCTGAGAGGAGGACAAAGAACAAGAATGACCCAATATTTTCCAAGAGACTGAGGACGAGGTCCATGGACGCCTCTCCTGCAGTTCCT
CCTACCATATCACCCGCCAAGCCGAAGGGCAAGTCACCCAAGGCTGCATCTCCCAAGAATCCATTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGGATGGAGAT
CATGAAGAAGAGAGATTTCCTCAACGAGAAGGGATTCTCTAACAGAGTTGGAGCACTGCCAGAATTCGTAACAAGAGTTATCTTCCAGTACAAGTGGCAGGACTTCTGTG
CTCACCCTCAGGAAGCTGTTGTGCCTTTAGTTCATGAATTTTACGCCGGCCTGAGGGATGAAAGTATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCA
GTCGACATTAACAGGGTGTACAGGATCAAGGCACCCTTGAACCCAAGAGGGAACGACGTTATCAGGAACCCTTCGGCCAAACAAATGAAGGAAGCTCTGAAACTTGTGGC
CAATAAGGGGGTTCAGTGGAAAGAATCACATACGAAAGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAGAATCGGCAGTTTGGCTTCACTTCATCAAAAACTGTTTGA
TGCCAACCACCCACGACAGCACGATTTCAGTGGATAGAGTAATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGGGGGGAACATTATCAGGGATGAGATTTTA
GCCTGTGGGAGAAAACGAGCAGGCAAGATTTTCTTTGGATCACTTATCACCCAGCTCTGCCAAAGGGTGAAGATCATTTCGGGAAAGGACGAAGAGCGTCACTTCTTCAA
GCCGACTATCGACCTGTCCTTGATTGGAAAGCTCCAACAGAATAGTATCCAGAGGAAAGACAAAGCCTCGACATCTCAGGCTACTCCTCAATCAGGGCCAAATGTAGCTT
CTCCATCCCAACACACTCCTTTTACAGGGCCCTCACCATCATCGGAAGCCCTAGCTATTGCCTACCGCCAGCTAGATCAAATCAGGGACAACCTGAAGACATATTGGGCG
TATGCAAAGGAGCGGGATGAAGCCATCAGAGAGTTTTACCTCTCTATTGCCCCAAGTATTGCTCCGATCTTTCCAGATTTCCCTCAGTCGCTGCTGCCTCAGGAAGACAA
GGATTCTGATGAAGAGGAAGGTGAAGAGAATGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACACTCCAAAACCCTCATCATCATGCAAGAACACTCGATCTCAGAGTGCTCAAGTGACCCACGAAGCTGAAGCTAGCGTGCGACGACAAGAGAAGAACCCCGA
AACGCCCATGCACGACACAAGAAGAACGAGACCCACAGGATTCTCACCGGCGGTCGTGAACCAAGCGTCCAACGCTCCAACTCCATCTTCTTCGGCAATGCCGGCCAGTT
CGAGGGAGATGCCGAGTTCATCTACACCATGGAGGTTCAAGCGCGCTGCCGCCGTCCATCAAACCCAAAAACCCACCGATCAAAAGTTCAAGAAACGTTCGCGGGAATGG
TTTGCAATGATTCGGGAGATGGGAGCTCAGAGACGTGCTGCCCTTGAAGAGGAAGGGAATCGGCAAGATGAAAAAGAAGCTGCCAAGGCAGCTGGAAGCTCTCGGCAAGG
AGGAGCTTCAATGGGTAAGCATTCTGAACCTTCAACTAACCCCTCTTTATCTTGCAGGACCAAACCATTTGTTACCTACAGCGCAAGAAAGAGGAGCTCGAAGAAGGCTG
TGCCTGAAAGGCCGCTTGAGGTCAAGCCCCTAAAAACCGCAAGGATGCCTCCCGACGTATTCGAAGGAATAATTCGTCAAGACGTGGCAAAGGCCCTTGAGATTTCTAAG
GGATATAAGGCTGAACAGGATGCTTTGAAAGAGATTGAGGATGAGAGAGAGATGGAAAATCAGAAAATGGTTGATGAAGACGAGTTTGCAAAGGAAAGAGATCGTGCAGA
AGAGAAAAGAAGAAGAGAAGAAGAGAAGGAAGCTAAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAGCACTGCAAGGAAAGGTAGAAG
AAAAGGCCCAACAGGGCCCAACTGAAGAAAATTTGGAAAAAGAAAAAGGAAGAGAAGTAGAGAAAGAAGGACAGAATGCGACCGCATCTGGGCCGCATTTTGAAGAAGGC
CTAGCTGAGGCCACCATTGATCAGCCAGTTGAAGAGGTTTTTGAGCCTCTATTCACGAATGACCCACCAGCTTCTGATAGCACCTCTTCGGGAGAGAAGAGGGACGAAGA
GGAAAAGGAAGATGAGGAGGTCGAGACCTCCACTAACTCTGACACAGAATCTGATTCAGAGATTAGGGAACTAGATGGCGACCAAGTTTCTATCTCTGCAGCGTTGAGAA
GAAAGAGAAAGAGAGAGATTAAGGCTGAGAGGAGGACAAAGAACAAGAATGACCCAATATTTTCCAAGAGACTGAGGACGAGGTCCATGGACGCCTCTCCTGCAGTTCCT
CCTACCATATCACCCGCCAAGCCGAAGGGCAAGTCACCCAAGGCTGCATCTCCCAAGAATCCATTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGGATGGAGAT
CATGAAGAAGAGAGATTTCCTCAACGAGAAGGGATTCTCTAACAGAGTTGGAGCACTGCCAGAATTCGTAACAAGAGTTATCTTCCAGTACAAGTGGCAGGACTTCTGTG
CTCACCCTCAGGAAGCTGTTGTGCCTTTAGTTCATGAATTTTACGCCGGCCTGAGGGATGAAAGTATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCA
GTCGACATTAACAGGGTGTACAGGATCAAGGCACCCTTGAACCCAAGAGGGAACGACGTTATCAGGAACCCTTCGGCCAAACAAATGAAGGAAGCTCTGAAACTTGTGGC
CAATAAGGGGGTTCAGTGGAAAGAATCACATACGAAAGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAGAATCGGCAGTTTGGCTTCACTTCATCAAAAACTGTTTGA
TGCCAACCACCCACGACAGCACGATTTCAGTGGATAGAGTAATGCTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGGGGGGAACATTATCAGGGATGAGATTTTA
GCCTGTGGGAGAAAACGAGCAGGCAAGATTTTCTTTGGATCACTTATCACCCAGCTCTGCCAAAGGGTGAAGATCATTTCGGGAAAGGACGAAGAGCGTCACTTCTTCAA
GCCGACTATCGACCTGTCCTTGATTGGAAAGCTCCAACAGAATAGTATCCAGAGGAAAGACAAAGCCTCGACATCTCAGGCTACTCCTCAATCAGGGCCAAATGTAGCTT
CTCCATCCCAACACACTCCTTTTACAGGGCCCTCACCATCATCGGAAGCCCTAGCTATTGCCTACCGCCAGCTAGATCAAATCAGGGACAACCTGAAGACATATTGGGCG
TATGCAAAGGAGCGGGATGAAGCCATCAGAGAGTTTTACCTCTCTATTGCCCCAAGTATTGCTCCGATCTTTCCAGATTTCCCTCAGTCGCTGCTGCCTCAGGAAGACAA
GGATTCTGATGAAGAGGAAGGTGAAGAGAATGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MKNTPKPSSSCKNTRSQSAQVTHEAEASVRRQEKNPETPMHDTRRTRPTGFSPAVVNQASNAPTPSSSAMPASSREMPSSSTPWRFKRAAAVHQTQKPTDQKFKKRSREW
FAMIREMGAQRRAALEEEGNRQDEKEAAKAAGSSRQGGASMGKHSEPSTNPSLSCRTKPFVTYSARKRSSKKAVPERPLEVKPLKTARMPPDVFEGIIRQDVAKALEISK
GYKAEQDALKEIEDEREMENQKMVDEDEFAKERDRAEEKRRREEEKEAKDFLAAFEPLHKAQSEAEALQGKVEEKAQQGPTEENLEKEKGREVEKEGQNATASGPHFEEG
LAEATIDQPVEEVFEPLFTNDPPASDSTSSGEKRDEEEKEDEEVETSTNSDTESDSEIRELDGDQVSISAALRRKRKREIKAERRTKNKNDPIFSKRLRTRSMDASPAVP
PTISPAKPKGKSPKAASPKNPFPEVFRDVNFQERMEIMKKRDFLNEKGFSNRVGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVHEFYAGLRDESISMAVVRGKMVSFSS
VDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESHTKVKSLVPSDLKPESAVWLHFIKNCLMPTTHDSTISVDRVMLLYCLMKGLEINGGNIIRDEIL
ACGRKRAGKIFFGSLITQLCQRVKIISGKDEERHFFKPTIDLSLIGKLQQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSPSSEALAIAYRQLDQIRDNLKTYWA
YAKERDEAIREFYLSIAPSIAPIFPDFPQSLLPQEDKDSDEEEGEENDDEEKESSSDEE