; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025496 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025496
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold13:30582977..30593281
RNA-Seq ExpressionSpg025496
SyntenySpg025496
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]2.1e-3030.62Show/hide
Query:  FAKRPRTR-----SMDASSAVPPT------------VSPPSQRERMEIMKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVR
        FAKRP +      ++D ++A  P+            V   +++   E +  R+   EKGF    S   G  P F++ VI    WQ FC HP + +VPLV+
Subjt:  FAKRPRTR-----SMDASSAVPPT------------VSPPSQRERMEIMKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVR

Query:  EFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMP
        EFYA L+ +  +   V    ++F+S  IN V  I    D    ++I +   +Q+KE LK +A  G QW  S     +   ++L+P + VW HF+ +RL+ 
Subjt:  EFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMP

Query:  TTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQAT
        +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E        +D+  I ++     ++ +K    +  
Subjt:  TTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQAT

Query:  PQSGPNVASPSQHTPFTGPSLASEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY
         Q  P+  S S HT     + + E L             +   L Q +E L  +WVY+++RD A+++ +
Subjt:  PQSGPNVASPSQHTPFTGPSLASEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY

KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]7.4e-2828.35Show/hide
Query:  RSMDASSAVPPTVSPPSQRERMEIMKKRDFFNEKG--FSDRAGA-LPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSS
        RS  A    PP     + +E  + +K R  F E G  FS+   A L   V  V+ ++KWQ F  HP      +V+EFY+ + + +    +VRG  + F+ 
Subjt:  RSMDASSAVPPTVSPPSQRERMEIMKKRDFFNEKG--FSDRAGA-LPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSS

Query:  VDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN
          INR ++++   D       +    +  +  L+ +   G +W   Q K K++  + L P   +W HF+K++LMPT+H++T+S  R++LL+ ++ G  I+
Subjt:  VDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKL--QQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLAS
        +G II +    C +++A  L F +LIT LC++ K+     +E       ++ + I  L   + +  +K +A+TS+    S P+V + S            
Subjt:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKL--QQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLAS

Query:  EALGMVHRQLDQIRENLKTYWVYAKERD
        +A+   H+ + Q+ + L  Y+ YAK RD
Subjt:  EALGMVHRQLDQIRENLKTYWVYAKERD

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.6e-3536.74Show/hide
Query:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN
        ++ R    EKGF    S+  G LP F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L     +   VRG  VS+S   IN V+ +  P+D   ++ I N
Subjt:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN

Query:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG
         +   +   L+ VA  G +W  S     + + + L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQATPQSGPNVASPSQ
        SLIT+LC+  +     +EE       ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQATPQSGPNVASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.1e-3632.11Show/hide
Query:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN
        ++ R    EKGF    S+  G LP F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L     +   VRG  VS+S   IN V+ +  P+D   ++ I+N
Subjt:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN

Query:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     + + + L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLASEALGMVHRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE       ID   + ++ Q   +   +  +S+  AT  S        Q        L+ + +   H    L    +  + +W
Subjt:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLASEALGMVHRQ--LDQIRENLKTYW

Query:  VYAKERDEAIREFYLSIAPTLYCPEYRSGHRSLLPQEEEDSDEKEDEENDDEEKE
         Y+KERD A+++  L    T   P + +  + +L   + + + + D++  +E  E
Subjt:  VYAKERDEAIREFYLSIAPTLYCPEYRSGHRSLLPQEEEDSDEKEDEENDDEEKE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]5.0e-3238.3Show/hide
Query:  PEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKES
        P F+  VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P+D   ++ + + +  ++   L+ VA  G +W  S
Subjt:  PEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKES

Query:  QTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK
             + + + L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  QTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)8.0e-3636.74Show/hide
Query:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN
        ++ R    EKGF    S+  G LP F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L     +   VRG  VS+S   IN V+ +  P+D   ++ I N
Subjt:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN

Query:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG
         +   +   L+ VA  G +W  S     + + + L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQATPQSGPNVASPSQ
        SLIT+LC+  +     +EE       ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQATPQSGPNVASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)5.5e-3732.11Show/hide
Query:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN
        ++ R    EKGF    S+  G LP F+ +VI Q+ W+ FCAHP++ +VPLVREFYA L     +   VRG  VS+S   IN V+ +  P+D   ++ I+N
Subjt:  MKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRN

Query:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     + + + L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF 
Subjt:  PSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLASEALGMVHRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE       ID   + ++ Q   +   +  +S+  AT  S        Q        L+ + +   H    L    +  + +W
Subjt:  SLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQN--SIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLASEALGMVHRQ--LDQIRENLKTYW

Query:  VYAKERDEAIREFYLSIAPTLYCPEYRSGHRSLLPQEEEDSDEKEDEENDDEEKE
         Y+KERD A+++  L    T   P + +  + +L   + + + + D++  +E  E
Subjt:  VYAKERDEAIREFYLSIAPTLYCPEYRSGHRSLLPQEEEDSDEKEDEENDDEEKE

A0A2P5DAQ2 Uncharacterized protein2.4e-3238.3Show/hide
Query:  PEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKES
        P F+  VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ +  P+D   ++ + + +  ++   L+ VA  G +W  S
Subjt:  PEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKES

Query:  QTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK
             + + + L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  QTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVK

A0A6A2ZUE4 Uncharacterized protein3.6e-2828.35Show/hide
Query:  RSMDASSAVPPTVSPPSQRERMEIMKKRDFFNEKG--FSDRAGA-LPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSS
        RS  A    PP     + +E  + +K R  F E G  FS+   A L   V  V+ ++KWQ F  HP      +V+EFY+ + + +    +VRG  + F+ 
Subjt:  RSMDASSAVPPTVSPPSQRERMEIMKKRDFFNEKG--FSDRAGA-LPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSS

Query:  VDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN
          INR ++++   D       +    +  +  L+ +   G +W   Q K K++  + L P   +W HF+K++LMPT+H++T+S  R++LL+ ++ G  I+
Subjt:  VDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKL--QQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLAS
        +G II +    C +++A  L F +LIT LC++ K+     +E       ++ + I  L   + +  +K +A+TS+    S P+V + S            
Subjt:  VGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKL--QQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLAS

Query:  EALGMVHRQLDQIRENLKTYWVYAKERD
        +A+   H+ + Q+ + L  Y+ YAK RD
Subjt:  EALGMVHRQLDQIRENLKTYWVYAKERD

W9RBS1 Uncharacterized protein1.0e-3030.62Show/hide
Query:  FAKRPRTR-----SMDASSAVPPT------------VSPPSQRERMEIMKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVR
        FAKRP +      ++D ++A  P+            V   +++   E +  R+   EKGF    S   G  P F++ VI    WQ FC HP + +VPLV+
Subjt:  FAKRPRTR-----SMDASSAVPPT------------VSPPSQRERMEIMKKRDFFNEKGF----SDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVR

Query:  EFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMP
        EFYA L+ +  +   V    ++F+S  IN V  I    D    ++I +   +Q+KE LK +A  G QW  S     +   ++L+P + VW HF+ +RL+ 
Subjt:  EFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANKGVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMP

Query:  TTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQAT
        +TH  TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E        +D+  I ++     ++ +K    +  
Subjt:  TTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPTIDMSLIRKLQQNSIQRKDKASTSQAT

Query:  PQSGPNVASPSQHTPFTGPSLASEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY
         Q  P+  S S HT     + + E L             +   L Q +E L  +WVY+++RD A+++ +
Subjt:  PQSGPNVASPSQHTPFTGPSLASEALGM-----------VHRQLDQIRENLKTYWVYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGGAGAGAGCAGCGTAGACAGCAAAATCAAATGGCTGACGTGCC
GCGTCTACCGCAGGGTCTAGAAGATCCAGTTGATCCCCAGCAGAATCGAAAAAAACACGAGCAAGAAAAAAATGTTGAGACTGTTGAGACTGAGATGGAACTGAGTGAAG
AGGTTGAGCTTAATGAAAGATTTCATTGGGAACGGTTGTACGCAAAGCCGGAGCCCACTGTTGATCAACTAGAATGGGAGTTCTACGCCAACATCGATGAAAATGAAGGA
TTCTTGGTTATTATTTGTGGAATTGTTCTCGACTGGAGCCATGTAGTGATTAATTCTCTGTTTAATTTGCAAGACTTTCCCCCATGCTGTTTTCGATGCAATATTGGTTG
CTTCCTCAAACGAGCAACTAAATGCCCAGTGGAGCTAAAGCAAGGCCAAGGAGCGGGAATCAAGCGAGAAGACGTGGAGATCGTGACGGGGACGTGTCGCCTAATTGGTG
ATGAGTTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAGAACAAAAATATAAACCCCTTA
AAAATGTGTTTTAATATGTCTGATAATAGAGCTAGGTTGTGGCAAGTTCTTAGAATTGAGTCAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTCTGCTGCAGC
AGAGCTTGGTTTTGCAGAATGCTCAGAATATGTTGCTGGGCGATTTGAGGGAGCAAACTCTGTGCTGGAGCAAAGCTGGGAGCAAAAACTGCCACATTGTAACGAACCCA
ACAATGAGTTTGCAAAGGAAAGAGATGAGAAAGAAAAGAAAAGAAGAAGAGAAGAAGAACAAGAGGCCGAGAGGGCCTTAGAAGCTGAGGAAGAGAGAAAGTATGAGGAA
AACCTCAGGAGGGCAGCTATGGATTTGCAGCTCCTTAAGGAAGAGAAAAAGAGACGGGAAGAAATAAAAGAAAATGAAAAGAGAAGGAAGAAAGCTGAAGACTTCCTTGC
AGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAACTGCTGCAAGGAAGGGTAGAAGAAAAGGCTCAACAGGGGCCAAGTGAAGACATTTTTGAAACAGAAAGAG
AAGTAAAGAATGAAGGCCAAAATGCGACCGCATCTGGGCCGCATTCTGAGGAAGGCCTAGTCGAGGCCACTGTTGATCAGCCAGCTGAAGAGGTTCTTGAACCTCTATTC
ACACATGACCCACCAGCTGCCAATAGCACCTCTTCGGGAAAGAAGAGGGTTGAAGAGGAAAAAGAAGACGAGGAGGCCGAGACCTCCAGTGATTCTAATTCTGACACAGA
ATCTGATTCAGAGATAAGGGAGCTGGATGATGACCAAGTTCCTATCTCTGCAGCAGTGAGAAGAAAGAGGAAGAGAGAGATTAAGGCCGAGAGGAGGACAAAAAACAAGA
ATGACCCAATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTTCTGCAGTTCCTCCGACCGTCTCCCCGCCAAGCCAAAGGGAACGAATGGAGATCATGAAG
AAAAGAGATTTCTTCAACGAGAAGGGATTCTCTGATAGAGCTGGAGCACTGCCTGAGTTCGTAACAAGAGTTATCTTCCAGTACAAGTGGCAGGACTTCTGTGCTCACCC
TCAGGAGGCTGTTGTGCCTTTAGTTCGAGAGTTTTACGCTGGCTTGAGGAAGGAGAGTATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACA
TTAACAGGGTGTACAGGATCAAGGCGCCCTTGGACCCAAGAGGGAACGACGTTATCAGGAACCCTTCGATCAAGCAGATGAAAGAATCTCTTAAACTTGTGGCCAACAAG
GGGGTTCAATGGAAAGAATCACAGACGAAAGTGAAGTCTTTAGTGCCAAACGACTTAAAGCCAGAATCGGCAGTTTGGCTTCACTTCATCAAGAACCGCTTGATGCCAAC
CACCCACGACAGCACAATTTCAGTGGATAGAGTGATGTTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTAGGGAGCATTATCAGGGATGAGATCTTAGCCTGTG
GGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGCTCACTCATCACCCAGCTCTGTCAGAGGGTGAAGATTGTGCCAGGCAAGGACGAGGAGTGCCATTTCTTTAAACCGACT
ATTGACATGTCCTTGATCAGGAAGCTACAACAGAACAGTATCCAGAGGAAAGACAAAGCCTCGACATCTCAGGCCACTCCTCAATCAGGGCCAAATGTAGCTTCTCCATC
CCAGCATACTCCTTTCACAGGGCCTTCACTAGCATCGGAAGCCCTAGGTATGGTCCACCGCCAGTTAGATCAAATCAGGGAAAACCTGAAGACGTACTGGGTATATGCAA
AGGAGAGGGATGAAGCTATTAGAGAGTTCTATCTCTCTATTGCCCCGACTCTCTATTGCCCCGAGTATCGCTCCGGTCATCGGTCGCTGCTGCCCCAAGAAGAAGAGGAT
TCTGATGAAAAGGAAGATGAAGAGAATGATGATGAAGAGAAGGAGAGTTCCTCAGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAATCCGCCTGGGGTGAGGTTCGAGCTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGGAGAGAGCAGCGTAGACAGCAAAATCAAATGGCTGACGTGCC
GCGTCTACCGCAGGGTCTAGAAGATCCAGTTGATCCCCAGCAGAATCGAAAAAAACACGAGCAAGAAAAAAATGTTGAGACTGTTGAGACTGAGATGGAACTGAGTGAAG
AGGTTGAGCTTAATGAAAGATTTCATTGGGAACGGTTGTACGCAAAGCCGGAGCCCACTGTTGATCAACTAGAATGGGAGTTCTACGCCAACATCGATGAAAATGAAGGA
TTCTTGGTTATTATTTGTGGAATTGTTCTCGACTGGAGCCATGTAGTGATTAATTCTCTGTTTAATTTGCAAGACTTTCCCCCATGCTGTTTTCGATGCAATATTGGTTG
CTTCCTCAAACGAGCAACTAAATGCCCAGTGGAGCTAAAGCAAGGCCAAGGAGCGGGAATCAAGCGAGAAGACGTGGAGATCGTGACGGGGACGTGTCGCCTAATTGGTG
ATGAGTTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAGAACAAAAATATAAACCCCTTA
AAAATGTGTTTTAATATGTCTGATAATAGAGCTAGGTTGTGGCAAGTTCTTAGAATTGAGTCAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTCTGCTGCAGC
AGAGCTTGGTTTTGCAGAATGCTCAGAATATGTTGCTGGGCGATTTGAGGGAGCAAACTCTGTGCTGGAGCAAAGCTGGGAGCAAAAACTGCCACATTGTAACGAACCCA
ACAATGAGTTTGCAAAGGAAAGAGATGAGAAAGAAAAGAAAAGAAGAAGAGAAGAAGAACAAGAGGCCGAGAGGGCCTTAGAAGCTGAGGAAGAGAGAAAGTATGAGGAA
AACCTCAGGAGGGCAGCTATGGATTTGCAGCTCCTTAAGGAAGAGAAAAAGAGACGGGAAGAAATAAAAGAAAATGAAAAGAGAAGGAAGAAAGCTGAAGACTTCCTTGC
AGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAACTGCTGCAAGGAAGGGTAGAAGAAAAGGCTCAACAGGGGCCAAGTGAAGACATTTTTGAAACAGAAAGAG
AAGTAAAGAATGAAGGCCAAAATGCGACCGCATCTGGGCCGCATTCTGAGGAAGGCCTAGTCGAGGCCACTGTTGATCAGCCAGCTGAAGAGGTTCTTGAACCTCTATTC
ACACATGACCCACCAGCTGCCAATAGCACCTCTTCGGGAAAGAAGAGGGTTGAAGAGGAAAAAGAAGACGAGGAGGCCGAGACCTCCAGTGATTCTAATTCTGACACAGA
ATCTGATTCAGAGATAAGGGAGCTGGATGATGACCAAGTTCCTATCTCTGCAGCAGTGAGAAGAAAGAGGAAGAGAGAGATTAAGGCCGAGAGGAGGACAAAAAACAAGA
ATGACCCAATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTTCTGCAGTTCCTCCGACCGTCTCCCCGCCAAGCCAAAGGGAACGAATGGAGATCATGAAG
AAAAGAGATTTCTTCAACGAGAAGGGATTCTCTGATAGAGCTGGAGCACTGCCTGAGTTCGTAACAAGAGTTATCTTCCAGTACAAGTGGCAGGACTTCTGTGCTCACCC
TCAGGAGGCTGTTGTGCCTTTAGTTCGAGAGTTTTACGCTGGCTTGAGGAAGGAGAGTATTAGCATGGCGGTGGTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACA
TTAACAGGGTGTACAGGATCAAGGCGCCCTTGGACCCAAGAGGGAACGACGTTATCAGGAACCCTTCGATCAAGCAGATGAAAGAATCTCTTAAACTTGTGGCCAACAAG
GGGGTTCAATGGAAAGAATCACAGACGAAAGTGAAGTCTTTAGTGCCAAACGACTTAAAGCCAGAATCGGCAGTTTGGCTTCACTTCATCAAGAACCGCTTGATGCCAAC
CACCCACGACAGCACAATTTCAGTGGATAGAGTGATGTTACTCTATTGCCTTATGAAGGGGTTGGAGATCAACGTAGGGAGCATTATCAGGGATGAGATCTTAGCCTGTG
GGAGAAAGCGAGCAGGCAAGCTTTTCTTTGGCTCACTCATCACCCAGCTCTGTCAGAGGGTGAAGATTGTGCCAGGCAAGGACGAGGAGTGCCATTTCTTTAAACCGACT
ATTGACATGTCCTTGATCAGGAAGCTACAACAGAACAGTATCCAGAGGAAAGACAAAGCCTCGACATCTCAGGCCACTCCTCAATCAGGGCCAAATGTAGCTTCTCCATC
CCAGCATACTCCTTTCACAGGGCCTTCACTAGCATCGGAAGCCCTAGGTATGGTCCACCGCCAGTTAGATCAAATCAGGGAAAACCTGAAGACGTACTGGGTATATGCAA
AGGAGAGGGATGAAGCTATTAGAGAGTTCTATCTCTCTATTGCCCCGACTCTCTATTGCCCCGAGTATCGCTCCGGTCATCGGTCGCTGCTGCCCCAAGAAGAAGAGGAT
TCTGATGAAAAGGAAGATGAAGAGAATGATGATGAAGAGAAGGAGAGTTCCTCAGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MNNPPGVRFELDPEIERTFRIRRREQRRQQNQMADVPRLPQGLEDPVDPQQNRKKHEQEKNVETVETEMELSEEVELNERFHWERLYAKPEPTVDQLEWEFYANIDENEG
FLVIICGIVLDWSHVVINSLFNLQDFPPCCFRCNIGCFLKRATKCPVELKQGQGAGIKREDVEIVTGTCRLIGDEFEARVYCTIKWVIPCLRAYDCRAALSLKNKNINPL
KMCFNMSDNRARLWQVLRIESKVVIICPCRKNYSAAAELGFAECSEYVAGRFEGANSVLEQSWEQKLPHCNEPNNEFAKERDEKEKKRRREEEQEAERALEAEEERKYEE
NLRRAAMDLQLLKEEKKRREEIKENEKRRKKAEDFLAAFEPLHKAQSEAELLQGRVEEKAQQGPSEDIFETEREVKNEGQNATASGPHSEEGLVEATVDQPAEEVLEPLF
THDPPAANSTSSGKKRVEEEKEDEEAETSSDSNSDTESDSEIRELDDDQVPISAAVRRKRKREIKAERRTKNKNDPIFAKRPRTRSMDASSAVPPTVSPPSQRERMEIMK
KRDFFNEKGFSDRAGALPEFVTRVIFQYKWQDFCAHPQEAVVPLVREFYAGLRKESISMAVVRGKMVSFSSVDINRVYRIKAPLDPRGNDVIRNPSIKQMKESLKLVANK
GVQWKESQTKVKSLVPNDLKPESAVWLHFIKNRLMPTTHDSTISVDRVMLLYCLMKGLEINVGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEECHFFKPT
IDMSLIRKLQQNSIQRKDKASTSQATPQSGPNVASPSQHTPFTGPSLASEALGMVHRQLDQIRENLKTYWVYAKERDEAIREFYLSIAPTLYCPEYRSGHRSLLPQEEED
SDEKEDEENDDEEKESSSDED