; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004657 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004657
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold5:20713246..20717361
RNA-Seq ExpressionSpg004657
SyntenySpg004657
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.8e-2432.34Show/hide
Query:  FAKRPRTRSMDVSPTVPPTISPAKPKGKSPKAASPKNPFPEVFKDVNFQERM---EIMKKKDF-LNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEA
        FAKRP + S    P +    + A     S +  S    F +   +  ++E +    ++K+K F L+D     + G    F+S VI    WQ FC HP + 
Subjt:  FAKRPRTRSMDVSPTVPPTISPAKPKGKSPKAASPKNPFPEVFKDVNFQERM---EIMKKKDF-LNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEA

Query:  VVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVW
        +VPLV EFYA L+ +  +   V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     +    +L+P + VW
Subjt:  VVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVW

Query:  LHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQNNIQR
         HF+  RL+  T   TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++
Subjt:  LHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQNNIQR

Query:  KDK
         +K
Subjt:  KDK

EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]3.1e-2434.54Show/hide
Query:  FVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQT
        F++RVI Q+ W++FC HP   +VPLV EFYA L + +     V+   V F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S  
Subjt:  FVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQT

Query:  KVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILAC-GWKRAGKLFFSSLITQLCQRVKIVPGKDE
           + +  +LK  + +W HF+  R MP T   T++ DRV+LLY ++ G+ +N+  I   EI AC   ++ G L+F SLITQL  +  +   KDE
Subjt:  KVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILAC-GWKRAGKLFFSSLITQLCQRVKIVPGKDE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]8.1e-3335.38Show/hide
Query:  KSPKAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSS
        K+ KA   +    E   + N Q R  +  +K F+ D   S+  G  L F+++VI Q+ W++FCAHP++ +VPLV EFYA L +   +   VRG  VS+S 
Subjt:  KSPKAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSS

Query:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEIN
          IN V+ +  P++   ++ I N +   +   L+ VA  G +W  S     + + S L P + VW HF+K  L+P T   T+S DR++LL+ ++ G  IN
Subjt:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQ
        VG +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      ID   + ++ Q
Subjt:  VGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.3e-3436.84Show/hide
Query:  DKGF---SDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEA
        +KGF   +      L F+++VI Q+ W++FCAHP++ +VPLV EFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N + + +   
Subjt:  DKGF---SDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEA

Query:  LKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQR
        L+ VA  G +W  S     + + S L P + VW HF+K RL+P T   T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+ 
Subjt:  LKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQR

Query:  VKIVPGKDEERHFVKPTIDLSLIGKLQQ
         +     +EE+      ID   + ++ Q
Subjt:  VKIVPGKDEERHFVKPTIDLSLIGKLQQ

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]5.4e-2932.92Show/hide
Query:  KAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDI
        KA   ++   E+  + N Q R  +  +K+F+ D   + +      F++ VI Q+ WQ FCAHP++ +VPLV EFY  +         +RG  V  S   I
Subjt:  KAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDI

Query:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGS
        N ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K RL+P T   T+S + V LLY ++ G  INVG 
Subjt:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGS

Query:  IIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEER
        +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  IIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEER

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.9e-3335.38Show/hide
Query:  KSPKAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSS
        K+ KA   +    E   + N Q R  +  +K F+ D   S+  G  L F+++VI Q+ W++FCAHP++ +VPLV EFYA L +   +   VRG  VS+S 
Subjt:  KSPKAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSS

Query:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEIN
          IN V+ +  P++   ++ I N +   +   L+ VA  G +W  S     + + S L P + VW HF+K  L+P T   T+S DR++LL+ ++ G  IN
Subjt:  VDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEIN

Query:  VGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQ
        VG +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      ID   + ++ Q
Subjt:  VGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQ

A0A2P5BCG4 Uncharacterized protein (Fragment)2.1e-3436.84Show/hide
Query:  DKGF---SDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEA
        +KGF   +      L F+++VI Q+ W++FCAHP++ +VPLV EFYA L +   +   VRG  VS+S   IN V+ +  P++   ++ I+N + + +   
Subjt:  DKGF---SDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEA

Query:  LKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQR
        L+ VA  G +W  S     + + S L P + VW HF+K RL+P T   T+S DR++LL+ ++ G  INVG +I  EI AC  ++ G LFF SLIT+LC+ 
Subjt:  LKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQR

Query:  VKIVPGKDEERHFVKPTIDLSLIGKLQQ
         +     +EE+      ID   + ++ Q
Subjt:  VKIVPGKDEERHFVKPTIDLSLIGKLQQ

A0A2P5DAQ2 Uncharacterized protein2.6e-2932.92Show/hide
Query:  KAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDI
        KA   ++   E+  + N Q R  +  +K+F+ D   + +      F++ VI Q+ WQ FCAHP++ +VPLV EFY  +         +RG  V  S   I
Subjt:  KAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDI

Query:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGS
        N ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     + + S L P + VW HF+K RL+P T   T+S + V LLY ++ G  INVG 
Subjt:  NRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGS

Query:  IIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEER
        +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  IIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEER

W9QTD9 Uncharacterized protein1.5e-2434.54Show/hide
Query:  FVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQT
        F++RVI Q+ W++FC HP   +VPLV EFYA L + +     V+   V F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S  
Subjt:  FVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQT

Query:  KVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILAC-GWKRAGKLFFSSLITQLCQRVKIVPGKDE
           + +  +LK  + +W HF+  R MP T   T++ DRV+LLY ++ G+ +N+  I   EI AC   ++ G L+F SLITQL  +  +   KDE
Subjt:  KVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILAC-GWKRAGKLFFSSLITQLCQRVKIVPGKDE

W9RBS1 Uncharacterized protein8.8e-2532.34Show/hide
Query:  FAKRPRTRSMDVSPTVPPTISPAKPKGKSPKAASPKNPFPEVFKDVNFQERM---EIMKKKDF-LNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEA
        FAKRP + S    P +    + A     S +  S    F +   +  ++E +    ++K+K F L+D     + G    F+S VI    WQ FC HP + 
Subjt:  FAKRPRTRSMDVSPTVPPTISPAKPKGKSPKAASPKNPFPEVFKDVNFQERM---EIMKKKDF-LNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEA

Query:  VVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVW
        +VPLV EFYA L+ +  +   V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     +    +L+P + VW
Subjt:  VVPLVCEFYAGLREESISMAVVRGKMVSFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVW

Query:  LHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQNNIQR
         HF+  RL+  T   TIS +R +LLY ++ G  INVG +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++
Subjt:  LHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEINVGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQNNIQR

Query:  KDK
         +K
Subjt:  KDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCACGCGAAGGACAAGACCCACGGGATTCTCGCCAGCGGTCATGAACCGAGTTCCCAACGCTCCAGCTCCATCCTCTTCGACAATGTCGGCTAGTTCGAGGGA
GATGCCGAGCTCATCTACACCAAGACGGTTCACACGCGCCGCTGCTGTCCACCAAACCCAAAAACCTCCCACTCAACAATTCAGAAAACATTCACGGGAGTGGTTTGAGA
TGATCCGAGAGATGGGTGCCAAGAGAAGAGCTGCCCTTGAAGAAGAAGGGAATCGGCAAAACGAAGAAAAGGCTGCCAAGGCAACGGAAAGCTCTCGACAAGGAGAAGCT
TCAATAGGTATGGTTTCCGAACCTTCAACTAACAGCTCTCTATCTTGCAGGACCAAACCCATGGTTACTTACAGCGCAAGGAAGAAGAGCCCAAAGAAAAATGTGTCTGA
AAAGTCGCTTGAAGTTCAACCCCTGGAAATCGCAAAGATGCCGCCTGATGTATTCGAAGGAATAATCCGCCAAGCAGTGGCAAAGGCACTTGAGATTGCAGAGGGGTACA
AGGCTGAACAGGACGCTTTGAAAGAGGTTGAAGCGGAGAAAGAGATGGAAAATCAAAAGATGGTTGAAGAGGATGGGTTTGCAAAGGAGAGAGATGAGAAAGATGAAAGA
AGAAAAACAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGCTGAGGAAGAGAGAAAATATGAGGAAAACCTCAGGAGGGCAGCTATGGATTTGAAGCTCCTTGAGGA
AGAGAAAAAGAGAAGGAAAGAAATAAAAGAAGACGAAAAGAGAAGAAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAGAGCTCAAAGTGAGGCTAATG
TGCTGCGAGGAAGGGTAGAAGAAGAGGCCCAACAGGGGCCAAGAGAAGAAAATTTAGAAAAAGAAGAAGAAACAGAAAAAAAAAAAGAAGAAGGCCAGAATGCGACCGCA
TCTGGGCTGCATTCTGAAGAAGGCATAGTCGAGGCCAATAAAGAGCAGTCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGCTGATAGCACCTC
TTCGGGGGAGAAAAGGGTTGAAGAGGAAAAAGAAGACGGGGAGGCCGAGACCTCCAGTGATTCTGACTCCGATACAGAGTCTGATTCAGAGATTAGGGAGCTAGATGAAG
ACCAAGTCCCTATCTCTGCAGCATTAAGAAGAAAAAGAAAGAGAGAGATTAAGGTTGAGAGGAGGACGAAGAACAAGAATGACCCAATCTTTGCCAAAAGGCCGAGGACG
AGATCCATGGACGTCTCTCCTACAGTTCCTCCAACCATCTCACCCGCCAAACCAAAGGGCAAATCACCCAAGGCCGCATCTCCTAAAAATCCGTTCCCTGAGGTATTTAA
AGATGTTAATTTTCAGGAAAGGATGGAGATCATGAAGAAAAAAGATTTCCTCAACGATAAGGGATTCTCTGACCGAGCAGGAGCACTGCTAGAGTTCGTAAGCAGAGTTA
TCTTCCAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCCCTAGTTTGTGAATTCTACGCCGGCCTGAGGGAGGAGAGTATTAGCATGGCGGTG
GTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATCAACCGGGTGTACAGGATCAAGGCACCCCTGAACCCAAGAGGGAATGATGTGATAAGGAACCCTTCGGCCAA
ACAGATGAAGGAAGCATTGAAACTCGTGGCCAACAAGGGGGTCCAATGGAAAGAATCGCAAACAAAAGTGAAGTCTCTAGTGCCAAGCGACCTAAAGCCAGAATCGGCAG
TGTGGCTTCATTTCATCAAGAAACGTTTGATGCCAATCACCCGCGACAACACGATTTCAGTAGATAGAGTGATGCTACTCTATTGCCTAATGAAGGGGTTGGAGATCAAT
GTAGGGAGCATTATTAGGGATGAAATCTTAGCCTGTGGATGGAAAAGGGCAGGCAAGCTTTTCTTTAGCTCACTCATCACCCAACTCTGTCAGAGGGTGAAGATTGTGCC
AGGCAAGGACGAGGAGCGCCATTTCGTTAAACCAACCATTGACTTGTCCTTGATAGGGAAGCTCCAGCAGAACAACATCCAGAGGAAGGATAAAGCCTCCACGTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCACGCGAAGGACAAGACCCACGGGATTCTCGCCAGCGGTCATGAACCGAGTTCCCAACGCTCCAGCTCCATCCTCTTCGACAATGTCGGCTAGTTCGAGGGA
GATGCCGAGCTCATCTACACCAAGACGGTTCACACGCGCCGCTGCTGTCCACCAAACCCAAAAACCTCCCACTCAACAATTCAGAAAACATTCACGGGAGTGGTTTGAGA
TGATCCGAGAGATGGGTGCCAAGAGAAGAGCTGCCCTTGAAGAAGAAGGGAATCGGCAAAACGAAGAAAAGGCTGCCAAGGCAACGGAAAGCTCTCGACAAGGAGAAGCT
TCAATAGGTATGGTTTCCGAACCTTCAACTAACAGCTCTCTATCTTGCAGGACCAAACCCATGGTTACTTACAGCGCAAGGAAGAAGAGCCCAAAGAAAAATGTGTCTGA
AAAGTCGCTTGAAGTTCAACCCCTGGAAATCGCAAAGATGCCGCCTGATGTATTCGAAGGAATAATCCGCCAAGCAGTGGCAAAGGCACTTGAGATTGCAGAGGGGTACA
AGGCTGAACAGGACGCTTTGAAAGAGGTTGAAGCGGAGAAAGAGATGGAAAATCAAAAGATGGTTGAAGAGGATGGGTTTGCAAAGGAGAGAGATGAGAAAGATGAAAGA
AGAAAAACAGAAGAAGAGCAAGAGGCCGAGAGGGCCTTAGAAGCTGAGGAAGAGAGAAAATATGAGGAAAACCTCAGGAGGGCAGCTATGGATTTGAAGCTCCTTGAGGA
AGAGAAAAAGAGAAGGAAAGAAATAAAAGAAGACGAAAAGAGAAGAAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAGAGCTCAAAGTGAGGCTAATG
TGCTGCGAGGAAGGGTAGAAGAAGAGGCCCAACAGGGGCCAAGAGAAGAAAATTTAGAAAAAGAAGAAGAAACAGAAAAAAAAAAAGAAGAAGGCCAGAATGCGACCGCA
TCTGGGCTGCATTCTGAAGAAGGCATAGTCGAGGCCAATAAAGAGCAGTCAGCTGAAGAGGTTTTTGAGCCTCTATTCACACATGACCCACCAGCTGCTGATAGCACCTC
TTCGGGGGAGAAAAGGGTTGAAGAGGAAAAAGAAGACGGGGAGGCCGAGACCTCCAGTGATTCTGACTCCGATACAGAGTCTGATTCAGAGATTAGGGAGCTAGATGAAG
ACCAAGTCCCTATCTCTGCAGCATTAAGAAGAAAAAGAAAGAGAGAGATTAAGGTTGAGAGGAGGACGAAGAACAAGAATGACCCAATCTTTGCCAAAAGGCCGAGGACG
AGATCCATGGACGTCTCTCCTACAGTTCCTCCAACCATCTCACCCGCCAAACCAAAGGGCAAATCACCCAAGGCCGCATCTCCTAAAAATCCGTTCCCTGAGGTATTTAA
AGATGTTAATTTTCAGGAAAGGATGGAGATCATGAAGAAAAAAGATTTCCTCAACGATAAGGGATTCTCTGACCGAGCAGGAGCACTGCTAGAGTTCGTAAGCAGAGTTA
TCTTCCAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCCCTAGTTTGTGAATTCTACGCCGGCCTGAGGGAGGAGAGTATTAGCATGGCGGTG
GTGAGGGGGAAGATGGTCAGTTTCTCCTCAGTCGACATCAACCGGGTGTACAGGATCAAGGCACCCCTGAACCCAAGAGGGAATGATGTGATAAGGAACCCTTCGGCCAA
ACAGATGAAGGAAGCATTGAAACTCGTGGCCAACAAGGGGGTCCAATGGAAAGAATCGCAAACAAAAGTGAAGTCTCTAGTGCCAAGCGACCTAAAGCCAGAATCGGCAG
TGTGGCTTCATTTCATCAAGAAACGTTTGATGCCAATCACCCGCGACAACACGATTTCAGTAGATAGAGTGATGCTACTCTATTGCCTAATGAAGGGGTTGGAGATCAAT
GTAGGGAGCATTATTAGGGATGAAATCTTAGCCTGTGGATGGAAAAGGGCAGGCAAGCTTTTCTTTAGCTCACTCATCACCCAACTCTGTCAGAGGGTGAAGATTGTGCC
AGGCAAGGACGAGGAGCGCCATTTCGTTAAACCAACCATTGACTTGTCCTTGATAGGGAAGCTCCAGCAGAACAACATCCAGAGGAAGGATAAAGCCTCCACGTCATAG
Protein sequenceShow/hide protein sequence
MQGTRRTRPTGFSPAVMNRVPNAPAPSSSTMSASSREMPSSSTPRRFTRAAAVHQTQKPPTQQFRKHSREWFEMIREMGAKRRAALEEEGNRQNEEKAAKATESSRQGEA
SIGMVSEPSTNSSLSCRTKPMVTYSARKKSPKKNVSEKSLEVQPLEIAKMPPDVFEGIIRQAVAKALEIAEGYKAEQDALKEVEAEKEMENQKMVEEDGFAKERDEKDER
RKTEEEQEAERALEAEEERKYEENLRRAAMDLKLLEEEKKRRKEIKEDEKRRKEAEDFLAAFEPLHRAQSEANVLRGRVEEEAQQGPREENLEKEEETEKKKEEGQNATA
SGLHSEEGIVEANKEQSAEEVFEPLFTHDPPAADSTSSGEKRVEEEKEDGEAETSSDSDSDTESDSEIRELDEDQVPISAALRRKRKREIKVERRTKNKNDPIFAKRPRT
RSMDVSPTVPPTISPAKPKGKSPKAASPKNPFPEVFKDVNFQERMEIMKKKDFLNDKGFSDRAGALLEFVSRVIFQYKWQEFCAHPQEAVVPLVCEFYAGLREESISMAV
VRGKMVSFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKSLVPSDLKPESAVWLHFIKKRLMPITRDNTISVDRVMLLYCLMKGLEIN
VGSIIRDEILACGWKRAGKLFFSSLITQLCQRVKIVPGKDEERHFVKPTIDLSLIGKLQQNNIQRKDKASTS