; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032521 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032521
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA-directed RNA polymerase V subunit 5C-like
Genome locationscaffold2:31082220..31086861
RNA-Seq ExpressionSpg032521
SyntenySpg032521
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.8e-1630.38Show/hide
Query:  GIANHGWERFSYNELAVAPSNEQLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKII
        GI N   E   + EL      EQL + ++ + I GAQW LS     T     L+  A  W  F+  RLL +TH  T+SR R +L +A+L    I+VG++I
Subjt:  GIANHGWERFSYNELAVAPSNEQLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKII

Query:  ADEISGCWKKKVGKLFFPNTITMLCKRAGV--------LENEGDVILFDKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEFAER-----
         D+I  C +K  G L+FP+ I+ LC ++ V        L N G + L     I +    + ++ +E  +       +T   + A +A  QE+ E+     
Subjt:  ADEISGCWKKKVGKLFFPNTITMLCKRAGV--------LENEGDVILFDKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEFAER-----

Query:  ----------------QALTFWNYVRTRDANLKKALQ
                        Q   FW Y R RD  LKK+ Q
Subjt:  ----------------QALTFWNYVRTRDANLKKALQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.7e-2532.94Show/hide
Query:  SNEQLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPN
        + + L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+
Subjt:  SNEQLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPN

Query:  TITMLCK--RAGVLENEGDVILFDKRIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-------EFAERQALTF
         IT LC+  RA  L NE    L +   ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q       +   +Q   F
Subjt:  TITMLCK--RAGVLENEGDVILFDKRIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-------EFAERQALTF

Query:  WNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED
        W Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  +
Subjt:  WNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]9.8e-2334.74Show/hide
Query:  ANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCK--RAGVLENE------GDVILF--------
        A  W  F+K RLLPTTH  TVS++R+LL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+  RA  L NE      G++           
Subjt:  ANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCK--RAGVLENE------GDVILF--------

Query:  -DKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEF--------AERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIP
            +   P+ +R      +R  G +      LEQ       Q++          +Q   FW Y + RD  LKKALQ NF++P P  P FP++LL     
Subjt:  -DKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEF--------AERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIP

Query:  PPLVEREGDGEED
            E + DG  +
Subjt:  PPLVEREGDGEED

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.9e-2736.4Show/hide
Query:  QLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTIT
        +L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT
Subjt:  QLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTIT

Query:  MLCKRAGVLENEGDVILFDKRIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRTRDANLKK
         LC+ A  L NE    L +   ID   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   FW Y + RD  LKK
Subjt:  MLCKRAGVLENEGDVILFDKRIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRTRDANLKK

Query:  ALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED
        ALQ NF++P P  PAFP+++L         E + DG  +
Subjt:  ALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED

XP_024971944.1 uncharacterized protein LOC112510826 [Cynara cardunculus var. scolymus]4.6e-2031.98Show/hide
Query:  YNELAVAPSNEQLSDAVREVGIEGAQWRL-SKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKK
        +  L+ + S  +L +  RE+G  G +W   S    RT++++ LK  AN W+ FI+  L PTTHDS++S E+++L + ++   +I+VGK++   I  C K+
Subjt:  YNELAVAPSNEQLSDAVREVGIEGAQWRL-SKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKK

Query:  KVGKLFFPNTITMLCKRAGVLENEGDVIL---FDKRIIDTPNLARL-QRTQEARQ-GGLVYGINTILEQLALSASRQEFAERQALTFWNYVRTRDANLKK
        + GKLFFP+ I  L  +AGV E   D+++    +K  ID   +++L +R++  R+  G+   +  +LEQ   S S  +F   Q       ++T  A+L  
Subjt:  KVGKLFFPNTITMLCKRAGVLENEGDVIL---FDKRIIDTPNLARL-QRTQEARQ-GGLVYGINTILEQLALSASRQEFAERQALTFWNYVRTRDANLKK

Query:  ALQENFSKPFPALPAFPEDLLN
         L+ +  K         EDL N
Subjt:  ALQENFSKPFPALPAFPEDLLN

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein4.3e-1630.28Show/hide
Query:  GAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVLENE
        GAQW+++K E  +F+S  L + A  W+ FI  R+LPT H   V+ +R LL + I+   + DVGKII+D I          L+FP+ IT LC RAGV  +E
Subjt:  GAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVLENE

Query:  GDVILFDKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEF---AERQA---LTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNP
         + ++F +  ID   + R+        GG    +   +  L    S QE     ER+    + +   +      L + +  +       +P F     +P
Subjt:  GDVILFDKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEF---AERQA---LTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNP

Query:  WIPPPLVEREGDGEEDPE
          PP       DG ED E
Subjt:  WIPPPLVEREGDGEEDPE

A0A2G9GQI5 Uncharacterized protein7.4e-1630.23Show/hide
Query:  GAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVLENE
        GAQW+++K E  +F+S  L + A  W+ FI  ++LPT+H   V+ ++ LL + I+   + DVGKII++ I          L+FP+ IT LC RAGV  +E
Subjt:  GAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVLENE

Query:  GDVILFDKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQE---FAERQALTFWNYVRTRD---ANLKKALQENFSKPFPALPAFPEDLLNP
         + ++F +  ID   + R+        GG    +   +  L    S QE     ER+     +YV         L + +  +       +P F  D  +P
Subjt:  GDVILFDKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQE---FAERQALTFWNYVRTRD---ANLKKALQENFSKPFPALPAFPEDLLNP

Query:  WIPPPLVEREGDGEE
          PPP    E + EE
Subjt:  WIPPPLVEREGDGEE

A0A2P5BCG4 Uncharacterized protein (Fragment)2.3e-2532.94Show/hide
Query:  SNEQLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPN
        + + L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+
Subjt:  SNEQLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPN

Query:  TITMLCK--RAGVLENEGDVILFDKRIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-------EFAERQALTF
         IT LC+  RA  L NE    L +   ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q       +   +Q   F
Subjt:  TITMLCK--RAGVLENEGDVILFDKRIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-------EFAERQALTF

Query:  WNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED
        W Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  +
Subjt:  WNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED

A0A2P5CEY2 Uncharacterized protein4.8e-2334.74Show/hide
Query:  ANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCK--RAGVLENE------GDVILF--------
        A  W  F+K RLLPTTH  TVS++R+LL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+  RA  L NE      G++           
Subjt:  ANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCK--RAGVLENE------GDVILF--------

Query:  -DKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEF--------AERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIP
            +   P+ +R      +R  G +      LEQ       Q++          +Q   FW Y + RD  LKKALQ NF++P P  P FP++LL     
Subjt:  -DKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEF--------AERQALTFWNYVRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIP

Query:  PPLVEREGDGEED
            E + DG  +
Subjt:  PPLVEREGDGEED

A0A2P5DXM3 Uncharacterized protein1.9e-2736.4Show/hide
Query:  QLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTIT
        +L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT
Subjt:  QLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTIT

Query:  MLCKRAGVLENEGDVILFDKRIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRTRDANLKK
         LC+ A  L NE    L +   ID   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   FW Y + RD  LKK
Subjt:  MLCKRAGVLENEGDVILFDKRIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRTRDANLKK

Query:  ALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED
        ALQ NF++P P  PAFP+++L         E + DG  +
Subjt:  ALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAAAGGAGGTACCCGTGACCCCCGAAGCACCGCAAGTAAAGGCAAAGAAGAAGAAGACACCAAAAGAAAA
AGAAGCTAAAAGAAGAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGCAGAGGATGTTATTGCAGAAGAAGATCCCAAAGAACCAGAGGGACAGA
ATCAAGAGCAGTCTGAGCCAGGAGTTGCAGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGGAGTTCGAGAAGGAAATACAGAGGAAGTTCGAGAAGACAATACAGAG
GAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAAGAACAGGTAGAGGTTGCGCCTGAAGAAGTTAGTGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATTATGCCGGA
AGTGCCCAAGCGTCGCCGTATAAAGCGCAAAGCGGGCCGTGTTAAGGTAGTCCGAGCTGATACCCCCTCGCCTCCAGCTACTGATTCTGAAAGAGAGAATGTTGAGAAAG
AAGAGCGTGAGAAGAAGGAGGCTGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCGCAAGCAAAGGGCAGACAGGGGAAGAGTAGCAAAA
TATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAGAGGGGATTTAGTGGTGATCTTCAACATTTTCTGAGGACCGGTATTGCAAACCACGGTTGGGAACGGTTTTCATA
TAATGAGTTGGCTGTAGCGCCATCCAATGAGCAGCTGAGTGACGCTGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCGGCTTTCGAAAATAGAGAAGAGGACGTTCC
AGTCAGCCTATTTGAAGAGGGAAGCAAATACTTGGATGGGATTTATCAAACAAAGGCTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCT
TTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATCTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTAC
CATGCTTTGCAAGCGAGCAGGGGTTCTAGAGAATGAAGGAGATGTGATATTATTTGACAAGAGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGG
CACGTCAGGGTGGACTGGTCTACGGCATAAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTTTGGAACTAT
GTTAGAACTCGTGATGCCAATCTGAAGAAGGCGTTGCAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCC
ACCGCTTGTCGAGAGAGAAGGAGATGGAGAAGAAGATCCTGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGTTTGTCT
TCGCGTCAAAAGATATGTATGATAATAGAGCTAGGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGAAGCAGAG
CTTGGTTTTGCAGAATGCTCAGGAACCCAACACTCACTCATCACCCAACTCTGTCAGAGGGTGAAGATTGTGCCAGTCAAGGACGAGGAGCGTCATTTCTTTAAACCAAC
CATTGATTTGTCCTTGATAGGAAAGCTTCAGCAAAACAACATCCAGAGGAAGGATAAAGCCTCCACATCACAGGCCACTCCTCAATCAGGGTCGAATGTTGCCTCTCCAT
CCCAACACACTCCTTTTACAGGGCCTTCACCAGCATCGGAAGTCCTAGGTATGGTCCACCGCCAGCTTGATCAAATCAGGGAGAACCTGAAGACATATTGGACATATGCC
AAGGAGAGGGATGAAGCTATTAGAGAGTTTTATCTCTCGATTGCCCCGAGCATTGCTCCAATCTTTCCAAATTTCCCTCGGTCGCTGCTGCCCCAAGAAAAAGAGGATTC
TGATGAAGAGGAAGATGAAGAGAATAAAGAGAAAGAGAGTTCCTCGGATGAGGAATCGGGGAGTTTTCTGATCCCCTTTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAAAGGAGGTACCCGTGACCCCCGAAGCACCGCAAGTAAAGGCAAAGAAGAAGAAGACACCAAAAGAAAA
AGAAGCTAAAAGAAGAAGAAAACAGCAGAGGACTGAAGATCAAGAAGTTGCTCAGAAAGCGGCAGAGGATGTTATTGCAGAAGAAGATCCCAAAGAACCAGAGGGACAGA
ATCAAGAGCAGTCTGAGCCAGGAGTTGCAGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGGAGTTCGAGAAGGAAATACAGAGGAAGTTCGAGAAGACAATACAGAG
GAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAAGAACAGGTAGAGGTTGCGCCTGAAGAAGTTAGTGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATTATGCCGGA
AGTGCCCAAGCGTCGCCGTATAAAGCGCAAAGCGGGCCGTGTTAAGGTAGTCCGAGCTGATACCCCCTCGCCTCCAGCTACTGATTCTGAAAGAGAGAATGTTGAGAAAG
AAGAGCGTGAGAAGAAGGAGGCTGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCGCAAGCAAAGGGCAGACAGGGGAAGAGTAGCAAAA
TATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAGAGGGGATTTAGTGGTGATCTTCAACATTTTCTGAGGACCGGTATTGCAAACCACGGTTGGGAACGGTTTTCATA
TAATGAGTTGGCTGTAGCGCCATCCAATGAGCAGCTGAGTGACGCTGTGAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCGGCTTTCGAAAATAGAGAAGAGGACGTTCC
AGTCAGCCTATTTGAAGAGGGAAGCAAATACTTGGATGGGATTTATCAAACAAAGGCTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCT
TTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATCTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTAC
CATGCTTTGCAAGCGAGCAGGGGTTCTAGAGAATGAAGGAGATGTGATATTATTTGACAAGAGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGG
CACGTCAGGGTGGACTGGTCTACGGCATAAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTTTGGAACTAT
GTTAGAACTCGTGATGCCAATCTGAAGAAGGCGTTGCAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCC
ACCGCTTGTCGAGAGAGAAGGAGATGGAGAAGAAGATCCTGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGTTTGTCT
TCGCGTCAAAAGATATGTATGATAATAGAGCTAGGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGAAGCAGAG
CTTGGTTTTGCAGAATGCTCAGGAACCCAACACTCACTCATCACCCAACTCTGTCAGAGGGTGAAGATTGTGCCAGTCAAGGACGAGGAGCGTCATTTCTTTAAACCAAC
CATTGATTTGTCCTTGATAGGAAAGCTTCAGCAAAACAACATCCAGAGGAAGGATAAAGCCTCCACATCACAGGCCACTCCTCAATCAGGGTCGAATGTTGCCTCTCCAT
CCCAACACACTCCTTTTACAGGGCCTTCACCAGCATCGGAAGTCCTAGGTATGGTCCACCGCCAGCTTGATCAAATCAGGGAGAACCTGAAGACATATTGGACATATGCC
AAGGAGAGGGATGAAGCTATTAGAGAGTTTTATCTCTCGATTGCCCCGAGCATTGCTCCAATCTTTCCAAATTTCCCTCGGTCGCTGCTGCCCCAAGAAAAAGAGGATTC
TGATGAAGAGGAAGATGAAGAGAATAAAGAGAAAGAGAGTTCCTCGGATGAGGAATCGGGGAGTTTTCTGATCCCCTTTGATTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEKEVPVTPEAPQVKAKKKKTPKEKEAKRRRKQQRTEDQEVAQKAAEDVIAEEDPKEPEGQNQEQSEPGVADTEEVREENTEGVREGNTEEVREDNTE
EVQEKQAEDVQEEQVEVAPEEVSEQEQEARVEVIMPEVPKRRRIKRKAGRVKVVRADTPSPPATDSERENVEKEEREKKEAEDKAREEAEKKAEEERLRKQRADRGRVAK
YAELLKRDFLFERGFSGDLQHFLRTGIANHGWERFSYNELAVAPSNEQLSDAVREVGIEGAQWRLSKIEKRTFQSAYLKREANTWMGFIKQRLLPTTHDSTVSRERVLLA
FAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVLENEGDVILFDKRIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNY
VRTRDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPLVEREGDGEEDPETFCLSIFSGLVVAAAKKILEFVFASKDMYDNRARLWQVLRIELKVVIICPCRKNYFEAE
LGFAECSGTQHSLITQLCQRVKIVPVKDEERHFFKPTIDLSLIGKLQQNNIQRKDKASTSQATPQSGSNVASPSQHTPFTGPSPASEVLGMVHRQLDQIRENLKTYWTYA
KERDEAIREFYLSIAPSIAPIFPNFPRSLLPQEKEDSDEEEDEENKEKESSSDEESGSFLIPFD