; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033407 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033407
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold5:3000688..3003756
RNA-Seq ExpressionSpg033407
SyntenySpg033407
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.2e-2935.81Show/hide
Query:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    LVREFYAN+         V+ ++V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC-WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILF
                   LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ +I   EI  C   +K G L+FP+ IT L  KA VP  + + I+ 
Subjt:  KTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC-WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILF

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]2.1e-2934.38Show/hide
Query:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNF
        RFV+  A  +Y + L+ +  + ERGF   GE    H   T +    W+ F + PES    LVREFYAN    +    +VRG EV +    IN LY +   
Subjt:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNF

Query:  PHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC
           A+       +     +  R +   GAQW+++K E   F+S  L + A  W+ FI  RMLPT H   V+ +R LL + I+   + DVGKII+D I+  
Subjt:  PHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC

Query:  WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGG
               L+FP+ IT LC +AGV  DE + ++F +  ID   + R+        GG
Subjt:  WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGG

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.7e-3134.81Show/hide
Query:  ASEEHDEIE-EQQLPDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLPDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRG

Query:  IEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI
        ++V WS  AINA++ L + P   ++E     +   L   +  V + GA+W +S         + L   A  W  F++  +LPTTH  TVS++R+LL  ++
Subjt:  IEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI

Query:  LRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQE
        L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+  TQE
Subjt:  LRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.9e-3931.42Show/hide
Query:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKL
        +F    A  +Y   +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AINA++ L
Subjt:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKL

Query:  QNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEI
         + P   ++E     + + L   +  V   GA+W +S         + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG++I  EI
Subjt:  QNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEI

Query:  LGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQR---TQEARQGGLVFGINTILEQLALSAKRQEFAERQRTTLVYTTLYYYL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ +   T+  +Q                S+ R   A   RT           
Subjt:  LGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQR---TQEARQGGLVFGINTILEQLALSAKRQEFAERQRTTLVYTTLYYYL

Query:  TRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALTFWSYVRNRDANLKKALQENFSKPYPALPAFPEDL
                I  Q+K           A  QR  +             ++Q  + +  Q    +Q   FW+Y + RD  LKKALQ NF++P P  PAFP+++
Subjt:  TRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALTFWSYVRNRDANLKKALQENFSKPYPALPAFPEDL

Query:  L
        L
Subjt:  L

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.7e-3131.64Show/hide
Query:  LVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQR
        LVREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S         + L   A  W  F++ R
Subjt:  LVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGGLVFG
        +LPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+  TQE        G
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGGLVFG

Query:  INTILEQLALSAKRQEFAERQRTTLVYTTLYYYLTRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALT
             +Q                                                 P+ +R      +R  G V      +EQ      +QE   +Q   
Subjt:  INTILEQLALSAKRQEFAERQRTTLVYTTLYYYLTRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALT

Query:  FWSYVRNRDANLKKALQENFSKPYPALPAFPEDLL
        FW+Y + RD  LKKALQ NF++P P  PAFP+++L
Subjt:  FWSYVRNRDANLKKALQENFSKPYPALPAFPEDLL

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein9.9e-3034.38Show/hide
Query:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNF
        RFV+  A  +Y + L+ +  + ERGF   GE    H   T +    W+ F + PES    LVREFYAN    +    +VRG EV +    IN LY +   
Subjt:  RFVNNFARAKY-AELLKRDFLFERGF--SGE--LPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNF

Query:  PHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC
           A+       +     +  R +   GAQW+++K E   F+S  L + A  W+ FI  RMLPT H   V+ +R LL + I+   + DVGKII+D I+  
Subjt:  PHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC

Query:  WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGG
               L+FP+ IT LC +AGV  DE + ++F +  ID   + R+        GG
Subjt:  WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGG

A0A2P5AGA5 Uncharacterized protein (Fragment)1.8e-3134.81Show/hide
Query:  ASEEHDEIE-EQQLPDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRG
        A + H  ++ E +  + R+ NN        +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+         VRG
Subjt:  ASEEHDEIE-EQQLPDDRFVNNFARAKYAELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRG

Query:  IEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI
        ++V WS  AINA++ L + P   ++E     +   L   +  V + GA+W +S         + L   A  W  F++  +LPTTH  TVS++R+LL  ++
Subjt:  IEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAI

Query:  LRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQE
        L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+  TQE
Subjt:  LRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.4e-3931.42Show/hide
Query:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKL
        +F    A  +Y   +  R    E+GF        G+LP F+   I  H W++FC+ PE     LVREFYAN+   E     VRG++V WS  AINA++ L
Subjt:  RFVNNFARAKYA-ELLKRDFLFERGF-------SGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKL

Query:  QNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEI
         + P   ++E     + + L   +  V   GA+W +S         + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG++I  EI
Subjt:  QNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEI

Query:  LGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQR---TQEARQGGLVFGINTILEQLALSAKRQEFAERQRTTLVYTTLYYYL
          C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+ +   T+  +Q                S+ R   A   RT           
Subjt:  LGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQR---TQEARQGGLVFGINTILEQLALSAKRQEFAERQRTTLVYTTLYYYL

Query:  TRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALTFWSYVRNRDANLKKALQENFSKPYPALPAFPEDL
                I  Q+K           A  QR  +             ++Q  + +  Q    +Q   FW+Y + RD  LKKALQ NF++P P  PAFP+++
Subjt:  TRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALTFWSYVRNRDANLKKALQENFSKPYPALPAFPEDL

Query:  L
        L
Subjt:  L

A0A2P5DXM3 Uncharacterized protein1.8e-3131.64Show/hide
Query:  LVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQR
        LVREFYAN+   E     VRG++V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S         + L   A  W  F++ R
Subjt:  LVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTEKRKFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGGLVFG
        +LPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+  TQE        G
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLARLQRTQEARQGGLVFG

Query:  INTILEQLALSAKRQEFAERQRTTLVYTTLYYYLTRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALT
             +Q                                                 P+ +R      +R  G V      +EQ      +QE   +Q   
Subjt:  INTILEQLALSAKRQEFAERQRTTLVYTTLYYYLTRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFAERQALT

Query:  FWSYVRNRDANLKKALQENFSKPYPALPAFPEDLL
        FW+Y + RD  LKKALQ NF++P P  PAFP+++L
Subjt:  FWSYVRNRDANLKKALQENFSKPYPALPAFPEDLL

W9QTD9 Uncharacterized protein5.8e-3035.81Show/hide
Query:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P +    LVREFYAN+         V+ ++V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC-WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILF
                   LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ +I   EI  C   +K G L+FP+ IT L  KA VP  + + I+ 
Subjt:  KTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGC-WKKKVGKLFFPNTITMLCRKAGVPEDEGDVILF

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAACGAGAGCGAGAAAAGAGAGAGATAATGAGGAAGAGGAGATACCCGTGACCCCTGAAGAACAGAAAGCAAAAACGAAAAAGAAGAAGACGCCAGAGGAGAA
AGAAGCGAAAAGGAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTATTCAGAAGGCGGCGGAAGATGTTGCTGCTCCGATAGTGGAGGAAGATCCGAAAGAACCTG
AAGTGCAGAACCCGGAACGGCCTGAGCCGAGAATTGCGGATACAGTGGAAAATACTGAAGAAGTTCAAGAAGGAAATACTGAGGAGACTCGAGTGGAGGTGATCATGCCG
GAGGTGCCCAAACGTCGCCGAATTAAGAGAAAAGCAGGCCGTGTAAAGGTAGTCCGAACTGACACCCCTTCGCCTCCAACGACTGATTCTGAAAGGGAGAATACAGAAAG
GGAAGAGCAAGAGAAGAAAGAGGCCGAGGAAAGAGCAAGAAAAGAAGCAGAGGAAAAAGCTGTGGAAGAGCGGTTGCTCAAGCGAAGGGTGGACAAGGGCAAAAATGTTG
CTGGAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACCGGATGATCGCTTTGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTTCTGAAAAGAGACTTC
CTATTTGAGAGGGGATTTAGTGGTGAGCTTCCACATTTTCTGAGGACCGGTATTGCGAATCACGGTTGGGAACGATTCTGTTCGAAACCCGAATCTGTGAACGCGCAGTT
AGTACGCGAATTCTATGCAAATATCGAGAGAGAAGAAGGTTTCCTAGCAATTGTTCGAGGTATTGAGGTCGACTGGAGTCCGAGTGCTATCAACGCACTGTATAAACTTC
AGAACTTCCCCCATGTGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAATTAAGTGATGCTGTGCGGGAGGTAGGTATTGAAGGGGCACAGTGGCAGCTGTCC
AAGACGGAGAAGAGGAAATTCCAGTCGGCTTATTTGAAAAGGGAAGCAAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGACGGTTTC
TAGGGAACGGGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATTGGGTTGTTGGAAAAAGAAGGTGGGGAAGC
TGTTTTTTCCGAATACCATTACAATGCTGTGCAGAAAAGCAGGGGTTCCAGAGGATGAAGGAGATGTGATTCTGTTTGACAAGGGAATCATCGACACGCCTAACTTGGCG
CGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTTCGGTATCAACACGATTCTAGAACAACTCGCACTTTCGGCCAAAAGGCAGGAGTTTGCCGAGAGGCA
AAGGACGACACTCGTTTATACGACACTTTATTATTACTTGACCCGTGCGCTTGCGGTAAGGAAAATTCCACATCAAGTCAAGTTTGACAAGGGAATCATCGACACGCCTA
ACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTTCGGTATCAACACGATTATAGAACAACTCGCACTTTCGGCCAAAAGGCAGGAGTTTGCC
GAGAGGCAAGCTTTAACCTTTTGGAGCTATGTTAGGAATCGTGATGCCAATCTGAAGAAGGCGCTTCAAGAGAATTTTTCCAAGCCATATCCAGCCCTTCCTGCATTCCC
TGAGGATTTGTTGAACCCCTGGATTCCACCCCCACCAGTTGAAAGAGGAGAAGAGGATGATGAAAATGAGCCAGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAACGAGAGCGAGAAAAGAGAGAGATAATGAGGAAGAGGAGATACCCGTGACCCCTGAAGAACAGAAAGCAAAAACGAAAAAGAAGAAGACGCCAGAGGAGAA
AGAAGCGAAAAGGAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTATTCAGAAGGCGGCGGAAGATGTTGCTGCTCCGATAGTGGAGGAAGATCCGAAAGAACCTG
AAGTGCAGAACCCGGAACGGCCTGAGCCGAGAATTGCGGATACAGTGGAAAATACTGAAGAAGTTCAAGAAGGAAATACTGAGGAGACTCGAGTGGAGGTGATCATGCCG
GAGGTGCCCAAACGTCGCCGAATTAAGAGAAAAGCAGGCCGTGTAAAGGTAGTCCGAACTGACACCCCTTCGCCTCCAACGACTGATTCTGAAAGGGAGAATACAGAAAG
GGAAGAGCAAGAGAAGAAAGAGGCCGAGGAAAGAGCAAGAAAAGAAGCAGAGGAAAAAGCTGTGGAAGAGCGGTTGCTCAAGCGAAGGGTGGACAAGGGCAAAAATGTTG
CTGGAGCATCAGAAGAGCACGATGAAATAGAAGAGCAACAGTTACCGGATGATCGCTTTGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTTCTGAAAAGAGACTTC
CTATTTGAGAGGGGATTTAGTGGTGAGCTTCCACATTTTCTGAGGACCGGTATTGCGAATCACGGTTGGGAACGATTCTGTTCGAAACCCGAATCTGTGAACGCGCAGTT
AGTACGCGAATTCTATGCAAATATCGAGAGAGAAGAAGGTTTCCTAGCAATTGTTCGAGGTATTGAGGTCGACTGGAGTCCGAGTGCTATCAACGCACTGTATAAACTTC
AGAACTTCCCCCATGTGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAATTAAGTGATGCTGTGCGGGAGGTAGGTATTGAAGGGGCACAGTGGCAGCTGTCC
AAGACGGAGAAGAGGAAATTCCAGTCGGCTTATTTGAAAAGGGAAGCAAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGACGGTTTC
TAGGGAACGGGTGCTTTTGGCTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATTGGGTTGTTGGAAAAAGAAGGTGGGGAAGC
TGTTTTTTCCGAATACCATTACAATGCTGTGCAGAAAAGCAGGGGTTCCAGAGGATGAAGGAGATGTGATTCTGTTTGACAAGGGAATCATCGACACGCCTAACTTGGCG
CGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTTCGGTATCAACACGATTCTAGAACAACTCGCACTTTCGGCCAAAAGGCAGGAGTTTGCCGAGAGGCA
AAGGACGACACTCGTTTATACGACACTTTATTATTACTTGACCCGTGCGCTTGCGGTAAGGAAAATTCCACATCAAGTCAAGTTTGACAAGGGAATCATCGACACGCCTA
ACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTTCGGTATCAACACGATTATAGAACAACTCGCACTTTCGGCCAAAAGGCAGGAGTTTGCC
GAGAGGCAAGCTTTAACCTTTTGGAGCTATGTTAGGAATCGTGATGCCAATCTGAAGAAGGCGCTTCAAGAGAATTTTTCCAAGCCATATCCAGCCCTTCCTGCATTCCC
TGAGGATTTGTTGAACCCCTGGATTCCACCCCCACCAGTTGAAAGAGGAGAAGAGGATGATGAAAATGAGCCAGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEIPVTPEEQKAKTKKKKTPEEKEAKRRRRQQRAEDQEAIQKAAEDVAAPIVEEDPKEPEVQNPERPEPRIADTVENTEEVQEGNTEETRVEVIMP
EVPKRRRIKRKAGRVKVVRTDTPSPPTTDSERENTEREEQEKKEAEERARKEAEEKAVEERLLKRRVDKGKNVAGASEEHDEIEEQQLPDDRFVNNFARAKYAELLKRDF
LFERGFSGELPHFLRTGIANHGWERFCSKPESVNAQLVREFYANIEREEGFLAIVRGIEVDWSPSAINALYKLQNFPHVAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS
KTEKRKFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEILGCWKKKVGKLFFPNTITMLCRKAGVPEDEGDVILFDKGIIDTPNLA
RLQRTQEARQGGLVFGINTILEQLALSAKRQEFAERQRTTLVYTTLYYYLTRALAVRKIPHQVKFDKGIIDTPNLARLQRTQEARQGGLVFGINTIIEQLALSAKRQEFA
ERQALTFWSYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVERGEEDDENEPGQED