; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024636 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024636
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold12:19852397..19854091
RNA-Seq ExpressionSpg024636
SyntenySpg024636
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8680640.1 hypothetical protein F3Y22_tig00111372pilonHSYRG00020 [Hibiscus syriacus]2.3e-2427.74Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F ++ A+A++Q   K+   FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++P A+   F LQ
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        D    HA F E   + + D++   + ++  E  +W                          + +L+PT++++TVS  R+LL  +I  S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L             +++    Q      +  ++E++    + +  +  +++
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ

Query:  TFWSYVKRRD
         F+ YVK RD
Subjt:  TFWSYVKRRD

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.2e-2526.57Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F N+ A+A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++  A+N  F LQ
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA+F E      +++    + ++  E  +W                          + +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L             +++    +      +  ++E +    +++  +   ++
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ

Query:  TFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREED
         F+ YVK RD  +    Q          P FPD++L  +      E E D
Subjt:  TFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREED

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.3e-2936.86Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V + GA+W +                           LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.6e-4033.88Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V   GA+W +                          RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H 
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-

Query:  SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE
         S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L         E ++D   +  E
Subjt:  SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.1e-3336.91Show/hide
Query:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------
        VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P   +L   +  V   GA+W +                          
Subjt:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------

Query:  RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE--
        RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+  
Subjt:  RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE--

Query:  ---------ARQGGLVCGIQQIQELLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE
                 +R  G V  +QQ++ L Q   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L         E ++D   +  E
Subjt:  ---------ARQGGLVCGIQQIQELLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.1e-2936.86Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V + GA+W +                           LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)3.2e-4033.88Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V   GA+W +                          RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H 
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-

Query:  SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE
         S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L         E ++D   +  E
Subjt:  SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE

A0A2P5DXM3 Uncharacterized protein9.9e-3436.91Show/hide
Query:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------
        VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P   +L   +  V   GA+W +                          
Subjt:  VREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQD--FPHAVFNEMVVAPSNDQLSAAVREVGIEGAQWRL--------------------------

Query:  RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE--
        RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+  
Subjt:  RLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL------QRTQE--

Query:  ---------ARQGGLVCGIQQIQELLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE
                 +R  G V  +QQ++ L Q   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L         E ++D   +  E
Subjt:  ---------ARQGGLVCGIQQIQELLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQE

A0A6A2YMQ9 Uncharacterized protein1.1e-2427.74Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F ++ A+A++Q   K+   FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++P A+   F LQ
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        D    HA F E   + + D++   + ++  E  +W                          + +L+PT++++TVS  R+LL  +I  S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L             +++    Q      +  ++E++    + +  +  +++
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ

Query:  TFWSYVKRRD
         F+ YVK RD
Subjt:  TFWSYVKRRD

A0A6A3BU96 Uncharacterized protein5.8e-2626.57Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F N+ A+A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++  A+N  F LQ
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA+F E      +++    + ++  E  +W                          + +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSAAVREVGIEGAQW--------------------------RLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L             +++    +      +  ++E +    +++  +   ++
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQ

Query:  TFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREED
         F+ YVK RD  +    Q          P FPD++L  +      E E D
Subjt:  TFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAGTGAGGAGGAGGAGGTACCGGTTACTCCGGAAATGCACAAAGGGAAAACGAAGAAGAAGAGAACACCGGAAGAAAA
AGAAGCCAAAAGAAGGAAAAGGCAACAACGGGCTGAGGAGCAAGAAGCTGTTCAGAAAGCGACAGAAGATGTTACTACTGCAGTGGGAGAAGGAAATCCGAAAGAACCTG
AAGTGCAGAACCCAGAACAGAATGAGTCGAGAATTGCAGATACAGAGGAAAATACAGAAAGAGTTCAAGAAAAGAACACTGAAGAGATTCAAGAAAAACAGGCTGAGGAG
GTGCAGGAACATCATGCAGAGGTTGCACCTGAAGAAGGCAATGAGCAAGAGCAGGAGGCTCGAGTGGAGGTGATCATGCCGGAGGTACCCAAACGTCGCCGCATTAAAAG
AAAAGCGGGTCGCGTCAAGGTAGTCCGAACTGATACCCCCTCGCCTCCAACCACTGATTCTGAAAGAGAGAATGCAGAGAGAGAAGAGCGTGAAAAGAAGGAAGCTGAAG
ACAAAGCAAAGGAAGAAGAAGCAAAGAAGGCTGAGGAAGAGATTTTGCGCAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCAGAGGCATCAGGTGCGGCTGATGAGGTT
GAGGCACAAGGGTTACCTTTTATTCGCTTCGTCAACAACCTTGCTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAACGAGGATTTGGCAATGAATT
GCCACGGTTCTTGAGGACTGGAATAGAGAACCTCGGCTGGAGCCAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGCAAATCTTGACG
ATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTGGATTGGAGCCCAGAAGCTGTTAATGAATTGTTTGATCTCCAGGATTTTCCGCATGCAGTCTTCAATGAG
ATGGTGGTTGCCCCATCTAACGATCAGTTAAGTGCGGCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAATGGAGGTTGCGTTTACTGCCGACTACGCATGACTCCACAGT
ATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTTCGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGA
AGCTGTTTTTCCCCAACACTATCACGATGTTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGATATGATATTACCAGATAAGGGAATAATTGACACGCCAAATTTG
GCTAGGCTTCAGAGAACACAGGAAGCACGCCAAGGGGGTTTGGTGTGCGGCATCCAACAAATTCAGGAGCTGTTGCAACTGCATTCCAGTAGGATGGAATTTGTTGAAAG
ACAATTGCAGACTTTCTGGAGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACG
ACCTACTGAACCCCTGGATCCCGCCCCCACCTGTTGAACGAGAGGAAGATGATGAAGAGCAGGGTCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAGTGAGGAGGAGGAGGTACCGGTTACTCCGGAAATGCACAAAGGGAAAACGAAGAAGAAGAGAACACCGGAAGAAAA
AGAAGCCAAAAGAAGGAAAAGGCAACAACGGGCTGAGGAGCAAGAAGCTGTTCAGAAAGCGACAGAAGATGTTACTACTGCAGTGGGAGAAGGAAATCCGAAAGAACCTG
AAGTGCAGAACCCAGAACAGAATGAGTCGAGAATTGCAGATACAGAGGAAAATACAGAAAGAGTTCAAGAAAAGAACACTGAAGAGATTCAAGAAAAACAGGCTGAGGAG
GTGCAGGAACATCATGCAGAGGTTGCACCTGAAGAAGGCAATGAGCAAGAGCAGGAGGCTCGAGTGGAGGTGATCATGCCGGAGGTACCCAAACGTCGCCGCATTAAAAG
AAAAGCGGGTCGCGTCAAGGTAGTCCGAACTGATACCCCCTCGCCTCCAACCACTGATTCTGAAAGAGAGAATGCAGAGAGAGAAGAGCGTGAAAAGAAGGAAGCTGAAG
ACAAAGCAAAGGAAGAAGAAGCAAAGAAGGCTGAGGAAGAGATTTTGCGCAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCAGAGGCATCAGGTGCGGCTGATGAGGTT
GAGGCACAAGGGTTACCTTTTATTCGCTTCGTCAACAACCTTGCTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAACGAGGATTTGGCAATGAATT
GCCACGGTTCTTGAGGACTGGAATAGAGAACCTCGGCTGGAGCCAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGCAAATCTTGACG
ATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTGGATTGGAGCCCAGAAGCTGTTAATGAATTGTTTGATCTCCAGGATTTTCCGCATGCAGTCTTCAATGAG
ATGGTGGTTGCCCCATCTAACGATCAGTTAAGTGCGGCTGTCCGAGAGGTTGGCATTGAGGGGGCCCAATGGAGGTTGCGTTTACTGCCGACTACGCATGACTCCACAGT
ATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTTCGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGA
AGCTGTTTTTCCCCAACACTATCACGATGTTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGATATGATATTACCAGATAAGGGAATAATTGACACGCCAAATTTG
GCTAGGCTTCAGAGAACACAGGAAGCACGCCAAGGGGGTTTGGTGTGCGGCATCCAACAAATTCAGGAGCTGTTGCAACTGCATTCCAGTAGGATGGAATTTGTTGAAAG
ACAATTGCAGACTTTCTGGAGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACG
ACCTACTGAACCCCTGGATCCCGCCCCCACCTGTTGAACGAGAGGAAGATGATGAAGAGCAGGGTCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERESEEEEVPVTPEMHKGKTKKKRTPEEKEAKRRKRQQRAEEQEAVQKATEDVTTAVGEGNPKEPEVQNPEQNESRIADTEENTERVQEKNTEEIQEKQAEE
VQEHHAEVAPEEGNEQEQEARVEVIMPEVPKRRRIKRKAGRVKVVRTDTPSPPTTDSERENAEREEREKKEAEDKAKEEEAKKAEEEILRKRREDKGKGIAEASGAADEV
EAQGLPFIRFVNNLARAKYQEMLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQDFPHAVFNE
MVVAPSNDQLSAAVREVGIEGAQWRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNL
ARLQRTQEARQGGLVCGIQQIQELLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQGQED