; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032264 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032264
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold2:35787088..35790247
RNA-Seq ExpressionSpg032264
SyntenySpg032264
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]3.9e-2426.93Show/hide
Query:  YNRFVNNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQDF---------------
        + +F N+ A+A++     R+  FE           GFG D+       ++ L W +F   P  VN+++V+EFYAN+    +   F               
Subjt:  YNRFVNNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQDF---------------

Query:  ---------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGK
                  HA F E      +++ +  + ++  E  +W   +T + +     L+  A  W  F+K +LMPT+H++TVS  R+LL  +++ S  IDVG+
Subjt:  ---------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGK

Query:  IISSEILDCWRKKVGKLFFPNTITMLCRRVEVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-L
        II  ++ DC  KK   L FPN IT LCR+ +V E+              D +PL+        K  +   ++   +   E R   L   + Q Q QL  L
Subjt:  IISSEILDCWRKKVGKLFFPNTITMLCRRVEVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-L

Query:  HSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL
        H          ++ F+ YVK RD  +    Q          P FPD +L
Subjt:  HSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.1e-2334.39Show/hide
Query:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYANL D  E             ++  +A F  
Subjt:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--

Query:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
               +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  L+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI 
Subjt:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQRTQE
         C  +K G LFFP+ IT LCR    P   ++  L + G ID   +AR+  TQE
Subjt:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]8.3e-3532.95Show/hide
Query:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYANL D EE             ++  +A F  
Subjt:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--

Query:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
               +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RL+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI 
Subjt:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SS
         C  +K G LFFP+ IT LCR    P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  S
Subjt:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SS

Query:  RMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL
         ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP  +L
Subjt:  RMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]3.9e-2437.81Show/hide
Query:  LKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLAR
        L   A  W  F+K RL+PTTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L   G ID   +AR
Subjt:  LKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNL
        +  TQE +                     G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP  L
Subjt:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNL

Query:  L
        L
Subjt:  L

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]5.2e-2934.18Show/hide
Query:  IVREFYANLDDQEEFQDF------------------------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIK
        +VREFYANL D EE   +                         H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANLDDQEEFQDF------------------------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIK

Query:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARL------------
         RL+PTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR      +E+   L + G ID   +AR+            
Subjt:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARL------------

Query:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL
           R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP  +L
Subjt:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.5e-2434.39Show/hide
Query:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYANL D  E             ++  +A F  
Subjt:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--

Query:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
               +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  L+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI 
Subjt:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQRTQE
         C  +K G LFFP+ IT LCR    P   ++  L + G ID   +AR+  TQE
Subjt:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)4.0e-3532.95Show/hide
Query:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYANL D EE             ++  +A F  
Subjt:  RFVNNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEE------------FQDFPHAAF--

Query:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
               +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RL+PTTH  TVS+DR+LL  ++L   SI+VG++I SEI 
Subjt:  -------NEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SS
         C  +K G LFFP+ IT LCR    P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  S
Subjt:  DCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QLH-SS

Query:  RMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL
         ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP  +L
Subjt:  RMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL

A0A2P5CEY2 Uncharacterized protein1.9e-2437.81Show/hide
Query:  LKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLAR
        L   A  W  F+K RL+PTTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L   G ID   +AR
Subjt:  LKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNL
        +  TQE +                     G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP  L
Subjt:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNL

Query:  L
        L
Subjt:  L

A0A2P5DXM3 Uncharacterized protein2.5e-2934.18Show/hide
Query:  IVREFYANLDDQEEFQDF------------------------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIK
        +VREFYANL D EE   +                         H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANLDDQEEFQDF------------------------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIK

Query:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARL------------
         RL+PTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR      +E+   L + G ID   +AR+            
Subjt:  LRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDKGIIDTPNLARL------------

Query:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL
           R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP  +L
Subjt:  --QRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL

A0A6A3BU96 Uncharacterized protein1.9e-2426.93Show/hide
Query:  YNRFVNNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQDF---------------
        + +F N+ A+A++     R+  FE           GFG D+       ++ L W +F   P  VN+++V+EFYAN+    +   F               
Subjt:  YNRFVNNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQDF---------------

Query:  ---------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGK
                  HA F E      +++ +  + ++  E  +W   +T + +     L+  A  W  F+K +LMPT+H++TVS  R+LL  +++ S  IDVG+
Subjt:  ---------PHAAFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGK

Query:  IISSEILDCWRKKVGKLFFPNTITMLCRRVEVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-L
        II  ++ DC  KK   L FPN IT LCR+ +V E+              D +PL+        K  +   ++   +   E R   L   + Q Q QL  L
Subjt:  IISSEILDCWRKKVGKLFFPNTITMLCRRVEVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-L

Query:  HSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL
        H          ++ F+ YVK RD  +    Q          P FPD +L
Subjt:  HSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCTAGAAAAGAGAGGGAGAATGAGGACGAAGAGGTACCTGTTACCCTTGTAGCACAGAAAGCGAAAACGAAAAAGAAGAAAACGCCAGAGGAGAA
AGAAGCGAAAAGGAGGAGAAAGCAACAGAAGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCGACTGTTACTGCCACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGG
AAAATACCGAGCAGAGGGTCACGGATACAGAGGAAGAACGAACAGAAGAGGTGCAAGAAGATCGGACCGAGGAAGTTCAAGAAGAACTTACAGAGGAAGTTCAAGAACAG
CAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTGCAGGACGCTCAAGTAGAGGTGATCATGCCGAAGGTGCCAAAGCGTCG
ACGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTGTGACCACGGATTCTGAAAGAGAGAATGCAGAGAGAGTAGAGCGTGAAAAGA
AGGAAGCTGAGGACAAGGGAAGAGAAGAAGAAGCGAAGAAAGCAGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAAGGCAAAAGCGTGGCGGAAGCATCAGAAGAA
CCTGAGGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCGTCAATAACCTTGCTCGGGCAAAGTATGTTGAGATGCTGAGAAGGGATTTCCTGTTTGAACGAGGATT
TGGTGATGATCTGCCGCGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCGAAGCCGGAGCCTGTTAATTCCAACATTGTTCGGGAGTTTTACG
CAAATCTTGACGATCAGGAAGAATTTCAGGATTTTCCGCATGCGGCCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAATTAAATGCGGCTGTCCGAGAGGTTGGC
ATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGCATGGATGGATTTCATCAAGCTGCGCTTAAT
GCCGACAACTCACGACTCAACGGTATCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTGG
ATTGCTGGAGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGTAGAGGTGCCAGAAGATGAGGATGATGTGCCGTTAATAGACAAG
GGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTC
CAGCAGGATGGAATTTGTTGAAAGACAATTGCAGACTTTCTGGAGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCCTTGCAGTCGAATTTTTCCAAGCCATATC
CGGCTTTGCCCGTATTCCCTGACAACCTACTGAACCCCTGGATCCCACCCCCACCTGTTGAACGAGAGGAAGTTGATGAAGAGCAGGACACCTTTTGCTTGAGCATTTTC
TCTGACCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTATTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTATTTGTCTTCGCGTCAAAAG
AGTATTGAGCCTAATTGGTGATAAGTTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCGTGCTTAAGAGCTTATGACTGTAAGGCTGCTTTAAGTCTGAAGA
ACAAAAATTTAAACCCCTTGAAAATATGTTTTGATATATCTGATAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGG
AAGAATTATTTTGCTGCATCAGAGCTTGGTTTTGCAGAGTGCTCAGAATATGTTGTTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAAAGCTG
CCACGTCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCTAGAAAAGAGAGGGAGAATGAGGACGAAGAGGTACCTGTTACCCTTGTAGCACAGAAAGCGAAAACGAAAAAGAAGAAAACGCCAGAGGAGAA
AGAAGCGAAAAGGAGGAGAAAGCAACAGAAGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCGACTGTTACTGCCACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGG
AAAATACCGAGCAGAGGGTCACGGATACAGAGGAAGAACGAACAGAAGAGGTGCAAGAAGATCGGACCGAGGAAGTTCAAGAAGAACTTACAGAGGAAGTTCAAGAACAG
CAGGCCGAGGATGTTCAAATGCAACAGGCAGAAGAGGTTCAGGTACCGGATAATGAGCCAGTGCAGGACGCTCAAGTAGAGGTGATCATGCCGAAGGTGCCAAAGCGTCG
ACGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCTGTGACCACGGATTCTGAAAGAGAGAATGCAGAGAGAGTAGAGCGTGAAAAGA
AGGAAGCTGAGGACAAGGGAAGAGAAGAAGAAGCGAAGAAAGCAGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAAAAAGGCAAAAGCGTGGCGGAAGCATCAGAAGAA
CCTGAGGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCGTCAATAACCTTGCTCGGGCAAAGTATGTTGAGATGCTGAGAAGGGATTTCCTGTTTGAACGAGGATT
TGGTGATGATCTGCCGCGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCGAAGCCGGAGCCTGTTAATTCCAACATTGTTCGGGAGTTTTACG
CAAATCTTGACGATCAGGAAGAATTTCAGGATTTTCCGCATGCGGCCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAATTAAATGCGGCTGTCCGAGAGGTTGGC
ATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGCATGGATGGATTTCATCAAGCTGCGCTTAAT
GCCGACAACTCACGACTCAACGGTATCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTGG
ATTGCTGGAGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGTAGAGGTGCCAGAAGATGAGGATGATGTGCCGTTAATAGACAAG
GGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTC
CAGCAGGATGGAATTTGTTGAAAGACAATTGCAGACTTTCTGGAGCTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCCTTGCAGTCGAATTTTTCCAAGCCATATC
CGGCTTTGCCCGTATTCCCTGACAACCTACTGAACCCCTGGATCCCACCCCCACCTGTTGAACGAGAGGAAGTTGATGAAGAGCAGGACACCTTTTGCTTGAGCATTTTC
TCTGACCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTATTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTATTTGTCTTCGCGTCAAAAG
AGTATTGAGCCTAATTGGTGATAAGTTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCGTGCTTAAGAGCTTATGACTGTAAGGCTGCTTTAAGTCTGAAGA
ACAAAAATTTAAACCCCTTGAAAATATGTTTTGATATATCTGATAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGG
AAGAATTATTTTGCTGCATCAGAGCTTGGTTTTGCAGAGTGCTCAGAATATGTTGTTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAAAGCTG
CCACGTCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERENEDEEVPVTLVAQKAKTKKKKTPEEKEAKRRRKQQKAEEQEKATEVATVTATVEEESPKQPEENTEQRVTDTEEERTEEVQEDRTEEVQEELTEEVQEQ
QAEDVQMQQAEEVQVPDNEPVQDAQVEVIMPKVPKRRRVKRKAGRARVVRTDTPSPVTTDSERENAERVEREKKEAEDKGREEEAKKAEEEILLKRRAEKGKSVAEASEE
PEEIEESRFPYNRFVNNLARAKYVEMLRRDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANLDDQEEFQDFPHAAFNEMVVAPSNDQLNAAVREVG
IEGAQWRLSKTEKRTFQAAYLKSEANAWMDFIKLRLMPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRVEVPEDEDDVPLIDK
GIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKRRDAALRVALQSNFSKPYPALPVFPDNLLNPWIPPPPVEREEVDEEQDTFCLSIF
SDLVVAAAKKILEVVLTYVIRFKLRSSPALICLRVKRVLSLIGDKFEARVYCTIKWVIPCLRAYDCKAALSLKNKNLNPLKICFDISDNRAKLWQVLRIELKVVIICPCR
KNYFAASELGFAECSEYVVGRLEGANSVLQQNWEQSCHVTAR