; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019497 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019497
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold1:39595252..39608371
RNA-Seq ExpressionSpg019497
SyntenySpg019497
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.6e-2132.56Show/hide
Query:  EPRLPTGIVN-HGWSQFCAKPESVNSNIVREFYANIDD--QE--------------------GFQAIV-RFNEMVVAPSNDQLNATVREVGIEGAQWRLS
        +P   T +++ HGW QFC  P +    +VREFYAN+ D  QE                    G + +V  + +     +++QL   + EV IEGA W++S
Subjt:  EPRLPTGIVN-HGWSQFCAKPESVNSNIVREFYANIDD--QE--------------------GFQAIV-RFNEMVVAPSNDQLNATVREVGIEGAQWRLS

Query:  KTEKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDC-WRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLM
             T     LK  A  W  F+  R  P+TH  TV++DRVLL ++IL  +S+++ +I   EI  C   +K G L+FP+ IT L  +  VP  +D+  + 
Subjt:  KTEKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDC-WRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLM

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.6e-2435.35Show/hide
Query:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDD----------------QEGFQAIV-------RFNEMVVAPSNDQLNATVREVGIEGAQWRLSKT
        P +   I  H W QFCA PE     +VREFYAN+ D                +E   A+          +E +   +   L   +  V + GA+W +S  
Subjt:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDD----------------QEGFQAIV-------RFNEMVVAPSNDQLNATVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG
           T   + L   A  W  F+K  L PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L + G
Subjt:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG

Query:  IIDTPNLARLQRTQE
         ID   +AR+  TQE
Subjt:  IIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.2e-3533.44Show/hide
Query:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKT
        P +   I  H W QFCA PE     +VREFYAN+ D E     VR                        +E +   +   L   +  V   GA+W +S  
Subjt:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG
           T   + L   A  W  F+K RL PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L + G
Subjt:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG

Query:  IIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPAL
         ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  S ++   +Q Q FW Y K+RD AL+ ALQ+NF++P P  
Subjt:  IIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPAL

Query:  PIFPDDLL
        P FP ++L
Subjt:  PIFPDDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.6e-2437.81Show/hide
Query:  LKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLAR
        L   A  W  F+K RL PTTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L   G ID   +AR
Subjt:  LKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDL
        +  TQE +                     G ++  +  ++++L      Q H  S ++   +Q Q FW Y K+RD AL+ ALQ+NF++P P  P FP +L
Subjt:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDL

Query:  L
        L
Subjt:  L

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.7e-2833.46Show/hide
Query:  IVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRL
        +VREFYAN+ D E     VR                        +E +   +  +L   +  V   GA+W +S     T   + L   A  W  F+K RL
Subjt:  IVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRL

Query:  RPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLARL--------------Q
         PTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L + G ID   +AR+               
Subjt:  RPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQMQEQLQMHSSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDLL
        R   A        + Q  + L+   S+ E   +Q Q FW Y K+RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  RTQEARQGGLVCGIHQMQEQLQMHSSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.2e-2435.35Show/hide
Query:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDD----------------QEGFQAIV-------RFNEMVVAPSNDQLNATVREVGIEGAQWRLSKT
        P +   I  H W QFCA PE     +VREFYAN+ D                +E   A+          +E +   +   L   +  V + GA+W +S  
Subjt:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDD----------------QEGFQAIV-------RFNEMVVAPSNDQLNATVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG
           T   + L   A  W  F+K  L PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L + G
Subjt:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG

Query:  IIDTPNLARLQRTQE
         ID   +AR+  TQE
Subjt:  IIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)2.0e-3533.44Show/hide
Query:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKT
        P +   I  H W QFCA PE     +VREFYAN+ D E     VR                        +E +   +   L   +  V   GA+W +S  
Subjt:  PRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG
           T   + L   A  W  F+K RL PTTH  TVS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L + G
Subjt:  EKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKG

Query:  IIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPAL
         ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  S ++   +Q Q FW Y K+RD AL+ ALQ+NF++P P  
Subjt:  IIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPAL

Query:  PIFPDDLL
        P FP ++L
Subjt:  PIFPDDLL

A0A2P5CEY2 Uncharacterized protein1.2e-2437.81Show/hide
Query:  LKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLAR
        L   A  W  F+K RL PTTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L   G ID   +AR
Subjt:  LKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDL
        +  TQE +                     G ++  +  ++++L      Q H  S ++   +Q Q FW Y K+RD AL+ ALQ+NF++P P  P FP +L
Subjt:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QMH-SSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDL

Query:  L
        L
Subjt:  L

A0A2P5DXM3 Uncharacterized protein8.3e-2933.46Show/hide
Query:  IVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRL
        +VREFYAN+ D E     VR                        +E +   +  +L   +  V   GA+W +S     T   + L   A  W  F+K RL
Subjt:  IVREFYANIDDQEGFQAIVR-----------------------FNEMVVAPSNDQLNATVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRL

Query:  RPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLARL--------------Q
         PTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR    P   ++  L + G ID   +AR+               
Subjt:  RPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQMQEQLQMHSSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDLL
        R   A        + Q  + L+   S+ E   +Q Q FW Y K+RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  RTQEARQGGLVCGIHQMQEQLQMHSSRMEFAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDLL

W9QTD9 Uncharacterized protein7.5e-2232.56Show/hide
Query:  EPRLPTGIVN-HGWSQFCAKPESVNSNIVREFYANIDD--QE--------------------GFQAIV-RFNEMVVAPSNDQLNATVREVGIEGAQWRLS
        +P   T +++ HGW QFC  P +    +VREFYAN+ D  QE                    G + +V  + +     +++QL   + EV IEGA W++S
Subjt:  EPRLPTGIVN-HGWSQFCAKPESVNSNIVREFYANIDD--QE--------------------GFQAIV-RFNEMVVAPSNDQLNATVREVGIEGAQWRLS

Query:  KTEKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDC-WRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLM
             T     LK  A  W  F+  R  P+TH  TV++DRVLL ++IL  +S+++ +I   EI  C   +K G L+FP+ IT L  +  VP  +D+  + 
Subjt:  KTEKRTFQAAYLKSEANTWMGFIKLRLRPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDC-WRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLM

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCCGCAATTGGAGAGAAAGAATTTGCAGAGCAAAAGAATCGAAGTTGAAAATTTGCGCTTGCATCTGGTCCTGAAGATGTGGAATATGCATTCAATATGCACCTG
CAAACGTATAAGGAAAGTAGCCCCAACCGAAGATAGAAGTGTCAAGGCAGTGCACAGAAGAGATTTTGCGCCCGATTTGGGACCGCAACTGGGCTTTTCGACAGAATTCC
TAGATTCGTGGGATCTTCACAGAGTTATCCGAAATTCCCAAATCTCGGCATTTGCGGGTGCAACTGGAGGATTCTACCTGCACCATGTCTATGTGGACAACATAGATAGT
GTTCTTCAGCCGTTTGCAGTGTATGAGGTTGCTTTTAGATACATATTATATGTTTTAACCCATTTGGAAGAATGGAAGCTTTGGAAATTATTTTGTGCAGAATATGTTGC
TGGGCGACTTGAGGGAGCAAACTCTGTGCTGGAGCAAAGCTGGGAGCAAAAACTGCCACTTTGGTGCATGAACGATCCGCCTGAGGTAAGGTTCGAGCTTGATCCAGAAA
TCGAGAGGACATTTAGGATAAGAAGGAGAGAGCAGCGTAGACAGCAAAATCAAATGGCTAACGTGTCGCGTCTCCCGCAGGGTCCAGAAGATCCAGTTGATCCCCAGCAG
AATCAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCGATAAAGATTCTGGAGCATCTGTTACCCCTAAGGCACAAAAAGTGAAAACGAAGAAAAATAAAACGCCGGAGAA
AAAAGAAGCCAAACGAAGGAGAAGACAGCAGAGGGCTGAGGACCAAGAAGTTGTCCAGAAGGCGGCGGAAGATGTTGCTGCTACGTTAGTTGAAGAAGGAAATCTGAAAG
AACCAGAGGGACAGAACACTGAGCTGAGTGACCTAGTAGTTGCAGATACGGAGGAAGTTCAAGAAGAACAAACAGAGGACGTTCAAGAAAAACAGGCTGAAGATACGCAA
GAAGGTAGGACAGAGGATGTTCAGGAAGCAGGTAATGAGCAGGTGGAGCAAGAGCAAGAGGCTCGAGTGGAGGTTATCATGCCGGAGGTGCCACGACGTCGCCGCGTGAA
GCGCAAGGCAGGACGCGTCAAGGTAGTCAGAACTGATACTCCCTCGCCTCCAACCATTGATTCTGAAAGAGAAAATGCAGAAAGAGAGGAACGAGAGAAAAAAGAAGCCG
AGGACAAAGCAATAGAGGAAGAAGCGAAGAAGGCTAAGGAAGAGATTTTGCCCAAACAAACGGAAGATAGGGGCAAAGGTATTGATGAGGCATCGGGTGAAGCTGACGAG
ATTGAGGAGCCGCGACTGCCGACTGGAATAGTGAACCATGGCTGGAGTCAATTTTGTGCAAAGCCAGAGTCGGTTAATTCCAACATTGTTCGGGAATTCTATGCAAATAT
TGATGATCAAGAGGGATTTCAGGCTATCGTCCGCTTTAATGAGATGGTGGTAGCACCATCTAACGATCAGTTAAATGCGACTGTCCGAGAAGTTGGCATCGAGGGGGCTC
AGTGGAGATTGTCAAAGACAGAGAAGCGCACGTTCCAGGCCGCTTATTTGAAGAGTGAAGCCAATACATGGATGGGCTTCATCAAGCTACGCTTGCGGCCAACAACTCAC
GATTCAACGGTGTCTCGGGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCAATGAGTATCGATGTGGGTAAAATCATTTCGTCTGAAATTCATGACTGCTGGCGGAA
AAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGTAGGGGTGCCAGGGAGTGAGGATGATGTGAGTTTAATGGACAAGGGAATAATCGACA
CACCAAATCTGGCTAGGCTTCAAAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGATGCATTCCAGCAGGATGGAA
TTTGCTGAAAGGCAAGTCCAGACCTTTTGGAATTATGTGAAGAAAAGGGATGCCGCTTTGAGGGTGGCCTTGCAATCAAACTTTTCTAAACCATATCCGGCCTTACCCAT
ATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGATGTAGAAGAAGATCCTGAAACCTTTTGCTTGAGCAATTCTTCTGAATTTT
TGAAGCTTGGACTGGTCACAGCTACGGCAAAGAAGATTTTGGAGGTAGTGTTGACTTATTTTATCCACTTTAAGCTTACAATTATTTTGCTGCAGCAGAGCTTGGTTTTG
CAGAATGCTCAAGTAAAGGTTGAAGGTAGTGTTGGATTATCTGTTGTGATTAAGCTATGCGTTGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACCCGCAATTGGAGAGAAAGAATTTGCAGAGCAAAAGAATCGAAGTTGAAAATTTGCGCTTGCATCTGGTCCTGAAGATGTGGAATATGCATTCAATATGCACCTG
CAAACGTATAAGGAAAGTAGCCCCAACCGAAGATAGAAGTGTCAAGGCAGTGCACAGAAGAGATTTTGCGCCCGATTTGGGACCGCAACTGGGCTTTTCGACAGAATTCC
TAGATTCGTGGGATCTTCACAGAGTTATCCGAAATTCCCAAATCTCGGCATTTGCGGGTGCAACTGGAGGATTCTACCTGCACCATGTCTATGTGGACAACATAGATAGT
GTTCTTCAGCCGTTTGCAGTGTATGAGGTTGCTTTTAGATACATATTATATGTTTTAACCCATTTGGAAGAATGGAAGCTTTGGAAATTATTTTGTGCAGAATATGTTGC
TGGGCGACTTGAGGGAGCAAACTCTGTGCTGGAGCAAAGCTGGGAGCAAAAACTGCCACTTTGGTGCATGAACGATCCGCCTGAGGTAAGGTTCGAGCTTGATCCAGAAA
TCGAGAGGACATTTAGGATAAGAAGGAGAGAGCAGCGTAGACAGCAAAATCAAATGGCTAACGTGTCGCGTCTCCCGCAGGGTCCAGAAGATCCAGTTGATCCCCAGCAG
AATCAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCGATAAAGATTCTGGAGCATCTGTTACCCCTAAGGCACAAAAAGTGAAAACGAAGAAAAATAAAACGCCGGAGAA
AAAAGAAGCCAAACGAAGGAGAAGACAGCAGAGGGCTGAGGACCAAGAAGTTGTCCAGAAGGCGGCGGAAGATGTTGCTGCTACGTTAGTTGAAGAAGGAAATCTGAAAG
AACCAGAGGGACAGAACACTGAGCTGAGTGACCTAGTAGTTGCAGATACGGAGGAAGTTCAAGAAGAACAAACAGAGGACGTTCAAGAAAAACAGGCTGAAGATACGCAA
GAAGGTAGGACAGAGGATGTTCAGGAAGCAGGTAATGAGCAGGTGGAGCAAGAGCAAGAGGCTCGAGTGGAGGTTATCATGCCGGAGGTGCCACGACGTCGCCGCGTGAA
GCGCAAGGCAGGACGCGTCAAGGTAGTCAGAACTGATACTCCCTCGCCTCCAACCATTGATTCTGAAAGAGAAAATGCAGAAAGAGAGGAACGAGAGAAAAAAGAAGCCG
AGGACAAAGCAATAGAGGAAGAAGCGAAGAAGGCTAAGGAAGAGATTTTGCCCAAACAAACGGAAGATAGGGGCAAAGGTATTGATGAGGCATCGGGTGAAGCTGACGAG
ATTGAGGAGCCGCGACTGCCGACTGGAATAGTGAACCATGGCTGGAGTCAATTTTGTGCAAAGCCAGAGTCGGTTAATTCCAACATTGTTCGGGAATTCTATGCAAATAT
TGATGATCAAGAGGGATTTCAGGCTATCGTCCGCTTTAATGAGATGGTGGTAGCACCATCTAACGATCAGTTAAATGCGACTGTCCGAGAAGTTGGCATCGAGGGGGCTC
AGTGGAGATTGTCAAAGACAGAGAAGCGCACGTTCCAGGCCGCTTATTTGAAGAGTGAAGCCAATACATGGATGGGCTTCATCAAGCTACGCTTGCGGCCAACAACTCAC
GATTCAACGGTGTCTCGGGACCGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCAATGAGTATCGATGTGGGTAAAATCATTTCGTCTGAAATTCATGACTGCTGGCGGAA
AAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGTAGGGGTGCCAGGGAGTGAGGATGATGTGAGTTTAATGGACAAGGGAATAATCGACA
CACCAAATCTGGCTAGGCTTCAAAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGATGCATTCCAGCAGGATGGAA
TTTGCTGAAAGGCAAGTCCAGACCTTTTGGAATTATGTGAAGAAAAGGGATGCCGCTTTGAGGGTGGCCTTGCAATCAAACTTTTCTAAACCATATCCGGCCTTACCCAT
ATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGATGTAGAAGAAGATCCTGAAACCTTTTGCTTGAGCAATTCTTCTGAATTTT
TGAAGCTTGGACTGGTCACAGCTACGGCAAAGAAGATTTTGGAGGTAGTGTTGACTTATTTTATCCACTTTAAGCTTACAATTATTTTGCTGCAGCAGAGCTTGGTTTTG
CAGAATGCTCAAGTAAAGGTTGAAGGTAGTGTTGGATTATCTGTTGTGATTAAGCTATGCGTTGTCTAA
Protein sequenceShow/hide protein sequence
MHPQLERKNLQSKRIEVENLRLHLVLKMWNMHSICTCKRIRKVAPTEDRSVKAVHRRDFAPDLGPQLGFSTEFLDSWDLHRVIRNSQISAFAGATGGFYLHHVYVDNIDS
VLQPFAVYEVAFRYILYVLTHLEEWKLWKLFCAEYVAGRLEGANSVLEQSWEQKLPLWCMNDPPEVRFELDPEIERTFRIRRREQRRQQNQMANVSRLPQGPEDPVDPQQ
NQLESGQGAGGSDKDSGASVTPKAQKVKTKKNKTPEKKEAKRRRRQQRAEDQEVVQKAAEDVAATLVEEGNLKEPEGQNTELSDLVVADTEEVQEEQTEDVQEKQAEDTQ
EGRTEDVQEAGNEQVEQEQEARVEVIMPEVPRRRRVKRKAGRVKVVRTDTPSPPTIDSERENAEREEREKKEAEDKAIEEEAKKAKEEILPKQTEDRGKGIDEASGEADE
IEEPRLPTGIVNHGWSQFCAKPESVNSNIVREFYANIDDQEGFQAIVRFNEMVVAPSNDQLNATVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLRPTTH
DSTVSRDRVLLAFAILRSMSIDVGKIISSEIHDCWRKKVGKLFFPNTITMLCRRVGVPGSEDDVSLMDKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQMHSSRME
FAERQVQTFWNYVKKRDAALRVALQSNFSKPYPALPIFPDDLLNPWIPPPPVEREGDVEEDPETFCLSNSSEFLKLGLVTATAKKILEVVLTYFIHFKLTIILLQQSLVL
QNAQVKVEGSVGLSVVIKLCVV