; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh13G010840 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh13G010840
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUsp domain-containing protein
Genome locationCmo_Chr13:8971326..8973711
RNA-Seq ExpressionCmoCh13G010840
SyntenyCmoCh13G010840
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584375.1 hypothetical protein SDJN03_20307, partial [Cucurbita argyrosperma subsp. sororia]1.9e-10798.58Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE-GGGDEEGRKIAAVVRE
        MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPT RSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE GGGDEEGRKIAAVVRE
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE-GGGDEEGRKIAAVVRE

Query:  IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII
        IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVI AAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII
Subjt:  IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII

Query:  WRSKKSRTRWTL
        WRSKKSRTRWTL
Subjt:  WRSKKSRTRWTL

KAG7019961.1 hypothetical protein SDJN02_18928 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-10899.06Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE-GGGDEEGRKIAAVVRE
        MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE GGGDEEGRKIAAVVRE
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE-GGGDEEGRKIAAVVRE

Query:  IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVI-AAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAI
        IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVI AAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAI
Subjt:  IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVI-AAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAI

Query:  IWRSKKSRTRWTL
        IWRSKKSRTRWTL
Subjt:  IWRSKKSRTRWTL

XP_022923726.1 uncharacterized protein LOC111431346 [Cucurbita moschata]2.8e-111100Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
        MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI

Query:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIW
        GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIW
Subjt:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIW

Query:  RSKKSRTRWTL
        RSKKSRTRWTL
Subjt:  RSKKSRTRWTL

XP_023000727.1 uncharacterized protein LOC111495088 [Cucurbita maxima]5.6e-10495.33Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGG---DEEGRKIAAVV
        MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVF TTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGG   DEEGRKIAAVV
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGG---DEEGRKIAAVV

Query:  REIGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA
        REIGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEE QKTKNVEVIAAA    DTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA
Subjt:  REIGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA

Query:  IIWRSKKSRTRWTL
        IIWRSK+SRTRWTL
Subjt:  IIWRSKKSRTRWTL

XP_023519721.1 uncharacterized protein LOC111783074 [Cucurbita pepo subsp. pepo]7.9e-10697.17Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE-GGGDEEGRKIAAVVRE
        MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE GGGDEEGRKIA VVRE
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTE-GGGDEEGRKIAAVVRE

Query:  IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII
        IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAA   DT SSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII
Subjt:  IGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII

Query:  WRSKKSRTRWTL
        WRSKKSRTRWTL
Subjt:  WRSKKSRTRWTL

TrEMBL top hitse value%identityAlignment
A0A0A0LQQ9 Usp domain-containing protein1.1e-8478.97Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
        MDLRKIVVIVEDVE ARTALKW LNNLMRYGDLITLLHVFP+TRSKS+SK+R+ RLNGYQLAL+F+DLC TFPNTKVEI+VTE  GD+EGRKI A+VREI
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI

Query:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSST---EEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA
        GASVLVVGLH  SFLYKMA+EE+D+ R F CKVLAIK +T   EE QKTK+VEVIAA      T  STNM+FSQIEIAKLQAPE+P QKIPYRICPDP A
Subjt:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSST---EEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA

Query:  IIWRSKKSRTRWTL
        IIWRSKKS  RWTL
Subjt:  IIWRSKKSRTRWTL

A0A1S3C3C8 uncharacterized protein LOC1034961792.0e-8378.14Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
        MDLRKIVVIVEDVE ARTALKW LNNLMRYGDLITLLHVFP+TRSKS+SK+R+ RLNGYQLAL+F+DLC TFPNTKVEIIVTE  GD+EGRK AA+VREI
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI

Query:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSST----EEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPS
        GASVLVVGLH  SFLYKMA+EE+D+ R F CKVLAIK +T    +E QKTKNVEVIAA      T  STNM+FSQIEI KLQAPE P QKIPYRICPDP 
Subjt:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSST----EEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPS

Query:  AIIWRSKKSRTRWTL
        AIIWRS+KS  RWTL
Subjt:  AIIWRSKKSRTRWTL

A0A5D3BIR9 UspA2.0e-8378.14Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
        MDLRKIVVIVEDVE ARTALKW LNNLMRYGDLITLLHVFP+TRSKS+SK+R+ RLNGYQLAL+F+DLC TFPNTKVEIIVTE  GD+EGRK AA+VREI
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI

Query:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSST----EEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPS
        GASVLVVGLH  SFLYKMA+EE+D+ R F CKVLAIK +T    +E QKTKNVEVIAA      T  STNM+FSQIEI KLQAPE P QKIPYRICPDP 
Subjt:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSST----EEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPS

Query:  AIIWRSKKSRTRWTL
        AIIWRS+KS  RWTL
Subjt:  AIIWRSKKSRTRWTL

A0A6J1E7J0 uncharacterized protein LOC1114313461.4e-111100Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
        MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI

Query:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIW
        GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIW
Subjt:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIW

Query:  RSKKSRTRWTL
        RSKKSRTRWTL
Subjt:  RSKKSRTRWTL

A0A6J1KNG3 uncharacterized protein LOC1114950882.7e-10495.33Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGG---DEEGRKIAAVV
        MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVF TTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGG   DEEGRKIAAVV
Subjt:  MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGG---DEEGRKIAAVV

Query:  REIGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA
        REIGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEE QKTKNVEVIAAA    DTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA
Subjt:  REIGASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSA

Query:  IIWRSKKSRTRWTL
        IIWRSK+SRTRWTL
Subjt:  IIWRSKKSRTRWTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48960.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.6e-5655.71Show/hide
Query:  DLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVF-PTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI
        D+R+IVV+VED +AARTAL+W L+NL+R GD+I LLHV+ P  R K ++  R LR +GY LALSF+++C +F NT  EIIV E  GD++GR IA VV+EI
Subjt:  DLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVF-PTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREI

Query:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEE---PQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIP-PQKIPYRICPDPS
        GAS+L+VGLH  SFLY+ A+   D+ARNF CKV+AIK  + E   P K K  +   A A A  +   TN DFSQIEI+ LQ PEIP P K+PYR+CP P 
Subjt:  GASVLVVGLHDRSFLYKMAVEEDDIARNFKCKVLAIKSSTEE---PQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIP-PQKIPYRICPDPS

Query:  AIIWRSKKSR
        AI+WR++  R
Subjt:  AIIWRSKKSR

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.4e-0529.91Show/hide
Query:  RKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSA-----------SKLRHLRLNGYQLALSFKDLC-TTFPNTKVEIIVTEGGGDEEGR
        R+I+V+V+    A+ AL WTL++  +  D I LLH      S+S            S  +       +   + K +C    P  K E++  +  GDE+G 
Subjt:  RKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSA-----------SKLRHLRLNGYQLALSFKDLC-TTFPNTKVEIIVTEGGGDEEGR

Query:  KIAAVVREIGASVLVVG
         I    RE  AS+LV+G
Subjt:  KIAAVVREIGASVLVVG

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein4.9e-0530.48Show/hide
Query:  RKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREIGAS
        R+I+V+V+    A+ AL WTL++  +  D I LLH      S+S       +  G   +             K E++  +  GDE+G  I    RE  AS
Subjt:  RKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREIGAS

Query:  VLVVG
        +LV+G
Subjt:  VLVVG

AT3G62550.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.8e-0525.81Show/hide
Query:  RKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVF-----PTTRSKSAS-------KLRHLRLNGYQLALSFKDLC-TTFPNTKVEIIVTEGGGDEEG
        RKIVV V++ E +  AL W+L+NL  YG   TL+ ++     P   S  A+        +  L+   Y+L  S      T + + + +I +    G  + 
Subjt:  RKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVF-----PTTRSKSAS-------KLRHLRLNGYQLALSFKDLC-TTFPNTKVEIIVTEGGGDEEG

Query:  RK-IAAVVREIGASVLVVGLHDRSFLYK--MAVEEDDIARNFKCKVLAIKSSTEE
        ++ I   V+++   +LV+G HD  F  +  +    +  A+  KC V+ +K   ++
Subjt:  RK-IAAVVREIGASVLVVGLHDRSFLYK--MAVEEDDIARNFKCKVLAIKSSTEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAGGAAAATCGTGGTGATTGTTGAGGATGTTGAAGCAGCTAGAACGGCATTGAAGTGGACGCTCAATAACCTAATGCGCTACGGCGATTTGATTACTCTTCT
CCATGTATTTCCGACTACAAGATCCAAAAGCGCCTCCAAACTTCGTCATCTCCGATTGAACGGCTATCAATTAGCCCTATCATTCAAAGACCTCTGTACCACTTTCCCCA
ATACAAAGGTAGAGATTATTGTGACGGAAGGCGGCGGCGACGAAGAAGGTAGAAAGATCGCGGCCGTTGTTAGAGAGATTGGAGCTTCTGTGCTTGTGGTTGGCCTCCAT
GATCGCAGCTTTCTGTACAAGATGGCTGTGGAGGAAGATGATATAGCAAGGAACTTCAAGTGTAAAGTTCTGGCAATCAAGAGCTCAACAGAAGAACCACAGAAAACCAA
AAACGTGGAGGTTATAGCAGCAGCAGCAGCAGCAGTGGACACGGGCAGTTCAACAAACATGGACTTTTCCCAGATCGAGATTGCCAAATTACAAGCTCCTGAAATTCCTC
CGCAGAAAATTCCATACAGAATCTGCCCCGACCCTTCTGCCATTATTTGGAGATCCAAGAAATCAAGAACAAGGTGGACCTTGTGA
mRNA sequenceShow/hide mRNA sequence
TTTAGCGACAGCAAAATCCCAGTCCAAACGCTACAGAATTTTCCCATCTATTTCCTTTCGGGTCTCTCAATTAATTGCCTCCTCTCTCTCTCTCTCTGTCAATTCAGCCA
TGGCGAGATTTGATTTACAGAGATAGGGCTTCATAATATACGAATCGCTCGTTTCGTTTCTCAAATTTGAACTCATCAAAACGCGAAAAGAGCGATGGATTTGAGGAAAA
TCGTGGTGATTGTTGAGGATGTTGAAGCAGCTAGAACGGCATTGAAGTGGACGCTCAATAACCTAATGCGCTACGGCGATTTGATTACTCTTCTCCATGTATTTCCGACT
ACAAGATCCAAAAGCGCCTCCAAACTTCGTCATCTCCGATTGAACGGCTATCAATTAGCCCTATCATTCAAAGACCTCTGTACCACTTTCCCCAATACAAAGGTAGAGAT
TATTGTGACGGAAGGCGGCGGCGACGAAGAAGGTAGAAAGATCGCGGCCGTTGTTAGAGAGATTGGAGCTTCTGTGCTTGTGGTTGGCCTCCATGATCGCAGCTTTCTGT
ACAAGATGGCTGTGGAGGAAGATGATATAGCAAGGAACTTCAAGTGTAAAGTTCTGGCAATCAAGAGCTCAACAGAAGAACCACAGAAAACCAAAAACGTGGAGGTTATA
GCAGCAGCAGCAGCAGCAGTGGACACGGGCAGTTCAACAAACATGGACTTTTCCCAGATCGAGATTGCCAAATTACAAGCTCCTGAAATTCCTCCGCAGAAAATTCCATA
CAGAATCTGCCCCGACCCTTCTGCCATTATTTGGAGATCCAAGAAATCAAGAACAAGGTGGACCTTGTGACATCGGCCCTCTCTCTTTATTTATCTTCTCTCTCGCTTTA
CACACCTTCTTAGACATCCCCAATAATGGAGTTTTTTTTTTCTTTTCTTTTTTCATGGAGGTTTTCAGGTTGTCGTTGTTAATATCATAAGCCCTGCTCACCACACACCC
GTAATACATTG
Protein sequenceShow/hide protein sequence
MDLRKIVVIVEDVEAARTALKWTLNNLMRYGDLITLLHVFPTTRSKSASKLRHLRLNGYQLALSFKDLCTTFPNTKVEIIVTEGGGDEEGRKIAAVVREIGASVLVVGLH
DRSFLYKMAVEEDDIARNFKCKVLAIKSSTEEPQKTKNVEVIAAAAAAVDTGSSTNMDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKKSRTRWTL