; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G015580 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G015580
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUsp domain-containing protein
Genome locationCicolChr01:28405636..28409463
RNA-Seq ExpressionCcUC01G015580
SyntenyCcUC01G015580
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137615.1 uncharacterized protein LOC101206357 [Cucumis sativus]5.2e-9485.99Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MDLRKIVVIVEDVE ARTALKWAL+NLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+D+C TFPNTKVEI+VTEGDQEGRKI A+VREIGA
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK
        SVLVVGLH+HSFLYKMAM E+DL RIFNCKVLAIKQAT + EESQKTK+VE+IAA  + STN++FSQIEIAKLQAPE+P QKIPYRICPDP AIIWRSKK
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK

Query:  SRRRWTL
        S RRWTL
Subjt:  SRRRWTL

XP_008456196.1 PREDICTED: uncharacterized protein LOC103496179 [Cucumis melo]6.4e-9285.58Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MDLRKIVVIVEDVE ARTALKWAL+NLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+D+C TFPNTKVEIIVTEGDQEGRK AA+VREIGA
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQAT-TSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSK
        SVLVVGLH+HSFLYKMAM E+DL RIFNCKVLAIKQAT T+ +ESQKTKNVE+IAA  + STN++FSQIEI KLQAPE P QKIPYRICPDP AIIWRS+
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQAT-TSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSK

Query:  KSRRRWTL
        KS RRWTL
Subjt:  KSRRRWTL

XP_022137469.1 uncharacterized protein LOC111008906 [Momordica charantia]5.9e-9084.31Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MD+RKI V+VEDVEAARTALKWAL+NLMRYGDLI LLHVFPSTRSKS +K RH RLKGYQLAL+FKD+C  FPNTKVEI+VTEGD++GRKIAAM+REIGA
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK
        S LVVGLH+HSFLYKMAM +DD+AR FNCKVLAIKQATTS EES K+KNV++I AAMDSSTN+DFSQIEIAKLQAPEI PQKIPYRICP+PSAIIWRSKK
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK

Query:  SRRR
        SRRR
Subjt:  SRRR

XP_023519721.1 uncharacterized protein LOC111783074 [Cucurbita pepo subsp. pepo]2.3e-8983.96Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTE---GDQEGRKIAAMVRE
        MDLRKIVVIVEDVEAARTALKW L+NLMRYGDLITLLHVFP+TRSKS+SK+RH RL GYQLAL+FKD+C TFPNTKVEIIVTE   GD+EGRKIA +VRE
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTE---GDQEGRKIAAMVRE

Query:  IGASVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMD--SSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII
        IGASVLVVGLH+ SFLYKMA+ EDD+AR F CKVLAIK   +STEE QKTKNVE+IAAA D  SSTN+DFSQIEIAKLQAPEIPPQKIPYRICPDPSAII
Subjt:  IGASVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMD--SSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAII

Query:  WRSKKSRRRWTL
        WRSKKSR RWTL
Subjt:  WRSKKSRRRWTL

XP_038893894.1 uncharacterized protein LOC120082691 isoform X2 [Benincasa hispida]8.8e-10292.75Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MDLRKI VIVEDVE ARTALKW L+NLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKD+CITFPNTKVEIIVTEGDQEGRKIAA+V+EIG 
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK
        SVLVVGLHN+SFLYKMAMGEDDLARIFNCKVLAIKQA+TS EES KTKNVE+IAAAMDSSTN+DFSQIEIAKLQAPEI PQKIPYRICPDPSAIIWRSKK
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK

Query:  SRRRWTL
        SRRRWTL
Subjt:  SRRRWTL

TrEMBL top hitse value%identityAlignment
A0A0A0LQQ9 Usp domain-containing protein2.5e-9485.99Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MDLRKIVVIVEDVE ARTALKWAL+NLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+D+C TFPNTKVEI+VTEGDQEGRKI A+VREIGA
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK
        SVLVVGLH+HSFLYKMAM E+DL RIFNCKVLAIKQAT + EESQKTK+VE+IAA  + STN++FSQIEIAKLQAPE+P QKIPYRICPDP AIIWRSKK
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK

Query:  SRRRWTL
        S RRWTL
Subjt:  SRRRWTL

A0A1S3C3C8 uncharacterized protein LOC1034961793.1e-9285.58Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MDLRKIVVIVEDVE ARTALKWAL+NLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+D+C TFPNTKVEIIVTEGDQEGRK AA+VREIGA
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQAT-TSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSK
        SVLVVGLH+HSFLYKMAM E+DL RIFNCKVLAIKQAT T+ +ESQKTKNVE+IAA  + STN++FSQIEI KLQAPE P QKIPYRICPDP AIIWRS+
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQAT-TSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSK

Query:  KSRRRWTL
        KS RRWTL
Subjt:  KSRRRWTL

A0A5D3BIR9 UspA3.1e-9285.58Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MDLRKIVVIVEDVE ARTALKWAL+NLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+D+C TFPNTKVEIIVTEGDQEGRK AA+VREIGA
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQAT-TSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSK
        SVLVVGLH+HSFLYKMAM E+DL RIFNCKVLAIKQAT T+ +ESQKTKNVE+IAA  + STN++FSQIEI KLQAPE P QKIPYRICPDP AIIWRS+
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQAT-TSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSK

Query:  KSRRRWTL
        KS RRWTL
Subjt:  KSRRRWTL

A0A6J1C7B3 uncharacterized protein LOC1110089062.9e-9084.31Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        MD+RKI V+VEDVEAARTALKWAL+NLMRYGDLI LLHVFPSTRSKS +K RH RLKGYQLAL+FKD+C  FPNTKVEI+VTEGD++GRKIAAM+REIGA
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK
        S LVVGLH+HSFLYKMAM +DD+AR FNCKVLAIKQATTS EES K+KNV++I AAMDSSTN+DFSQIEIAKLQAPEI PQKIPYRICP+PSAIIWRSKK
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKK

Query:  SRRR
        SRRR
Subjt:  SRRR

A0A6J1E7J0 uncharacterized protein LOC1114313464.2e-8983.18Show/hide
Query:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTE--GDQEGRKIAAMVREI
        MDLRKIVVIVEDVEAARTALKW L+NLMRYGDLITLLHVFP+TRSKS+SK+RH RL GYQLAL+FKD+C TFPNTKVEIIVTE  GD+EGRKIAA+VREI
Subjt:  MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTE--GDQEGRKIAAMVREI

Query:  GASVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAA-----MDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSA
        GASVLVVGLH+ SFLYKMA+ EDD+AR F CKVLAIK   +STEE QKTKNVE+IAAA       SSTN+DFSQIEIAKLQAPEIPPQKIPYRICPDPSA
Subjt:  GASVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAA-----MDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSA

Query:  IIWRSKKSRRRWTL
        IIWRSKKSR RWTL
Subjt:  IIWRSKKSRRRWTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.7e-0528.85Show/hide
Query:  RKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDIC-ITFPNTKVEIIVTEGDQEGRKIAAMVREIGASV
        ++++V+V++   ++ A+ WAL++L   GDL+TLLHV       + S           LA +   +C    P   VE +V +G +    + + V+++  SV
Subjt:  RKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDIC-ITFPNTKVEIIVTEGDQEGRKIAAMVREIGASV

Query:  LVVG
        LV+G
Subjt:  LVVG

AT1G48960.1 Adenine nucleotide alpha hydrolases-like superfamily protein7.1e-5755.77Show/hide
Query:  DLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVF-PSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA
        D+R+IVV+VED +AARTAL+WAL NL+R GD+I LLHV+ P  R K S+  R  R  GY LAL+F++IC +F NT  EIIV EGD +GR IA +V+EIGA
Subjt:  DLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVF-PSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGA

Query:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTK----NVELIAAAMDSSTNLDFSQIEIAKLQAPEIP-PQKIPYRICPDPSAII
        S+L+VGLH +SFLY+ A+   D+AR FNCKV+AIKQ +       K K    +     A  D  TN DFSQIEI+ LQ PEIP P K+PYR+CP P AI+
Subjt:  SVLVVGLHNHSFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTK----NVELIAAAMDSSTNLDFSQIEIAKLQAPEIP-PQKIPYRICPDPSAII

Query:  WRSKKSRR
        WR++  RR
Subjt:  WRSKKSRR

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.1e-0726.77Show/hide
Query:  RKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRR-----------LKGYQLALTFKDIC-ITFPNTKVEIIVTEGDQEGRKI
        R+I+V+V+    A+ AL W LS+  +  D I LLH   +  S+S                    +  +     K +C +  P  K E++  +GD++G  I
Subjt:  RKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRR-----------LKGYQLALTFKDIC-ITFPNTKVEIIVTEGDQEGRKI

Query:  AAMVREIGASVLVVGLHNHSFLYKMAM
            RE  AS+LV+G       +++ M
Subjt:  AAMVREIGASVLVVGLHNHSFLYKMAM

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.0e-0627.83Show/hide
Query:  RKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGASVL
        R+I+V+V+    A+ AL W LS+  +  D I LLH   +  S+S       + +G   +             K E++  +GD++G  I    RE  AS+L
Subjt:  RKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGASVL

Query:  VVGLHNHSFLYKMAM
        V+G       +++ M
Subjt:  VVGLHNHSFLYKMAM

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.6e-0833.01Show/hide
Query:  VVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKV-RHRRLKGYQLALTFKDIC-ITFPNTKVEIIVTE-GDQEGRKIAAMVREIGASVL
        +V+V+     + AL+WAL++ ++  D ITLLHV  +   ++  +  R R  + ++L    K+ C +  PN K EI+V E  +++G+ I    ++ GA VL
Subjt:  VVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKV-RHRRLKGYQLALTFKDIC-ITFPNTKVEIIVTE-GDQEGRKIAAMVREIGASVL

Query:  VVG
        V+G
Subjt:  VVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAGGAAAATCGTTGTGATTGTTGAGGACGTTGAAGCAGCAAGAACAGCATTGAAATGGGCGCTCAGTAACCTAATGCGCTATGGCGATTTGATTACTCTTCT
CCATGTATTTCCTTCTACAAGATCCAAAAGTAGTTCCAAAGTTCGTCATCGCCGATTGAAGGGCTATCAATTAGCCCTTACCTTCAAAGACATCTGTATCACTTTCCCCA
ATACAAAGGTAGAGATTATTGTGACGGAAGGCGATCAAGAAGGTAGAAAGATCGCGGCCATGGTTAGAGAGATTGGCGCTTCTGTGCTTGTAGTCGGCCTCCATAACCAT
AGCTTTCTATACAAGATGGCTATGGGGGAAGATGATTTAGCAAGGATTTTCAATTGCAAAGTTCTAGCAATCAAGCAAGCAACGACCTCAACAGAAGAGTCACAGAAAAC
CAAAAATGTTGAACTTATAGCTGCAGCTATGGACAGTTCAACCAACTTGGACTTTTCCCAGATCGAGATTGCCAAATTACAAGCTCCTGAAATTCCTCCACAGAAAATTC
CATACAGAATCTGCCCCGACCCATCTGCGATTATTTGGAGATCGAAGAAATCAAGAAGAAGATGGACATTGTGA
mRNA sequenceShow/hide mRNA sequence
TCACTCTCTCAATCTCTTCTCAAATATAAAGTCAGCTTCCGTGGCTTTAAGAGAGAGCTTATCAATTACAAATTATCCATTAATGCCTAAAACAAGCTCTTACAGTCTCT
TCAATGGCAGACGCGCAAGAGATTTGATCACAAAACTCCGCCATTAAACTTTGGAGCGAAGAAGAACAGAGGATTATCATTACAGAGCAGCCCGAGGCCCACGTTCGCAG
CAGAATCCCAGTCAAAACAAGATAGGGCTTCATATAATCGACGAATCGCGAGTTTCTTTTCGCGAATTTCCATCTCATAGAAAGGCCAAAAGAGAGATGGATTTGAGGAA
AATCGTTGTGATTGTTGAGGACGTTGAAGCAGCAAGAACAGCATTGAAATGGGCGCTCAGTAACCTAATGCGCTATGGCGATTTGATTACTCTTCTCCATGTATTTCCTT
CTACAAGATCCAAAAGTAGTTCCAAAGTTCGTCATCGCCGATTGAAGGGCTATCAATTAGCCCTTACCTTCAAAGACATCTGTATCACTTTCCCCAATACAAAGGTAGAG
ATTATTGTGACGGAAGGCGATCAAGAAGGTAGAAAGATCGCGGCCATGGTTAGAGAGATTGGCGCTTCTGTGCTTGTAGTCGGCCTCCATAACCATAGCTTTCTATACAA
GATGGCTATGGGGGAAGATGATTTAGCAAGGATTTTCAATTGCAAAGTTCTAGCAATCAAGCAAGCAACGACCTCAACAGAAGAGTCACAGAAAACCAAAAATGTTGAAC
TTATAGCTGCAGCTATGGACAGTTCAACCAACTTGGACTTTTCCCAGATCGAGATTGCCAAATTACAAGCTCCTGAAATTCCTCCACAGAAAATTCCATACAGAATCTGC
CCCGACCCATCTGCGATTATTTGGAGATCGAAGAAATCAAGAAGAAGATGGACATTGTGACAACCAGACCCTCTGAATTTATCTCAAACATCGTTATTCCACTAATGGCC
TGTTCTTTCTCTTTCCCACACCTTCTTTTTAGACATTGCCAATAATGGAGTCTTTCTTTTTTTCAAGGAGGTTTTTGAGGTTGTCATTGTCGATAACAACATATGCCCTG
TACACCACACACCCAAAATAGATGAAAAAAGATGCATTCAAATTGTAATCTATTTTATCTGATTTATATATATATGTAGGTTTTGC
Protein sequenceShow/hide protein sequence
MDLRKIVVIVEDVEAARTALKWALSNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDICITFPNTKVEIIVTEGDQEGRKIAAMVREIGASVLVVGLHNH
SFLYKMAMGEDDLARIFNCKVLAIKQATTSTEESQKTKNVELIAAAMDSSTNLDFSQIEIAKLQAPEIPPQKIPYRICPDPSAIIWRSKKSRRRWTL