; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi08G001004 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi08G001004
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionUsp domain-containing protein
Genome locationchr8:37475489..37479587
RNA-Seq ExpressionBhi08G001004
SyntenyBhi08G001004
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137615.1 uncharacterized protein LOC101206357 [Cucumis sativus]4.9e-9285.51Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MDLRKI VIVEDVEVARTALKW LNNLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+DLC TFPNTKVEI+VTEGDQEGRKI AIV+EIG 
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK
        SVLVVGLH++SFLYKMAM E+DL RIFNCKVLAIKQA+ +AEES KTK+VEVIAA  + STNM+FSQIEIAKLQAPE+  QKIPYRICPDP AIIWRSKK
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK

Query:  SRRRWTL
        S RRWTL
Subjt:  SRRRWTL

XP_008456196.1 PREDICTED: uncharacterized protein LOC103496179 [Cucumis melo]1.7e-8984.62Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MDLRKI VIVEDVEVARTALKW LNNLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+DLC TFPNTKVEIIVTEGDQEGRK AAIV+EIG 
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSA-EESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSK
        SVLVVGLH++SFLYKMAM E+DL RIFNCKVLAIKQA+ +A +ES KTKNVEVIAA  + STNM+FSQIEI KLQAPE   QKIPYRICPDP AIIWRS+
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSA-EESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSK

Query:  KSRRRWTL
        KS RRWTL
Subjt:  KSRRRWTL

XP_022137469.1 uncharacterized protein LOC111008906 [Momordica charantia]3.5e-9084.31Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MD+RKIAV+VEDVE ARTALKW LNNLMRYGDLI LLHVFPSTRSKS +K RH RLKGYQLAL+FKDLC  FPNTKVEI+VTEGD++GRKIAA+++EIG 
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK
        S LVVGLH++SFLYKMAM +DD+AR FNCKVLAIKQA+TS EESHK+KNV+VI AAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICP+PSAIIWRSKK
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK

Query:  SRRR
        SRRR
Subjt:  SRRR

XP_038893893.1 uncharacterized protein LOC120082691 isoform X1 [Benincasa hispida]5.6e-88100Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQ
        SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQ
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQ

XP_038893894.1 uncharacterized protein LOC120082691 isoform X2 [Benincasa hispida]5.7e-109100Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK
        SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK

Query:  SRRRWTL
        SRRRWTL
Subjt:  SRRRWTL

TrEMBL top hitse value%identityAlignment
A0A0A0LQQ9 Usp domain-containing protein2.4e-9285.51Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MDLRKI VIVEDVEVARTALKW LNNLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+DLC TFPNTKVEI+VTEGDQEGRKI AIV+EIG 
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK
        SVLVVGLH++SFLYKMAM E+DL RIFNCKVLAIKQA+ +AEES KTK+VEVIAA  + STNM+FSQIEIAKLQAPE+  QKIPYRICPDP AIIWRSKK
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK

Query:  SRRRWTL
        S RRWTL
Subjt:  SRRRWTL

A0A1S3C3C8 uncharacterized protein LOC1034961798.4e-9084.62Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MDLRKI VIVEDVEVARTALKW LNNLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+DLC TFPNTKVEIIVTEGDQEGRK AAIV+EIG 
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSA-EESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSK
        SVLVVGLH++SFLYKMAM E+DL RIFNCKVLAIKQA+ +A +ES KTKNVEVIAA  + STNM+FSQIEI KLQAPE   QKIPYRICPDP AIIWRS+
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSA-EESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSK

Query:  KSRRRWTL
        KS RRWTL
Subjt:  KSRRRWTL

A0A5D3BIR9 UspA8.4e-9084.62Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MDLRKI VIVEDVEVARTALKW LNNLMRYGDLITLLHVFPSTRSKSSSKVR+RRL GYQLALTF+DLC TFPNTKVEIIVTEGDQEGRK AAIV+EIG 
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSA-EESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSK
        SVLVVGLH++SFLYKMAM E+DL RIFNCKVLAIKQA+ +A +ES KTKNVEVIAA  + STNM+FSQIEI KLQAPE   QKIPYRICPDP AIIWRS+
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSA-EESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSK

Query:  KSRRRWTL
        KS RRWTL
Subjt:  KSRRRWTL

A0A6J1C7B3 uncharacterized protein LOC1110089061.7e-9084.31Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        MD+RKIAV+VEDVE ARTALKW LNNLMRYGDLI LLHVFPSTRSKS +K RH RLKGYQLAL+FKDLC  FPNTKVEI+VTEGD++GRKIAA+++EIG 
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK
        S LVVGLH++SFLYKMAM +DD+AR FNCKVLAIKQA+TS EESHK+KNV+VI AAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICP+PSAIIWRSKK
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKK

Query:  SRRR
        SRRR
Subjt:  SRRR

A0A6J1E7J0 uncharacterized protein LOC1114313465.1e-8782.24Show/hide
Query:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTE--GDQEGRKIAAIVKEI
        MDLRKI VIVEDVE ARTALKWTLNNLMRYGDLITLLHVFP+TRSKS+SK+RH RL GYQLAL+FKDLC TFPNTKVEIIVTE  GD+EGRKIAA+V+EI
Subjt:  MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTE--GDQEGRKIAAIVKEI

Query:  GVSVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAA-----MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSA
        G SVLVVGLH+ SFLYKMA+ EDD+AR F CKVLAIK   +S EE  KTKNVEVIAAA       SSTNMDFSQIEIAKLQAPEI PQKIPYRICPDPSA
Subjt:  GVSVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAA-----MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSA

Query:  IIWRSKKSRRRWTL
        IIWRSKKSR RWTL
Subjt:  IIWRSKKSRRRWTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16760.1 Protein kinase protein with adenine nucleotide alpha hydrolases-like domain6.9e-0427.62Show/hide
Query:  IAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFP---NTK----VEIIVTEGDQEGRKIAAIVKEI
        +A+ ++  + ++ A+KWTL NL   G  + L+HV P  +S+SS  +        Q+    KDL ++F    + K    +++++ + D    K+ AIV+ +
Subjt:  IAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFP---NTK----VEIIVTEGDQEGRKIAAIVKEI

Query:  GVSVL
         VS +
Subjt:  GVSVL

AT1G48960.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.1e-5554.07Show/hide
Query:  DLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVF-PSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV
        D+R+I V+VED + ARTAL+W L+NL+R GD+I LLHV+ P  R K S+  R  R  GY LAL+F+++C +F NT  EIIV EGD +GR IA +VKEIG 
Subjt:  DLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVF-PSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGV

Query:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAE-----ESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEI-LPQKIPYRICPDPSAI
        S+L+VGLH  SFLY+ A+   D+AR FNCKV+AIKQ S         + HKT      A + D  TN DFSQIEI+ LQ PEI  P K+PYR+CP P AI
Subjt:  SVLVVGLHNYSFLYKMAMGEDDLARIFNCKVLAIKQASTSAE-----ESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEI-LPQKIPYRICPDPSAI

Query:  IWRSKKSRR
        +WR++  RR
Subjt:  IWRSKKSRR

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.3e-0525.2Show/hide
Query:  RKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRR-----------LKGYQLALTFKDLC-ITFPNTKVEIIVTEGDQEGRKI
        R+I V+V+    A+ AL WTL++  +  D I LLH   +  S+S                    +  +     K +C +  P  K E++  +GD++G  I
Subjt:  RKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRR-----------LKGYQLALTFKDLC-ITFPNTKVEIIVTEGDQEGRKI

Query:  AAIVKEIGVSVLVVGLHNYSFLYKMAM
            +E   S+LV+G       +++ M
Subjt:  AAIVKEIGVSVLVVGLHNYSFLYKMAM

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein6.3e-0526.09Show/hide
Query:  RKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGVSVL
        R+I V+V+    A+ AL WTL++  +  D I LLH   +  S+S       + +G   +             K E++  +GD++G  I    +E   S+L
Subjt:  RKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGVSVL

Query:  VVGLHNYSFLYKMAM
        V+G       +++ M
Subjt:  VVGLHNYSFLYKMAM

AT2G07020.1 Protein kinase protein with adenine nucleotide alpha hydrolases-like domain1.8e-0422.75Show/hide
Query:  IAVIVEDVEVARTALKWTLNNLMRYGDLITLLHV-FPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGVSVLV
        +A+ ++  + ++ ALKW ++NL+  G+ +TL+HV    T + + ++         +L L F+  C T  +   E +V E       I   V+E  + +LV
Subjt:  IAVIVEDVEVARTALKWTLNNLMRYGDLITLLHV-FPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGVSVLV

Query:  VGLHNYSFLYKMAMGEDDLARIFN----CKVLAIKQASTSAEESHKTKNVEV------IAAAMDSSTNMDFSQIEIAKLQAPEILPQKI
        +G    + L ++   +   A I      C V AI +   S+  S  +    +      + A   ++ N +FS     +LQ+ + +  +I
Subjt:  VGLHNYSFLYKMAMGEDDLARIFN----CKVLAIKQASTSAEESHKTKNVEV------IAAAMDSSTNMDFSQIEIAKLQAPEILPQKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAGGAAAATCGCGGTGATTGTTGAGGATGTTGAAGTAGCGAGAACGGCGTTGAAATGGACGCTCAATAACCTAATGCGCTATGGCGATTTGATTACTCTTCT
TCATGTATTTCCTTCTACAAGATCCAAAAGTAGTTCCAAAGTTCGTCATCGCCGATTGAAGGGCTATCAATTAGCCCTAACTTTCAAAGACCTCTGCATCACTTTCCCCA
ATACAAAGGTAGAGATTATTGTGACGGAAGGCGATCAAGAAGGTAGAAAGATCGCGGCCATTGTTAAAGAAATTGGAGTTTCCGTGCTTGTAGTTGGCCTCCATAACTAT
AGCTTTCTGTACAAAATGGCTATGGGGGAAGATGATTTAGCAAGGATTTTCAATTGCAAAGTTCTGGCAATCAAACAAGCATCAACCTCAGCAGAAGAATCACATAAAAC
AAAAAATGTGGAAGTTATAGCTGCAGCTATGGACAGTTCAACCAACATGGACTTTTCCCAGATTGAGATTGCCAAATTACAAGCTCCTGAAATTCTTCCGCAGAAAATTC
CATACAGAATCTGCCCCGACCCTTCTGCGATTATTTGGAGATCAAAGAAATCAAGAAGAAGGTGGACTTTGTGA
mRNA sequenceShow/hide mRNA sequence
GAAAAAGCTCCAAAAGAAAATAAAAGGTTGCAATAATTTACTTAAGAAGATTAATGCTTGTCACCATCTCAATCTCTTCTCAAATATAAAGTCAGCTTCTGTGGCTTTAA
GAGAGAACGTATCAATTCCAAATTATCCATTAACGCTTAAAACAAGCTTTTCCAGTTTCTTCAATGGCAGACGCCAGAGATTTGATCACAAAATTCCGCCATTAATCTTT
CGAGCGAAGGAGAACAGAGGATTATCATTACATTACAGAGCAACCTGAGGCCCACGAACACAGCAGAGAGAAGAGAAATTCATATTTTAGAGACAGCAAAATCCCAGTCA
AAATAGTACAGAATTTCCCCAACTGTTTCCCTTGCGATCTTTCAATTAATTGCATTGTATAATCGTCTCTCTCTTTCTGTCAATTCTTCCATGGCGAGAATTGATTTACA
GAGATAGATAGGGCTTCATAATCGACGAATCGCGAGTTTCTTTTCTCAATTTTCATCTCAAAGAAAAGCGAAAAGAGTGATGGATTTGAGGAAAATCGCGGTGATTGTTG
AGGATGTTGAAGTAGCGAGAACGGCGTTGAAATGGACGCTCAATAACCTAATGCGCTATGGCGATTTGATTACTCTTCTTCATGTATTTCCTTCTACAAGATCCAAAAGT
AGTTCCAAAGTTCGTCATCGCCGATTGAAGGGCTATCAATTAGCCCTAACTTTCAAAGACCTCTGCATCACTTTCCCCAATACAAAGGTAGAGATTATTGTGACGGAAGG
CGATCAAGAAGGTAGAAAGATCGCGGCCATTGTTAAAGAAATTGGAGTTTCCGTGCTTGTAGTTGGCCTCCATAACTATAGCTTTCTGTACAAAATGGCTATGGGGGAAG
ATGATTTAGCAAGGATTTTCAATTGCAAAGTTCTGGCAATCAAACAAGCATCAACCTCAGCAGAAGAATCACATAAAACAAAAAATGTGGAAGTTATAGCTGCAGCTATG
GACAGTTCAACCAACATGGACTTTTCCCAGATTGAGATTGCCAAATTACAAGCTCCTGAAATTCTTCCGCAGAAAATTCCATACAGAATCTGCCCCGACCCTTCTGCGAT
TATTTGGAGATCAAAGAAATCAAGAAGAAGGTGGACTTTGTGACGACGGGACCATCTTTATTTATCTCAAACTTTATTTCTACTAATGTTTGCCCCTGTTTTTTCTCTTT
CCCACACCTTCTTTTTAGACATTGCCAATAATGGAGACTGTCTGGTTTTTGAGGTTGTCATTGTCGATGACAAACGTGCCCTGTACACCACACACCCAGAGTAGGAAAAA
AAATGCATTAAAATTTGTAATCCATTTGATCTGATTTATACATATAGGTTTTGCAGGTCCCCCCACTACCCTGATCTTGTTTCTTTTCTCATTCGTATCAGTGTTCAATT
TTAGTGTATTGTGTATAGTAATAACGTATGAGTATAGTAAGTTTTAATTTGGTAAATTATATATATGTGTGTTTATATATGGAAGCTTAAAT
Protein sequenceShow/hide protein sequence
MDLRKIAVIVEDVEVARTALKWTLNNLMRYGDLITLLHVFPSTRSKSSSKVRHRRLKGYQLALTFKDLCITFPNTKVEIIVTEGDQEGRKIAAIVKEIGVSVLVVGLHNY
SFLYKMAMGEDDLARIFNCKVLAIKQASTSAEESHKTKNVEVIAAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPDPSAIIWRSKKSRRRWTL