; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1222 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1222
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUsp domain-containing protein
Genome locationMC03:18276151..18281741
RNA-Seq ExpressionMC03g1222
SyntenyMC03g1222
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137469.1 uncharacterized protein LOC111008906 [Momordica charantia]2.69e-140100Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
        MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
        SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS

Query:  RRRTL
        RRRTL
Subjt:  RRRTL

XP_022923726.1 uncharacterized protein LOC111431346 [Cucurbita moschata]2.75e-10779.15Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG--DKDGRKIAAMLREI
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVFP+TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTEG  D++GRKIAA++REI
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG--DKDGRKIAAMLREI

Query:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM------DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA
        GAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EE  K+KNV+VIAA        SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSA
Subjt:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM------DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA

Query:  IIWRSKKSRRR
        IIWRSKKSR R
Subjt:  IIWRSKKSRRR

XP_023000727.1 uncharacterized protein LOC111495088 [Cucurbita maxima]1.87e-10779.52Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAML
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVF +TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTEG     D++GRKIAA++
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAML

Query:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI
        REIGAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EES K+KNV+VIAA D  SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSAI
Subjt:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI

Query:  IWRSKKSRRR
        IWRSK+SR R
Subjt:  IWRSKKSRRR

XP_023519721.1 uncharacterized protein LOC111783074 [Cucurbita pepo subsp. pepo]3.64e-10779.43Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG---DKDGRKIAAMLRE
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVFP+TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTEG   D++GRKIA ++RE
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG---DKDGRKIAAMLRE

Query:  IGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM---DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAII
        IGAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EE  K+KNV+VIAA     SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSAII
Subjt:  IGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM---DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAII

Query:  WRSKKSRRR
        WRSKKSR R
Subjt:  WRSKKSRRR

XP_038893894.1 uncharacterized protein LOC120082691 isoform X2 [Benincasa hispida]1.49e-11784.31Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
        MD+RKIAV+VEDVE ARTALKW LNNLMRYGDLI LLHVFPSTRSKS +K RH RLKGYQLAL+FKDLC  FPNTKVEI+VTEGD++GRKIAA+++EIG 
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA-MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK
        S LVVGLH++SFLYKMAM +DD+AR FNCKVLAIKQA+TS EESHK+KNV+VIAA MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICP+PSAIIWRSKK
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA-MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK

Query:  SRRR
        SRRR
Subjt:  SRRR

TrEMBL top hitse value%identityAlignment
A0A0A0LQQ9 Usp domain-containing protein1.11e-10577.45Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
        MD+RKI V+VEDVE ARTALKWALNNLMRYGDLI LLHVFPSTRSKS +K R+ RL GYQLAL+F+DLC  FPNTKVEIVVTEGD++GRKI A++REIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM-DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK
        S LVVGLH HSFLYKMAM ++D+ R FNCKVLAIKQAT + EES K+K+V+VIAA  + STNM+FSQIEIAKLQAPE+  QKIPYRICP+P AIIWRSKK
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM-DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK

Query:  SRRR
        S RR
Subjt:  SRRR

A0A5D3BIR9 UspA1.27e-10276.1Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
        MD+RKI V+VEDVE ARTALKWALNNLMRYGDLI LLHVFPSTRSKS +K R+ RL GYQLAL+F+DLC  FPNTKVEI+VTEGD++GRK AA++REIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQAT-TSIEESHKSKNVQVIAAMDS-STNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSK
        S LVVGLH HSFLYKMAM ++D+ R FNCKVLAIKQAT T+ +ES K+KNV+VIAA  + STNM+FSQIEI KLQAPE   QKIPYRICP+P AIIWRS+
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQAT-TSIEESHKSKNVQVIAAMDS-STNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSK

Query:  KSRRR
        KS RR
Subjt:  KSRRR

A0A6J1C7B3 uncharacterized protein LOC1110089061.30e-140100Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
        MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
        SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS

Query:  RRRTL
        RRRTL
Subjt:  RRRTL

A0A6J1E7J0 uncharacterized protein LOC1114313461.33e-10779.15Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG--DKDGRKIAAMLREI
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVFP+TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTEG  D++GRKIAA++REI
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG--DKDGRKIAAMLREI

Query:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM------DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA
        GAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EE  K+KNV+VIAA        SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSA
Subjt:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAM------DSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA

Query:  IIWRSKKSRRR
        IIWRSKKSR R
Subjt:  IIWRSKKSRRR

A0A6J1KNG3 uncharacterized protein LOC1114950889.06e-10879.52Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAML
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVF +TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTEG     D++GRKIAA++
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAML

Query:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI
        REIGAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EES K+KNV+VIAA D  SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSAI
Subjt:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI

Query:  IWRSKKSRRR
        IWRSK+SR R
Subjt:  IWRSKKSRRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-0428.85Show/hide
Query:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAF-PNTKVEIVVTEGDKDGRKIAAMLREIGASA
        +++ V+V++   ++ A+ WAL +L   GDL+ LLHV       + +           LA S   LC A  P   VE +V +G K    + + ++++  S 
Subjt:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAF-PNTKVEIVVTEGDKDGRKIAAMLREIGASA

Query:  LVVG
        LV+G
Subjt:  LVVG

AT1G48960.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.8e-6055.77Show/hide
Query:  DVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVF-PSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA
        DVR+I V+VED +AARTAL+WAL+NL+R GD+I+LLHV+ P  R K    AR LR  GY LALSF+++C +F NT  EI+V EGD DGR IA +++EIGA
Subjt:  DVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVF-PSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIE-----ESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEI-LPQKIPYRICPNPSAII
        S L+VGLH +SFLY+ A++  D+ARNFNCKV+AIKQ +  +      + HK+      A  D  TN DFSQIEI+ LQ PEI  P K+PYR+CP+P AI+
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIE-----ESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEI-LPQKIPYRICPNPSAII

Query:  WRSKKSRR
        WR++  RR
Subjt:  WRSKKSRR

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.6e-0728.36Show/hide
Query:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKA-----------RHLRLKGYQLALSFKDLC-TAFPNTKVEIVVTEGDKDGRKI
        R+I V+V+    A+ AL W L++  +  D I+LLH   +  S+SG  A           +    +  +   + K +C    P  K E+V  +GD+ G  I
Subjt:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKA-----------RHLRLKGYQLALSFKDLC-TAFPNTKVEIVVTEGDKDGRKI

Query:  AAMLREIGASALVVGLHDHSFLYKMAMAQDDIAR
            RE  AS LV+G       +++ M     AR
Subjt:  AAMLREIGASALVVGLHDHSFLYKMAMAQDDIAR

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.3e-0730.33Show/hide
Query:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGASAL
        R+I V+V+    A+ AL W L++  +  D I+LLH   +  S+SG  A   + +G   +        A    K E+V  +GD+ G  I    RE  AS L
Subjt:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGASAL

Query:  VVGLHDHSFLYKMAMAQDDIAR
        V+G       +++ M     AR
Subjt:  VVGLHDHSFLYKMAMAQDDIAR

AT5G57035.1 U-box domain-containing protein kinase family protein1.4e-0426.72Show/hide
Query:  AARTALKWALNNLMRYGDLIILLHVFPSTR---SKSGAK--------------ARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLRE
        A+R AL+W + N +   D ++L+HV P+     S SG+K               R LR +  Q+ + FK +C    + KVE ++ E     + +   + +
Subjt:  AARTALKWALNNLMRYGDLIILLHVFPSTR---SKSGAK--------------ARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLRE

Query:  IGASALVVGLHDHSFL
             LV+G    +FL
Subjt:  IGASALVVGLHDHSFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTGAGGAAAATCGCGGTGATGGTGGAGGATGTCGAAGCAGCAAGAACGGCATTGAAATGGGCGCTCAACAATCTCATGCGCTATGGCGATTTGATTATTCTACT
CCATGTATTTCCCTCCACAAGATCCAAAAGCGGCGCCAAGGCTCGCCATCTCCGATTGAAGGGCTATCAATTAGCCCTCTCCTTCAAAGACCTCTGCACCGCCTTCCCCA
ATACAAAGGTAGAGATCGTCGTGACGGAAGGCGATAAAGATGGTAGAAAGATCGCAGCCATGCTCAGAGAGATTGGAGCTTCGGCTCTTGTTGTTGGCCTCCATGACCAT
AGCTTCCTCTACAAGATGGCTATGGCCCAAGATGACATAGCAAGGAATTTCAATTGCAAAGTTCTGGCAATCAAGCAAGCAACGACATCAATAGAAGAGTCACATAAAAG
CAAAAATGTGCAAGTAATAGCAGCTATGGACAGTTCAACCAACATGGACTTTTCTCAGATAGAGATTGCCAAATTACAAGCTCCTGAAATTCTTCCGCAGAAAATTCCAT
ACAGAATCTGCCCGAACCCTTCTGCGATTATATGGAGATCGAAGAAATCAAGAAGAAGGACTTTGTGA
mRNA sequenceShow/hide mRNA sequence
CACATAAAAAAGGTATTATAAGAAAAGATTTACATTTAATTGGGAGATCAATGCTTGATGTAAAGTCAGCTTTTATGATTTGAGAGAGAGCGTATCAATTTCAAATTATT
ATCATAAACCTAAAGCTTTGAAACTCTCTTCAATGGCAGGCGCGCGAGAGATTTGACAAAATTCATTCATAAACCTTCGGGAGGGAACAGAGGGTTATCTTTCCAGGGCG
GCCGAGGCCCACGAACACAGCAGAAATTCATATTTTAGACCCACGATCAGACCTCCCAAAAATGCCAAAAGTTGAAGGATCAACGTCGATTTAAAGACAGCAAAATCCCA
GTGGGAAACAGTACATAATTTCTCCGCTGTTCCTTTGGGGTCTGGAAATTAATTGCATTCCACAGTCGTCTCTCTGATTCTGAGTCCTCACCCGCCCCTCTCTCCATTGT
TCTTCCTCTAATTCGAGCCATGGCGACAACTGATTTACAGAAGTAGGGCTTCGATTGTATAGAGAAGGAGCCGGGGGCCATAATCGATGAGTTGTGAGTTTCTTTTCACA
AACTTCAACTCATAGAACAATCCTAGAGAGAGAGAGAGATATAGAGATGGATGTGAGGAAAATCGCGGTGATGGTGGAGGATGTCGAAGCAGCAAGAACGGCATTGAAAT
GGGCGCTCAACAATCTCATGCGCTATGGCGATTTGATTATTCTACTCCATGTATTTCCCTCCACAAGATCCAAAAGCGGCGCCAAGGCTCGCCATCTCCGATTGAAGGGC
TATCAATTAGCCCTCTCCTTCAAAGACCTCTGCACCGCCTTCCCCAATACAAAGGTAGAGATCGTCGTGACGGAAGGCGATAAAGATGGTAGAAAGATCGCAGCCATGCT
CAGAGAGATTGGAGCTTCGGCTCTTGTTGTTGGCCTCCATGACCATAGCTTCCTCTACAAGATGGCTATGGCCCAAGATGACATAGCAAGGAATTTCAATTGCAAAGTTC
TGGCAATCAAGCAAGCAACGACATCAATAGAAGAGTCACATAAAAGCAAAAATGTGCAAGTAATAGCAGCTATGGACAGTTCAACCAACATGGACTTTTCTCAGATAGAG
ATTGCCAAATTACAAGCTCCTGAAATTCTTCCGCAGAAAATTCCATACAGAATCTGCCCGAACCCTTCTGCGATTATATGGAGATCGAAGAAATCAAGAAGAAGGACTTT
GTGACAATTAAGGGACCCTCTTAATTTATCTCAAACTTCACTTCTCTAATATGGCTTGTTCTCTCTCTTTCCACAGCTTCTTAGATATTGAAAATAATGGAGTTTGCTCT
TTTTTTTCTTTTTTTTTTCTCTCTTTTTTTCTTTTTTCAAGGAGGTGTTGAGGTTGTCAACATATGCCCTACACACCAACACCCAACATAGATGAAAAAATGAACTGAAA
AGGTAATCCATTTGGTCTGATTTATATAGGTTTTGCAGGTTTTTTTGTATATATTTTTATAATTAATACCCTAATCTTGTTTCCTTGTATGCTCCGGGAATTTGATTTGA
TAACTTCTTTTATCTTCGTCTCTGTTTGAGAAACTTCTTTTTAGTTTCCTGTTTTGTGTAAGCAATTTTAATTTTAAAAATAAAAAACTAAAAGATTGCAGTGTTGTGTA
CAGACCATCGTGCCCTTTTCTTTTCTTTTCTTTTCTTTTTAACTAACGTTATTTATTAACAGTGATTTATTTTTTAATCTTGTGTTTTTAATTTCATAGCCTAATTTTAG
AAGCGTGCGTGTATTAAAAAACATGTTCTAGAAAAATATTGGTTAGGAATTCAAATATTTTTTTTTAGTAAAGAATTCAAATATGATTTAAAATGGTACAGTATCAAATT
AGCCTTGATCTCAAATAATGAATATAATAAATAATATTCCATATTATAATAGTATTATTAATTTATCATTAATTTTAAAAGTTCAATGATGTAAGTTCTTTAGAAAAATT
GGTTGTTTATATTATCACTTATTAGTATTAGTATCCAAATCTGGTTTGAATTATTTTCCGCTGACCTCATGTCCAATGGAAAGTTTATTTTATCGTGAATTTGAATGGAA
AGTTTATTTTATACCTGACCAATGGCCAGGTTGCCCGATGTTTTGAAAATTGAACTCTTCTCACAAATATAGAAAAATCAAGAGAATTCAATTTACAATTAAAAACATCT
TATATATACATATTTGAATAATTTACTATTTTTATCGATTTAAACATATAACTTCATGATAAGACATCAATTACCTAATATGAAGGTTGGATCCCCCACCCCGGTGAACT
AAAAAAAAATTACTGTTTGTCAACCAATATTAACATTCAAATATTCAAATATAGATTTATTGATATATGTTTTTTTGACTTTGATTCATTTTATAGTCTCTATATTATCA
AAATGGTTATTGTACTTTCAACTTTTGTTTGTTTTAATTACTATGTTTCTAAAATTTCCGTGAAAAAGTGACTAATTTGATCTTTATACATCGAAAAAACCATTAAATAC
AAATCGATGACTATGTTTAAAAAAATCAATATTAAATGTTACACCGAAATTATAATCGAAAGAAATAAAATAAAAGTACAGTTACAAAATAGTATTTAATCATACAATAA
ATTGTACCTATTTTAAAATGAATAATACTTACTCCTCGTTGTTGAGGATGAGTCTTGATCTAGAGAAAGAAATACAAAAATGATTTTTTTTTATTTTTTATTTTTTATGA
AAAGAGTGAATTAATCCTCACGGAGGTAAGTATTTTTCTTTTACAACAAACTTCACCAAACATACTAATTCTTGGGGATTTCATTGTCTGTGATGAGATAATTTTAGTAA
AGCGAATTCCTTTGACAATTGTTACTCTTGAGTAGACAAAACAAAACATAACATATTTTACATGCACAACAAAATTACTCTCACTCTGTACAAAAGAAACAGAAACTTGC
AAAACCAGGGCCAAAATAAAAACGAAGGCATATCCTATAATCCAATCACATCCTCAGGTGACGTACCTGAAATCACCCCCGCGTTACGTTAACCAATTGAAATCTGAAGC
TCCTTTTAGGTCTAATTCAGTATACCTGGCGATGGCCGGTTGCGAGATAGGAAGGCAGAGGACTGGCACGCGGCGACCGTGGAGGAACTTCGTTCAGCTGCTTCCCCTTA
CTATCGTAATAAATTACGCCCGAGAAATTCAACGGCTCACACTTATCCCCCATCAGGATCGCCTTCTGCCAGATTCCTCCGTCTCCCCACTCGTCTTCCGCATCGCATCC
GCCGCGCTTCGTCTTCTTGTTCTTGTTCTTCTGCATCAGCGTGATTGCCGTGTCGCTTATGCTCTTCAGGAACCGCTTCGGCCGGACCTGCGGCGATCGCAATTCTAGAC
GCGGAAATTGCTTCGATTTCAGCTTCGTCTGAAGCTTCTTCGACACTCTGTTCGCTTCCTTCGCACACAGAGCCACCAGAGTCGATAGCGAGATCAGCAAGGTCGCCGCC
GCCCGCTTCGGCGACGCCAGGACGGAGCCGGAGCCAGAGCCAGAGCCGCCGCGTTCCGGTGTGGCGTGGCCCATGATGCAGAAACCGGAAGCGGAAGGTAAACGAAATGC
GAAATCTGTTTGGGAGATTATTCGGATGAAGAAGATTTTGAATTTGGGAATTTTGGTGTGTATTTATATTGGGGAGTTTGTGGTCCTGGTCCTCGTGGGGTCTGCGACTG
CGCTACAGTTATATAGAAAGAAACGACGGCGTACAGCCTCAGTAGTTCACCGATATATGTTTTAAGTTTATTGAAGAAAAATGAGAGTTCGTTTTTGCGTTAAAAATACG
AGTCCACTTATTCATCTAGGGGCCAAATTTGAAGGCGTGACATAGTACGGAATCTATTTTTTATTTGGAAAAGGAATTGCGAATATAAATAATAAGTAAACAATTTTAGT
TTTGCAATCATAAAAAATTCAAATAAAAACTTT
Protein sequenceShow/hide protein sequence
MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMLREIGASALVVGLHDH
SFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKSRRRTL