; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004035 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004035
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUsp domain-containing protein
Genome locationscaffold1597:648942..651167
RNA-Seq ExpressionMS004035
SyntenyMS004035
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137469.1 uncharacterized protein LOC111008906 [Momordica charantia]3.5e-10699.51Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA
        MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAM+REIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
        SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS

Query:  RRR
        RRR
Subjt:  RRR

XP_022923726.1 uncharacterized protein LOC111431346 [Cucurbita moschata]2.6e-8579.91Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTE--GDKDGRKIAAMVREI
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVFP+TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTE  GD++GRKIAA+VREI
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTE--GDKDGRKIAAMVREI

Query:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA------MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA
        GAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EE  K+KNV+VIAA        SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSA
Subjt:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA------MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA

Query:  IIWRSKKSRRRWTL
        IIWRSKKSR RWTL
Subjt:  IIWRSKKSRRRWTL

XP_023000727.1 uncharacterized protein LOC111495088 [Cucurbita maxima]2.0e-8580.28Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAMV
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVF +TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTEG     D++GRKIAA+V
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAMV

Query:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI
        REIGAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EES K+KNV+VIAA D  SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSAI
Subjt:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI

Query:  IWRSKKSRRRWTL
        IWRSK+SR RWTL
Subjt:  IWRSKKSRRRWTL

XP_023519721.1 uncharacterized protein LOC111783074 [Cucurbita pepo subsp. pepo]3.4e-8580.19Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTE---GDKDGRKIAAMVRE
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVFP+TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTE   GD++GRKIA +VRE
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTE---GDKDGRKIAAMVRE

Query:  IGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA---MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAII
        IGAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EE  K+KNV+VIAA     SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSAII
Subjt:  IGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA---MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAII

Query:  WRSKKSRRRWTL
        WRSKKSR RWTL
Subjt:  WRSKKSRRRWTL

XP_038893894.1 uncharacterized protein LOC120082691 isoform X2 [Benincasa hispida]4.4e-9385.02Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA
        MD+RKIAV+VEDVE ARTALKW LNNLMRYGDLI LLHVFPSTRSKS +K RH RLKGYQLAL+FKDLC  FPNTKVEI+VTEGD++GRKIAA+V+EIG 
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVI-AAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK
        S LVVGLH++SFLYKMAM +DD+AR FNCKVLAIKQA+TS EESHK+KNV+VI AAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICP+PSAIIWRSKK
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVI-AAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK

Query:  SRRRWTL
        SRRRWTL
Subjt:  SRRRWTL

TrEMBL top hitse value%identityAlignment
A0A0A0LQQ9 Usp domain-containing protein4.0e-8478.26Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA
        MD+RKI V+VEDVE ARTALKWALNNLMRYGDLI LLHVFPSTRSKS +K R+ RL GYQLAL+F+DLC  FPNTKVEIVVTEGD++GRKI A+VREIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA-MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK
        S LVVGLH HSFLYKMAM ++D+ R FNCKVLAIKQAT + EES K+K+V+VIAA  + STNM+FSQIEIAKLQAPE+  QKIPYRICP+P AIIWRSKK
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA-MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKK

Query:  SRRRWTL
        S RRWTL
Subjt:  SRRRWTL

A0A5D3BIR9 UspA8.4e-8276.92Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA
        MD+RKI V+VEDVE ARTALKWALNNLMRYGDLI LLHVFPSTRSKS +K R+ RL GYQLAL+F+DLC  FPNTKVEI+VTEGD++GRK AA+VREIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQAT-TSIEESHKSKNVQVIAAMDS-STNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSK
        S LVVGLH HSFLYKMAM ++D+ R FNCKVLAIKQAT T+ +ES K+KNV+VIAA  + STNM+FSQIEI KLQAPE   QKIPYRICP+P AIIWRS+
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQAT-TSIEESHKSKNVQVIAAMDS-STNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSK

Query:  KSRRRWTL
        KS RRWTL
Subjt:  KSRRRWTL

A0A6J1C7B3 uncharacterized protein LOC1110089061.7e-10699.51Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA
        MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAM+REIGA
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
        SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKS

Query:  RRR
        RRR
Subjt:  RRR

A0A6J1E7J0 uncharacterized protein LOC1114313461.2e-8579.91Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTE--GDKDGRKIAAMVREI
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVFP+TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTE  GD++GRKIAA+VREI
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTE--GDKDGRKIAAMVREI

Query:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA------MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA
        GAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EE  K+KNV+VIAA        SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSA
Subjt:  GASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAA------MDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSA

Query:  IIWRSKKSRRRWTL
        IIWRSKKSR RWTL
Subjt:  IIWRSKKSRRRWTL

A0A6J1KNG3 uncharacterized protein LOC1114950889.5e-8680.28Show/hide
Query:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAMV
        MD+RKI V+VEDVEAARTALKW LNNLMRYGDLI LLHVF +TRSKS +K RHLRL GYQLALSFKDLCT FPNTKVEI+VTEG     D++GRKIAA+V
Subjt:  MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEG-----DKDGRKIAAMV

Query:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI
        REIGAS LVVGLHD SFLYKMA+ +DDIARNF CKVLAIK +T   EES K+KNV+VIAA D  SSTNMDFSQIEIAKLQAPEI PQKIPYRICP+PSAI
Subjt:  REIGASALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMD--SSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAI

Query:  IWRSKKSRRRWTL
        IWRSK+SR RWTL
Subjt:  IWRSKKSRRRWTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.3e-0529.81Show/hide
Query:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAF-PNTKVEIVVTEGDKDGRKIAAMVREIGASA
        +++ V+V++   ++ A+ WAL +L   GDL+ LLHV       + +           LA S   LC A  P   VE +V +G K    + + V+++  S 
Subjt:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAF-PNTKVEIVVTEGDKDGRKIAAMVREIGASA

Query:  LVVG
        LV+G
Subjt:  LVVG

AT1G48960.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.1e-6056.25Show/hide
Query:  DVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVF-PSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA
        DVR+I V+VED +AARTAL+WAL+NL+R GD+I+LLHV+ P  R K    AR LR  GY LALSF+++C +F NT  EI+V EGD DGR IA +V+EIGA
Subjt:  DVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVF-PSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGA

Query:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIE-----ESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEI-LPQKIPYRICPNPSAII
        S L+VGLH +SFLY+ A++  D+ARNFNCKV+AIKQ +  +      + HK+      A  D  TN DFSQIEI+ LQ PEI  P K+PYR+CP+P AI+
Subjt:  SALVVGLHDHSFLYKMAMAQDDIARNFNCKVLAIKQATTSIE-----ESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEI-LPQKIPYRICPNPSAII

Query:  WRSKKSRR
        WR++  RR
Subjt:  WRSKKSRR

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.7e-0728.36Show/hide
Query:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKA-----------RHLRLKGYQLALSFKDLC-TAFPNTKVEIVVTEGDKDGRKI
        R+I V+V+    A+ AL W L++  +  D I+LLH   +  S+SG  A           +    +  +   + K +C    P  K E+V  +GD+ G  I
Subjt:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKA-----------RHLRLKGYQLALSFKDLC-TAFPNTKVEIVVTEGDKDGRKI

Query:  AAMVREIGASALVVGLHDHSFLYKMAMAQDDIAR
            RE  AS LV+G       +++ M     AR
Subjt:  AAMVREIGASALVVGLHDHSFLYKMAMAQDDIAR

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.3e-0730.33Show/hide
Query:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGASAL
        R+I V+V+    A+ AL W L++  +  D I+LLH   +  S+SG  A   + +G   +        A    K E+V  +GD+ G  I    RE  AS L
Subjt:  RKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGASAL

Query:  VVGLHDHSFLYKMAMAQDDIAR
        V+G       +++ M     AR
Subjt:  VVGLHDHSFLYKMAMAQDDIAR

AT2G07020.1 Protein kinase protein with adenine nucleotide alpha hydrolases-like domain1.1e-0425.79Show/hide
Query:  IAVMVEDVEAARTALKWALNNLMRYGDLIILLHV-FPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGASALV
        +A+ ++  + ++ ALKWA++NL+  G+ + L+HV    T + +G +         +L L F+  CT   +   E VV E       I   V+E     LV
Subjt:  IAVMVEDVEAARTALKWALNNLMRYGDLIILLHV-FPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGASALV

Query:  VGLHDHSFLYKM-----AMAQDDIARNFNCKVLAIKQATTSIEESHKSK-------NVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKI
        +G    + L ++       A    A NF C V AI +   S   S  S          Q+ A   ++ N +FS     +LQ+ + +  +I
Subjt:  VGLHDHSFLYKM-----AMAQDDIARNFNCKVLAIKQATTSIEESHKSK-------NVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTGAGGAAAATCGCGGTGATGGTGGAGGATGTCGAAGCAGCAAGAACGGCATTGAAATGGGCGCTCAACAATCTCATGCGCTATGGCGATTTGATTATTCTACT
CCATGTATTTCCCTCCACAAGATCCAAAAGCGGCGCCAAGGCTCGCCATCTCCGATTGAAGGGCTATCAATTAGCCCTCTCCTTCAAAGACCTCTGCACCGCCTTCCCCA
ATACAAAGGTAGAGATCGTCGTGACGGAAGGCGATAAAGATGGTAGAAAGATCGCAGCCATGGTCAGAGAGATTGGAGCTTCGGCTCTTGTTGTTGGCCTCCATGACCAT
AGCTTCCTCTACAAGATGGCTATGGCCCAAGATGACATAGCAAGGAATTTCAATTGCAAAGTTCTGGCAATCAAGCAAGCAACGACATCAATAGAAGAGTCACATAAAAG
CAAAAATGTGCAAGTAATAGCAGCTATGGACAGTTCAACCAACATGGACTTTTCTCAGATAGAGATTGCCAAATTACAAGCTCCTGAAATTCTTCCGCAGAAAATTCCAT
ACAGAATCTGCCCGAACCCTTCTGCGATTATATGGAGATCGAAGAAATCAAGAAGAAGGTGGACTTTG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTGAGGAAAATCGCGGTGATGGTGGAGGATGTCGAAGCAGCAAGAACGGCATTGAAATGGGCGCTCAACAATCTCATGCGCTATGGCGATTTGATTATTCTACT
CCATGTATTTCCCTCCACAAGATCCAAAAGCGGCGCCAAGGCTCGCCATCTCCGATTGAAGGGCTATCAATTAGCCCTCTCCTTCAAAGACCTCTGCACCGCCTTCCCCA
ATACAAAGGTAGAGATCGTCGTGACGGAAGGCGATAAAGATGGTAGAAAGATCGCAGCCATGGTCAGAGAGATTGGAGCTTCGGCTCTTGTTGTTGGCCTCCATGACCAT
AGCTTCCTCTACAAGATGGCTATGGCCCAAGATGACATAGCAAGGAATTTCAATTGCAAAGTTCTGGCAATCAAGCAAGCAACGACATCAATAGAAGAGTCACATAAAAG
CAAAAATGTGCAAGTAATAGCAGCTATGGACAGTTCAACCAACATGGACTTTTCTCAGATAGAGATTGCCAAATTACAAGCTCCTGAAATTCTTCCGCAGAAAATTCCAT
ACAGAATCTGCCCGAACCCTTCTGCGATTATATGGAGATCGAAGAAATCAAGAAGAAGGTGGACTTTG
Protein sequenceShow/hide protein sequence
MDVRKIAVMVEDVEAARTALKWALNNLMRYGDLIILLHVFPSTRSKSGAKARHLRLKGYQLALSFKDLCTAFPNTKVEIVVTEGDKDGRKIAAMVREIGASALVVGLHDH
SFLYKMAMAQDDIARNFNCKVLAIKQATTSIEESHKSKNVQVIAAMDSSTNMDFSQIEIAKLQAPEILPQKIPYRICPNPSAIIWRSKKSRRRWTL