; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g32360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g32360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCysteine dioxygenase
Genome locationchr1:22770589..22782089
RNA-Seq ExpressionMoc01g32360
SyntenyMoc01g32360
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0017172 - cysteine dioxygenase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR011051 - RmlC-like cupin domain superfamily
IPR012864 - Cysteine oxygenase/2-aminoethanethiol dioxygenase
IPR014710 - RmlC-like jelly roll fold
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581360.1 Plant cysteine oxidase 5, partial [Cucurbita argyrosperma subsp. sororia]1.3e-16673.04Show/hide
Query:  PIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIP
        P+  +   + KASFS S PVSEEAL+KV++LLDELKPSNVGLE+ESQLARGWKGS N TN KK R G+HQYP TI+YLHLHEC+RFSIGIFCMPPGSIIP
Subjt:  PIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIP

Query:  LHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR
        LHNHPGMTV SKLLYGALHVRSYDWLDLP+F DLSQARPAKLVRDCEMIAPCGTTILYPDR GNIH FKAITPCAIFDILSPPYSS DGRHCSYFRRSPR
Subjt:  LHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR

Query:  REISGLDQFCGPES--SEVTWLEEIQPPENFVVEYLKWVSAFRSRNLKFLLLKNRYCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSS----RLFDLN
        REISGLDQ CG E+  S VTWL+EIQPPENFV                           NLL+NFRDRR+R ++  DE  A+EKLNP +S     LFDLN
Subjt:  REISGLDQFCGPES--SEVTWLEEIQPPENFVVEYLKWVSAFRSRNLKFLLLKNRYCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSS----RLFDLN

Query:  EEAMVEESDH---------EERKYGSISTNNNNNNSSN------GNG----RRTTVRQYVRSKVPRLRWTPELHLNFVHAVERLGGQERATPKLVLQLMN
        EEA VE+ DH         EERK+G  ST +NNN ++N      GNG    RRT VRQYVRSKVPRLRWTPELHLNFVHAV+RLGGQERATPKLVLQLMN
Subjt:  EEAMVEESDH---------EERKYGSISTNNNNNNSSN------GNG----RRTTVRQYVRSKVPRLRWTPELHLNFVHAVERLGGQERATPKLVLQLMN

Query:  VRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNG
        V+GLSIAHVKSHLQMYRSKKLDQTGQVI EA  G
Subjt:  VRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNG

XP_022142149.1 uncharacterized protein LOC111012344 isoform X1 [Momordica charantia]1.7e-18299.11Show/hide
Query:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
        YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
Subjt:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV

Query:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
        HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
Subjt:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR

Query:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK
        FHSHFPYHIRPNTSFS   RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK
Subjt:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK

Query:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
        SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
Subjt:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS

XP_022142150.1 putative Myb family transcription factor At1g14600 isoform X2 [Momordica charantia]4.0e-184100Show/hide
Query:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
        YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
Subjt:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV

Query:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
        HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
Subjt:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR

Query:  FHSHFPYHIRPNTSFSRQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSKSKS
        FHSHFPYHIRPNTSFSRQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSKSKS
Subjt:  FHSHFPYHIRPNTSFSRQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSKSKS

Query:  GNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
        GNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
Subjt:  GNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS

XP_022142151.1 putative two-component response regulator ARR20 isoform X3 [Momordica charantia]7.9e-16491.99Show/hide
Query:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
        YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
Subjt:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV

Query:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
        HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
Subjt:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR

Query:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK
        FHSHFPYHIRPNTSFS   RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSN    
Subjt:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK

Query:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
                            GGGRGSTQEITTQLSLS
Subjt:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS

XP_022142152.1 plant cysteine oxidase 4-like isoform X1 [Momordica charantia]7.7e-135100Show/hide
Query:  MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII
        MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII
Subjt:  MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII

Query:  PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP
        PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP
Subjt:  PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP

Query:  RREISGLDQFCGPESSEVTWLEEIQPPENFVV
        RREISGLDQFCGPESSEVTWLEEIQPPENFVV
Subjt:  RREISGLDQFCGPESSEVTWLEEIQPPENFVV

TrEMBL top hitse value%identityAlignment
A0A6J1CK39 putative Myb family transcription factor At1g14600 isoform X22.0e-184100Show/hide
Query:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
        YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
Subjt:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV

Query:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
        HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
Subjt:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR

Query:  FHSHFPYHIRPNTSFSRQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSKSKS
        FHSHFPYHIRPNTSFSRQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSKSKS
Subjt:  FHSHFPYHIRPNTSFSRQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSKSKS

Query:  GNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
        GNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
Subjt:  GNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS

A0A6J1CKR9 uncharacterized protein LOC111012344 isoform X18.2e-18399.11Show/hide
Query:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
        YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
Subjt:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV

Query:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
        HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
Subjt:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR

Query:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK
        FHSHFPYHIRPNTSFS   RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK
Subjt:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK

Query:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
        SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
Subjt:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS

A0A6J1CLD5 putative two-component response regulator ARR20 isoform X33.8e-16491.99Show/hide
Query:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
        YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV
Subjt:  YCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRTTVRQYVRSKVPRLRWTPELHLNFV

Query:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
        HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR
Subjt:  HAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIHSHSHSHSFGAVGTR

Query:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK
        FHSHFPYHIRPNTSFS   RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSN    
Subjt:  FHSHFPYHIRPNTSFS---RQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKSK

Query:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS
                            GGGRGSTQEITTQLSLS
Subjt:  SKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS

A0A6J1CMJ3 Cysteine dioxygenase3.7e-135100Show/hide
Query:  MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII
        MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII
Subjt:  MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII

Query:  PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP
        PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP
Subjt:  PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP

Query:  RREISGLDQFCGPESSEVTWLEEIQPPENFVV
        RREISGLDQFCGPESSEVTWLEEIQPPENFVV
Subjt:  RREISGLDQFCGPESSEVTWLEEIQPPENFVV

A0A6J1IUW9 Cysteine dioxygenase8.4e-11988.46Show/hide
Query:  MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII
        MPI+QKLYDACKASFS S PVSEEAL+KV++LLDELKPSNVGLE+ESQLARGWKGS N TN KK R G HQYP TI+YLHLHEC+RFSIGIFCMPPGSII
Subjt:  MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII

Query:  PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP
        PLHNHPGMTVLSKLLYGALHVRSYDWLDLP+F DLSQARPAKLVRDCEMIAPCGTTILYPDR GNIH FKAITPCAIFDILSPPYSS DGRHCSYFRRSP
Subjt:  PLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP

Query:  RREISGLDQFCGPES--SEVTWLEEIQPPENFVV
        RREISGLDQ CG E+  SEVTWL+EIQPPENFVV
Subjt:  RREISGLDQFCGPES--SEVTWLEEIQPPENFVV

SwissProt top hitse value%identityAlignment
Q1G3U6 Plant cysteine oxidase 31.0e-4439.58Show/hide
Query:  PIIQKLYDACKASFSASGP-VSEEALEKVLSLLDELKPSNVGLEQESQ-LARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSI
        P +Q+LYD CK +F+   P  +  A++K+ S+LD + P++VGLE+ SQ   RG+     G +     N + ++   I +L +HEC+ F++ IFC P  S+
Subjt:  PIIQKLYDACKASFSASGP-VSEEALEKVLSLLDELKPSNVGLEQESQ-LARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSI

Query:  IPLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQ-------ARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRH
        IPLH+HP M V SK+LYG+LHV++YDW++ P      +       AR AKLV D  +        LYP   GN+HCF A+TPCA+ DILSPPY    GR 
Subjt:  IPLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQ-------ARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRH

Query:  CSYFRRSPRREI---SGLDQFCGPESSEVTWLEEIQPPEN
        CSY+   P       +G+ +    +  E  WL +I  P++
Subjt:  CSYFRRSPRREI---SGLDQFCGPESSEVTWLEEIQPPEN

Q8LGJ5 Plant cysteine oxidase 21.6e-4542.5Show/hide
Query:  IQKLYDACKASFS--ASGPV-SEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII
        +QKL+D CK  F+   SG V S+E +E + ++LDE+KP +VG+  +    R         ++   R+ L      + YLH++ C RFSI IFC+PP  +I
Subjt:  IQKLYDACKASFS--ASGPV-SEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSII

Query:  PLHNHPGMTVLSKLLYGALHVRSYDWL-DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRS
        PLHNHP MTV SKLL+G +H++SYDW+ D PQ    S  R AK+  D +  APC T+ILYP   GN+HCF A T CA+ D++ PPYS   GRHC+Y+   
Subjt:  PLHNHPGMTVLSKLLYGALHVRSYDWL-DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRS

Query:  PRREISGLDQFCGPESSE-VTWLEE-IQPPENFVVEYLKW
        P    S        E  E   WL+E  + PE+  V  L +
Subjt:  PRREISGLDQFCGPESSE-VTWLEE-IQPPENFVVEYLKW

Q9LXG9 Plant cysteine oxidase 11.5e-4843.17Show/hide
Query:  IQKLYDACKASFSASGP---VSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPN-GTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSI
        +++L++ CK  FS  GP    SE+ ++++  +LD++KP +VGL       R     PN G  ++            I YLHLH+C++FSIGIFC+PP  +
Subjt:  IQKLYDACKASFSASGP---VSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPN-GTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSI

Query:  IPLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRS
        IPLHNHPGMTV SKLL+G +H++SYDW+      D S+ R AKL  D    APC  +ILYP+  GN+H F AIT CA+ D+L PPY + +GRHC+YF   
Subjt:  IPLHNHPGMTVLSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRS

Query:  PRREISGLDQ---FCGPESSEVTWLEE
        P  ++S  D        E     WL+E
Subjt:  PRREISGLDQ---FCGPESSEVTWLEE

Q9LXT4 Plant cysteine oxidase 53.9e-8162.5Show/hide
Query:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH
        IQ+L++ CK+S S +GPVSEEAL+KV ++L+++KPS+VGLEQE+QL R W G  N      +RNG H     IKYL LHEC+ FSIGIFCMPPGSIIPLH
Subjt:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH

Query:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR
        NHPGMTVLSKL+YG++HV+SYDW   D  +  D  QARPAKLV+D +M +P   T LYP   GNIHCFKAIT CAIFDILSPPYSS  GRHC+YFR+SP 
Subjt:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR

Query:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV
         ++ G ++   G   S VTWLEE QPP+NFV+
Subjt:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV

Q9SJI9 Plant cysteine oxidase 44.3e-8060.59Show/hide
Query:  QKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLHN
        Q+LY+ CKASFS+ GP++E+ALEKV ++L+++KPS+VG+EQ++QLAR   G  N      +RNG +Q P  IKYLHLHEC+ FSIGIFCMPP S+IPLHN
Subjt:  QKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLHN

Query:  HPGMTVLSKLLYGALHVRSYDWLDLPQFT---DLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR
        HPGMTVLSKL+YG++HV+SYDWL+ PQ T   D SQARPAKLV+D EM A    T LYP   GNIHCFKAIT CAI DIL+PPYSS   RHC+YFR+S R
Subjt:  HPGMTVLSKLLYGALHVRSYDWLDLPQFT---DLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR

Query:  REISGLDQFCGPESSEVTWLEEIQPPENFVVEYLKW
         ++ G  +  G   ++VTWLEE QPP++FV+  + +
Subjt:  REISGLDQFCGPESSEVTWLEEIQPPENFVVEYLKW

Arabidopsis top hitse value%identityAlignment
AT2G42670.1 Protein of unknown function (DUF1637)3.1e-8160.59Show/hide
Query:  QKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLHN
        Q+LY+ CKASFS+ GP++E+ALEKV ++L+++KPS+VG+EQ++QLAR   G  N      +RNG +Q P  IKYLHLHEC+ FSIGIFCMPP S+IPLHN
Subjt:  QKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLHN

Query:  HPGMTVLSKLLYGALHVRSYDWLDLPQFT---DLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR
        HPGMTVLSKL+YG++HV+SYDWL+ PQ T   D SQARPAKLV+D EM A    T LYP   GNIHCFKAIT CAI DIL+PPYSS   RHC+YFR+S R
Subjt:  HPGMTVLSKLLYGALHVRSYDWLDLPQFT---DLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR

Query:  REISGLDQFCGPESSEVTWLEEIQPPENFVVEYLKW
         ++ G  +  G   ++VTWLEE QPP++FV+  + +
Subjt:  REISGLDQFCGPESSEVTWLEEIQPPENFVVEYLKW

AT2G42670.2 Protein of unknown function (DUF1637)5.8e-8059.07Show/hide
Query:  QKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLHN
        Q+LY+ CKASFS+ GP++E+ALEKV ++L+++KPS+VG+EQ++QLAR   G  N      +RNG +Q P  IKYLHLHEC+ FSIGIFCMPP S+IPLHN
Subjt:  QKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLHN

Query:  HPGMTVLSKLLYGALHVRSYDWLDLPQFTD----LSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP
        HPGMTVLSKL+YG++HV+SYDWL+ PQ T+      +ARPAKLV+D EM A    T LYP   GNIHCFKAIT CAI DIL+PPYSS   RHC+YFR+S 
Subjt:  HPGMTVLSKLLYGALHVRSYDWLDLPQFTD----LSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSP

Query:  RREISGLDQFCGPESSEVTWLEEIQPPENFVVEYLKW
        R ++ G  +  G   ++VTWLEE QPP++FV+  + +
Subjt:  RREISGLDQFCGPESSEVTWLEEIQPPENFVVEYLKW

AT3G58670.1 Protein of unknown function (DUF1637)2.8e-8262.5Show/hide
Query:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH
        IQ+L++ CK+S S +GPVSEEAL+KV ++L+++KPS+VGLEQE+QL R W G  N      +RNG H     IKYL LHEC+ FSIGIFCMPPGSIIPLH
Subjt:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH

Query:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR
        NHPGMTVLSKL+YG++HV+SYDW   D  +  D  QARPAKLV+D +M +P   T LYP   GNIHCFKAIT CAIFDILSPPYSS  GRHC+YFR+SP 
Subjt:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR

Query:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV
         ++ G ++   G   S VTWLEE QPP+NFV+
Subjt:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV

AT3G58670.2 Protein of unknown function (DUF1637)2.8e-8262.5Show/hide
Query:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH
        IQ+L++ CK+S S +GPVSEEAL+KV ++L+++KPS+VGLEQE+QL R W G  N      +RNG H     IKYL LHEC+ FSIGIFCMPPGSIIPLH
Subjt:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH

Query:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR
        NHPGMTVLSKL+YG++HV+SYDW   D  +  D  QARPAKLV+D +M +P   T LYP   GNIHCFKAIT CAIFDILSPPYSS  GRHC+YFR+SP 
Subjt:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR

Query:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV
         ++ G ++   G   S VTWLEE QPP+NFV+
Subjt:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV

AT3G58670.3 Protein of unknown function (DUF1637)2.8e-8262.5Show/hide
Query:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH
        IQ+L++ CK+S S +GPVSEEAL+KV ++L+++KPS+VGLEQE+QL R W G  N      +RNG H     IKYL LHEC+ FSIGIFCMPPGSIIPLH
Subjt:  IQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLH

Query:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR
        NHPGMTVLSKL+YG++HV+SYDW   D  +  D  QARPAKLV+D +M +P   T LYP   GNIHCFKAIT CAIFDILSPPYSS  GRHC+YFR+SP 
Subjt:  NHPGMTVLSKLLYGALHVRSYDWL--DLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPR

Query:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV
         ++ G ++   G   S VTWLEE QPP+NFV+
Subjt:  REISG-LDQFCGPESSEVTWLEEIQPPENFVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCATAATTCAGAAGTTGTATGATGCTTGCAAAGCATCATTTTCTGCCAGTGGTCCGGTGTCAGAAGAGGCTCTAGAGAAAGTCCTTTCTCTCTTAGATGAACTGAA
GCCGTCTAATGTGGGTCTCGAACAGGAGTCACAGTTAGCTCGTGGTTGGAAAGGTTCGCCGAATGGTACTAATAGCAAGAAAGATCGAAATGGCTTACATCAATATCCAT
CAACAATAAAGTACCTACATTTGCATGAATGTGAAAGATTCTCGATAGGAATCTTTTGCATGCCCCCAGGTTCTATCATTCCACTTCATAATCATCCTGGAATGACCGTA
TTGAGCAAGCTTCTATATGGTGCTTTACACGTACGATCATATGATTGGCTTGATCTACCTCAGTTCACTGATCTTTCTCAAGCCAGACCTGCAAAACTTGTTAGGGACTG
TGAGATGATTGCACCTTGTGGGACAACAATTCTTTATCCAGATCGAGCTGGCAACATTCATTGTTTCAAAGCCATAACTCCCTGCGCAATATTCGACATTCTCTCACCGC
CTTACTCTTCTGTAGATGGGCGACACTGCTCTTATTTCCGGAGGTCTCCCAGGAGAGAAATTTCAGGTCTCGACCAATTTTGTGGACCCGAATCCTCAGAAGTTACGTGG
TTGGAAGAGATTCAACCACCTGAAAACTTTGTGGTTGAATATTTGAAGTGGGTATCAGCTTTCAGATCAAGGAACTTGAAGTTTCTTCTTCTCAAGAATCGGTATTGCAG
CAGCAACCTCTTATTGAATTTTCGAGACCGGCGGAAACGGGACCAGAGCGACGACGATGAACGGCGAGCAAAAGAAAAGTTGAATCCATTCTCATCTAGATTGTTTGATT
TGAATGAAGAGGCCATGGTTGAAGAGAGCGATCATGAAGAAAGAAAATATGGAAGCATTTCAACTAATAATAATAACAACAACTCATCAAATGGAAATGGAAGGAGGACA
ACAGTGAGACAATATGTTAGATCAAAAGTGCCTCGCCTACGTTGGACTCCTGAGCTGCATCTCAATTTTGTTCATGCTGTGGAAAGGCTTGGTGGCCAAGAGAGAGCAAC
CCCCAAGTTGGTTCTTCAGCTGATGAATGTTAGAGGACTGAGTATTGCCCATGTTAAAAGCCACTTACAGATGTATCGAAGTAAGAAGCTTGACCAAACTGGGCAAGTTA
TACGAGAGGCATGCAATGGGACAATGCATGGAGGAGGATATTACAGTAATAATAATGGCAGCATGAGCATACATTCTTCTCGTTATCTTCCACTGGCAAGGCATATCCAT
AGCCATAGCCATAGCCACTCGTTTGGTGCTGTTGGAACTCGTTTTCACTCTCACTTCCCATATCACATCAGACCCAACACCTCGTTCTCAAGGCAAATTTTGGAGCAAAA
GAGATGGAGTAGTTTGGGAGAAAGGGCATGGAAGATGATGAGAAGAACAAATAACAATAATCGGAGTGAGAGGAAGGATCCAATAATAATTAGTGATGATGCAAAGATAA
GAGATGATGAGTTGCGAGGAATAGGAAATAATTGGGAAGCAGAAGCAGAAGCAGAGGCAGAGCTGCTGCAGCTAGGAATAGGATTAAGCAGAAGCAGCAACGGTAAGAGT
AAGAGTAAGAGTGGTAACAATAATATTAATAATAAAGAGAGTTATTGTTGGTCATATGGTTTAGGAGGTGGCAGAGGCAGCACTCAAGAAATTACCACCCAGCTTTCCCT
TTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCATAATTCAGAAGTTGTATGATGCTTGCAAAGCATCATTTTCTGCCAGTGGTCCGGTGTCAGAAGAGGCTCTAGAGAAAGTCCTTTCTCTCTTAGATGAACTGAA
GCCGTCTAATGTGGGTCTCGAACAGGAGTCACAGTTAGCTCGTGGTTGGAAAGGTTCGCCGAATGGTACTAATAGCAAGAAAGATCGAAATGGCTTACATCAATATCCAT
CAACAATAAAGTACCTACATTTGCATGAATGTGAAAGATTCTCGATAGGAATCTTTTGCATGCCCCCAGGTTCTATCATTCCACTTCATAATCATCCTGGAATGACCGTA
TTGAGCAAGCTTCTATATGGTGCTTTACACGTACGATCATATGATTGGCTTGATCTACCTCAGTTCACTGATCTTTCTCAAGCCAGACCTGCAAAACTTGTTAGGGACTG
TGAGATGATTGCACCTTGTGGGACAACAATTCTTTATCCAGATCGAGCTGGCAACATTCATTGTTTCAAAGCCATAACTCCCTGCGCAATATTCGACATTCTCTCACCGC
CTTACTCTTCTGTAGATGGGCGACACTGCTCTTATTTCCGGAGGTCTCCCAGGAGAGAAATTTCAGGTCTCGACCAATTTTGTGGACCCGAATCCTCAGAAGTTACGTGG
TTGGAAGAGATTCAACCACCTGAAAACTTTGTGGTTGAATATTTGAAGTGGGTATCAGCTTTCAGATCAAGGAACTTGAAGTTTCTTCTTCTCAAGAATCGGTATTGCAG
CAGCAACCTCTTATTGAATTTTCGAGACCGGCGGAAACGGGACCAGAGCGACGACGATGAACGGCGAGCAAAAGAAAAGTTGAATCCATTCTCATCTAGATTGTTTGATT
TGAATGAAGAGGCCATGGTTGAAGAGAGCGATCATGAAGAAAGAAAATATGGAAGCATTTCAACTAATAATAATAACAACAACTCATCAAATGGAAATGGAAGGAGGACA
ACAGTGAGACAATATGTTAGATCAAAAGTGCCTCGCCTACGTTGGACTCCTGAGCTGCATCTCAATTTTGTTCATGCTGTGGAAAGGCTTGGTGGCCAAGAGAGAGCAAC
CCCCAAGTTGGTTCTTCAGCTGATGAATGTTAGAGGACTGAGTATTGCCCATGTTAAAAGCCACTTACAGATGTATCGAAGTAAGAAGCTTGACCAAACTGGGCAAGTTA
TACGAGAGGCATGCAATGGGACAATGCATGGAGGAGGATATTACAGTAATAATAATGGCAGCATGAGCATACATTCTTCTCGTTATCTTCCACTGGCAAGGCATATCCAT
AGCCATAGCCATAGCCACTCGTTTGGTGCTGTTGGAACTCGTTTTCACTCTCACTTCCCATATCACATCAGACCCAACACCTCGTTCTCAAGGCAAATTTTGGAGCAAAA
GAGATGGAGTAGTTTGGGAGAAAGGGCATGGAAGATGATGAGAAGAACAAATAACAATAATCGGAGTGAGAGGAAGGATCCAATAATAATTAGTGATGATGCAAAGATAA
GAGATGATGAGTTGCGAGGAATAGGAAATAATTGGGAAGCAGAAGCAGAAGCAGAGGCAGAGCTGCTGCAGCTAGGAATAGGATTAAGCAGAAGCAGCAACGGTAAGAGT
AAGAGTAAGAGTGGTAACAATAATATTAATAATAAAGAGAGTTATTGTTGGTCATATGGTTTAGGAGGTGGCAGAGGCAGCACTCAAGAAATTACCACCCAGCTTTCCCT
TTCTTAA
Protein sequenceShow/hide protein sequence
MPIIQKLYDACKASFSASGPVSEEALEKVLSLLDELKPSNVGLEQESQLARGWKGSPNGTNSKKDRNGLHQYPSTIKYLHLHECERFSIGIFCMPPGSIIPLHNHPGMTV
LSKLLYGALHVRSYDWLDLPQFTDLSQARPAKLVRDCEMIAPCGTTILYPDRAGNIHCFKAITPCAIFDILSPPYSSVDGRHCSYFRRSPRREISGLDQFCGPESSEVTW
LEEIQPPENFVVEYLKWVSAFRSRNLKFLLLKNRYCSSNLLLNFRDRRKRDQSDDDERRAKEKLNPFSSRLFDLNEEAMVEESDHEERKYGSISTNNNNNNSSNGNGRRT
TVRQYVRSKVPRLRWTPELHLNFVHAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDQTGQVIREACNGTMHGGGYYSNNNGSMSIHSSRYLPLARHIH
SHSHSHSFGAVGTRFHSHFPYHIRPNTSFSRQILEQKRWSSLGERAWKMMRRTNNNNRSERKDPIIISDDAKIRDDELRGIGNNWEAEAEAEAELLQLGIGLSRSSNGKS
KSKSGNNNINNKESYCWSYGLGGGRGSTQEITTQLSLS