; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g09720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g09720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPOLIIIAc domain-containing protein
Genome locationchr4:7177864..7190171
RNA-Seq ExpressionMoc04g09720
SyntenyMoc04g09720
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0090503 - RNA phosphodiester bond hydrolysis, exonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004534 - 5'-3' exoribonuclease activity (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR003141 - Polymerase/histidinol phosphatase, N-terminal
IPR004013 - PHP domain
IPR016195 - Polymerase/histidinol phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588776.1 hypothetical protein SDJN03_17341, partial [Cucurbita argyrosperma subsp. sororia]8.3e-21484.87Show/hide
Query:  MVGDGH---PPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSK
        MVGD H    PPNS             K++KKKKKR G+ KKMTS+Q  AFKYVTEWV+LD+SNSLAS+AA+SVVDDFGVQK+LGKGG+KVVF+LHSHSK
Subjt:  MVGDGH---PPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSK

Query:  FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL
        FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFS SGN ESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL
Subjt:  FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL

Query:  RAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAV
        RAKNMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARAMVEAGYVENLKQAF+RYLFDGGPAYSTGSEPCAE+AIQ+IH+TGGV VLAHPWALKNPVA+
Subjt:  RAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAV

Query:  IRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFG
        IRRLKDAGL GLEVYRSDGKLAAYSDLAD  GLLKLGGSDFHGRGG+SESE+GSVNLPVLAMHDFLK+ARPIWC AIRDIL+SY EEPSDSNLA ITRFG
Subjt:  IRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFG

Query:  RTRVSKSSS-SASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        RTRV K  S   S   D ID CLTSWLTNEEKQ+AEFEAIRLKL+H+SM  QEVQ+
Subjt:  RTRVSKSSS-SASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

XP_004136869.1 uncharacterized protein LOC101218042 [Cucumis sativus]9.8e-21584.51Show/hide
Query:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
        MVGD H P    +SP++  +KP      KKKKR G+ KKMTS+Q  AFKYVTEW +LD+SNSLASSAA+SVVDDFGVQK++GKGG+KVVFELHSHSK SD
Subjt:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD

Query:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
        GFL+PSKLVERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS  G+ ESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
Subjt:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK

Query:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
        NMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARA+VEAGYVENLKQAF+RYLFDGGPAYSTGSEPCA EAIQ+IHDTGG+ VLAHPWALKNPVAVIRR
Subjt:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR

Query:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
        LKDAGL GLEVYRSDG+LAAYSDLAD+YGLLKLGGSDFHGRGGHSESE+GSVNLPVLAMHDFLK ARP+WC AIRDILESY EEPS+SNLA ITRFGRTR
Subjt:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR

Query:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        V K  SS  SG D I+RCLT WLTNEEKQN EFEAIRLKL+H+S+N QEVQ+
Subjt:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

XP_008455216.1 PREDICTED: 3',5'-nucleoside bisphosphate phosphatase [Cucumis melo]1.7e-21484.73Show/hide
Query:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
        MVGD H P +S+ S          K++ KKKKR G+ KKMTS+Q  AFKYVTEWV+LD+SNSLASSAA+SVVDDFGVQKSLGKGG+KVVFELHSHSK SD
Subjt:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD

Query:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
        GFL+PSKLVERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS  G+ ESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
Subjt:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK

Query:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
        NMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARA+VEAGYVENLKQAF+RYLFDGGPAYSTGSEPCA EAIQ+I DTGG+ VLAHPWALKNPVAVIRR
Subjt:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR

Query:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
        LKDAGL GLEVYRSDG+LAAYSDLAD+YGLLKLGGSDFHGRGGHSESE+GSVNLPVLAMHDFLK ARP+WC AIRDILE Y EEPS+SNLA ITRFGRTR
Subjt:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR

Query:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        V K  SS SSG D I+RCLT WLTNEEKQN EFEAIRLKL+H+S+N QEVQ+
Subjt:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

XP_022141958.1 uncharacterized protein LOC111012205 [Momordica charantia]1.0e-25699.78Show/hide
Query:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
        MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
Subjt:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD

Query:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
        GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
Subjt:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK

Query:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
        NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
Subjt:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR

Query:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
        LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
Subjt:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR

Query:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQ+
Subjt:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

XP_038886806.1 3',5'-nucleoside bisphosphate phosphatase [Benincasa hispida]1.1e-21886.5Show/hide
Query:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
        MVGD H P ++ +S          K++ KKKKR GS KKMTS+QA AFKYVTEWV+LD+SNSLASSAA+SVVDDFGVQKSLGKGG+KVVFELHSHSKFSD
Subjt:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD

Query:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
        GFL+PSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFS  G+ ESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
Subjt:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK

Query:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
        NMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARAMVEAGYVENLKQAF+RYLFDGGPAYS GSEPCA EAIQ+IHDTGGV VLAHPWALKNPVA+IRR
Subjt:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR

Query:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
        LKDAGLQGLEVYRSDGKLAAY DLAD+YGLLKLGGSDFHGRGGHSESE+GSVNLPVLAMHDFLK ARPIWC AIRDILE Y EEPS+SNLA ITRFGRTR
Subjt:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR

Query:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        V K  SS SS  D IDRCLTSWLTNEEKQNAEFEAIRLKL+H+S+N QEVQ+
Subjt:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

TrEMBL top hitse value%identityAlignment
A0A0A0K205 POLIIIAc domain-containing protein4.7e-21584.51Show/hide
Query:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
        MVGD H P    +SP++  +KP      KKKKR G+ KKMTS+Q  AFKYVTEW +LD+SNSLASSAA+SVVDDFGVQK++GKGG+KVVFELHSHSK SD
Subjt:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD

Query:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
        GFL+PSKLVERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS  G+ ESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
Subjt:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK

Query:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
        NMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARA+VEAGYVENLKQAF+RYLFDGGPAYSTGSEPCA EAIQ+IHDTGG+ VLAHPWALKNPVAVIRR
Subjt:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR

Query:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
        LKDAGL GLEVYRSDG+LAAYSDLAD+YGLLKLGGSDFHGRGGHSESE+GSVNLPVLAMHDFLK ARP+WC AIRDILESY EEPS+SNLA ITRFGRTR
Subjt:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR

Query:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        V K  SS  SG D I+RCLT WLTNEEKQN EFEAIRLKL+H+S+N QEVQ+
Subjt:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

A0A1S3C0E5 3',5'-nucleoside bisphosphate phosphatase8.1e-21584.73Show/hide
Query:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
        MVGD H P +S+ S          K++ KKKKR G+ KKMTS+Q  AFKYVTEWV+LD+SNSLASSAA+SVVDDFGVQKSLGKGG+KVVFELHSHSK SD
Subjt:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD

Query:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
        GFL+PSKLVERAHGNGVKVLALTDHDTMSGIPEA+EAARRFGIKIIPGVEISTIFS  G+ ESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
Subjt:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK

Query:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
        NMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARA+VEAGYVENLKQAF+RYLFDGGPAYSTGSEPCA EAIQ+I DTGG+ VLAHPWALKNPVAVIRR
Subjt:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR

Query:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
        LKDAGL GLEVYRSDG+LAAYSDLAD+YGLLKLGGSDFHGRGGHSESE+GSVNLPVLAMHDFLK ARP+WC AIRDILE Y EEPS+SNLA ITRFGRTR
Subjt:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR

Query:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        V K  SS SSG D I+RCLT WLTNEEKQN EFEAIRLKL+H+S+N QEVQ+
Subjt:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

A0A6J1CM23 uncharacterized protein LOC1110122055.0e-25799.78Show/hide
Query:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
        MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD
Subjt:  MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSD

Query:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
        GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK
Subjt:  GFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAK

Query:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
        NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR
Subjt:  NMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRR

Query:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
        LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR
Subjt:  LKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTR

Query:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQ+
Subjt:  VSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

A0A6J1EMA5 uncharacterized protein LOC1114346461.3e-21284.43Show/hide
Query:  MVGDGH---PPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSK
        MVGD H    PPNS             K++KKKKKR G+ KKMTS+Q  AFKYVTEWV+LD+SNSLAS+AA+SVVDDFGVQK+LGKGG+KVVF+LHSHSK
Subjt:  MVGDGH---PPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSK

Query:  FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL
        FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFS SGN ESEEPVHILAYYSSCGPAKIEKLE FLENIREGRFL
Subjt:  FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL

Query:  RAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAV
        RAKNMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARAMVEAGYVENLKQAF+RYLFDGGPAYSTGSEPCAE+AIQ+IH+TGGV VLAHPWALKNPVA+
Subjt:  RAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAV

Query:  IRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFG
        IRRLKDAGL GLEVYRSDGKLA YSDLAD  GLLKLGGSDFHGRGG+SESE+GSVNLP LAMHDFLK+ARPIWC AIRDILESY EEPSDSNLA ITRFG
Subjt:  IRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFG

Query:  RTRVSKSSS-SASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        RTRV K  S   S   D ID CLTSWLTNEEKQ+AEFEAIRLKL+H+SM  QEVQ+
Subjt:  RTRVSKSSS-SASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

A0A6J1JMJ7 uncharacterized protein LOC1114860222.6e-21384.87Show/hide
Query:  MVGDGH---PPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSK
        MVGDGH    PPNS             K++KKKKKR GS KKMTS+Q  AFKYVTEWV+LD+SNSLAS+AA+SVVDDFGVQK+LGKGG+KVVFELHSHSK
Subjt:  MVGDGH---PPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSK

Query:  FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL
        FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFS SGN ESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL
Subjt:  FSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFL

Query:  RAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAV
        RAKNMVSKLNELKLPLKWDHVA+ITGKGVAPGRLHVARAMVEAGYVENLKQAF+RYLFDGGPAYSTGSEPCAE+AIQ+IH+TGGV VLAHPWALKNPVA+
Subjt:  RAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAV

Query:  IRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFG
        IRRLKDAGL GLEVYRSDGKLAAYSDLAD  GLLKLGGSDFHGRGG+SESE+GSVNLPVLAMHDFLK+AR IWC AIRDILESY EEPS+SNLA ITRFG
Subjt:  IRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFG

Query:  RTRVSKSSS-SASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI
        RTRV K  S   S   D ID CL SWLTNEEKQ+AEFEAIRLKL+H+SM  QEV++
Subjt:  RTRVSKSSS-SASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKLAHVSMNHQEVQI

SwissProt top hitse value%identityAlignment
C8WJZ5 Phosphoribosyl 1,2-cyclic phosphate 1,2-diphosphodiesterase9.4e-1928.08Show/hide
Query:  VVFELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKF
        ++ +LH HS  SDG  +  +++E+A   GV+ LA T+HDT +G+  A E   R G++++ G+E+S       + E    VHIL      G   +  L   
Subjt:  VVFELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKF

Query:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLF-DGGPAYSTGSEPCAEEAIQMIHDTGGVTVLA
          +  E R   +   + +L E    +  +    +        + H+  A+    Y     +   R LF +GG          A +A++++ + GG+ VLA
Subjt:  LENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLF-DGGPAYSTGSEPCAEEAIQMIHDTGGVTVLA

Query:  HPWALKNPVAVIRRLKDAGLQGLEVYRSDGKLAAY---SDLADSYGLLKLGGSDFHGRGG
        HP  L +   ++  L + GL G+E +  D  LA +   ++LA  Y L+  GGSD+HG+ G
Subjt:  HPWALKNPVAVIRRLKDAGLQGLEVYRSDGKLAAY---SDLADSYGLLKLGGSDFHGRGG

O54453 5'-3' exoribonuclease7.2e-2734.6Show/hide
Query:  VVFELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSTSGNPESEEPVHILAY-YSSCGPAKIEKL
        V+++LHSH+  SDG L+P  LV RA    V  LA+TDHDT + IP A E   R G  + +IPGVEIST++      E+ E +HI+        PA    +
Subjt:  VVFELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSTSGNPESEEPVHILAY-YSSCGPAKIEKL

Query:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTV
          FL    E R  R + +  +L +  +P  W+   R+   G A  R H AR +VE G    +   F +YL  G   Y        E+AI +IH +GG  V
Subjt:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTV

Query:  LAHP--------WALKNPVAVIRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFH
        LAHP        W LK  VA         ++  +  +S  +    + LA  + L    GSDFH
Subjt:  LAHP--------WALKNPVAVIRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFH

P44176 5'-3' exoribonuclease2.9e-2835.27Show/hide
Query:  FELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAY-YSSCGPAKIEKLEKFL
        ++LH HS  SDG LSP++LV RA+  GV VLAL DHDT++GI EA  AA+  GI++I GVEIST +   G       +HI+   +    P    K+   L
Subjt:  FELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAY-YSSCGPAKIEKLEKFL

Query:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHP
        ++ +  R  RA  +  KL +  +P  +D    +    V   R H AR +V+ G V N  QAF RYL  G  A+          AI+ IH  GG+ ++AHP
Subjt:  ENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHP

Query:  WALKNPVAVIRRL----KDAGLQGLEVY---RSDGKLAAYSDLADSYGLLKLGGSDFH
                 +R+L    K  G  G+E+    ++  +    +  A  + L    GSDFH
Subjt:  WALKNPVAVIRRL----KDAGLQGLEVY---RSDGKLAAYSDLADSYGLLKLGGSDFH

P77766 5'-3' exoribonuclease1.1e-2434.22Show/hide
Query:  VVFELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSTSGNPESEEPVHILAY-YSSCGPAKIEKL
        V+++LHSH+  SDG L+P  LV RA    V  LA+TDHDT + I  A E   R G  + +IPGVEIST++      E+ E +HI+        P   E  
Subjt:  VVFELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFG--IKIIPGVEISTIFSTSGNPESEEPVHILAY-YSSCGPAKIEKL

Query:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTV
          FL    E R  RA+ +  +L + ++P   +   R+  +G A  R H AR +VE G   ++   F +YL  G   Y        E+AI +IH +GG  V
Subjt:  EKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTV

Query:  LAHP--------WALKNPVAVIRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFH
        LAHP        W LK  VA         ++  +  +S  +    + LA  + L    GSDFH
Subjt:  LAHP--------WALKNPVAVIRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGLLKLGGSDFH

Q7NXD4 3',5'-nucleoside bisphosphate phosphatase1.8e-3033.45Show/hide
Query:  ELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLEN
        +LH HS+ SDG L+P+++++RA      +LALTDHD   G+ EA  AA R GI  + GVE+S  +           VHI+       PA+   L   L++
Subjt:  ELHSHSKFSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLEN

Query:  IREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWA
        IREGR  RA+ M + L    +   +D   R         R H AR +V++G V++++  F +YL  G P Y +      E+A+  I   GG+ V+AHP  
Subjt:  IREGRFLRAKNMVSKLNELKLPLKWDHVARITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWA

Query:  LKNPVAVIRRL----KDAGLQGLEVYRSDGKL---AAYSDLADSYGLLKLGGSDFHGRG------GHSESELGSVNLPVLAMHDFLKLARPIW
              +I RL    + AG QG+EV      L     ++  AD +GL    GSDFH  G      GH+E              D   + RPIW
Subjt:  LKNPVAVIRRL----KDAGLQGLEVYRSDGKL---AAYSDLADSYGLLKLGGSDFHGRG------GHSESELGSVNLPVLAMHDFLKLARPIW

Arabidopsis top hitse value%identityAlignment
AT2G13840.1 Polymerase/histidinol phosphatase-like4.2e-15563.36Show/hide
Query:  KDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLG--KGGDKVVFELHSHSKFSDGFLSPSKLVERAHGNGVKVL
        K K  KKKK+  G+ +KMT++Q+ AFK +T+W+ L  S SL+SS+     DDF V  + G  + G+KVVFELHSHS  SDGFLSPSK+VERA+ NGVKVL
Subjt:  KDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLG--KGGDKVVFELHSHSKFSDGFLSPSKLVERAHGNGVKVL

Query:  ALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAR
        +LTDHDTM+G+PEA+EA RRFGIKIIPG+EIST+F    +  SEEPVHILAYY + GPA  ++LE FL  IR+GRF+R + MV KLN+LK+PLKW+HV R
Subjt:  ALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAR

Query:  ITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRRLKDAGLQGLEVYRSDGKLAA
        I GK VAPGR+HVARA++EAGYVENL+QAF +YL DGGPAY+TG+EP AEEA+++I  TGGV VLAHPWALKN V +IRRLKDAGL G+EVYRSDGKL  
Subjt:  ITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRRLKDAGLQGLEVYRSDGKLAA

Query:  YSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTRVSKSSSSASSGEDFIDRCLT
        +S+LAD+Y LLKLGGSD+HG+GG +ESELGSVNLPV A+ DFL + RPIWC AI+  + ++ ++PSDSNL+NI RF + R+ K +S+ S G++ +DRCL 
Subjt:  YSDLADSYGLLKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTRVSKSSSSASSGEDFIDRCLT

Query:  SWLTNEEKQNAEFEAIRLKLAHV
         WLT++E+ + +FEA+RLKL+ V
Subjt:  SWLTNEEKQNAEFEAIRLKLAHV

AT5G64160.1 unknown protein1.8e-5753.44Show/hide
Query:  MSLALIQGYSSA-EDEAEDNSLLDRTSSDDDEDLPTAAAAASSSATVNLSIRDRSLFELPQPSSHPGLPSAFDAFSEVSGPPEFLNNSVEEYAATKDVDQ
        MSL L+QGYSSA E+EAE+ +  D  +SD+D D       +SS    + S            + + GLPSA D FS++SGPPEFLNN  E   A  +   
Subjt:  MSLALIQGYSSA-EDEAEDNSLLDRTSSDDDEDLPTAAAAASSSATVNLSIRDRSLFELPQPSSHPGLPSAFDAFSEVSGPPEFLNNSVEEYAATKDVDQ

Query:  PRGNHGGRRNRKEKKDLPTGAVLEAKAQLVGIHERVRSDVESNQPSNPSISNATQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGD
            H  R +RK+KK  P G V+EAK QLVGIHERVR+D+++  PS+ S        KR++TA NPNAE++A+LLRMC+ CG+PKT+++ARGM CP+CGD
Subjt:  PRGNHGGRRNRKEKKDLPTGAVLEAKAQLVGIHERVRSDVESNQPSNPSISNATQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGD

Query:  RPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMHLRQQFD
        R P P+ + KKKGST+KDKEK KRMRGQSSHA+WKSETEM LRQ FD
Subjt:  RPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMHLRQQFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGCGACGGCCACCCTCCTCCTAATTCCGCTGCCTCTCCCCACAATTGTAACAACAAACCCAAAGACAAGAACAGGAAGAAGAAGAAGAAGCGCCGCGGGAGCAA
CAAGAAGATGACTTCCGACCAGGCTCTGGCCTTCAAGTACGTCACCGAGTGGGTGTTTTTGGATCGCTCTAATTCTCTTGCTTCCTCTGCTGCCTCCTCTGTGGTCGATG
ATTTTGGAGTGCAGAAGAGTCTCGGCAAGGGTGGGGACAAGGTGGTGTTCGAACTGCACTCCCACTCCAAATTCAGTGATGGGTTTCTCTCCCCTTCCAAGCTTGTTGAG
AGAGCTCATGGAAATGGGGTGAAAGTTCTTGCTTTGACAGATCATGACACAATGTCTGGCATCCCTGAGGCTATAGAAGCAGCTCGTAGATTCGGCATCAAAATAATTCC
AGGGGTTGAAATCAGTACGATATTCTCTACAAGTGGAAACCCAGAATCAGAAGAACCAGTACACATCCTTGCATATTACAGCAGTTGTGGACCAGCAAAGATTGAGAAGC
TGGAAAAATTCTTAGAAAATATAAGGGAGGGGCGGTTTCTGCGTGCAAAGAACATGGTGTCAAAACTGAATGAGCTGAAGCTGCCTCTTAAATGGGATCATGTGGCTAGG
ATTACTGGTAAAGGAGTAGCTCCTGGGAGACTCCATGTGGCTCGTGCCATGGTTGAAGCAGGCTATGTGGAGAATCTAAAACAAGCATTTGCTCGTTACCTTTTTGATGG
TGGACCTGCTTACTCAACGGGATCAGAGCCTTGTGCAGAGGAAGCAATACAAATGATACACGACACAGGTGGTGTGACCGTACTAGCCCATCCGTGGGCCTTGAAGAATC
CTGTTGCCGTTATTAGAAGACTGAAAGATGCTGGCCTCCAGGGGCTGGAGGTCTACAGGAGTGATGGAAAATTGGCAGCATACAGTGACCTAGCAGACAGTTATGGGCTT
CTGAAACTTGGAGGATCAGATTTTCATGGAAGAGGTGGACACAGTGAGTCTGAACTTGGAAGTGTAAACCTTCCTGTTCTTGCTATGCACGACTTCCTCAAGCTTGCTCG
GCCAATTTGGTGCGGTGCTATTCGGGATATTTTAGAGAGTTATGCCGAAGAGCCTTCCGACTCAAATCTAGCAAATATTACTAGATTTGGGAGGACCCGGGTTTCAAAGA
GCAGCTCTTCAGCGAGTAGCGGAGAGGACTTCATTGATCGTTGCTTGACTTCGTGGCTGACAAATGAAGAGAAGCAAAATGCTGAGTTTGAGGCTATCAGGTTAAAGCTC
GCCCACGTTTCAATGAATCATCAGGAAGTTCAGATCATTCTTTTGGAAATGTTGACAAGGAATCTGGACATGTTGGAAGTGAAGGTATGTTATGAGATTCATTGTAGTTC
TCAAACTATGAAGAAGAAACATAACCATATATTAAGTCCACCGGACAAAAGTTCGGAATCATCAACGAGGAGTTCAAAGCGAAGCAGTTCAATCTCCATTAACCCTGGCG
CCATGAGTCTGGCGCTCATCCAAGGCTATTCTTCAGCCGAAGATGAAGCTGAAGACAACTCTCTCCTCGACCGCACCTCTTCCGACGACGACGAAGATCTACCCACCGCC
GCCGCCGCCGCCTCCTCCTCGGCTACCGTCAATCTTTCCATACGCGACAGGTCACTTTTCGAACTTCCACAGCCCTCCTCTCATCCCGGCTTGCCTTCCGCCTTCGACGC
TTTCTCCGAAGTTTCAGGACCGCCGGAGTTTCTAAATAATTCGGTCGAGGAGTACGCTGCAACGAAAGATGTCGATCAGCCGCGAGGGAACCATGGGGGTCGTAGGAATC
GGAAGGAGAAGAAAGATTTGCCTACTGGTGCTGTACTGGAGGCAAAAGCTCAACTAGTTGGGATTCATGAACGAGTAAGGAGTGATGTTGAGAGTAATCAACCATCAAAT
CCATCCATTTCAAACGCAACACAGGAAGGCAAGCGTGTGGCAACTGCAGCCAATCCAAATGCTGAAGACGCTGCAGAGCTACTGAGAATGTGCCTGCATTGTGGCATTCC
CAAGACCTTTTCAAATGCCCGAGGGATGTTTTGCCCTCTATGTGGTGATCGTCCTCCAGAGCCGAACAGCGAGACTAAAAAGAAGGGTTCTACAGTCAAAGATAAGGAAA
AGGTAAAGAGAATGAGGGGACAGTCATCTCATGCTACTTGGAAGAGCGAAACCGAGATGCATCTTAGACAACAGTTTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGCGACGGCCACCCTCCTCCTAATTCCGCTGCCTCTCCCCACAATTGTAACAACAAACCCAAAGACAAGAACAGGAAGAAGAAGAAGAAGCGCCGCGGGAGCAA
CAAGAAGATGACTTCCGACCAGGCTCTGGCCTTCAAGTACGTCACCGAGTGGGTGTTTTTGGATCGCTCTAATTCTCTTGCTTCCTCTGCTGCCTCCTCTGTGGTCGATG
ATTTTGGAGTGCAGAAGAGTCTCGGCAAGGGTGGGGACAAGGTGGTGTTCGAACTGCACTCCCACTCCAAATTCAGTGATGGGTTTCTCTCCCCTTCCAAGCTTGTTGAG
AGAGCTCATGGAAATGGGGTGAAAGTTCTTGCTTTGACAGATCATGACACAATGTCTGGCATCCCTGAGGCTATAGAAGCAGCTCGTAGATTCGGCATCAAAATAATTCC
AGGGGTTGAAATCAGTACGATATTCTCTACAAGTGGAAACCCAGAATCAGAAGAACCAGTACACATCCTTGCATATTACAGCAGTTGTGGACCAGCAAAGATTGAGAAGC
TGGAAAAATTCTTAGAAAATATAAGGGAGGGGCGGTTTCTGCGTGCAAAGAACATGGTGTCAAAACTGAATGAGCTGAAGCTGCCTCTTAAATGGGATCATGTGGCTAGG
ATTACTGGTAAAGGAGTAGCTCCTGGGAGACTCCATGTGGCTCGTGCCATGGTTGAAGCAGGCTATGTGGAGAATCTAAAACAAGCATTTGCTCGTTACCTTTTTGATGG
TGGACCTGCTTACTCAACGGGATCAGAGCCTTGTGCAGAGGAAGCAATACAAATGATACACGACACAGGTGGTGTGACCGTACTAGCCCATCCGTGGGCCTTGAAGAATC
CTGTTGCCGTTATTAGAAGACTGAAAGATGCTGGCCTCCAGGGGCTGGAGGTCTACAGGAGTGATGGAAAATTGGCAGCATACAGTGACCTAGCAGACAGTTATGGGCTT
CTGAAACTTGGAGGATCAGATTTTCATGGAAGAGGTGGACACAGTGAGTCTGAACTTGGAAGTGTAAACCTTCCTGTTCTTGCTATGCACGACTTCCTCAAGCTTGCTCG
GCCAATTTGGTGCGGTGCTATTCGGGATATTTTAGAGAGTTATGCCGAAGAGCCTTCCGACTCAAATCTAGCAAATATTACTAGATTTGGGAGGACCCGGGTTTCAAAGA
GCAGCTCTTCAGCGAGTAGCGGAGAGGACTTCATTGATCGTTGCTTGACTTCGTGGCTGACAAATGAAGAGAAGCAAAATGCTGAGTTTGAGGCTATCAGGTTAAAGCTC
GCCCACGTTTCAATGAATCATCAGGAAGTTCAGATCATTCTTTTGGAAATGTTGACAAGGAATCTGGACATGTTGGAAGTGAAGGTATGTTATGAGATTCATTGTAGTTC
TCAAACTATGAAGAAGAAACATAACCATATATTAAGTCCACCGGACAAAAGTTCGGAATCATCAACGAGGAGTTCAAAGCGAAGCAGTTCAATCTCCATTAACCCTGGCG
CCATGAGTCTGGCGCTCATCCAAGGCTATTCTTCAGCCGAAGATGAAGCTGAAGACAACTCTCTCCTCGACCGCACCTCTTCCGACGACGACGAAGATCTACCCACCGCC
GCCGCCGCCGCCTCCTCCTCGGCTACCGTCAATCTTTCCATACGCGACAGGTCACTTTTCGAACTTCCACAGCCCTCCTCTCATCCCGGCTTGCCTTCCGCCTTCGACGC
TTTCTCCGAAGTTTCAGGACCGCCGGAGTTTCTAAATAATTCGGTCGAGGAGTACGCTGCAACGAAAGATGTCGATCAGCCGCGAGGGAACCATGGGGGTCGTAGGAATC
GGAAGGAGAAGAAAGATTTGCCTACTGGTGCTGTACTGGAGGCAAAAGCTCAACTAGTTGGGATTCATGAACGAGTAAGGAGTGATGTTGAGAGTAATCAACCATCAAAT
CCATCCATTTCAAACGCAACACAGGAAGGCAAGCGTGTGGCAACTGCAGCCAATCCAAATGCTGAAGACGCTGCAGAGCTACTGAGAATGTGCCTGCATTGTGGCATTCC
CAAGACCTTTTCAAATGCCCGAGGGATGTTTTGCCCTCTATGTGGTGATCGTCCTCCAGAGCCGAACAGCGAGACTAAAAAGAAGGGTTCTACAGTCAAAGATAAGGAAA
AGGTAAAGAGAATGAGGGGACAGTCATCTCATGCTACTTGGAAGAGCGAAACCGAGATGCATCTTAGACAACAGTTTGATTAG
Protein sequenceShow/hide protein sequence
MVGDGHPPPNSAASPHNCNNKPKDKNRKKKKKRRGSNKKMTSDQALAFKYVTEWVFLDRSNSLASSAASSVVDDFGVQKSLGKGGDKVVFELHSHSKFSDGFLSPSKLVE
RAHGNGVKVLALTDHDTMSGIPEAIEAARRFGIKIIPGVEISTIFSTSGNPESEEPVHILAYYSSCGPAKIEKLEKFLENIREGRFLRAKNMVSKLNELKLPLKWDHVAR
ITGKGVAPGRLHVARAMVEAGYVENLKQAFARYLFDGGPAYSTGSEPCAEEAIQMIHDTGGVTVLAHPWALKNPVAVIRRLKDAGLQGLEVYRSDGKLAAYSDLADSYGL
LKLGGSDFHGRGGHSESELGSVNLPVLAMHDFLKLARPIWCGAIRDILESYAEEPSDSNLANITRFGRTRVSKSSSSASSGEDFIDRCLTSWLTNEEKQNAEFEAIRLKL
AHVSMNHQEVQIILLEMLTRNLDMLEVKVCYEIHCSSQTMKKKHNHILSPPDKSSESSTRSSKRSSSISINPGAMSLALIQGYSSAEDEAEDNSLLDRTSSDDDEDLPTA
AAAASSSATVNLSIRDRSLFELPQPSSHPGLPSAFDAFSEVSGPPEFLNNSVEEYAATKDVDQPRGNHGGRRNRKEKKDLPTGAVLEAKAQLVGIHERVRSDVESNQPSN
PSISNATQEGKRVATAANPNAEDAAELLRMCLHCGIPKTFSNARGMFCPLCGDRPPEPNSETKKKGSTVKDKEKVKRMRGQSSHATWKSETEMHLRQQFD