; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0430 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0430
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionAvr9/Cf-9 rapidly elicited protein
Genome locationMC01:10956123..10957511
RNA-Seq ExpressionMC01g0430
SyntenyMC01g0430
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032674.1 Avr9/Cf-9 rapidly elicited protein [Cucumis melo var. makuwa]9.07e-24776.51Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIRA+DDIGIG  +  S    
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA +K  VRAKLRR NWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_004142198.1 uncharacterized protein LOC101204955 [Cucumis sativus]3.68e-24676.08Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+++NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+S  KNVAKL+ARMEKLV  T+ELHSAME L EME SE+K+QKW++ +PKQ PPVNFE FDKK+A+QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG
        RL+C+IY RI TVFGP V D    L H+P +RILRDRVW WNFYG  +RK     E RLVTQSGPI KKGKKELVRFPSGIRA+DD+GIGY +  S    
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP TTVGGSGLS+NYANVIL AERCLHAPATIG+EARG+LYEMLPA +K  VRAKLRR NWVKRGE  EE+GSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E+M WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_008449997.1 PREDICTED: uncharacterized protein LOC103491711, partial [Cucumis melo]3.38e-24776.51Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIRA+DDIGIG  +  S    
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA +K  VRAKLRR NWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_022931864.1 uncharacterized protein LOC111438146 [Cucurbita moschata]1.63e-22169.46Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        +PP TL+IL+F  AKTMA L+SL+RSL+D +I  L++H + S+G++YLNS   E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD++FSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIF+ GKS+S  KNVAKL+ +MEKLV ATAELHSAME L+EMEASE+K+QK +   PKQ  P+ F+ FDKK+A+QRKDVKH+KEISLWNQSFD+AVGLMT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGED-RLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAG
        RL+CVIYARIF VF PFVSD    LD +P ++ L +RVW WNF+G +HRK G G ++ +LVTQSGPI K GKKEL+RFPSGIR +++  I Y +  S   
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGED-RLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAG

Query:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRG--EEEMGSGGDGHSLAAGWREAV
           A +NRVYTSAP TTVGGSGLS+NYANVIL AERCL+   TIG++ARG+LY+MLPAR+K  VRAKLRR NW KRG  +EEM S  DGHSLA GWREA+
Subjt:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRG--EEEMGSGGDGHSLAAGWREAV

Query:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        EE+M WLGPLAHDTVRWQSERN+EKQRFD   T LLMQTLHYSDLEK EAAIVEVLVGL CI+RY
Subjt:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_038883935.1 uncharacterized protein LOC120074765 [Benincasa hispida]4.40e-24977.97Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++H LRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+S  KNVAKL+ARMEKLV  TAELHSAME+L+EMEASE+K+QKWR+ +PKQ PPVNFE FDKK+A+Q+KDVKH+KEISLWNQSFD+AVGLMT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG
        RL+C IYARI TVFGP   D    + H+P +RILRDRVW WNFYGG +RK GD  E RLVTQSGPI KKGKKELVRFPSGIRA+DDIGIGY +  S  G 
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRG-EEEMGSGGDGHSLAAGWREAVEE
            +NRVYTSAP TTVGGSGLSMNYANVIL AERCL APATIGEEARG+LYEMLPAR+K  VRAKLRR NWVKRG EEEMG G DG+SLAAGWREAVEE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRG-EEEMGSGGDGHSLAAGWREAVEE

Query:  IMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        +M WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  IMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

TrEMBL top hitse value%identityAlignment
A0A0A0L0Z9 Uncharacterized protein1.78e-24676.08Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+++NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+S  KNVAKL+ARMEKLV  T+ELHSAME L EME SE+K+QKW++ +PKQ PPVNFE FDKK+A+QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG
        RL+C+IY RI TVFGP V D    L H+P +RILRDRVW WNFYG  +RK     E RLVTQSGPI KKGKKELVRFPSGIRA+DD+GIGY +  S    
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP TTVGGSGLS+NYANVIL AERCLHAPATIG+EARG+LYEMLPA +K  VRAKLRR NWVKRGE  EE+GSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E+M WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A1S3BMP0 uncharacterized protein LOC1034917111.63e-24776.51Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIRA+DDIGIG  +  S    
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA +K  VRAKLRR NWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A5D3DIU8 Avr9/Cf-9 rapidly elicited protein4.39e-24776.51Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIRA+DDIGIG  +  S    
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA +K  VRAKLRR NWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A6J1EVF1 uncharacterized protein LOC1114381467.91e-22269.46Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        +PP TL+IL+F  AKTMA L+SL+RSL+D +I  L++H + S+G++YLNS   E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD++FSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIF+ GKS+S  KNVAKL+ +MEKLV ATAELHSAME L+EMEASE+K+QK +   PKQ  P+ F+ FDKK+A+QRKDVKH+KEISLWNQSFD+AVGLMT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGED-RLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAG
        RL+CVIYARIF VF PFVSD    LD +P ++ L +RVW WNF+G +HRK G G ++ +LVTQSGPI K GKKEL+RFPSGIR +++  I Y +  S   
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGED-RLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAG

Query:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRG--EEEMGSGGDGHSLAAGWREAV
           A +NRVYTSAP TTVGGSGLS+NYANVIL AERCL+   TIG++ARG+LY+MLPAR+K  VRAKLRR NW KRG  +EEM S  DGHSLA GWREA+
Subjt:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRG--EEEMGSGGDGHSLAAGWREAV

Query:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        EE+M WLGPLAHDTVRWQSERN+EKQRFD   T LLMQTLHYSDLEK EAAIVEVLVGL CI+RY
Subjt:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A6J1GEN0 uncharacterized protein LOC1114534841.45e-21969.4Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        SPP TL ILSF  AKTMA L+SL+RSLSD++I  L++  +RSQG+ YLNS + + FLL LACSERLE+L+NAASSVSRLS+KCADLGL+RFD++FS+MK 
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIF+S KS S  KNVAKL+ RMEKLV +TAELHS+MEAL EMEASE+K+Q WR+ +P Q PPVNFE   +K+A QRKDVKH KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAG
        RLICVIYARI TVF P+V + D    H+P   I RDR W WNFYGG HRK G G GE +  TQSGPI K+GKKELVRFPS IR  D  G    +L S   
Subjt:  RLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAG

Query:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWV-KRGEEEMGSGGDGHSLAAGWREAVE
             +NRVYTSAP TTVGG+GLS+NYANVIL AERCLH+P TIGEEARG  YEMLPAR+K  +RAKLRR NW+ K GEE+MGS  D  SLAAGW+EAVE
Subjt:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWV-KRGEEEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        ++M WLGPLAHDT+RWQSERN+EKQRFD SPT LLMQTLHYSDLEK +AAIVE+LVGLSCI+++
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 36.1e-0524.83Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GL+++YAN+I+  +  +   ++I   AR  LY+ LP  +K  +R+K++  N             D        ++ +E  + WL P+A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  ---WQSERNLEKQRFDASPTA---LLMQTLHYSDLEKAEAAIVEVLVGL
           W  E       F + P+    L ++TL+++  EK E  I+  ++ L
Subjt:  ---WQSERNLEKQRFDASPTA---LLMQTLHYSDLEKAEAAIVEVLVGL

Q9SA91 Protein PSK SIMULATOR 22.8e-0527.1Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GLS++YAN+I   +     P+++    R  LY  LPA VK  +R +L+  +     EEE+             +  +E+ + WL P A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  ---WQSE---------RNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL
           W  E         +   K   + +PT L  QTLH++D    ++ ++E++V L
Subjt:  ---WQSE---------RNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL

Q9XID5 Protein PSK SIMULATOR 16.6e-0727.33Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GL+++YAN+I   +  +   +T+    R  LY+ LP  +K  +R++++      + +EE+             +  +E+ + WL P+A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL
               W S  +   QR  A  T L + TLH++D EK EA I++++V L
Subjt:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)2.0e-0627.1Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GLS++YAN+I   +     P+++    R  LY  LPA VK  +R +L+  +     EEE+             +  +E+ + WL P A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  ---WQSE---------RNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL
           W  E         +   K   + +PT L  QTLH++D    ++ ++E++V L
Subjt:  ---WQSE---------RNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL

AT1G34320.1 Protein of unknown function (DUF668)4.7e-0827.33Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GL+++YAN+I   +  +   +T+    R  LY+ LP  +K  +R++++      + +EE+             +  +E+ + WL P+A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL
               W S  +   QR  A  T L + TLH++D EK EA I++++V L
Subjt:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL

AT3G23160.1 Protein of unknown function (DUF668)7.8e-6435.66Show/hide
Query:  SSPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMK
        S P  T+ ILSF  A  M+  + LHRSLSD +I  LK     S+G+  L S+  EN LL L+ SE+L+ L   AS VSRL +KC +  L  F+ ++ D+ 
Subjt:  SSPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMK

Query:  LGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLM
         G     K     K++  +V +ME+ V+AT  L+  ME + E+E +  K+Q+ + +        + + F++K+  QR+DVK  ++ SLWNQ++D  V ++
Subjt:  LGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLM

Query:  TRLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRA----------------
         R +C IY RI TVFG         L     V + RDR  +        R    G +D   +++    + G      FP G                   
Subjt:  TRLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRA----------------

Query:  --EDDIGIGYRQLKSPAGGEAAPSN------RVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKR
          +DD   G R     +      SN      R+   A A+T+GGS LS++YANV++V E+ L  P  IGEEAR DLY+MLP  +K  ++A LR  +++K 
Subjt:  --EDDIGIGYRQLKSPAGGEAAPSN------RVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKR

Query:  GEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRFDASPT-ALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
                     LA  W+E ++ I++WL PLAH+ +RWQSERN E+Q      T  LL+QTL+++D EK EAAI ++LVGL+ I  Y
Subjt:  GEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRFDASPT-ALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

AT5G04550.1 Protein of unknown function (DUF668)2.0e-4326.9Show/hide
Query:  ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSD-MKLGI
        A L +L+F  A  ++ L+ L +SLSD  +  L+D    S G+  L S   ++F++ L   E +E ++N A +V+RL+RKC D  L  F+  FSD MK G 
Subjt:  ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSD-MKLGI

Query:  FHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMTRL
           G      K + K   +ME+ +S+ A L+   E L ++E + ++++   S         N  ++ KK+  +R +VK+ +++SLWN+++D+ V L+ R 
Subjt:  FHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMTRL

Query:  ICVIYARIFTVFG--------PFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHR---------------------------------------------
        +  I +R   VFG           S D  ++     V  +   V   +   G  R                                             
Subjt:  ICVIYARIFTVFG--------PFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHR---------------------------------------------

Query:  -----KYGDGGEDRLVTQSGPIAKKGKK--------------------------ELVRFPSGIRAEDDIGIGYRQLKSPAGGEAA---------------
             K+  G   ++ ++SGP+   GK                           ++  F   + + D I     + ++ A   +A               
Subjt:  -----KYGDGGEDRLVTQSGPIAKKGKK--------------------------ELVRFPSGIRAEDDIGIGYRQLKSPAGGEAA---------------

Query:  ----PSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLR--RKNWVKRGEEEMGSGGDGHSLAAGWREAV
            PS    + A   T+G + L+++YANVI+V ER + +P  IG++AR DLY MLPA V+  +R +L+   KN       + G       LA  W +A+
Subjt:  ----PSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLR--RKNWVKRGEEEMGSGGDGHSLAAGWREAV

Query:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
          I+ WLGPLAH+ ++WQSER+ E Q   +    +L QTL +++ +K EA I E+LVGL+ ++R+
Subjt:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

AT5G51670.1 Protein of unknown function (DUF668)1.0e-4731.39Show/hide
Query:  SSPP-----ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDIL
        SSPP     +++ +LSF  A+ M  LL L  SL+D  + T +DH+L  +GL  +     E F L+L C+E  + L +AA+SVSRLS +C    L  F  L
Subjt:  SSPP-----ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDIL

Query:  FSDM-KLGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERK--------IQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEI
        F +   +G    G   +C    AK   ++E+ VS T  L+  ME +  +E S RK         ++   Y  K+   +       KI  Q++ VK+ K+ 
Subjt:  FSDM-KLGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERK--------IQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEI

Query:  SLWNQSFDHAVGLMTRLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAED
        SLWN+SFD  V ++ R +    AR+ +VF    +    Y+       + R    S +     H    D   D+  T S  + +                 
Subjt:  SLWNQSFDHAVGLMTRLICVIYARIFTVFGPFVSDDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAED

Query:  DIGIGYRQLKSPAGGEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGD
              R LK                 P TT+GG+G++++YAN+I+V E+ +  P  +G +AR DLY MLPA V+  +R++L+           +G    
Subjt:  DIGIGYRQLKSPAGGEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGD

Query:  GHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRF----DASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
           LA  W+ A+  I+ WL PLA + +RWQSER+ E+Q      ++    +L+QTL ++D  K EAAI E+LVGL+ I+R+
Subjt:  GHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRF----DASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCTCTCCCCCGGCCACCCTCCAAATCCTCTCCTTCCACGCCGCCAAAACCATGGCCACTCTCCTCTCCCTCCACCGCTCCCTCTCCGACGACCAAATCCGAACCCTCAA
AGACCACACCCTCCGCTCCCAGGGCCTCCTCTACTTGAATTCCACCCGCCACGAAAACTTCCTCCTTACCCTCGCCTGCTCCGAGCGCCTCGAGCAGCTCGACAATGCCG
CCTCCTCTGTCTCCCGCCTCAGCCGCAAGTGCGCCGATTTGGGCCTCTCCCGGTTCGACATCCTCTTCTCCGACATGAAACTCGGGATCTTCCATTCTGGGAAATCCGAG
TCCTGCTCCAAGAATGTGGCCAAGCTCGTCGCCAGGATGGAGAAGCTTGTTTCGGCCACCGCCGAGCTCCACTCTGCCATGGAAGCTCTGCTCGAGATGGAGGCCTCCGA
GAGGAAGATTCAGAAGTGGAGATCCTACGCCCCTAAGCAGTGCCCCCCGGTCAATTTCGAGCAATTTGATAAAAAAATCGCAGCCCAGAGGAAAGATGTGAAGCATTACA
AAGAAATTTCGCTATGGAACCAGAGTTTCGATCACGCCGTCGGGCTCATGACCCGCCTGATTTGCGTCATCTACGCGAGGATTTTCACCGTCTTCGGCCCGTTCGTTTCC
GACGATGATTTTTATCTGGATCACGACCCGCACGTTCGGATCCTCCGGGACCGGGTTTGGAGCTGGAATTTCTATGGCGGGCAGCATCGGAAGTACGGCGACGGCGGGGA
AGACAGGCTCGTGACGCAATCGGGCCCAATCGCGAAGAAAGGGAAGAAGGAATTGGTCCGATTTCCGAGTGGGATTAGAGCGGAAGACGATATTGGAATTGGGTACCGGC
AATTGAAATCGCCGGCGGGGGGAGAAGCGGCGCCGAGTAACAGAGTATACACATCCGCGCCGGCAACGACGGTGGGCGGGTCCGGACTGTCGATGAACTACGCGAATGTG
ATACTGGTGGCGGAGAGGTGTCTGCACGCGCCGGCGACGATAGGGGAGGAGGCGCGTGGGGATCTGTACGAGATGCTGCCGGCGAGAGTAAAAGGGATGGTGAGGGCGAA
GTTGAGGAGGAAAAATTGGGTGAAGAGAGGGGAGGAGGAAATGGGGAGCGGCGGCGACGGGCATTCGCTGGCGGCGGGGTGGAGGGAGGCGGTGGAGGAGATTATGGCGT
GGTTGGGGCCGCTGGCGCACGACACTGTGCGGTGGCAATCGGAGAGAAATTTAGAGAAACAGAGGTTCGATGCGAGTCCCACGGCGCTGCTGATGCAGACGCTGCATTAT
TCCGACTTGGAGAAGGCGGAGGCGGCCATTGTCGAGGTTCTGGTGGGCCTCAGTTGTATATTTAGGTAC
mRNA sequenceShow/hide mRNA sequence
TCCTCTCCCCCGGCCACCCTCCAAATCCTCTCCTTCCACGCCGCCAAAACCATGGCCACTCTCCTCTCCCTCCACCGCTCCCTCTCCGACGACCAAATCCGAACCCTCAA
AGACCACACCCTCCGCTCCCAGGGCCTCCTCTACTTGAATTCCACCCGCCACGAAAACTTCCTCCTTACCCTCGCCTGCTCCGAGCGCCTCGAGCAGCTCGACAATGCCG
CCTCCTCTGTCTCCCGCCTCAGCCGCAAGTGCGCCGATTTGGGCCTCTCCCGGTTCGACATCCTCTTCTCCGACATGAAACTCGGGATCTTCCATTCTGGGAAATCCGAG
TCCTGCTCCAAGAATGTGGCCAAGCTCGTCGCCAGGATGGAGAAGCTTGTTTCGGCCACCGCCGAGCTCCACTCTGCCATGGAAGCTCTGCTCGAGATGGAGGCCTCCGA
GAGGAAGATTCAGAAGTGGAGATCCTACGCCCCTAAGCAGTGCCCCCCGGTCAATTTCGAGCAATTTGATAAAAAAATCGCAGCCCAGAGGAAAGATGTGAAGCATTACA
AAGAAATTTCGCTATGGAACCAGAGTTTCGATCACGCCGTCGGGCTCATGACCCGCCTGATTTGCGTCATCTACGCGAGGATTTTCACCGTCTTCGGCCCGTTCGTTTCC
GACGATGATTTTTATCTGGATCACGACCCGCACGTTCGGATCCTCCGGGACCGGGTTTGGAGCTGGAATTTCTATGGCGGGCAGCATCGGAAGTACGGCGACGGCGGGGA
AGACAGGCTCGTGACGCAATCGGGCCCAATCGCGAAGAAAGGGAAGAAGGAATTGGTCCGATTTCCGAGTGGGATTAGAGCGGAAGACGATATTGGAATTGGGTACCGGC
AATTGAAATCGCCGGCGGGGGGAGAAGCGGCGCCGAGTAACAGAGTATACACATCCGCGCCGGCAACGACGGTGGGCGGGTCCGGACTGTCGATGAACTACGCGAATGTG
ATACTGGTGGCGGAGAGGTGTCTGCACGCGCCGGCGACGATAGGGGAGGAGGCGCGTGGGGATCTGTACGAGATGCTGCCGGCGAGAGTAAAAGGGATGGTGAGGGCGAA
GTTGAGGAGGAAAAATTGGGTGAAGAGAGGGGAGGAGGAAATGGGGAGCGGCGGCGACGGGCATTCGCTGGCGGCGGGGTGGAGGGAGGCGGTGGAGGAGATTATGGCGT
GGTTGGGGCCGCTGGCGCACGACACTGTGCGGTGGCAATCGGAGAGAAATTTAGAGAAACAGAGGTTCGATGCGAGTCCCACGGCGCTGCTGATGCAGACGCTGCATTAT
TCCGACTTGGAGAAGGCGGAGGCGGCCATTGTCGAGGTTCTGGTGGGCCTCAGTTGTATATTTAGGTAC
Protein sequenceShow/hide protein sequence
SSPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKLGIFHSGKSE
SCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMTRLICVIYARIFTVFGPFVS
DDDFYLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRAEDDIGIGYRQLKSPAGGEAAPSNRVYTSAPATTVGGSGLSMNYANV
ILVAERCLHAPATIGEEARGDLYEMLPARVKGMVRAKLRRKNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHY
SDLEKAEAAIVEVLVGLSCIFRY