; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g23930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g23930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF3537)
Genome locationchr3:16950494..16959373
RNA-Seq ExpressionMoc03g23930
SyntenyMoc03g23930
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008700 - RIN4, pathogenic type III effector avirulence factor Avr cleavage site
IPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573486.1 hypothetical protein SDJN03_27373, partial [Cucurbita argyrosperma subsp. sororia]7.7e-17675.23Show/hide
Query:  MEDEKKK----ASSQIDSQQHQLLDPEKSEAAAESESRSA----FESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF
        MED +KK    + S   S     ++  KSE   ESES SA    FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH RPF
Subjt:  MEDEKKK----ASSQIDSQQHQLLDPEKSEAAAESESRSA----FESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF

Query:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC
        HVVVQLSLSA ATLSF CLS WLR  GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAAN+IPYYG NMY+SYITSC
Subjt:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC

Query:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA
         LEL SWLYRTSIFFFVCILFRLVCRLQM+ LEDF   FHRE++VG IL QHLGLRRTL ++SHRFRVFM LSL+LVTASQFISLLMTTRS+A  NLSK 
Subjt:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA

Query:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALV
        GQLALCS+SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A INTFDDLD ETPTA+ + +   ++ DDE ED+DD DD KLMPVFAHTISFQKRQALV
Subjt:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALV

Query:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
         YL+NNK GITVYGF+VDRTWLKS+FAIELAL LWLLNKT
Subjt:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

XP_008458982.1 PREDICTED: uncharacterized protein LOC103498231 [Cucumis melo]3.6e-18177.73Show/hide
Query:  DEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA
        +E+KK+  Q+DSQ+ +  + E SE+  E+     FES+LKWIC  I+D SN +RAS+SC VFFV  IAVP+ASHFAL+C++CDEDH RPFHVVVQLSLSA
Subjt:  DEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA

Query:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR
         ATLSF CLS WLRL+GL RFLFLDKL EA+ K+R EY +QLQRSMEL+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISYITSC LELCSWLYR
Subjt:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR

Query:  TSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL
        TSIFFFVCILFRL+C LQM+ LEDF   F  ETEVG IL QHLGLRRT  I+SHRFRVFMLLSL+LVTASQFISLLMTTRS AHVNLSKAGQLALCS+SL
Subjt:  TSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL

Query:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG
        VTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFDDLD ET PTA+ V + + SNSDDE  D+DDLDD KLMPVFAHTISFQKRQALV YLRNNKAG
Subjt:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        ITVYGFMVDRTWLKSIFAIELAL LWLLNKT
Subjt:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKT

XP_011660297.1 uncharacterized protein LOC101203162 [Cucumis sativus]1.2e-17675.17Show/hide
Query:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        ME  +KK+  QIDSQ+ +  + E    A E       ES+L+WIC  I+D SN +RASVSC +FFV  IAVP+ASHF L+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA ATLSF CLS WLR++GL RFLFLDKLCEA+ K+R EY +QLQ+SM+L+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISY+TSC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVCI FRL+C LQM+ LEDF  +F  ETEVG IL QHLGLRRT  ++SHRFRVFMLLSL+LVTASQFISLLMTTRS AH NLSK+GQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--YEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRN
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFD+LD E TPTA+ V + + SNSDDE   ED+DDLDDAKLMPVFAHTISFQKRQALV YLRN
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--YEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRN

Query:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        NKAGITVYGFMVDRTWLKSIFAIELAL LWLLNKT
Subjt:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

XP_022954540.1 uncharacterized protein LOC111456780 [Cucurbita moschata]1.5e-17674.77Show/hide
Query:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF
        MED +KK+ S   S     ++  KSE   ESES S         FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH RPF
Subjt:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF

Query:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC
        HVVVQLSLSA ATLSF CLS WLR  GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAAN+IPYYG NMY+SYITSC
Subjt:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC

Query:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA
         LEL SWLYRTSIFFFVCILFRLVCRLQM+ LEDF   FHRE++VG IL QHLGLRRTL  +SHRFRVFM LSL+LVTASQFISLLMTTRS+A  NLSK 
Subjt:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA

Query:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALV
        GQLALCS+SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A INTFDDLD ETPTA+ + +   ++ DDE +D+DD DD KLMPVFAHTISFQKRQALV
Subjt:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALV

Query:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
         YL+NNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKT
Subjt:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]2.0e-17976.39Show/hide
Query:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        ME E+KK+  QIDSQ  +L    +SE  +E+     F+S LKWIC  I D SNP+RAS+SC VFFV AIAVP+ASHFAL+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA ATLSF CLS WLR +GL RFLFLDKL EA+ +VR EY +QLQRSM L+ FFLLPCF AEA YK+WWY+SAA +IPYY NN+YISYI SC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVCILFRL+C LQM+ LEDF   FHRE EVG IL QHL LRRT  I+SHRFR F+LLSL+LVTASQFISLLMTTRS AHVNLSKAGQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKA
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFDDLD ETP  A ++S IV ++ DE  D+DDLDDAKLMPVFAHTISFQKRQALV YLRNNKA
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKA

Query:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        GITVYGF VDRTWLKSIFAIELAL LWLLNKT
Subjt:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKT

TrEMBL top hitse value%identityAlignment
A0A0A0M3M6 Uncharacterized protein5.7e-17775.17Show/hide
Query:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        ME  +KK+  QIDSQ+ +  + E    A E       ES+L+WIC  I+D SN +RASVSC +FFV  IAVP+ASHF L+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA ATLSF CLS WLR++GL RFLFLDKLCEA+ K+R EY +QLQ+SM+L+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISY+TSC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVCI FRL+C LQM+ LEDF  +F  ETEVG IL QHLGLRRT  ++SHRFRVFMLLSL+LVTASQFISLLMTTRS AH NLSK+GQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--YEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRN
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFD+LD E TPTA+ V + + SNSDDE   ED+DDLDDAKLMPVFAHTISFQKRQALV YLRN
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--YEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRN

Query:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        NKAGITVYGFMVDRTWLKSIFAIELAL LWLLNKT
Subjt:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

A0A1S3C949 uncharacterized protein LOC1034982311.7e-18177.73Show/hide
Query:  DEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA
        +E+KK+  Q+DSQ+ +  + E SE+  E+     FES+LKWIC  I+D SN +RAS+SC VFFV  IAVP+ASHFAL+C++CDEDH RPFHVVVQLSLSA
Subjt:  DEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA

Query:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR
         ATLSF CLS WLRL+GL RFLFLDKL EA+ K+R EY +QLQRSMEL+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISYITSC LELCSWLYR
Subjt:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR

Query:  TSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL
        TSIFFFVCILFRL+C LQM+ LEDF   F  ETEVG IL QHLGLRRT  I+SHRFRVFMLLSL+LVTASQFISLLMTTRS AHVNLSKAGQLALCS+SL
Subjt:  TSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL

Query:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG
        VTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFDDLD ET PTA+ V + + SNSDDE  D+DDLDD KLMPVFAHTISFQKRQALV YLRNNKAG
Subjt:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        ITVYGFMVDRTWLKSIFAIELAL LWLLNKT
Subjt:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKT

A0A6J1EG16 uncharacterized protein LOC1114339902.5e-17274.02Show/hide
Query:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        M++  KK+ S +DSQ+        SE+  E+      ES+LKWIC  I DQSNP+RAS+SC +FF+ AIAVPLASHFAL+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA A LSF  LS WLRL+G  RFLFLDKL +A+ +VR EYS+QLQRS ELIC F++PCF AEAAYK+WWY++AA QIPYY NNMY+SYITSC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVC+LFRL+C LQM+ LEDF   FHRET+VG IL  HLGLRRT  I+SHRFR F+LLSL+LVTASQFISLLMTT + AHVNLSKAGQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--YEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRN
        SLVTG+FICLRSAAKITHKAQSITCLAAKWH++A+++TFDDLD + TPTAA     I SNSDDE    D+DDLDDAKLMPVFA TISFQKRQALV+YLRN
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--YEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRN

Query:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        NKAGITVYGFMVDRTWLKSIFAIELAL LWLLNKT
Subjt:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

A0A6J1GSP8 uncharacterized protein LOC1114567807.5e-17774.77Show/hide
Query:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF
        MED +KK+ S   S     ++  KSE   ESES S         FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH RPF
Subjt:  MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF

Query:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC
        HVVVQLSLSA ATLSF CLS WLR  GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAAN+IPYYG NMY+SYITSC
Subjt:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC

Query:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA
         LEL SWLYRTSIFFFVCILFRLVCRLQM+ LEDF   FHRE++VG IL QHLGLRRTL  +SHRFRVFM LSL+LVTASQFISLLMTTRS+A  NLSK 
Subjt:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA

Query:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALV
        GQLALCS+SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A INTFDDLD ETPTA+ + +   ++ DDE +D+DD DD KLMPVFAHTISFQKRQALV
Subjt:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALV

Query:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
         YL+NNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKT
Subjt:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

A0A6J1K314 uncharacterized protein LOC1114902094.9e-17673.65Show/hide
Query:  MEDEKKK------------ASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDH
        MED +KK            + SQI+S++ +L    +SE+ ++S     FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH
Subjt:  MEDEKKK------------ASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDH

Query:  SRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISY
         RPFHVVVQLSLSA ATLSF CLS WLR +GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAA +IPYYG NMY+SY
Subjt:  SRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISY

Query:  ITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVN
        ITSC LEL SWLYRTSIFFFVCILFRLVCRLQM+ LEDF   FHRE++VG IL QHLGLRRTL I+SHRFRVFM LSL+LVTASQFI LLMTTRS+A  N
Subjt:  ITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVN

Query:  LSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKR
        LSK GQLALCS+SLVTG+FICLRSAAKI+HKAQSITCLAAKWHV+A INTFDDLD ETPT + + +   ++ DDE +D+DD DD KLMPVFAHTISFQKR
Subjt:  LSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKR

Query:  QALVVYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        QALV YLRNNKAGITVYGF+VDRTWLKS+FAIELALVLWLLNKT
Subjt:  QALVVYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

SwissProt top hitse value%identityAlignment
O22633 Protein NOI43.9e-0556.76Show/hide
Query:  VPKFGSWDARDPKSGDGYTAIFNKVKMEKKIGGSNSS
        +PKFG WD  DP S +G+T IFNK + EKK GG   S
Subjt:  VPKFGSWDARDPKSGDGYTAIFNKVKMEKKIGGSNSS

Q8GYN5 RPM1-interacting protein 41.9e-1534.72Show/hide
Query:  SHVPKFGNWDTDD-VPYTICFENAGELRAAG-VTFDPNDPDTYPPEAFPAANHRSNPSDRRRHHDHHRRPREQPNRDRRSSGSEKLSSERSGSDYSLLKE
        S+VPKFGNW+ ++ VPYT  F+ A + RA G    +PNDP+        A  H  +   +    D  RR RE   R R  S  ++       S+ +  K 
Subjt:  SHVPKFGNWDTDD-VPYTICFENAGELRAAG-VTFDPNDPDTYPPEAFPAANHRSNPSDRRRHHDHHRRPREQPNRDRRSSGSEKLSSERSGSDYSLLKE

Query:  PGGRRKKINNGVDGISRFPPTAAGHGGNPNPNPRARR--NEGEMVASVPKFGSWDARDPKSGDGYTAIFNKVKMEKKIGGSNSSQAVPPLTNQTKQQPIP
         G  R   NN  D  S     +    G   P P   R     E V  VPKFG WD  +P S DGYT IFNKV+ E+  G + S  +  P T+Q+ + P  
Subjt:  PGGRRKKINNGVDGISRFPPTAAGHGGNPNPNPRARR--NEGEMVASVPKFGSWDARDPKSGDGYTAIFNKVKMEKKIGGSNSSQAVPPLTNQTKQQPIP

Query:  QIVHGSTSFVSKVCCC
             +TS     CCC
Subjt:  QIVHGSTSFVSKVCCC

Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)1.7e-11752.02Show/hide
Query:  ESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKL
        + +   +F  YL+W+C   VD S+PW A +S  +F V  + VP  SHF LACA+CD  HSRP+  VVQLSLS+ AT+SF CL+ ++  YGLRRFLF DKL
Subjt:  ESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKL

Query:  CEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGR
         + +  VR  Y+ QL  S+ ++ +F++PCF+A +AYK+WWY S  ++IP+ GN + +S   +C +ELCSWLYRT++ F VC+LFRL+C LQ++ L+DF +
Subjt:  CEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVCLEDFGR

Query:  AFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAK
         F  +++VG+IL +HL +RR LRI+SHR+R F+L  L+LVT SQF SLL+TT++   VN+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAK
Subjt:  AFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAK

Query:  WHVAAIINTFD------DLDGETPT-AAR----------VMSTIVSNSDDEYEDDDDLDDAKLMPVFA-HTISFQKRQALVVYLRNNKAGITVYGFMVDR
        WHV A + +FD      D   ETPT  AR          V++   S+SD+  +++DDLD+  ++PV+A  T+SFQKRQALV Y  NN AGITVYGF +DR
Subjt:  WHVAAIINTFD------DLDGETPT-AAR----------VMSTIVSNSDDEYEDDDDLDDAKLMPVFA-HTISFQKRQALVVYLRNNKAGITVYGFMVDR

Query:  TWLKSIFAIELALVLWLLNKT
          L +IF +EL+LVLWLL KT
Subjt:  TWLKSIFAIELALVLWLLNKT

AT3G20300.1 Protein of unknown function (DUF3537)4.4e-12153.33Show/hide
Query:  KSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRF
        +S + A+ E  S F  YL+W+C   VDQS+PW A +S  +F V  + VP  SHF LAC++CD  HSRP+  VVQLSLS+ A LSF CLS ++  YGLRRF
Subjt:  KSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRF

Query:  LFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVC
        LF DKL + +  VR  Y+ QL RS++++ +F+ PCF A ++YK+WWY S A+QIP+ G N+ +S   +C +ELCSWLYRT++ F VC+LFRL+C LQ++ 
Subjt:  LFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVC

Query:  LEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSI
        L+DF + F  +++VG+IL +HL +RR LRI+SHR+R F+LLSL+LVT SQF SLL+TT++ A +N+ +AG+LALCS++LVT + I LRSA+KITHKAQ++
Subjt:  LEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSI

Query:  TCLAAKWHVAAIINTFDDLDGETPTAARVMS----------TIVSNSDDEYEDDDDLDDAKLMPVFAH-TISFQKRQALVVYLRNNKAGITVYGFMVDRT
        TCLAAKWHV A I +F+ +DGETP      S             S+S+D  +++DD D+  L+P +A+ TISFQKRQALV Y  NN++GITV+GF +DR+
Subjt:  TCLAAKWHVAAIINTFDDLDGETPTAARVMS----------TIVSNSDDEYEDDDDLDDAKLMPVFAH-TISFQKRQALVVYLRNNKAGITVYGFMVDRT

Query:  WLKSIFAIELALVLWLLNKT
         L +IF IE++LVLWLL KT
Subjt:  WLKSIFAIELALVLWLLNKT

AT4G03820.1 Protein of unknown function (DUF3537)8.6e-10949.54Show/hide
Query:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS
        +S   Q+L+    P+  +   ES  ++ +F     W      DQSN  +  +S  +FF+LA+ VP+ SHF L CA+CD  H RP+  +VQLSLS  A +S
Subjt:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS

Query:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF
        F  LS W + YG+RRFLF DKL + + KVR  Y  ++QRSM+L+  F+LP  T +A Y++WWY S  NQIPY   N  +S++ +C L+L SWLYRTS+F 
Subjt:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF

Query:  FVCILFRLVCRLQMVCLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI
          CIL++ +C LQ++ L++F R F  E  +  +IL +HL +RR L+IVSHRFR F+LLSL  VTA+QF++LL T R++   N+ + G+LALCS SLV+G+
Subjt:  FVCILFRLVCRLQMVCLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI

Query:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLR
        FICL+SA ++THKAQS+T +A KW+V A ++TFD L DGETP        ++++S    +V +SDD+ E + D +D ++ P+FA  IS QKRQALV YL 
Subjt:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLR

Query:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        NN+AGITVYGF+VD+TWL+ IF+IELAL+LWLL KT
Subjt:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

AT4G03820.2 Protein of unknown function (DUF3537)8.6e-10949.54Show/hide
Query:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS
        +S   Q+L+    P+  +   ES  ++ +F     W      DQSN  +  +S  +FF+LA+ VP+ SHF L CA+CD  H RP+  +VQLSLS  A +S
Subjt:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS

Query:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF
        F  LS W + YG+RRFLF DKL + + KVR  Y  ++QRSM+L+  F+LP  T +A Y++WWY S  NQIPY   N  +S++ +C L+L SWLYRTS+F 
Subjt:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF

Query:  FVCILFRLVCRLQMVCLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI
          CIL++ +C LQ++ L++F R F  E  +  +IL +HL +RR L+IVSHRFR F+LLSL  VTA+QF++LL T R++   N+ + G+LALCS SLV+G+
Subjt:  FVCILFRLVCRLQMVCLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI

Query:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLR
        FICL+SA ++THKAQS+T +A KW+V A ++TFD L DGETP        ++++S    +V +SDD+ E + D +D ++ P+FA  IS QKRQALV YL 
Subjt:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLR

Query:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        NN+AGITVYGF+VD+TWL+ IF+IELAL+LWLL KT
Subjt:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

AT4G22270.1 Protein of unknown function (DUF3537)9.5e-12454.5Show/hide
Query:  SQQHQLLDPEKSEAAAESE------SRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS
        S +H L +  +  A    +      SR  F S + W      DQSN   A +S  VFF+L + VPL SHF L C++CD  H RP+ V+VQLSLS  A +S
Subjt:  SQQHQLLDPEKSEAAAESE------SRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS

Query:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF
        F  LS W R +G+RRFLFLDKL + + KVR EY  ++QRS++ +  F+LP  T EA Y++WWY+S  NQIPY  N + +S++ +C L+L SWLYR S+F 
Subjt:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF

Query:  FVCILFRLVCRLQMVCLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI
         VCIL+++ C LQ + L+DF R F  E T+V + L +H  +RR LRIVSHRFR F+LLSL+LVTA+QF++LL TTR++  VN+ + G+LALCS+SLVTG+
Subjt:  FVCILFRLVCRLQMVCLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI

Query:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMS-------TIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNK
        FICLRSA KITHKAQS+T LAAKW+V A +++FD LDGETPT + + S        I ++ D+E E DDDLD+ K+ P++A+TIS+QKRQALV YL NNK
Subjt:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMS-------TIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNK

Query:  AGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        AGITVYGF+VDR+WL +IF IELAL+LWLLNKT
Subjt:  AGITVYGFMVDRTWLKSIFAIELALVLWLLNKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAGAAGAAGAAAGCCTCGTCTCAAATCGATTCGCAACAGCATCAATTGCTCGATCCGGAAAAATCAGAAGCAGCAGCCGAATCCGAATCGAGAAGC
GCCTTCGAATCTTACCTGAAATGGATTTGCAGAATCATCGTGGATCAGTCGAATCCATGGCGCGCTTCGGTCTCCTGCTTCGTGTTCTTCGTCTTAGCAATCGCC
GTCCCCCTCGCCTCGCACTTCGCTCTCGCTTGCGCCAATTGCGACGAAGATCACAGCAGGCCTTTCCATGTTGTCGTCCAGCTCTCGCTCTCCGCCGCCGCGACG
CTCTCCTTCGCCTGCCTCTCTTTTTGGCTCCGTCTCTACGGATTGAGGCGGTTTCTGTTCCTCGATAAGCTCTGTGAAGCGAATCGGAAGGTTCGGGATGAGTAT
TCGAAGCAATTGCAGAGATCAATGGAGCTGATCTGCTTCTTCCTCCTCCCATGTTTCACGGCAGAAGCAGCGTACAAAGTGTGGTGGTACGTCTCAGCAGCCAAC
CAAATCCCATACTACGGCAACAACATGTACATCAGCTACATCACCAGCTGTGCACTGGAGCTCTGCTCCTGGCTCTACAGAACCTCCATCTTCTTCTTCGTCTGC
ATCCTCTTCCGCCTCGTCTGCCGCCTCCAGATGGTGTGCCTCGAAGATTTCGGGCGCGCCTTCCATCGGGAGACGGAGGTGGGCGCCATCTTGAGGCAGCATTTG
GGGCTCAGAAGAACCCTCAGAATCGTCAGCCATCGGTTCAGAGTGTTCATGTTGCTGTCTCTGGTTTTGGTCACCGCCAGTCAATTTATATCTCTCTTGATGACT
ACCAGATCCAATGCCCATGTCAATCTCTCCAAGGCTGGACAACTTGCGCTATGCTCCGTCAGCCTGGTGACAGGCATCTTCATATGCCTCCGAAGTGCTGCAAAG
ATCACCCACAAGGCACAGTCCATCACGTGCCTTGCTGCCAAGTGGCACGTCGCCGCCATCATAAACACCTTCGACGACCTCGACGGTGAGACGCCGACGGCAGCT
CGGGTCATGTCAACCATTGTATCTAATTCTGATGACGAATATGAGGATGATGACGACTTGGACGATGCCAAATTAATGCCAGTTTTTGCCCACACAATCTCATTC
CAAAAGAGGCAGGCATTAGTGGTATATTTGAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCCATT
GAACTTGCACTTGTCCTGTGGCTGCTCAACAAGACTGCAAAATCGTCATCTCACGTTCCGAAATTCGGCAATTGGGACACCGACGACGTACCGTACACCATCTGC
TTCGAGAACGCCGGCGAACTCCGAGCCGCTGGCGTAACCTTCGATCCCAACGATCCGGACACCTATCCGCCGGAGGCGTTCCCCGCCGCGAACCACAGATCTAAT
CCCTCAGACCGCCGGCGGCATCACGACCACCACCGGCGGCCTCGCGAGCAGCCTAATCGCGACCGGCGAAGCTCCGGATCGGAGAAATTGAGCAGCGAGAGGAGC
GGTTCGGATTACTCGCTTCTGAAGGAGCCGGGGGGGAGGAGGAAGAAGATCAATAATGGAGTGGACGGAATCAGCCGGTTTCCTCCGACGGCGGCCGGCCATGGC
GGGAACCCTAACCCTAACCCTAGGGCTCGGCGGAATGAGGGGGAAATGGTGGCGTCGGTGCCGAAATTTGGGAGCTGGGATGCGAGGGATCCGAAATCGGGAGAT
GGATACACGGCGATTTTCAATAAAGTGAAGATGGAGAAGAAAATTGGAGGTTCGAATAGTTCTCAGGCTGTGCCGCCATTGACGAATCAGACCAAGCAACAGCCC
ATCCCTCAGATAGTCCATGGATCCACTTCCTTTGTATCCAAGGTATGTTGTTGTTTGTGTCCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACGAGAAGAAGAAAGCCTCGTCTCAAATCGATTCGCAACAGCATCAATTGCTCGATCCGGAAAAATCAGAAGCAGCAGCCGAATCCGAATCGAGAAGC
GCCTTCGAATCTTACCTGAAATGGATTTGCAGAATCATCGTGGATCAGTCGAATCCATGGCGCGCTTCGGTCTCCTGCTTCGTGTTCTTCGTCTTAGCAATCGCC
GTCCCCCTCGCCTCGCACTTCGCTCTCGCTTGCGCCAATTGCGACGAAGATCACAGCAGGCCTTTCCATGTTGTCGTCCAGCTCTCGCTCTCCGCCGCCGCGACG
CTCTCCTTCGCCTGCCTCTCTTTTTGGCTCCGTCTCTACGGATTGAGGCGGTTTCTGTTCCTCGATAAGCTCTGTGAAGCGAATCGGAAGGTTCGGGATGAGTAT
TCGAAGCAATTGCAGAGATCAATGGAGCTGATCTGCTTCTTCCTCCTCCCATGTTTCACGGCAGAAGCAGCGTACAAAGTGTGGTGGTACGTCTCAGCAGCCAAC
CAAATCCCATACTACGGCAACAACATGTACATCAGCTACATCACCAGCTGTGCACTGGAGCTCTGCTCCTGGCTCTACAGAACCTCCATCTTCTTCTTCGTCTGC
ATCCTCTTCCGCCTCGTCTGCCGCCTCCAGATGGTGTGCCTCGAAGATTTCGGGCGCGCCTTCCATCGGGAGACGGAGGTGGGCGCCATCTTGAGGCAGCATTTG
GGGCTCAGAAGAACCCTCAGAATCGTCAGCCATCGGTTCAGAGTGTTCATGTTGCTGTCTCTGGTTTTGGTCACCGCCAGTCAATTTATATCTCTCTTGATGACT
ACCAGATCCAATGCCCATGTCAATCTCTCCAAGGCTGGACAACTTGCGCTATGCTCCGTCAGCCTGGTGACAGGCATCTTCATATGCCTCCGAAGTGCTGCAAAG
ATCACCCACAAGGCACAGTCCATCACGTGCCTTGCTGCCAAGTGGCACGTCGCCGCCATCATAAACACCTTCGACGACCTCGACGGTGAGACGCCGACGGCAGCT
CGGGTCATGTCAACCATTGTATCTAATTCTGATGACGAATATGAGGATGATGACGACTTGGACGATGCCAAATTAATGCCAGTTTTTGCCCACACAATCTCATTC
CAAAAGAGGCAGGCATTAGTGGTATATTTGAGGAATAACAAAGCAGGAATTACAGTGTATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCCATT
GAACTTGCACTTGTCCTGTGGCTGCTCAACAAGACTGCAAAATCGTCATCTCACGTTCCGAAATTCGGCAATTGGGACACCGACGACGTACCGTACACCATCTGC
TTCGAGAACGCCGGCGAACTCCGAGCCGCTGGCGTAACCTTCGATCCCAACGATCCGGACACCTATCCGCCGGAGGCGTTCCCCGCCGCGAACCACAGATCTAAT
CCCTCAGACCGCCGGCGGCATCACGACCACCACCGGCGGCCTCGCGAGCAGCCTAATCGCGACCGGCGAAGCTCCGGATCGGAGAAATTGAGCAGCGAGAGGAGC
GGTTCGGATTACTCGCTTCTGAAGGAGCCGGGGGGGAGGAGGAAGAAGATCAATAATGGAGTGGACGGAATCAGCCGGTTTCCTCCGACGGCGGCCGGCCATGGC
GGGAACCCTAACCCTAACCCTAGGGCTCGGCGGAATGAGGGGGAAATGGTGGCGTCGGTGCCGAAATTTGGGAGCTGGGATGCGAGGGATCCGAAATCGGGAGAT
GGATACACGGCGATTTTCAATAAAGTGAAGATGGAGAAGAAAATTGGAGGTTCGAATAGTTCTCAGGCTGTGCCGCCATTGACGAATCAGACCAAGCAACAGCCC
ATCCCTCAGATAGTCCATGGATCCACTTCCTTTGTATCCAAGGTATGTTGTTGTTTGTGTCCAAAGTGA
Protein sequenceShow/hide protein sequence
MEDEKKKASSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAAT
LSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVC
ILFRLVCRLQMVCLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAK
ITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEYEDDDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAGITVYGFMVDRTWLKSIFAI
ELALVLWLLNKTAKSSSHVPKFGNWDTDDVPYTICFENAGELRAAGVTFDPNDPDTYPPEAFPAANHRSNPSDRRRHHDHHRRPREQPNRDRRSSGSEKLSSERS
GSDYSLLKEPGGRRKKINNGVDGISRFPPTAAGHGGNPNPNPRARRNEGEMVASVPKFGSWDARDPKSGDGYTAIFNKVKMEKKIGGSNSSQAVPPLTNQTKQQP
IPQIVHGSTSFVSKVCCCLCPK