; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013313 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013313
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationscaffold402:241316..245082
RNA-Seq ExpressionMS013313
SyntenyMS013313
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458982.1 PREDICTED: uncharacterized protein LOC103498231 [Cucumis melo]3.5e-18578.39Show/hide
Query:  DEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA
        +E+KK+  Q+DSQ+ +  + E SE+  E+     FES+LKWIC  I+D SN +RAS+SC VFFV  IAVP+ASHFAL+C++CDEDH RPFHVVVQLSLSA
Subjt:  DEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA

Query:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR
         ATLSF CLS WLRL+GL RFLFLDKL EA+ K+R EY +QLQRSMEL+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISYITSC LELCSWLYR
Subjt:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR

Query:  TSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL
        TSIFFFVCILFRL+C LQM+RLEDF   F  ETEVG IL QHLGLRRT  I+SHRFRVFMLLSL+LVTASQFISLLMTTRS AHVNLSKAGQLALCS+SL
Subjt:  TSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL

Query:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG
        VTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFDDLD ET PTA+ V + + SNSDDED DEDDLDD KLMPVFAHTISFQKRQALV YLRNNKAG
Subjt:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        ITVYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

XP_011660297.1 uncharacterized protein LOC101203162 [Cucumis sativus]9.0e-18175.85Show/hide
Query:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        ME  +KK+  QIDSQ+ +  + E    A E       ES+L+WIC  I+D SN +RASVSC +FFV  IAVP+ASHF L+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA ATLSF CLS WLR++GL RFLFLDKLCEA+ K+R EY +QLQ+SM+L+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISY+TSC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVCI FRL+C LQM+RLEDF  +F  ETEVG IL QHLGLRRT  ++SHRFRVFMLLSL+LVTASQFISLLMTTRS AH NLSK+GQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--DEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRN
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFD+LD E TPTA+ V + + SNSDDE  DEDEDDLDDAKLMPVFAHTISFQKRQALV YLRN
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--DEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRN

Query:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        NKAGITVYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

XP_022954540.1 uncharacterized protein LOC111456780 [Cucurbita moschata]4.5e-18075.45Show/hide
Query:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF
        MED +KK+ S   S     ++  KSE   ESES S         FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH RPF
Subjt:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF

Query:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC
        HVVVQLSLSA ATLSF CLS WLR  GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAAN+IPYYG NMY+SYITSC
Subjt:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC

Query:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA
         LEL SWLYRTSIFFFVCILFRLVCRLQM+RLEDF   FHRE++VG IL QHLGLRRTL  +SHRFRVFM LSL+LVTASQFISLLMTTRS+A  NLSK 
Subjt:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA

Query:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALV
        GQLALCS+SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A INTFDDLD ETPTA+ + +   ++ DDE++DEDD DD KLMPVFAHTISFQKRQALV
Subjt:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALV

Query:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
         YL+NNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKTVGIS
Subjt:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

XP_022994504.1 uncharacterized protein LOC111490209 [Cucurbita maxima]2.0e-18077.05Show/hide
Query:  SQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFA
        SQI+S++ +L    +SE+ ++S     FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH RPFHVVVQLSLSA ATLSF 
Subjt:  SQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFA

Query:  CLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFV
        CLS WLR +GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAA +IPYYG NMY+SYITSC LEL SWLYRTSIFFFV
Subjt:  CLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFV

Query:  CILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFIC
        CILFRLVCRLQM+RLEDF   FHRE++VG IL QHLGLRRTL I+SHRFRVFM LSL+LVTASQFI LLMTTRS+A  NLSK GQLALCS+SLVTG+FIC
Subjt:  CILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFIC

Query:  LRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAGITVYGFMV
        LRSAAKI+HKAQSITCLAAKWHV+A INTFDDLD ETPT + + +   ++ DDED+DEDD DD KLMPVFAHTISFQKRQALV YLRNNKAGITVYGF+V
Subjt:  LRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAGITVYGFMV

Query:  DRTWLKSIFAIELALVLWLLNKTVGIS
        DRTWLKS+FAIELALVLWLLNKTVGIS
Subjt:  DRTWLKSIFAIELALVLWLLNKTVGIS

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]5.7e-18376.83Show/hide
Query:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        ME E+KK+  QIDSQ  +L    +SE  +E+     F+S LKWIC  I D SNP+RAS+SC VFFV AIAVP+ASHFAL+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA ATLSF CLS WLR +GL RFLFLDKL EA+ +VR EY +QLQRSM L+ FFLLPCF AEA YK+WWY+SAA +IPYY NN+YISYI SC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVCILFRL+C LQM+RLEDF   FHRE EVG IL QHL LRRT  I+SHRFR F+LLSL+LVTASQFISLLMTTRS AHVNLSKAGQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKA
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFDDLD ETP  A ++S IV ++ DE+ DEDDLDDAKLMPVFAHTISFQKRQALV YLRNNKA
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKA

Query:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        GITVYGF VDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

TrEMBL top hitse value%identityAlignment
A0A0A0M3M6 Uncharacterized protein4.4e-18175.85Show/hide
Query:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        ME  +KK+  QIDSQ+ +  + E    A E       ES+L+WIC  I+D SN +RASVSC +FFV  IAVP+ASHF L+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA ATLSF CLS WLR++GL RFLFLDKLCEA+ K+R EY +QLQ+SM+L+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISY+TSC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVCI FRL+C LQM+RLEDF  +F  ETEVG IL QHLGLRRT  ++SHRFRVFMLLSL+LVTASQFISLLMTTRS AH NLSK+GQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--DEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRN
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFD+LD E TPTA+ V + + SNSDDE  DEDEDDLDDAKLMPVFAHTISFQKRQALV YLRN
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--DEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRN

Query:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        NKAGITVYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

A0A1S3C949 uncharacterized protein LOC1034982311.7e-18578.39Show/hide
Query:  DEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA
        +E+KK+  Q+DSQ+ +  + E SE+  E+     FES+LKWIC  I+D SN +RAS+SC VFFV  IAVP+ASHFAL+C++CDEDH RPFHVVVQLSLSA
Subjt:  DEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSA

Query:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR
         ATLSF CLS WLRL+GL RFLFLDKL EA+ K+R EY +QLQRSMEL+ FFLLPCF AEA YK+WWY+SAA +IPYY NNMYISYITSC LELCSWLYR
Subjt:  AATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYR

Query:  TSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL
        TSIFFFVCILFRL+C LQM+RLEDF   F  ETEVG IL QHLGLRRT  I+SHRFRVFMLLSL+LVTASQFISLLMTTRS AHVNLSKAGQLALCS+SL
Subjt:  TSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSL

Query:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG
        VTG+FICLRSAAKITHKAQSITCLAAKWHV+A+INTFDDLD ET PTA+ V + + SNSDDED DEDDLDD KLMPVFAHTISFQKRQALV YLRNNKAG
Subjt:  VTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGET-PTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAG

Query:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        ITVYGFMVDRTWLKSIFAIELAL LWLLNKTVG+S
Subjt:  ITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

A0A6J1EG16 uncharacterized protein LOC1114339901.9e-17674.94Show/hide
Query:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL
        M++  KK+ S +DSQ+        SE+  E+      ES+LKWIC  I DQSNP+RAS+SC +FF+ AIAVPLASHFAL+C++CDEDH RPFHVVVQLSL
Subjt:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSL

Query:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL
        SA A LSF  LS WLRL+G  RFLFLDKL +A+ +VR EYS+QLQRS ELIC F++PCF AEAAYK+WWY++AA QIPYY NNMY+SYITSC LELCSWL
Subjt:  SAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWL

Query:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV
        YRTSIFFFVC+LFRL+C LQM+RLEDF   FHRET+VG IL  HLGLRRT  I+SHRFR F+LLSL+LVTASQFISLLMTT + AHVNLSKAGQLALCS+
Subjt:  YRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSV

Query:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--DEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRN
        SLVTG+FICLRSAAKITHKAQSITCLAAKWH++A+++TFDDLD + TPTAA     I SNSDDE  D DEDDLDDAKLMPVFA TISFQKRQALV+YLRN
Subjt:  SLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGE-TPTAARVMSTIVSNSDDE--DEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRN

Query:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
        NKAGITVYGFMVDRTWLKSIFAIELAL LWLLNKTVGIS
Subjt:  NKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

A0A6J1GSP8 uncharacterized protein LOC1114567802.2e-18075.45Show/hide
Query:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF
        MED +KK+ S   S     ++  KSE   ESES S         FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH RPF
Subjt:  MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRS--------AFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPF

Query:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC
        HVVVQLSLSA ATLSF CLS WLR  GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAAN+IPYYG NMY+SYITSC
Subjt:  HVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSC

Query:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA
         LEL SWLYRTSIFFFVCILFRLVCRLQM+RLEDF   FHRE++VG IL QHLGLRRTL  +SHRFRVFM LSL+LVTASQFISLLMTTRS+A  NLSK 
Subjt:  ALELCSWLYRTSIFFFVCILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKA

Query:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALV
        GQLALCS+SLVTG+FICLRSAAKITHKAQSITCLAAKWHV+A INTFDDLD ETPTA+ + +   ++ DDE++DEDD DD KLMPVFAHTISFQKRQALV
Subjt:  GQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALV

Query:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS
         YL+NNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKTVGIS
Subjt:  VYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS

A0A6J1K314 uncharacterized protein LOC1114902099.7e-18177.05Show/hide
Query:  SQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFA
        SQI+S++ +L    +SE+ ++S     FES+LKWIC  I+D SNP+ A++SCF+F   AIAVP+ASHFAL+C++CDEDH RPFHVVVQLSLSA ATLSF 
Subjt:  SQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFA

Query:  CLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFV
        CLS WLR +GL RFLFLDKLCE++ K RDEYSKQL+RSMELI FFLLPCF AEAAYK+WWYVSAA +IPYYG NMY+SYITSC LEL SWLYRTSIFFFV
Subjt:  CLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFV

Query:  CILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFIC
        CILFRLVCRLQM+RLEDF   FHRE++VG IL QHLGLRRTL I+SHRFRVFM LSL+LVTASQFI LLMTTRS+A  NLSK GQLALCS+SLVTG+FIC
Subjt:  CILFRLVCRLQMVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFIC

Query:  LRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAGITVYGFMV
        LRSAAKI+HKAQSITCLAAKWHV+A INTFDDLD ETPT + + +   ++ DDED+DEDD DD KLMPVFAHTISFQKRQALV YLRNNKAGITVYGF+V
Subjt:  LRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAGITVYGFMV

Query:  DRTWLKSIFAIELALVLWLLNKTVGIS
        DRTWLKS+FAIELALVLWLLNKTVGIS
Subjt:  DRTWLKSIFAIELALVLWLLNKTVGIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)1.5e-12052.71Show/hide
Query:  ESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKL
        + +   +F  YL+W+C   VD S+PW A +S  +F V  + VP  SHF LACA+CD  HSRP+  VVQLSLS+ AT+SF CL+ ++  YGLRRFLF DKL
Subjt:  ESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRFLFLDKL

Query:  CEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVRLEDFGR
         + +  VR  Y+ QL  S+ ++ +F++PCF+A +AYK+WWY S  ++IP+ GN + +S   +C +ELCSWLYRT++ F VC+LFRL+C LQ++RL+DF +
Subjt:  CEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVRLEDFGR

Query:  AFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAK
         F  +++VG+IL +HL +RR LRI+SHR+R F+L  L+LVT SQF SLL+TT++   VN+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAK
Subjt:  AFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAK

Query:  WHVAAIINTFD------DLDGETPT-AAR----------VMSTIVSNSDDEDEDEDDLDDAKLMPVFA-HTISFQKRQALVVYLRNNKAGITVYGFMVDR
        WHV A + +FD      D   ETPT  AR          V++   S+SD+  ++EDDLD+  ++PV+A  T+SFQKRQALV Y  NN AGITVYGF +DR
Subjt:  WHVAAIINTFD------DLDGETPT-AAR----------VMSTIVSNSDDEDEDEDDLDDAKLMPVFA-HTISFQKRQALVVYLRNNKAGITVYGFMVDR

Query:  TWLKSIFAIELALVLWLLNKTVGIS
          L +IF +EL+LVLWLL KT+GIS
Subjt:  TWLKSIFAIELALVLWLLNKTVGIS

AT3G20300.1 Protein of unknown function (DUF3537)3.7e-12454.01Show/hide
Query:  KSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRF
        +S + A+ E  S F  YL+W+C   VDQS+PW A +S  +F V  + VP  SHF LAC++CD  HSRP+  VVQLSLS+ A LSF CLS ++  YGLRRF
Subjt:  KSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFACLSFWLRLYGLRRF

Query:  LFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVR
        LF DKL + +  VR  Y+ QL RS++++ +F+ PCF A ++YK+WWY S A+QIP+ G N+ +S   +C +ELCSWLYRT++ F VC+LFRL+C LQ++R
Subjt:  LFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQMVR

Query:  LEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSI
        L+DF + F  +++VG+IL +HL +RR LRI+SHR+R F+LLSL+LVT SQF SLL+TT++ A +N+ +AG+LALCS++LVT + I LRSA+KITHKAQ++
Subjt:  LEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSI

Query:  TCLAAKWHVAAIINTFDDLDGETPTAARVMS----------TIVSNSDDEDEDEDDLDDAKLMPVFAH-TISFQKRQALVVYLRNNKAGITVYGFMVDRT
        TCLAAKWHV A I +F+ +DGETP      S             S+S+D  ++EDD D+  L+P +A+ TISFQKRQALV Y  NN++GITV+GF +DR+
Subjt:  TCLAAKWHVAAIINTFDDLDGETPTAARVMS----------TIVSNSDDEDEDEDDLDDAKLMPVFAH-TISFQKRQALVVYLRNNKAGITVYGFMVDRT

Query:  WLKSIFAIELALVLWLLNKTVGIS
         L +IF IE++LVLWLL KT+GIS
Subjt:  WLKSIFAIELALVLWLLNKTVGIS

AT4G03820.1 Protein of unknown function (DUF3537)1.4e-11049.89Show/hide
Query:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS
        +S   Q+L+    P+  +   ES  ++ +F     W      DQSN  +  +S  +FF+LA+ VP+ SHF L CA+CD  H RP+  +VQLSLS  A +S
Subjt:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS

Query:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF
        F  LS W + YG+RRFLF DKL + + KVR  Y  ++QRSM+L+  F+LP  T +A Y++WWY S  NQIPY   N  +S++ +C L+L SWLYRTS+F 
Subjt:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF

Query:  FVCILFRLVCRLQMVRLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI
          CIL++ +C LQ++RL++F R F  E  +  +IL +HL +RR L+IVSHRFR F+LLSL  VTA+QF++LL T R++   N+ + G+LALCS SLV+G+
Subjt:  FVCILFRLVCRLQMVRLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI

Query:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLR
        FICL+SA ++THKAQS+T +A KW+V A ++TFD L DGETP        ++++S    +V +SDD++E E D +D ++ P+FA  IS QKRQALV YL 
Subjt:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLR

Query:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV
        NN+AGITVYGF+VD+TWL+ IF+IELAL+LWLL KT+
Subjt:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV

AT4G03820.2 Protein of unknown function (DUF3537)1.4e-11049.89Show/hide
Query:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS
        +S   Q+L+    P+  +   ES  ++ +F     W      DQSN  +  +S  +FF+LA+ VP+ SHF L CA+CD  H RP+  +VQLSLS  A +S
Subjt:  DSQQHQLLD----PEKSEAAAES-ESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS

Query:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF
        F  LS W + YG+RRFLF DKL + + KVR  Y  ++QRSM+L+  F+LP  T +A Y++WWY S  NQIPY   N  +S++ +C L+L SWLYRTS+F 
Subjt:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF

Query:  FVCILFRLVCRLQMVRLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI
          CIL++ +C LQ++RL++F R F  E  +  +IL +HL +RR L+IVSHRFR F+LLSL  VTA+QF++LL T R++   N+ + G+LALCS SLV+G+
Subjt:  FVCILFRLVCRLQMVRLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI

Query:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLR
        FICL+SA ++THKAQS+T +A KW+V A ++TFD L DGETP        ++++S    +V +SDD++E E D +D ++ P+FA  IS QKRQALV YL 
Subjt:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDL-DGETPTA------ARVMS---TIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLR

Query:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV
        NN+AGITVYGF+VD+TWL+ IF+IELAL+LWLL KT+
Subjt:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV

AT4G22270.1 Protein of unknown function (DUF3537)1.3e-12454.61Show/hide
Query:  SQQHQLLDPEKSEAAAESE------SRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS
        S +H L +  +  A    +      SR  F S + W      DQSN   A +S  VFF+L + VPL SHF L C++CD  H RP+ V+VQLSLS  A +S
Subjt:  SQQHQLLDPEKSEAAAESE------SRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLS

Query:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF
        F  LS W R +G+RRFLFLDKL + + KVR EY  ++QRS++ +  F+LP  T EA Y++WWY+S  NQIPY  N + +S++ +C L+L SWLYR S+F 
Subjt:  FACLSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFF

Query:  FVCILFRLVCRLQMVRLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI
         VCIL+++ C LQ +RL+DF R F  E T+V + L +H  +RR LRIVSHRFR F+LLSL+LVTA+QF++LL TTR++  VN+ + G+LALCS+SLVTG+
Subjt:  FVCILFRLVCRLQMVRLEDFGRAFHRE-TEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGI

Query:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTI------VSNSDDED-EDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNK
        FICLRSA KITHKAQS+T LAAKW+V A +++FD LDGETPT + + S +      +  SDDE+ E +DDLD+ K+ P++A+TIS+QKRQALV YL NNK
Subjt:  FICLRSAAKITHKAQSITCLAAKWHVAAIINTFDDLDGETPTAARVMSTI------VSNSDDED-EDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNK

Query:  AGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV
        AGITVYGF+VDR+WL +IF IELAL+LWLLNKT+
Subjt:  AGITVYGFMVDRTWLKSIFAIELALVLWLLNKTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAGAAGAAGAAAGCCTGGTCTCAAATCGATTCGCAACAGCATCAATTGCTCGATCCGGAAAAATCAGAAGCAGCAGCCGAATCCGAATCGAGAAGCGCCTT
CGAATCTTACCTGAAATGGATTTGCAGAATCATCGTGGATCAGTCGAATCCATGGCGCGCTTCGGTCTCCTGCTTCGTGTTCTTCGTCTTAGCAATCGCCGTCCCCCTCG
CCTCGCACTTCGCTCTCGCTTGCGCCAATTGCGACGAAGATCACAGCAGGCCTTTCCATGTTGTCGTCCAGCTCTCGCTCTCCGCCGCCGCGACGCTCTCCTTCGCCTGC
CTCTCTTTTTGGCTCCGTCTCTACGGATTGAGGCGGTTTCTGTTCCTCGATAAGCTCTGTGAAGCGAATCGGAAGGTTCGGGATGAGTATTCGAAGCAATTGCAGAGATC
AATGGAGCTGATCTGCTTCTTCCTCCTCCCATGTTTCACGGCAGAAGCAGCGTACAAAGTGTGGTGGTACGTCTCAGCAGCCAACCAAATCCCATACTACGGCAACAACA
TGTACATCAGCTACATCACCAGCTGTGCACTGGAGCTCTGCTCCTGGCTCTACAGAACCTCCATCTTCTTCTTCGTCTGCATCCTCTTCCGCCTCGTCTGCCGCCTCCAG
ATGGTGCGCCTCGAAGATTTCGGGCGCGCCTTCCATCGGGAGACGGAGGTGGGCGCCATCTTGAGGCAGCATTTGGGGCTCAGAAGAACCCTCAGAATCGTCAGCCATCG
GTTCAGAGTGTTCATGTTGCTGTCTCTGGTTTTGGTCACCGCCAGTCAATTTATATCTCTCTTGATGACTACCAGATCCAATGCCCATGTCAATCTCTCCAAGGCTGGAC
AACTTGCGCTATGCTCCGTCAGCCTGGTGACAGGCATCTTCATATGCCTCCGAAGTGCTGCAAAGATCACCCACAAGGCACAGTCCATCACGTGCCTTGCTGCCAAGTGG
CACGTCGCCGCCATCATAAACACCTTCGACGACCTCGACGGTGAGACGCCGACGGCAGCTCGGGTCATGTCAACCATTGTATCTAATTCTGATGATGAAGATGAGGATGA
AGACGACTTGGACGATGCCAAACTAATGCCAGTTTTTGCCCACACAATCTCATTCCAAAAGAGGCAGGCATTAGTGGTATATTTGAGGAATAACAAAGCAGGAATTACAG
TGTATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCCATTGAACTTGCACTTGTCCTGTGGCTGCTCAACAAGACTGTTGGTATTTCT
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACGAGAAGAAGAAAGCCTGGTCTCAAATCGATTCGCAACAGCATCAATTGCTCGATCCGGAAAAATCAGAAGCAGCAGCCGAATCCGAATCGAGAAGCGCCTT
CGAATCTTACCTGAAATGGATTTGCAGAATCATCGTGGATCAGTCGAATCCATGGCGCGCTTCGGTCTCCTGCTTCGTGTTCTTCGTCTTAGCAATCGCCGTCCCCCTCG
CCTCGCACTTCGCTCTCGCTTGCGCCAATTGCGACGAAGATCACAGCAGGCCTTTCCATGTTGTCGTCCAGCTCTCGCTCTCCGCCGCCGCGACGCTCTCCTTCGCCTGC
CTCTCTTTTTGGCTCCGTCTCTACGGATTGAGGCGGTTTCTGTTCCTCGATAAGCTCTGTGAAGCGAATCGGAAGGTTCGGGATGAGTATTCGAAGCAATTGCAGAGATC
AATGGAGCTGATCTGCTTCTTCCTCCTCCCATGTTTCACGGCAGAAGCAGCGTACAAAGTGTGGTGGTACGTCTCAGCAGCCAACCAAATCCCATACTACGGCAACAACA
TGTACATCAGCTACATCACCAGCTGTGCACTGGAGCTCTGCTCCTGGCTCTACAGAACCTCCATCTTCTTCTTCGTCTGCATCCTCTTCCGCCTCGTCTGCCGCCTCCAG
ATGGTGCGCCTCGAAGATTTCGGGCGCGCCTTCCATCGGGAGACGGAGGTGGGCGCCATCTTGAGGCAGCATTTGGGGCTCAGAAGAACCCTCAGAATCGTCAGCCATCG
GTTCAGAGTGTTCATGTTGCTGTCTCTGGTTTTGGTCACCGCCAGTCAATTTATATCTCTCTTGATGACTACCAGATCCAATGCCCATGTCAATCTCTCCAAGGCTGGAC
AACTTGCGCTATGCTCCGTCAGCCTGGTGACAGGCATCTTCATATGCCTCCGAAGTGCTGCAAAGATCACCCACAAGGCACAGTCCATCACGTGCCTTGCTGCCAAGTGG
CACGTCGCCGCCATCATAAACACCTTCGACGACCTCGACGGTGAGACGCCGACGGCAGCTCGGGTCATGTCAACCATTGTATCTAATTCTGATGATGAAGATGAGGATGA
AGACGACTTGGACGATGCCAAACTAATGCCAGTTTTTGCCCACACAATCTCATTCCAAAAGAGGCAGGCATTAGTGGTATATTTGAGGAATAACAAAGCAGGAATTACAG
TGTATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCCATTGAACTTGCACTTGTCCTGTGGCTGCTCAACAAGACTGTTGGTATTTCT
Protein sequenceShow/hide protein sequence
MEDEKKKAWSQIDSQQHQLLDPEKSEAAAESESRSAFESYLKWICRIIVDQSNPWRASVSCFVFFVLAIAVPLASHFALACANCDEDHSRPFHVVVQLSLSAAATLSFAC
LSFWLRLYGLRRFLFLDKLCEANRKVRDEYSKQLQRSMELICFFLLPCFTAEAAYKVWWYVSAANQIPYYGNNMYISYITSCALELCSWLYRTSIFFFVCILFRLVCRLQ
MVRLEDFGRAFHRETEVGAILRQHLGLRRTLRIVSHRFRVFMLLSLVLVTASQFISLLMTTRSNAHVNLSKAGQLALCSVSLVTGIFICLRSAAKITHKAQSITCLAAKW
HVAAIINTFDDLDGETPTAARVMSTIVSNSDDEDEDEDDLDDAKLMPVFAHTISFQKRQALVVYLRNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTVGIS