; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039804 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039804
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function, DUF547
Genome locationscaffold10:42854571..42856779
RNA-Seq ExpressionSpg039804
SyntenySpg039804
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK22784.1 DUF547 domain-containing protein/Lzipper-MIP1 domain-containing protein [Cucumis melo var. makuwa]1.7e-20489.1Show/hide
Query:  MEPT-GEHQR-KLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQ
        MEPT  EHQ+ KLDLES+V+KLQAELHGEQALNKALHWALHGPLLSHPHV+S+LPPQVQL+MEELG VEREIDRLE+KVEELK NLYKE+EQNKEWEIQQ
Subjt:  MEPT-GEHQR-KLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQ

Query:  RLRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCL
        RLR L QQNLLL+GPEIN+ S++NGQRSRSQHYDE+RKDIMLSERRFSSSAASDIQI+MS   STGARKN TRSR QSQ EK TCIETPNE+SE+LIKCL
Subjt:  RLRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCL

Query:  IGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIR
        I IYLDLNQ S +SQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS+PNPYSILLDSEGTV+DIGPYKNFIHITRTSFDI RLPECS SIR
Subjt:  IGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIR

Query:  KLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRHAYG
        KLRVLIHKL SVDLTFLTYKQKLAFWINIYNSSIMHAFLEHG PST EKLLALMNKAALNVGGIVLNALAIEHFILRHPS  ETKYP+DEKEMLLRHAYG
Subjt:  KLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRHAYG

Query:  LGYPEPNVTFALCRGSWSSPAV
        LGYPEPNVTFALCRGSWSSPA+
Subjt:  LGYPEPNVTFALCRGSWSSPAV

XP_022134548.1 uncharacterized protein LOC111006766 [Momordica charantia]2.5e-20889.6Show/hide
Query:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL
        MEPTGEHQRKLDLE+EVVKLQAELHGEQ LNKALHWALHGPL+SHPHVSSSLPPQVQLLMEELGVVEREIDRLE+KVEELKLNLYKEREQNKEWEIQQRL
Subjt:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL

Query:  RRLWQQNLLL-SGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLI
         RLWQ NLLL +G EINN+SVLNGQRSRSQH+DE+RKDIML+ERRFSSSAASDIQISMS   STG+RKN  RSRKQSQLEKE+CI+TPNELSEQL+KCLI
Subjt:  RRLWQQNLLL-SGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLI

Query:  GIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPN----PYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECST
        GIYLDLNQ S +SQ+SP +PKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS+PN    PYSILLDSEG+VRD GPY+N IHITR SFDI RLP CST
Subjt:  GIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPN----PYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECST

Query:  SIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAY
        SIRKLR+LIHKL+SVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS+TKYPVDEKEMLLRHAY
Subjt:  SIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAY

Query:  GLGYPEPNVTFALCRGSWSSPAV
        GLGYPEPNVTFALCRGSWSSPA+
Subjt:  GLGYPEPNVTFALCRGSWSSPAV

XP_022960572.1 uncharacterized protein LOC111461289 isoform X3 [Cucurbita moschata]5.0e-22584.91Show/hide
Query:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL
        M+PT  HQR+LDL+ EVVKL+ +LHGEQ LNKALHWALHGP LSHPHVSSSLPPQ+Q+LMEELGVVEREI+RLERKVEELKLNLYKEREQNKEWEIQQRL
Subjt:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL

Query:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG
        R LWQ NLLL+GP+INN SVL GQRSRS HYDE+RKDIMLSERRFSSSAASDIQ           RKN TRSRKQSQLEKETC ETPNELSEQLIKCLIG
Subjt:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG

Query:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRK
        IY+DLNQ SYSS+TSPNIPKHGLSCI+SKRCIAKTSFSCKAPQLTLSFDY+SS+PNPYSI LLDSEG VR+ GPYKNFIHITRTSFDI RLPECSTSIRK
Subjt:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRK

Query:  LRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGLGY
        LR+LIHKL++V+LTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTT+KLLALMNKAALNVGG+VLNALAIEHFILRH +ETKYPVDEKEMLLRHAYGLGY
Subjt:  LRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGLGY

Query:  PEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL
        PEPNVTFALCRGSWSSPA   GVY+RGS ER+G GKS + GS+ GDD+QEEDNGA+ SS+AHER CRRHGI VGVDL
Subjt:  PEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL

XP_022990125.1 uncharacterized protein LOC111487111 isoform X3 [Cucurbita maxima]2.1e-22384.76Show/hide
Query:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL
        M+PT  HQRKLDL+ EVVKL+AELHGEQ LNKALHWALHGP LSHPH SSSLPPQ+Q+LMEELGVVEREI+RLERKVEELKLNLYKEREQNKEWEIQQRL
Subjt:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL

Query:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG
        R LWQ NLLL+GP+INN SVL  QRSRS HYDE+RKDIMLSERRFSSSAASDIQ           RKN TRSRKQSQLE+ETC ETPNELSEQLIKCLIG
Subjt:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG

Query:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS--SPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSI
        IY+DLNQ SYSS+TSPNIPKHGLSCI+SKRCIAKTSFSCKAPQLTLSFDY+SS  +PNPYSI LLDSEG VR++GPYKNFIHITRTSFDI RLPECSTSI
Subjt:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS--SPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSI

Query:  RKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGL
        RKLR+LIHKL+SV+LTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTT+KLLALMNKAALNVGG+VLNALAIEHFILRH +ETKYPVDEKEMLLRHAYGL
Subjt:  RKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGL

Query:  GYPEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL
        GYPEPNVTFALCRGSWSSPA   GVY+RGS ER+G GKS + GSI GDD+QEEDNGA+ SS+AHER CRR+GI VGVDL
Subjt:  GYPEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL

XP_038891096.1 uncharacterized protein LOC120080496 [Benincasa hispida]6.8e-20688.94Show/hide
Query:  MEPT-GEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQR
        MEPT  EHQ KLDLE++V+KLQAELHGEQALNKALHWALHGPLLSHPH SSSLPPQVQL+MEELGVVEREIDRLE+KVEELK NLYKEREQNKEWEIQQR
Subjt:  MEPT-GEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQR

Query:  LRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLI
        LR LWQ N+LL+GPEINN S++NGQRSRSQHYDE+RKDIMLSERRFSSSAASDIQI+MS   STGARKN TRSRKQSQLEKETCIETPNE+SEQLIKCLI
Subjt:  LRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLI

Query:  GIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS----SPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECST
         IYLDLNQ S +SQTSPNIPK GLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS    +PNPYSILLDSEGTVRDIGPYKNFIHITRTSFDI RLPECS+
Subjt:  GIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS----SPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECST

Query:  SIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRH
        SIRKLRVLIHKL +VDL+FLTYKQKLAFWINIYNSSIMHAFLEHG PST EKLLALMNKA LNVGGIVLNALAIEHFILRHPS  ETKYP+DEKEMLL+H
Subjt:  SIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRH

Query:  AYGLGYPEPNVTFALCRGSWSSPAV
        AYGLGYPEPNVTFALCRGSWSSPA+
Subjt:  AYGLGYPEPNVTFALCRGSWSSPAV

TrEMBL top hitse value%identityAlignment
A0A1S3C8E6 uncharacterized protein LOC1034981453.1e-20489.1Show/hide
Query:  MEPT-GEHQR-KLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQ
        MEPT  EHQ+ KLDLES+V+KLQAELHGEQALNKALHWALHGPLLSHPHV+S+LPPQVQL+MEELG VEREIDRLE+KVEELK NLYKE+EQNKEWEIQQ
Subjt:  MEPT-GEHQR-KLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQ

Query:  RLRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCL
        RLR L QQNLLL+GPEIN+ S++NGQRSRSQHYDE+RKDIMLSERRFSSSAASDIQI+MS   STGARKN TRSR QSQ EK TCIETPNE+SEQLIKCL
Subjt:  RLRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCL

Query:  IGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIR
        I IYLDLNQ S +SQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS+PNPYSILLDSEGTV+DIGPYKNFIHITRTSFDI RLPECS SIR
Subjt:  IGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIR

Query:  KLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRHAYG
        KLRVLIHKL SVDLTFLTYKQKLAFWINIYNSSIMHAFLEHG PST EKL ALMNKAALNVGGIVLNALAIEHFILRHPS  ETKYP+DEKEMLLRHAYG
Subjt:  KLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRHAYG

Query:  LGYPEPNVTFALCRGSWSSPAV
        LGYPEPNVTFALCRGSWSSPA+
Subjt:  LGYPEPNVTFALCRGSWSSPAV

A0A5D3DGN5 DUF547 domain-containing protein/Lzipper-MIP1 domain-containing protein8.1e-20589.1Show/hide
Query:  MEPT-GEHQR-KLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQ
        MEPT  EHQ+ KLDLES+V+KLQAELHGEQALNKALHWALHGPLLSHPHV+S+LPPQVQL+MEELG VEREIDRLE+KVEELK NLYKE+EQNKEWEIQQ
Subjt:  MEPT-GEHQR-KLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQ

Query:  RLRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCL
        RLR L QQNLLL+GPEIN+ S++NGQRSRSQHYDE+RKDIMLSERRFSSSAASDIQI+MS   STGARKN TRSR QSQ EK TCIETPNE+SE+LIKCL
Subjt:  RLRRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCL

Query:  IGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIR
        I IYLDLNQ S +SQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS+PNPYSILLDSEGTV+DIGPYKNFIHITRTSFDI RLPECS SIR
Subjt:  IGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIR

Query:  KLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRHAYG
        KLRVLIHKL SVDLTFLTYKQKLAFWINIYNSSIMHAFLEHG PST EKLLALMNKAALNVGGIVLNALAIEHFILRHPS  ETKYP+DEKEMLLRHAYG
Subjt:  KLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS--ETKYPVDEKEMLLRHAYG

Query:  LGYPEPNVTFALCRGSWSSPAV
        LGYPEPNVTFALCRGSWSSPA+
Subjt:  LGYPEPNVTFALCRGSWSSPAV

A0A6J1C2A6 uncharacterized protein LOC1110067661.2e-20889.6Show/hide
Query:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL
        MEPTGEHQRKLDLE+EVVKLQAELHGEQ LNKALHWALHGPL+SHPHVSSSLPPQVQLLMEELGVVEREIDRLE+KVEELKLNLYKEREQNKEWEIQQRL
Subjt:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL

Query:  RRLWQQNLLL-SGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLI
         RLWQ NLLL +G EINN+SVLNGQRSRSQH+DE+RKDIML+ERRFSSSAASDIQISMS   STG+RKN  RSRKQSQLEKE+CI+TPNELSEQL+KCLI
Subjt:  RRLWQQNLLL-SGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLI

Query:  GIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPN----PYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECST
        GIYLDLNQ S +SQ+SP +PKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS+PN    PYSILLDSEG+VRD GPY+N IHITR SFDI RLP CST
Subjt:  GIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPN----PYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECST

Query:  SIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAY
        SIRKLR+LIHKL+SVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPS+TKYPVDEKEMLLRHAY
Subjt:  SIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAY

Query:  GLGYPEPNVTFALCRGSWSSPAV
        GLGYPEPNVTFALCRGSWSSPA+
Subjt:  GLGYPEPNVTFALCRGSWSSPAV

A0A6J1H7T1 uncharacterized protein LOC111461289 isoform X32.4e-22584.91Show/hide
Query:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL
        M+PT  HQR+LDL+ EVVKL+ +LHGEQ LNKALHWALHGP LSHPHVSSSLPPQ+Q+LMEELGVVEREI+RLERKVEELKLNLYKEREQNKEWEIQQRL
Subjt:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL

Query:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG
        R LWQ NLLL+GP+INN SVL GQRSRS HYDE+RKDIMLSERRFSSSAASDIQ           RKN TRSRKQSQLEKETC ETPNELSEQLIKCLIG
Subjt:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG

Query:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRK
        IY+DLNQ SYSS+TSPNIPKHGLSCI+SKRCIAKTSFSCKAPQLTLSFDY+SS+PNPYSI LLDSEG VR+ GPYKNFIHITRTSFDI RLPECSTSIRK
Subjt:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRK

Query:  LRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGLGY
        LR+LIHKL++V+LTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTT+KLLALMNKAALNVGG+VLNALAIEHFILRH +ETKYPVDEKEMLLRHAYGLGY
Subjt:  LRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGLGY

Query:  PEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL
        PEPNVTFALCRGSWSSPA   GVY+RGS ER+G GKS + GS+ GDD+QEEDNGA+ SS+AHER CRRHGI VGVDL
Subjt:  PEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL

A0A6J1JP96 uncharacterized protein LOC111487111 isoform X31.0e-22384.76Show/hide
Query:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL
        M+PT  HQRKLDL+ EVVKL+AELHGEQ LNKALHWALHGP LSHPH SSSLPPQ+Q+LMEELGVVEREI+RLERKVEELKLNLYKEREQNKEWEIQQRL
Subjt:  MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRL

Query:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG
        R LWQ NLLL+GP+INN SVL  QRSRS HYDE+RKDIMLSERRFSSSAASDIQ           RKN TRSRKQSQLE+ETC ETPNELSEQLIKCLIG
Subjt:  RRLWQQNLLLSGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIG

Query:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS--SPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSI
        IY+DLNQ SYSS+TSPNIPKHGLSCI+SKRCIAKTSFSCKAPQLTLSFDY+SS  +PNPYSI LLDSEG VR++GPYKNFIHITRTSFDI RLPECSTSI
Subjt:  IYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSS--SPNPYSI-LLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSI

Query:  RKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGL
        RKLR+LIHKL+SV+LTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTT+KLLALMNKAALNVGG+VLNALAIEHFILRH +ETKYPVDEKEMLLRHAYGL
Subjt:  RKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGL

Query:  GYPEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL
        GYPEPNVTFALCRGSWSSPA   GVY+RGS ER+G GKS + GSI GDD+QEEDNGA+ SS+AHER CRR+GI VGVDL
Subjt:  GYPEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFGSISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39690.1 Protein of unknown function, DUF5472.7e-8348.31Show/hide
Query:  LQAELHGEQALNKALHWALHGPLLSHPHVSS-SLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRLRRLWQQNLLLSGPEINNK
        L+  L  E+A+ + L  A  G ++S P +SS  LPPQ   L++EL +VE EI  L+RK+EELKL LY E+ Q +E ++Q     + +Q   L+      +
Subjt:  LQAELHGEQALNKALHWALHGPLLSHPHVSS-SLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRLRRLWQQNLLLSGPEINNK

Query:  SVLN-----GQRSRSQHYDEIRKD-IMLSERRFSSSAASDIQISMSSNKSTG-----ARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQ
        S L       QRS S  Y     D    +  R S S A D   + SS   T       R    R RK  +L +    + PNE+SEQLI CLIGIYL+LN 
Subjt:  SVLN-----GQRSRSQHYDEIRKD-IMLSERRFSSSAASDIQISMSSNKSTG-----ARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQ

Query:  SSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTV-RDIGPYKNFIHITRTSFDINRLPE-CSTSIRKLRVLIH
           SS+T  ++             +++   SC     T S+  ++ + +PY +L DS G V RDIGPYKNFIHI+R+S D+      CS ++ +L VL+ 
Subjt:  SSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTV-RDIGPYKNFIHITRTSFDINRLPE-CSTSIRKLRVLIH

Query:  KLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETK-YPVDEKEMLLRHAYGLGYPEPNV
        KL+ VDL+FLTYKQKLAFWINIYN+ IMHAFLE+GLPS+  +LL LMNKA+LNVGGIVLNALAIEHF+LRHP E +   +DEKE LLRH YGLGY EPNV
Subjt:  KLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETK-YPVDEKEMLLRHAYGLGYPEPNV

Query:  TFALCRGSWSSPAV
        TFALCRGSWSSPA+
Subjt:  TFALCRGSWSSPAV

AT2G39690.2 Protein of unknown function, DUF5471.2e-7052.79Show/hide
Query:  QRSRSQHYDEIRKD-IMLSERRFSSSAASDIQISMSSNKSTG-----ARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQSSYSSQTSPN
        QRS S  Y     D    +  R S S A D   + SS   T       R    R RK  +L +    + PNE+SEQLI CLIGIYL+LN    SS+T  +
Subjt:  QRSRSQHYDEIRKD-IMLSERRFSSSAASDIQISMSSNKSTG-----ARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQSSYSSQTSPN

Query:  IPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTV-RDIGPYKNFIHITRTSFDINRLPE-CSTSIRKLRVLIHKLTSVDLTFL
        +             +++   SC     T S+  ++ + +PY +L DS G V RDIGPYKNFIHI+R+S D+      CS ++ +L VL+ KL+ VDL+FL
Subjt:  IPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTV-RDIGPYKNFIHITRTSFDINRLPE-CSTSIRKLRVLIHKLTSVDLTFL

Query:  TYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHP--SETKYPVDEKEMLLRHAYGLGYPEPNVTFALCRGSW
        TYKQKLAFWINIYN+ IMHAFLE+GLPS+  +LL LMNKA+LNVGGIVLNALAIEHF+LRHP   E K  +DEKE LLRH YGLGY EPNVTFALCRGSW
Subjt:  TYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHP--SETKYPVDEKEMLLRHAYGLGYPEPNVTFALCRGSW

Query:  SSPAV
        SSPA+
Subjt:  SSPAV

AT3G12540.1 Protein of unknown function, DUF5471.9e-7342.48Show/hide
Query:  VKLQAELHGEQALNKALHWALHGPLLSHPHVS-SSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWE----------IQQRLRRLWQQ
        +K+Q EL  EQALNKAL     GP++S P +S   LPPQVQ L+EEL  VE EI  LE+++++LKL++Y E+++NKE E          +    R L +Q
Subjt:  VKLQAELHGEQALNKALHWALHGPLLSHPHVS-SSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWE----------IQQRLRRLWQQ

Query:  NLLLSGPE---INNKSVLNGQRSRSQHYDE--IRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGI
        N L    +   I  +S    QRS+SQ Y +  + KDI ++  R  +S  S ++ S   + ST +    +R+++++ +++     TPN +SE L+KCL+GI
Subjt:  NLLLSGPE---INNKSVLNGQRSRSQHYDE--IRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGI

Query:  YLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRKLR
        YL+LN+SS   + S  + K  L+ + +       SF  K+      +D+++S+ +PY  ++ +  ++RDIG YKNFIHITRTS D++RL +CSTS+  LR
Subjt:  YLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRKLR

Query:  VLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVD--EKEMLLRHAYGLGY
        VL  KL+ VDL+FL +K+K+AFWIN YN+ +M+ FLEHGLPS+ EKLL ++  A ++VGG  L+AL IE  IL+ P E +  V   E E+ ++  YG   
Subjt:  VLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVD--EKEMLLRHAYGLGY

Query:  PEPNVTFALCRGSWSSPAV
         EPN+ F LCRG WSSPA+
Subjt:  PEPNVTFALCRGSWSSPAV

AT4G37080.1 Protein of unknown function, DUF5478.1e-4031.84Show/hide
Query:  QRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKE--------------------
        ++K+DL  +V KL+ +L  E+ +++AL  A   PL + P + S LP     L+ E+ V+E E+ RLE +V   +  LY+E                    
Subjt:  QRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKE--------------------

Query:  --------REQNKEW-------------EIQQRL-RRLWQQNLLLSGPEINNKS---VLNG-QRSRSQHYDEIRK----DIMLSERRFSSSAASDIQISM
                 +++K               + QQ L R +  + L  S   +N++S   V++G Q S   +   +      D+   E + SS+A+ D +   
Subjt:  --------REQNKEW-------------EIQQRL-RRLWQQNLLLSGPEINNKS---VLNG-QRSRSQHYDEIRK----DIMLSERRFSSSAASDIQISM

Query:  SSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSP--
        S  K  G R  T+  +K+  ++ E   +  +E ++  +   +    D  Q S S  +S +     L   N    +++    C    +T+    SSS    
Subjt:  SSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSP--

Query:  -NPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNK
         +PY+    SE   R++G YK+F  +  +S D+ R    S  I +L+ L++KL+ V+L  L+++QKLAFWIN YNS +M+AFLEHG+P+T E ++ALM K
Subjt:  -NPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNK

Query:  AALNVGGIVLNALAIEHFILRHPSETKY----PVDEKEMLLRHAYGLGYPEPNVTFALCRGSWSSPAV
        A + VGG  LNA+ IEHFILR P   K+        +EM     +GL + EP VTFAL  GSWSSPAV
Subjt:  AALNVGGIVLNALAIEHFILRHPSETKY----PVDEKEMLLRHAYGLGYPEPNVTFALCRGSWSSPAV

AT4G37080.2 Protein of unknown function, DUF5478.1e-4031.84Show/hide
Query:  QRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKE--------------------
        ++K+DL  +V KL+ +L  E+ +++AL  A   PL + P + S LP     L+ E+ V+E E+ RLE +V   +  LY+E                    
Subjt:  QRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKE--------------------

Query:  --------REQNKEW-------------EIQQRL-RRLWQQNLLLSGPEINNKS---VLNG-QRSRSQHYDEIRK----DIMLSERRFSSSAASDIQISM
                 +++K               + QQ L R +  + L  S   +N++S   V++G Q S   +   +      D+   E + SS+A+ D +   
Subjt:  --------REQNKEW-------------EIQQRL-RRLWQQNLLLSGPEINNKS---VLNG-QRSRSQHYDEIRK----DIMLSERRFSSSAASDIQISM

Query:  SSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSP--
        S  K  G R  T+  +K+  ++ E   +  +E ++  +   +    D  Q S S  +S +     L   N    +++    C    +T+    SSS    
Subjt:  SSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQSSYSSQTSPNIPKHGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSP--

Query:  -NPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNK
         +PY+    SE   R++G YK+F  +  +S D+ R    S  I +L+ L++KL+ V+L  L+++QKLAFWIN YNS +M+AFLEHG+P+T E ++ALM K
Subjt:  -NPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNSSIMHAFLEHGLPSTTEKLLALMNK

Query:  AALNVGGIVLNALAIEHFILRHPSETKY----PVDEKEMLLRHAYGLGYPEPNVTFALCRGSWSSPAV
        A + VGG  LNA+ IEHFILR P   K+        +EM     +GL + EP VTFAL  GSWSSPAV
Subjt:  AALNVGGIVLNALAIEHFILRHPSETKY----PVDEKEMLLRHAYGLGYPEPNVTFALCRGSWSSPAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCAACAGGAGAGCATCAGCGCAAACTTGATCTTGAGAGTGAAGTTGTTAAGTTGCAGGCAGAATTGCATGGTGAACAAGCACTCAACAAGGCTCTGCATTGGGC
TCTTCATGGCCCTCTCTTGTCTCATCCCCATGTTTCTTCTTCTTTACCACCTCAGGTGCAGTTGCTTATGGAAGAATTGGGAGTTGTGGAGAGGGAGATTGATAGGTTAG
AAAGAAAGGTTGAAGAGCTGAAGTTGAACTTGTACAAAGAGAGGGAACAGAACAAGGAGTGGGAAATTCAGCAGAGGCTCAGAAGGCTTTGGCAACAGAACCTGCTCTTG
AGTGGACCTGAAATTAATAACAAGTCTGTGCTTAATGGCCAGAGATCCAGATCTCAGCATTATGATGAAATTAGAAAGGATATAATGCTGAGTGAGAGGAGATTCTCTTC
AAGTGCTGCTTCTGATATCCAAATTAGTATGTCTTCCAATAAGTCTACAGGTGCAAGGAAGAACACGACGAGAAGCCGGAAGCAGAGTCAATTAGAGAAAGAAACTTGCA
TTGAGACACCAAATGAGCTTTCAGAACAGTTGATTAAGTGTCTCATAGGCATTTATCTTGATCTAAATCAGTCATCATACAGCAGCCAAACTTCACCAAATATCCCAAAA
CATGGCCTTTCCTGCATTAACTCAAAGCGCTGCATTGCAAAGACCAGCTTCAGTTGCAAAGCACCCCAACTCACCTTGTCTTTTGATTATAGCTCATCCAGTCCCAATCC
TTATAGCATACTGCTTGACTCAGAAGGCACTGTCAGGGACATTGGCCCTTATAAGAACTTCATCCACATCACGAGAACTTCATTCGACATTAATCGATTGCCCGAGTGCT
CAACTTCAATCAGAAAGTTGAGGGTTTTGATTCATAAGCTGACCAGTGTGGACTTAACCTTTTTGACGTACAAACAGAAATTAGCATTCTGGATAAACATTTATAACTCC
TCTATAATGCATGCATTTCTCGAACATGGGCTGCCGTCGACAACAGAAAAACTCTTGGCTTTGATGAACAAGGCTGCTCTTAATGTTGGTGGAATAGTTCTCAATGCTCT
GGCTATCGAACATTTCATTCTTCGGCATCCAAGCGAAACAAAATACCCTGTGGATGAGAAGGAAATGCTGCTTCGACATGCCTATGGCTTAGGATATCCAGAACCAAATG
TGACATTTGCTCTCTGCCGAGGCAGTTGGTCATCTCCAGCAGTAATTAAGGGTGTATACTCCAGAGGAAGTGATGAACGAATTGGGACTGGCAAAAGTGGAATATTTGGA
AGCATCAGTGGGGATGACCAGCAAGAAGAAGATAATGGTGCCAAAGATTCTTCAGTGGCACATGAAAGACTTTGCAGACGACATGGAATCGCTGTTGGAGTGGATTTATA
G
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCAACAGGAGAGCATCAGCGCAAACTTGATCTTGAGAGTGAAGTTGTTAAGTTGCAGGCAGAATTGCATGGTGAACAAGCACTCAACAAGGCTCTGCATTGGGC
TCTTCATGGCCCTCTCTTGTCTCATCCCCATGTTTCTTCTTCTTTACCACCTCAGGTGCAGTTGCTTATGGAAGAATTGGGAGTTGTGGAGAGGGAGATTGATAGGTTAG
AAAGAAAGGTTGAAGAGCTGAAGTTGAACTTGTACAAAGAGAGGGAACAGAACAAGGAGTGGGAAATTCAGCAGAGGCTCAGAAGGCTTTGGCAACAGAACCTGCTCTTG
AGTGGACCTGAAATTAATAACAAGTCTGTGCTTAATGGCCAGAGATCCAGATCTCAGCATTATGATGAAATTAGAAAGGATATAATGCTGAGTGAGAGGAGATTCTCTTC
AAGTGCTGCTTCTGATATCCAAATTAGTATGTCTTCCAATAAGTCTACAGGTGCAAGGAAGAACACGACGAGAAGCCGGAAGCAGAGTCAATTAGAGAAAGAAACTTGCA
TTGAGACACCAAATGAGCTTTCAGAACAGTTGATTAAGTGTCTCATAGGCATTTATCTTGATCTAAATCAGTCATCATACAGCAGCCAAACTTCACCAAATATCCCAAAA
CATGGCCTTTCCTGCATTAACTCAAAGCGCTGCATTGCAAAGACCAGCTTCAGTTGCAAAGCACCCCAACTCACCTTGTCTTTTGATTATAGCTCATCCAGTCCCAATCC
TTATAGCATACTGCTTGACTCAGAAGGCACTGTCAGGGACATTGGCCCTTATAAGAACTTCATCCACATCACGAGAACTTCATTCGACATTAATCGATTGCCCGAGTGCT
CAACTTCAATCAGAAAGTTGAGGGTTTTGATTCATAAGCTGACCAGTGTGGACTTAACCTTTTTGACGTACAAACAGAAATTAGCATTCTGGATAAACATTTATAACTCC
TCTATAATGCATGCATTTCTCGAACATGGGCTGCCGTCGACAACAGAAAAACTCTTGGCTTTGATGAACAAGGCTGCTCTTAATGTTGGTGGAATAGTTCTCAATGCTCT
GGCTATCGAACATTTCATTCTTCGGCATCCAAGCGAAACAAAATACCCTGTGGATGAGAAGGAAATGCTGCTTCGACATGCCTATGGCTTAGGATATCCAGAACCAAATG
TGACATTTGCTCTCTGCCGAGGCAGTTGGTCATCTCCAGCAGTAATTAAGGGTGTATACTCCAGAGGAAGTGATGAACGAATTGGGACTGGCAAAAGTGGAATATTTGGA
AGCATCAGTGGGGATGACCAGCAAGAAGAAGATAATGGTGCCAAAGATTCTTCAGTGGCACATGAAAGACTTTGCAGACGACATGGAATCGCTGTTGGAGTGGATTTATA
G
Protein sequenceShow/hide protein sequence
MEPTGEHQRKLDLESEVVKLQAELHGEQALNKALHWALHGPLLSHPHVSSSLPPQVQLLMEELGVVEREIDRLERKVEELKLNLYKEREQNKEWEIQQRLRRLWQQNLLL
SGPEINNKSVLNGQRSRSQHYDEIRKDIMLSERRFSSSAASDIQISMSSNKSTGARKNTTRSRKQSQLEKETCIETPNELSEQLIKCLIGIYLDLNQSSYSSQTSPNIPK
HGLSCINSKRCIAKTSFSCKAPQLTLSFDYSSSSPNPYSILLDSEGTVRDIGPYKNFIHITRTSFDINRLPECSTSIRKLRVLIHKLTSVDLTFLTYKQKLAFWINIYNS
SIMHAFLEHGLPSTTEKLLALMNKAALNVGGIVLNALAIEHFILRHPSETKYPVDEKEMLLRHAYGLGYPEPNVTFALCRGSWSSPAVIKGVYSRGSDERIGTGKSGIFG
SISGDDQQEEDNGAKDSSVAHERLCRRHGIAVGVDL