; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g07050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g07050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTPR_REGION domain-containing protein
Genome locationchr5:4963334..4966689
RNA-Seq ExpressionMoc05g07050
SyntenyMoc05g07050
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582196.1 Sperm-associated antigen 1A, partial [Cucurbita argyrosperma subsp. sororia]1.0e-20582.29Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA
        MNSN GKNF+FDLGLG+S SKSLNDQKNK+PSYSSYASS  SYSSTQTRPAW QP++ SWTHQ A+P L+N PSSMVGDIFGKSW ST NSG+TAGIGI 
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA

Query:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS
        EKNPNLFGDLVSS+LGSGKSN N+PLKNA P  AS SS AA N NSFSM NM DSLP+A+SNP K++GNWSF+NLSS NS   NQSNTTNIKAPNLGG S
Subjt:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS

Query:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE
        MSST GSGKTSSSKDPFGSLVDFGSKSSG LNSTS SQKINSSED FGDFQNAS PST  F SSG SGSGVGF GSSF+S +NM  FGMP M+FG KVQ+
Subjt:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE

Query:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA
        TVQT+ +DPLD+LFSSSKA  GGA +AS A G PQS+DADDWG+DSEFGGGGHD+GGSTTEIEGLP PPAGVT+SLA NKGVD YKQGQYADAIKWLSWA
Subjt:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA

Query:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        V+LFEKAGD+AAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANV+VLVQRALLYESMEKY+LG+EDLRAVLKIDPG
Subjt:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

XP_008438908.1 PREDICTED: epidermal growth factor receptor substrate 15 homolog [Cucumis melo]6.1e-20680.82Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA
        MNSNFGKNFEFDLGLGSS SKSLNDQKNK+ SYSSY SS  SYSST TRPAW QP++ SWTH     QAARP LSN+P+SMVGDIFGK+WGST  SGSTA
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA

Query:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN
        GIGI EKNPNLFGDLV SALGSGKSN+N+PLKN  P  AS SS AALNRNSFSMGNM DSLP+++SNPSK++GNWSF+NLS+YN+G +NQSNTTNIK PN
Subjt:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN

Query:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG
        LGG SMSST G GKTSS+KDPFGSLVDFGSKSSG LNST+ +Q I SSEDSFGDFQNA NPST TFPSS S+ +GV F GSSFNSG+NMGDFGMP M+F 
Subjt:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG

Query:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK
         KVQ+TVQTTASDPLD+LFSSSKAP  G ++AS   G  QS+DADDWG+DS+FGGGGHDVGGSTTEIEGLP PPAGVT+S A NKGVD Y+QGQYADAIK
Subjt:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK

Query:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        WLSWAVILFEK G++AAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKY+LGAEDLRAVLK+DPG
Subjt:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

XP_022137927.1 P17/29C-like protein DDB_G0287399 [Momordica charantia]1.4e-258100Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA
        MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA

Query:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS
        EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS
Subjt:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS

Query:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE
        MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE
Subjt:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE

Query:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA
        TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA
Subjt:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA

Query:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
Subjt:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

XP_022955813.1 nuclear pore complex protein NUP62-like [Cucurbita moschata]8.0e-20682.29Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA
        MNSN GKNF+FDLGLG+S SKSLNDQKNK+PSYSSYASS  SYSSTQTRPAW QP++ SWTHQ A+P LSN PSSMVGDIFGKSW ST NSGSTA IGI 
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA

Query:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS
        EKNPNLFGDLVSS+LGSGKSN N+PLKNA PAS S    AA N NSFSM NM DSLP+A+SNP K++GNWSF+NLSS NS   NQSNTTNIKAPNLGG S
Subjt:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS

Query:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE
        MSST GSGKTSS+KDPFGSLVDFGSKSSG LNSTS SQKINSSED FGDFQNAS PST  FPSSG SGSGVGF GSSF+S +N+  FGMP M+FG KVQ+
Subjt:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE

Query:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA
        TVQT+ +DPLD+LFSSSKA  GGA +AS A GVPQS+DADDWG+DSEFGGGGHD+GGSTTEIEGLP PPAGVT+SLA NKGVD YKQGQYADAIKWLSWA
Subjt:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA

Query:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        V+LFEKAGD+AAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANV+VLVQRALLYESMEKY+LG+EDLRAVLKIDPG
Subjt:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

XP_038901127.1 cell wall protein RBR3 [Benincasa hispida]1.2e-20680.12Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSP-SYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGST
        MNSNFGKNFEFDLGLGSS SKSLNDQKNK+P SYSSYASSG SYSSTQTRPAW QP++ SWTH     QAA P LSN+P+SMVGDIFGK+WGST  SGST
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSP-SYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGST

Query:  AGIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAP
        AGIGI EKNPNLFGDLV SALGS KSN+N PLKNA P  AS SS AALN+NSFSMGNM DSLP+++SNP K++GNWSFENLSSYNS ++NQSNTTNIK P
Subjt:  AGIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAP

Query:  NLGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQ----------------------NASNPSTRTFPSSGSSGSGVG
        NLGG SMSST GSGKTSSSKDPFGSLVDFGSKSSGKLNS S SQ INSSEDSFGDFQ                      NASNPST  FPSS SSG+G  
Subjt:  NLGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQ----------------------NASNPSTRTFPSSGSSGSGVG

Query:  FNGSSFNSGLNMGDFGMPPMNFGPKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGV
        F GSSFNSG+NMGDFGMP M+FG KVQ+TVQTTASDPLD+LFSSSKAP GGA +AS   G  QS+DADDWG+D EFGGGGHDVGGSTTEIEGLP PPAGV
Subjt:  FNGSSFNSGLNMGDFGMPPMNFGPKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGV

Query:  TSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLR
        T+SLA NKGVD YKQGQYADAIKWLSWAVILFEKAG++AA+VEVLSTRASCYKEVGEYKKAVVDCTKVLDQD ANVTVLVQRALLYESMEKYRLGAEDLR
Subjt:  TSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLR

Query:  AVLKIDPG
        AVLKIDPG
Subjt:  AVLKIDPG

TrEMBL top hitse value%identityAlignment
A0A1S3AXK7 epidermal growth factor receptor substrate 15 homolog3.0e-20680.82Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA
        MNSNFGKNFEFDLGLGSS SKSLNDQKNK+ SYSSY SS  SYSST TRPAW QP++ SWTH     QAARP LSN+P+SMVGDIFGK+WGST  SGSTA
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA

Query:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN
        GIGI EKNPNLFGDLV SALGSGKSN+N+PLKN  P  AS SS AALNRNSFSMGNM DSLP+++SNPSK++GNWSF+NLS+YN+G +NQSNTTNIK PN
Subjt:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN

Query:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG
        LGG SMSST G GKTSS+KDPFGSLVDFGSKSSG LNST+ +Q I SSEDSFGDFQNA NPST TFPSS S+ +GV F GSSFNSG+NMGDFGMP M+F 
Subjt:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG

Query:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK
         KVQ+TVQTTASDPLD+LFSSSKAP  G ++AS   G  QS+DADDWG+DS+FGGGGHDVGGSTTEIEGLP PPAGVT+S A NKGVD Y+QGQYADAIK
Subjt:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK

Query:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        WLSWAVILFEK G++AAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKY+LGAEDLRAVLK+DPG
Subjt:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

A0A5D3D165 Epidermal growth factor receptor substrate 15-like protein3.0e-20680.82Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA
        MNSNFGKNFEFDLGLGSS SKSLNDQKNK+ SYSSY SS  SYSST TRPAW QP++ SWTH     QAARP LSN+P+SMVGDIFGK+WGST  SGSTA
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA

Query:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN
        GIGI EKNPNLFGDLV SALGSGKSN+N+PLKN  P  AS SS AALNRNSFSMGNM DSLP+++SNPSK++GNWSF+NLS+YN+G +NQSNTTNIK PN
Subjt:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN

Query:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG
        LGG SMSST G GKTSS+KDPFGSLVDFGSKSSG LNST+ +Q I SSEDSFGDFQNA NPST TFPSS S+ +GV F GSSFNSG+NMGDFGMP M+F 
Subjt:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG

Query:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK
         KVQ+TVQTTASDPLD+LFSSSKAP  G ++AS   G  QS+DADDWG+DS+FGGGGHDVGGSTTEIEGLP PPAGVT+S A NKGVD Y+QGQYADAIK
Subjt:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK

Query:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        WLSWAVILFEK G++AAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKY+LGAEDLRAVLK+DPG
Subjt:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

A0A6J1C9M7 P17/29C-like protein DDB_G02873996.7e-259100Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA
        MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA

Query:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS
        EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS
Subjt:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS

Query:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE
        MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE
Subjt:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE

Query:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA
        TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA
Subjt:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA

Query:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
Subjt:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

A0A6J1GV19 nuclear pore complex protein NUP62-like3.9e-20682.29Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA
        MNSN GKNF+FDLGLG+S SKSLNDQKNK+PSYSSYASS  SYSSTQTRPAW QP++ SWTHQ A+P LSN PSSMVGDIFGKSW ST NSGSTA IGI 
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIA

Query:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS
        EKNPNLFGDLVSS+LGSGKSN N+PLKNA PAS S    AA N NSFSM NM DSLP+A+SNP K++GNWSF+NLSS NS   NQSNTTNIKAPNLGG S
Subjt:  EKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRS

Query:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE
        MSST GSGKTSS+KDPFGSLVDFGSKSSG LNSTS SQKINSSED FGDFQNAS PST  FPSSG SGSGVGF GSSF+S +N+  FGMP M+FG KVQ+
Subjt:  MSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQE

Query:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA
        TVQT+ +DPLD+LFSSSKA  GGA +AS A GVPQS+DADDWG+DSEFGGGGHD+GGSTTEIEGLP PPAGVT+SLA NKGVD YKQGQYADAIKWLSWA
Subjt:  TVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWA

Query:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        V+LFEKAGD+AAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANV+VLVQRALLYESMEKY+LG+EDLRAVLKIDPG
Subjt:  VILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

A0A6J1I9E6 jacalin-related lectin 5-like2.8e-20481.65Show/hide
Query:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA
        MNSNFGK+FEFDLG+GSS SKSLNDQKNK+ SYSSYASSG S SSTQTRPAW +P+  SWTH     QAARP LSN+P+SMVGDIFGKSWGST NSGS +
Subjt:  MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTH-----QAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTA

Query:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN
        GIGI EKNPNLFGDLV SALGSGKSNTN+ LKN   AS  ASS+AALNRNSFSMGNM++SLP+A+ NP+KS+ N SFENL+SYNSG +N+S TTNI+APN
Subjt:  GIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPN

Query:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG
          G SMSST GSGKTSSSKDPFGSLVDFGSK SG LNSTS SQKIN++EDSFGDFQNASNPST  FPSSGSSGSG+GFNGSSF+S LNMGDFGMP M+FG
Subjt:  LGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFG

Query:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK
         K QETVQT ASDPLD+LFSSSKA  GGA MA  A G PQS DAD WG DSEFGGG HDVGGSTTEIEGLP PPAGVT+SLA NKGVD YKQGQYADAIK
Subjt:  PKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIK

Query:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        WLSWAVILF+KAGD+AA VEVLSTRASCYKEVGEYKKAVVDCTKVLD DD NVTVLVQRALLYESMEKY+LGAEDLRAVLKIDPG
Subjt:  WLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

SwissProt top hitse value%identityAlignment
F1RBN2 Sperm-associated antigen 1A2.1e-0731.25Show/hide
Query:  GAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLA--NNKGVDFYKQGQYADAIKWLSWAVILFEKAG-DSAAIVEVL-S
        G   A+ +  VP S  A +  + +          GS  E   L  P   +   LA   N+G   +K GQ+ DA++  + A+    +AG DS   + VL S
Subjt:  GAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLA--NNKGVDFYKQGQYADAIKWLSWAVILFEKAG-DSAAIVEVL-S

Query:  TRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKID
         RA+C+ + G     + DCT+ L+    ++  L++RA+ YES+E+YR    D + VL+ID
Subjt:  TRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKID

Q07617 Sperm-associated antigen 17.7e-1031.93Show/hide
Query:  ATGVPQSMDADDWGMD----------SEFGGG--GHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAA--IVE
        A G PQ     + G D          +  GGG  GH  GG   E       PAG+ S     +G + ++ GQ+A+A    S A+ L E AG   A  +  
Subjt:  ATGVPQSMDADDWGMD----------SEFGGG--GHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAA--IVE

Query:  VLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPGL
        + S RA+CY + G     + DC + L+    ++  L++RA+ YE++E+Y     D + VL+ID GL
Subjt:  VLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPGL

Q80ZX8 Sperm-associated antigen 12.9e-0930.28Show/hide
Query:  GMDSEFGGGGHDVGGSTTEIEGLPTPP-AGVTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAA--IVEVLSTRASCYKEVGEYKKAVVDCTK
        G  +E  GG  +   ++T     P  P A    S    +G + ++ GQ+A+A    S A+   E  G + A  +  + S RA+CY + G  +  + DC +
Subjt:  GMDSEFGGGGHDVGGSTTEIEGLPTPP-AGVTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAA--IVEVLSTRASCYKEVGEYKKAVVDCTK

Query:  VLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPGL
         L+    +V  L++RA+ YE++E+YR    D + VL+ID G+
Subjt:  VLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPGL

Arabidopsis top hitse value%identityAlignment
AT1G56440.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-0526.12Show/hide
Query:  GGGGHDVGGSTTEIEGLPTPPAG---VTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDD
        G G +D       I  L +   G   + SS    +G +F+KQ ++ +AI   S ++ L   A          + RA  Y ++  Y++A VDCT+ L+ DD
Subjt:  GGGGHDVGGSTTEIEGLPTPPAG---VTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDD

Query:  ANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDP
          +    +RA   + +   +   ED    L+++P
Subjt:  ANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDP

AT1G56440.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-0526.12Show/hide
Query:  GGGGHDVGGSTTEIEGLPTPPAG---VTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDD
        G G +D       I  L +   G   + SS    +G +F+KQ ++ +AI   S ++ L   A          + RA  Y ++  Y++A VDCT+ L+ DD
Subjt:  GGGGHDVGGSTTEIEGLPTPPAG---VTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDD

Query:  ANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDP
          +    +RA   + +   +   ED    L+++P
Subjt:  ANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDP

AT3G16760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-10350.6Show/hide
Query:  MNSNFGK-------NFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAA------RPTLSNAPSSMVGDIFGKSWGS
        MNSNFGK       +F+FDLGLGSS  + LN QK+++ SYSS        S++Q RPAW QP + SWTHQ A      R  + + P+SMVGDI GK+WGS
Subjt:  MNSNFGK-------NFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAA------RPTLSNAPSSMVGDIFGKSWGS

Query:  TPNSGSTAGIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSN
           SGS +GIGI  K+P+LFGDLV SA+G GKS+ N PLKNA P SAS SS     ++ +SMGN+ DSLP++ ++     G     N S ++ G+T+   
Subjt:  TPNSGSTAGIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSN

Query:  -TTNIKAPNLGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGV------GFNGSSFNS
           N+  P++   +  +  GSG  S+S DPFGSLV FGSKSSG   S       N+  D+FG+FQ  SN       S GS+G+        GF  S  N+
Subjt:  -TTNIKAPNLGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGV------GFNGSSFNS

Query:  GLNMGDFGMPPMNFGPKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNK
          + G F    ++FG +      ++A+D    +FS+SK         S A   PQ+   +DWG +S F GG     GSTTE++GLP PP GV+++ A NK
Subjt:  GLNMGDFGMPPMNFGPKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNK

Query:  GVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        G+D  +QGQYADAIKWLSWAVIL ++AGD A   EVLSTRASCYKEVGEYKKAV DCTKVLD D  NVT+LVQRALLYESMEKY+LGAEDLR VLKIDPG
Subjt:  GVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

AT3G16760.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-8947.4Show/hide
Query:  MNSNFGK-------NFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAA------RPTLSNAPSSMVGDIFGKSWGS
        MNSNFGK       +F+FDLGLGSS  + LN QK+++ SYSS        S++Q RPAW QP + SWTHQ A      R  + + P+SMVGDI GK+WGS
Subjt:  MNSNFGK-------NFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAA------RPTLSNAPSSMVGDIFGKSWGS

Query:  TPNSGSTAGIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSN
           SGS +GIGI  K+P+LFGDLV SA+G GKS+ N PLKNA P SAS SS     ++ +SMGN+ DSLP++ ++     G     N S ++ G+T+   
Subjt:  TPNSGSTAGIGIAEKNPNLFGDLVSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSN

Query:  -TTNIKAPNLGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGV------GFNGSSFNS
           N+  P++   +  +  GSG  S+S DPFGSLV FGSKSSG   S       N+  D+FG+FQ  SN       S GS+G+        GF  S  N+
Subjt:  -TTNIKAPNLGGRSMSSTTGSGKTSSSKDPFGSLVDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGV------GFNGSSFNS

Query:  GLNMGDFGMPPMNFGPKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNK
          + G F    ++FG +      ++A+D    +FS+SK         S A   PQ+   +DWG +S F GG     GSTTE++GLP PP GV+++ A NK
Subjt:  GLNMGDFGMPPMNFGPKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGATGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNK

Query:  GVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG
        G+D  +Q                   AGD A   EVLSTRASCYKEVGEYKKAV DCTKVLD D  NVT+LVQRALLYESMEKY+LGAEDLR VLKIDPG
Subjt:  GVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPG

AT3G25230.1 rotamase FKBP 14.5e-0530.11Show/hide
Query:  QYADAIKWLSWAVILFEKAGDSAAIVEVLS--TRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDP
        +Y  A+K++ +     E+    A  ++V      A+C  ++ +YK+A   CTKVL+ +  NV  L +RA  Y  +    L   D++  L+IDP
Subjt:  QYADAIKWLSWAVILFEKAGDSAAIVEVLS--TRASCYKEVGEYKKAVVDCTKVLDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCCAATTTCGGAAAGAATTTCGAGTTCGATTTGGGTCTCGGATCCTCTCCCTCCAAATCGCTCAACGATCAGAAGAACAAATCCCCATCGTATTCTTCATATGC
TTCCTCCGGTCCTTCATATTCGTCGACGCAGACCAGGCCGGCCTGGCAGCAGCCCAGCAGATCCTCGTGGACGCATCAGGCGGCGCGGCCCACCTTGTCCAATGCACCCT
CTTCAATGGTCGGCGATATATTTGGGAAGAGCTGGGGTTCCACGCCGAATTCTGGCTCCACTGCGGGCATTGGAATTGCGGAAAAGAATCCGAATCTGTTCGGCGATTTG
GTCAGTTCCGCCTTGGGATCAGGTAAGAGTAACACGAATTCTCCATTGAAAAATGCGACTCCGGCATCGGCATCGGCATCGTCAGCTGCTGCATTGAATAGGAATTCGTT
TTCGATGGGAAACATGACCGATTCGTTGCCAAGAGCCACTAGTAATCCGAGTAAAAGTACTGGAAATTGGAGTTTTGAGAATCTTAGCAGTTATAATAGTGGGCATACTA
ATCAGAGTAATACCACCAATATCAAGGCTCCAAATCTCGGAGGTCGGAGCATGAGTTCTACCACCGGTAGTGGTAAGACGAGCTCTAGCAAGGATCCCTTTGGTTCTTTA
GTTGACTTTGGATCTAAATCATCCGGAAAGCTGAATTCAACAAGTAGCAGTCAAAAGATCAATTCAAGCGAGGACTCATTTGGAGATTTCCAGAATGCTTCAAATCCAAG
CACTAGAACATTTCCTTCGAGTGGATCGAGTGGGAGTGGTGTTGGTTTCAATGGATCTAGTTTTAATTCTGGCTTAAACATGGGTGATTTCGGAATGCCCCCAATGAATT
TTGGTCCCAAGGTTCAAGAGACTGTTCAAACCACTGCCAGTGATCCGCTCGATGTGCTGTTTAGCTCATCCAAAGCCCCAGTTGGAGGTGCTGCAATGGCGTCTGGAGCA
ACTGGAGTGCCACAATCCATGGATGCCGATGATTGGGGAATGGATTCGGAGTTTGGGGGTGGTGGTCATGATGTGGGCGGCTCAACAACTGAGATTGAAGGACTTCCAAC
TCCTCCTGCAGGGGTGACATCTTCTTTGGCGAATAACAAGGGAGTCGATTTCTATAAGCAGGGACAATATGCTGATGCTATTAAGTGGCTTTCTTGGGCTGTAATTCTTT
TTGAGAAAGCTGGTGATAGTGCTGCCATAGTTGAAGTTTTGTCGACACGAGCTTCATGTTACAAAGAAGTTGGGGAATATAAGAAAGCAGTGGTTGATTGTACAAAGGTA
TTGGATCAGGATGATGCAAATGTAACCGTTCTCGTTCAACGTGCACTTCTGTACGAGAGTATGGAGAAGTACAGACTTGGAGCAGAAGACCTGAGGGCTGTCCTGAAGAT
CGATCCGGGATTACACTGGTTCCCCTCATGCTATCTAAATTTGCTGGGAATGACTACATTTTTCAATTACATGGAATTGGTTTTGTCCAGGATTCTAGTCAAATCGACCT
TCGCTCTTGGTGAGGCAAGGAATTCCATCAAATGTTCGTTGTTCAATATCTCGATTGCTCAAGAAGTTGGACTAGCAGCTCCTCGACCCGACTCGGAGTGGATTGACGGA
GAAGGGAAGGGCGGGCAGAGCCTCACGTTGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTCCAATTTCGGAAAGAATTTCGAGTTCGATTTGGGTCTCGGATCCTCTCCCTCCAAATCGCTCAACGATCAGAAGAACAAATCCCCATCGTATTCTTCATATGC
TTCCTCCGGTCCTTCATATTCGTCGACGCAGACCAGGCCGGCCTGGCAGCAGCCCAGCAGATCCTCGTGGACGCATCAGGCGGCGCGGCCCACCTTGTCCAATGCACCCT
CTTCAATGGTCGGCGATATATTTGGGAAGAGCTGGGGTTCCACGCCGAATTCTGGCTCCACTGCGGGCATTGGAATTGCGGAAAAGAATCCGAATCTGTTCGGCGATTTG
GTCAGTTCCGCCTTGGGATCAGGTAAGAGTAACACGAATTCTCCATTGAAAAATGCGACTCCGGCATCGGCATCGGCATCGTCAGCTGCTGCATTGAATAGGAATTCGTT
TTCGATGGGAAACATGACCGATTCGTTGCCAAGAGCCACTAGTAATCCGAGTAAAAGTACTGGAAATTGGAGTTTTGAGAATCTTAGCAGTTATAATAGTGGGCATACTA
ATCAGAGTAATACCACCAATATCAAGGCTCCAAATCTCGGAGGTCGGAGCATGAGTTCTACCACCGGTAGTGGTAAGACGAGCTCTAGCAAGGATCCCTTTGGTTCTTTA
GTTGACTTTGGATCTAAATCATCCGGAAAGCTGAATTCAACAAGTAGCAGTCAAAAGATCAATTCAAGCGAGGACTCATTTGGAGATTTCCAGAATGCTTCAAATCCAAG
CACTAGAACATTTCCTTCGAGTGGATCGAGTGGGAGTGGTGTTGGTTTCAATGGATCTAGTTTTAATTCTGGCTTAAACATGGGTGATTTCGGAATGCCCCCAATGAATT
TTGGTCCCAAGGTTCAAGAGACTGTTCAAACCACTGCCAGTGATCCGCTCGATGTGCTGTTTAGCTCATCCAAAGCCCCAGTTGGAGGTGCTGCAATGGCGTCTGGAGCA
ACTGGAGTGCCACAATCCATGGATGCCGATGATTGGGGAATGGATTCGGAGTTTGGGGGTGGTGGTCATGATGTGGGCGGCTCAACAACTGAGATTGAAGGACTTCCAAC
TCCTCCTGCAGGGGTGACATCTTCTTTGGCGAATAACAAGGGAGTCGATTTCTATAAGCAGGGACAATATGCTGATGCTATTAAGTGGCTTTCTTGGGCTGTAATTCTTT
TTGAGAAAGCTGGTGATAGTGCTGCCATAGTTGAAGTTTTGTCGACACGAGCTTCATGTTACAAAGAAGTTGGGGAATATAAGAAAGCAGTGGTTGATTGTACAAAGGTA
TTGGATCAGGATGATGCAAATGTAACCGTTCTCGTTCAACGTGCACTTCTGTACGAGAGTATGGAGAAGTACAGACTTGGAGCAGAAGACCTGAGGGCTGTCCTGAAGAT
CGATCCGGGATTACACTGGTTCCCCTCATGCTATCTAAATTTGCTGGGAATGACTACATTTTTCAATTACATGGAATTGGTTTTGTCCAGGATTCTAGTCAAATCGACCT
TCGCTCTTGGTGAGGCAAGGAATTCCATCAAATGTTCGTTGTTCAATATCTCGATTGCTCAAGAAGTTGGACTAGCAGCTCCTCGACCCGACTCGGAGTGGATTGACGGA
GAAGGGAAGGGCGGGCAGAGCCTCACGTTGCAGTAG
Protein sequenceShow/hide protein sequence
MNSNFGKNFEFDLGLGSSPSKSLNDQKNKSPSYSSYASSGPSYSSTQTRPAWQQPSRSSWTHQAARPTLSNAPSSMVGDIFGKSWGSTPNSGSTAGIGIAEKNPNLFGDL
VSSALGSGKSNTNSPLKNATPASASASSAAALNRNSFSMGNMTDSLPRATSNPSKSTGNWSFENLSSYNSGHTNQSNTTNIKAPNLGGRSMSSTTGSGKTSSSKDPFGSL
VDFGSKSSGKLNSTSSSQKINSSEDSFGDFQNASNPSTRTFPSSGSSGSGVGFNGSSFNSGLNMGDFGMPPMNFGPKVQETVQTTASDPLDVLFSSSKAPVGGAAMASGA
TGVPQSMDADDWGMDSEFGGGGHDVGGSTTEIEGLPTPPAGVTSSLANNKGVDFYKQGQYADAIKWLSWAVILFEKAGDSAAIVEVLSTRASCYKEVGEYKKAVVDCTKV
LDQDDANVTVLVQRALLYESMEKYRLGAEDLRAVLKIDPGLHWFPSCYLNLLGMTTFFNYMELVLSRILVKSTFALGEARNSIKCSLFNISIAQEVGLAAPRPDSEWIDG
EGKGGQSLTLQ