; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019774 (gene) of Snake gourd v1 genome

Gene IDTan0019774
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWD_REPEATS_REGION domain-containing protein
Genome locationLG09:59941643..59953335
RNA-Seq ExpressionTan0019774
SyntenyTan0019774
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0030687 - preribosome, large subunit precursor (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588180.1 Transcription factor basic helix-loop-helix 122, partial [Cucurbita argyrosperma subsp. sororia]1.4e-25992.93Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVPS+S+SQ PKP  RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHI+GV CPT MNRIANGAQ HQLPFKKQGISAVDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATKAGCLTVHDFESLYLQTNE GLSEDETKHLLHL LNEQLDFVRWNPANQDEVVCTSMKSKELKIFDI YISSKPVEVLRVRQ INN GS+NHKG
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAFIS+N+RLLASDTCGVI+MWDRRIGKLPCLELTSNSCSTLNRIQLN ENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLAS+I
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        EKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+MPTDQLFLRKPSWLPTDSIY+VGS SDEGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PDVRSPSHV+++DEL GA GENK+RQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKN+SC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

XP_022927843.1 uncharacterized protein LOC111434607 [Cucurbita moschata]6.1e-26092.72Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVPS+S+SQ PKP  RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHI+GV CPT MNRIANGAQDHQLPFKKQGIS VDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATKAGCLTVHDFESLYLQTNE GLSEDETKHLLHL LNEQLDFVRWNPANQDEVVCTSMKSKELKIFDI YISSKPVEVLRVRQ INN GS+NHKG
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAFIS+N+RLLASDTCGVI+MWDRRIGKLPCLELTSNSCSTLNRIQLN ENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLAS+I
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        EKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+MPTDQL LRKPSWLPTDSIY+VGS SDEGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PDVRSPSHV+++DE+ GAGGENK+RQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKN+SC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

XP_022967357.1 uncharacterized protein LOC111466905 [Cucurbita maxima]8.8e-25992.08Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVP +S+SQ PKP  RNRWKLSTVELNGKFDPKYR+DLSVLLVQSYTEVGAFPHLYHI+GV CPT MNRIANGAQDHQLPFKKQGISAVDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATKAGCLTVHDFESLYLQTNE GLSEDE KHLLHL LNEQLDFVRWNPANQDEVVCTSMKSKELKIFDI YISSKPVEVLRVRQ INN GS+NHKG
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAFIS+N+RLLASDTCGVI+MWDRRIGKLPCLELTSNSCSTLNRIQLN ENQIIFGAGKHG IYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLAS+I
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        EKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+MPTDQLFLRKP+WLPTDSIY+VGS SDEGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PDVRSPSHV+++DE+ GAGGENK+RQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKN+SC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

XP_023531333.1 uncharacterized protein LOC111793608 isoform X1 [Cucurbita pepo subsp. pepo]2.5e-26193.15Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVPS+S+SQ PKP  RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHI+GV CPT MNRIANGAQDHQLPFKKQGISAVDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATKAGCLTVHDFESLYLQTNE GLSEDETKHLLHL LNEQLDFVRWNPANQDEVVCTSMKSKELKIFDI YISSKPVEVLRVRQ INN GS+NHKG
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAFIS+N+RLLASDTCGVI+MWDRRIGKLPCLELTSNSCSTLNRIQLN ENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLAS+I
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        EKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+MPTDQLFLRKPSWLPTDSIY+VGS SDEGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PDVRSPSHV+++DE+ GAGGENK+RQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKN+SC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

XP_038878673.1 uncharacterized protein LOC120070854 isoform X1 [Benincasa hispida]2.0e-25891.43Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVPSDS+ Q PKPFPRNRWKLSTVELNGK DPKYRQDLSVLLVQSY EVG FPHLYHIDGVFCPT MNRI +GAQ HQLPFKKQGISAVDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATK GCLTVHDFESLYLQTNETGLSEDETKHL+HL LNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQ INN+GS+NHK 
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAFISD+SRLLASDTCGVINMWDRR G LPCLELTSNSCS LNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKS KLASMI
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        E+IGTLKEQANI+ KEIHSID NP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAW+NDS++PTDQLFLRKPSWLPTDSIY VGSSS EGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PD RSPSHVDY+D+LCGA  ENKKRQNR+V LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKN+SC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

TrEMBL top hitse value%identityAlignment
A0A0A0LX91 Uncharacterized protein6.4e-25590.36Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVPSDS+ Q PKPF RNRWKLSTVELNGK DPKYRQ+LS LLVQSY EVGAFPHLYHIDGVFCPT MNRI  GAQ HQLPFKKQGISAVDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATK GCLTVHDFESLYLQTNETGLSE+E K LLHL LNEQLDFVRWNP NQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQ INN+GS+NHKG
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAF SDNSRLLASDTCGVINMWDRRIG LPCLELTSNSC TLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKS KLAS+I
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        EKIGTLKEQ NI+ KEIHSID NP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAW+NDS++PTDQLFLRKPSWLPTDSIY VGSSSDEGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PD RSPSHV+Y+DELCGA  E+KKRQNRFV LSEGVT+CAAHPLNGTI AGTKNSSLIMISQK+QSC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

A0A6J1DJ92 uncharacterized protein LOC111021413 isoform X43.3e-25186.94Show/hide
Query:  MDKYLVPSDSVSQFPKPFP-RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKG
        MDKYLVPSDS SQ PKP P RNRWKLS VELNGKFDPKYRQDLSVLL+QSYTEVGAFPHLYHIDG+FCPT +NRIAN AQDH LPFKKQGISAVDFDNKG
Subjt:  MDKYLVPSDSVSQFPKPFP-RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKG

Query:  IYLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHK
        IYLVS TK GCLTVHDFESLY QTNE G SEDE KHLLHL L EQLDFVRWNPANQDEVVCTSMKSKEL+IFDIGYISSKPVEVLR RQ INNLG++NHK
Subjt:  IYLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHK

Query:  GLSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASM
        GLSDIAFISDNSRLLASDTCG INMWDRRIG LPCLELTSNSCSTLNRIQLNVENQIIFG+GKHG+IYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASM
Subjt:  GLSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASM

Query:  IEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDS-----------------
        IEKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+M +DQLFLRKPSWLPTDS                 
Subjt:  IEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDS-----------------

Query:  ------IYAVGSSSDEGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS
              I+AVGSSSDEGIHLLDFHPD RSPSHVDY+D+L  AGGENKKRQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQK QS
Subjt:  ------IYAVGSSSDEGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS

A0A6J1DKZ0 uncharacterized protein LOC111021413 isoform X53.8e-25591.22Show/hide
Query:  MDKYLVPSDSVSQFPKPFP-RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKG
        MDKYLVPSDS SQ PKP P RNRWKLS VELNGKFDPKYRQDLSVLL+QSYTEVGAFPHLYHIDG+FCPT +NRIAN AQDH LPFKKQGISAVDFDNKG
Subjt:  MDKYLVPSDSVSQFPKPFP-RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKG

Query:  IYLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHK
        IYLVS TK GCLTVHDFESLY QTNE G SEDE KHLLHL L EQLDFVRWNPANQDEVVCTSMKSKEL+IFDIGYISSKPVEVLR RQ INNLG++NHK
Subjt:  IYLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHK

Query:  GLSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASM
        GLSDIAFISDNSRLLASDTCG INMWDRRIG LPCLELTSNSCSTLNRIQLNVENQIIFG+GKHG+IYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASM
Subjt:  GLSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASM

Query:  IEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDF
        IEKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+M +DQLFLRKPSWLPTDSI+AVGSSSDEGIHLLDF
Subjt:  IEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDF

Query:  HPDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS
        HPD RSPSHVDY+D+L  AGGENKKRQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQK QS
Subjt:  HPDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS

A0A6J1EM59 uncharacterized protein LOC1114346073.0e-26092.72Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVPS+S+SQ PKP  RNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHI+GV CPT MNRIANGAQDHQLPFKKQGIS VDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATKAGCLTVHDFESLYLQTNE GLSEDETKHLLHL LNEQLDFVRWNPANQDEVVCTSMKSKELKIFDI YISSKPVEVLRVRQ INN GS+NHKG
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAFIS+N+RLLASDTCGVI+MWDRRIGKLPCLELTSNSCSTLNRIQLN ENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLAS+I
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        EKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+MPTDQL LRKPSWLPTDSIY+VGS SDEGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PDVRSPSHV+++DE+ GAGGENK+RQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKN+SC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

A0A6J1HQK8 uncharacterized protein LOC1114669054.3e-25992.08Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        MDKYLVP +S+SQ PKP  RNRWKLSTVELNGKFDPKYR+DLSVLLVQSYTEVGAFPHLYHI+GV CPT MNRIANGAQDHQLPFKKQGISAVDFDNKGI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG
        YLVSATKAGCLTVHDFESLYLQTNE GLSEDE KHLLHL LNEQLDFVRWNPANQDEVVCTSMKSKELKIFDI YISSKPVEVLRVRQ INN GS+NHKG
Subjt:  YLVSATKAGCLTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKG

Query:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI
        LSDIAFIS+N+RLLASDTCGVI+MWDRRIGKLPCLELTSNSCSTLNRIQLN ENQIIFGAGKHG IYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLAS+I
Subjt:  LSDIAFISDNSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMI

Query:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH
        EKIGTLKEQANII KEIHSIDLNP CPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDS+MPTDQLFLRKP+WLPTDSIY+VGS SDEGIHLLDFH
Subjt:  EKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFH

Query:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC
        PDVRSPSHV+++DE+ GAGGENK+RQNRFV LSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKN+SC
Subjt:  PDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQSC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G12920.1 Transducin/WD40 repeat-like superfamily protein8.0e-14150.82Show/hide
Query:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI
        M+KYLV  +      +   R  WK S +E+NG+ D  YR +L   +  SY+E+G F H YH++   C T M +I +    +Q P   +G++ +DFDN+GI
Subjt:  MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGI

Query:  YLVSATKAGCLTVHDFESLYLQTN-ETGLSEDETKHLLHLFL--NEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSEN
        +LVS T++GCL VHDFESLY Q+    G +EDE+KH++H       + D  RWNP+NQ+EV CTS K  ++ IFDI Y+S KP EVL+ RQ ++ +G + 
Subjt:  YLVSATKAGCLTVHDFESLYLQTN-ETGLSEDETKHLLHLFL--NEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSEN

Query:  HKGLSDIAFISD-NSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEV----------
         +GLSD+A  SD +SR+ + DT G++++WDRR G  PC+EL+++   ++  IQ+ V+NQ IFGAGK G+I+IWDLRGGR S AFQ+ K+V          
Subjt:  HKGLSDIAFISD-NSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEV----------

Query:  ---CHPPLKSLKLASMIEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSI
              PL SL LA  ++KI +LK Q+ I+ KEIHSID+NP  P+QLAFHLDDGWSG+LD+Y  +VTH+HCPPPAWL+ S+   D L LRKPSWLPT SI
Subjt:  ---CHPPLKSLKLASMIEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSI

Query:  YAVGSSSDEGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKR-----QNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS
        Y VGS S++GIH+LDFHP  RSP HVDYD++       N+KR     +N+FV+LSE VT CAAHPLNG I+AGT+NSSL++I+Q + S
Subjt:  YAVGSSSDEGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKR-----QNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS

AT5G12920.2 Transducin/WD40 repeat-like superfamily protein1.2e-13348.59Show/hide
Query:  WKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNK------------------------
        WK S +E+NG+ D  YR +L   +  SY+E+G F H YH++   C T M +I +    +Q P   +G++ +DFDN+                        
Subjt:  WKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNK------------------------

Query:  --------GIYLVSATKAGCLTVHDFESLYLQTN-ETGLSEDETKHLLHLFL--NEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVR
                GI+LVS T++GCL VHDFESLY Q+    G +EDE+KH++H       + D  RWNP+NQ+EV CTS K  ++ IFDI Y+S KP EVL+ R
Subjt:  --------GIYLVSATKAGCLTVHDFESLYLQTN-ETGLSEDETKHLLHLFL--NEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVR

Query:  QGINNLGSENHKGLSDIAFISD-NSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEV
        Q ++ +G +  +GLSD+A  SD +SR+ + DT G++++WDRR G  PC+EL+++   ++  IQ+ V+NQ IFGAGK G+I+IWDLRGGR S AFQ+ K+V
Subjt:  QGINNLGSENHKGLSDIAFISD-NSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEV

Query:  -------------CHPPLKSLKLASMIEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLR
                        PL SL LA  ++KI +LK Q+ I+ KEIHSID+NP  P+QLAFHLDDGWSG+LD+Y  +VTH+HCPPPAWL+ S+   D L LR
Subjt:  -------------CHPPLKSLKLASMIEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLR

Query:  KPSWLPTDSIYAVGSSSDEGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKR-----QNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS
        KPSWLPT SIY VGS S++GIH+LDFHP  RSP HVDYD++       N+KR     +N+FV+LSE VT CAAHPLNG I+AGT+NSSL++I+Q + S
Subjt:  KPSWLPTDSIYAVGSSSDEGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKR-----QNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS

AT5G12920.3 Transducin/WD40 repeat-like superfamily protein1.2e-13650.21Show/hide
Query:  WKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGIYLVSATKAGCLTVHDFESLYLQ
        WK S +E+NG+ D  YR +L   +  SY+E+G F H YH++   C T M +I +    +Q P   +G++ +DFDN+GI+LVS T++GCL VHDFESLY Q
Subjt:  WKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGIYLVSATKAGCLTVHDFESLYLQ

Query:  TN-ETGLSEDETKHLLHLFL--NEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVE---------------------------VLRVRQGINN
        +    G +EDE+KH++H       + D  RWNP+NQ+EV CTS K  ++ IFDI Y+S KP E                           VL+ RQ ++ 
Subjt:  TN-ETGLSEDETKHLLHLFL--NEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVE---------------------------VLRVRQGINN

Query:  LGSENHKGLSDIAFISD-NSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPL
        +G +  +GLSD+A  SD +SR+ + DT G++++WDRR G  PC+EL+++   ++  IQ+ V+NQ IFGAGK G+I+IWDLRGGR S AFQ+ K++   PL
Subjt:  LGSENHKGLSDIAFISD-NSRLLASDTCGVINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPL

Query:  KSLKLASMIEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSD
         SL LA  ++KI +LK Q+ I+ KEIHSID+NP  P+QLAFHLDDGWSG+LD+Y  +VTH+HCPPPAWL+ S+   D L LRKPSWLPT SIY VGS S+
Subjt:  KSLKLASMIEKIGTLKEQANIISKEIHSIDLNPVCPYQLAFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSD

Query:  EGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKR-----QNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS
        +GIH+LDFHP  RSP HVDYD++       N+KR     +N+FV+LSE VT CAAHPLNG I+AGT+NSSL++I+Q + S
Subjt:  EGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKR-----QNRFVNLSEGVTSCAAHPLNGTIIAGTKNSSLIMISQKNQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAGTACTTGGTTCCTTCCGATTCAGTATCCCAATTTCCTAAACCGTTCCCGAGGAATAGGTGGAAATTGAGCACCGTTGAATTGAATGGGAAGTTTGATCCGAA
GTACCGCCAGGACTTGTCCGTGCTTCTTGTGCAATCTTATACCGAGGTTGGAGCGTTTCCTCATTTGTATCATATAGACGGCGTTTTTTGTCCAACGGGTATGAATCGAA
TTGCCAATGGAGCTCAGGATCATCAACTTCCATTTAAGAAACAAGGTATATCTGCCGTGGACTTCGACAACAAGGGAATATACTTGGTTTCAGCAACGAAAGCAGGTTGT
TTAACAGTGCACGACTTTGAAAGTCTTTATTTGCAGACTAATGAAACAGGCTTGAGTGAAGACGAGACCAAACACTTGCTGCACCTTTTCCTAAATGAACAGCTTGACTT
TGTACGGTGGAATCCTGCCAACCAAGATGAGGTGGTCTGTACATCCATGAAAAGTAAGGAACTAAAGATCTTTGACATTGGTTATATTTCTTCAAAACCAGTCGAAGTGT
TGAGAGTAAGACAAGGAATCAATAATTTAGGGTCTGAAAATCATAAAGGTTTATCCGATATAGCATTTATTTCAGATAATTCGAGGTTACTTGCATCTGATACATGTGGT
GTAATCAATATGTGGGACAGAAGAATTGGAAAACTTCCATGCCTTGAGCTTACCAGTAATTCATGCAGTACCCTTAACAGAATCCAGCTGAATGTTGAAAATCAGATTAT
CTTTGGCGCTGGTAAGCATGGAGTCATCTATATTTGGGATCTTCGTGGAGGGAGAACATCTGGTGCTTTTCAAAATCATAAAGAGGTGTGTCATCCTCCTCTAAAATCAT
TGAAGTTAGCCTCAATGATAGAGAAAATTGGGACTTTGAAGGAGCAAGCGAATATCATTTCAAAGGAAATACACTCTATTGATCTCAACCCAGTTTGTCCTTATCAGTTG
GCCTTCCATCTTGACGACGGATGGTCAGGTATTTTGGATGTGTATAATTTTCAAGTCACACATATTCATTGTCCGCCCCCAGCTTGGTTAAATGATTCCAGCATGCCCAC
AGATCAATTATTTTTAAGGAAACCATCATGGCTGCCTACAGATTCTATCTATGCAGTTGGATCTTCTTCTGATGAAGGCATACATCTCTTAGATTTTCATCCCGACGTCC
GCTCTCCCAGCCATGTGGACTACGATGATGAGTTATGTGGTGCTGGAGGAGAAAACAAAAAACGACAGAACAGGTTTGTGAATTTGTCTGAAGGAGTTACTTCTTGTGCT
GCTCACCCCCTCAATGGCACCATCATAGCTGGAACCAAGAATTCGTCATTGATCATGATTTCCCAGAAGAATCAATCATGTTAG
mRNA sequenceShow/hide mRNA sequence
GAAAAGCTCACATTCCTTCCTCACAGTCACAACCAACAAGCAGCCGTTCACAATTCACAATTCACAATTCGCAGATTACATTCACCGGAATCAAGACTATTCGCCGGAAT
TATAAGCTGACGGAAGCGAAGGGAGCAGCCATTTCCGGTCGTTGTCGTCACTGTTACCCGGCCAAAACAAAAATGAAGCTGTGAGCCGTAAACAAGGTGGAGAAAAGGTC
TGGTGCGGAAGAACAAGTACAGCATCTCCTTCGAGCATCCTCAGTTTCGGGAACTCAAATTGAAGCTTCGTTTCATCAGCCATGGACAAGTACTTGGTTCCTTCCGATTC
AGTATCCCAATTTCCTAAACCGTTCCCGAGGAATAGGTGGAAATTGAGCACCGTTGAATTGAATGGGAAGTTTGATCCGAAGTACCGCCAGGACTTGTCCGTGCTTCTTG
TGCAATCTTATACCGAGGTTGGAGCGTTTCCTCATTTGTATCATATAGACGGCGTTTTTTGTCCAACGGGTATGAATCGAATTGCCAATGGAGCTCAGGATCATCAACTT
CCATTTAAGAAACAAGGTATATCTGCCGTGGACTTCGACAACAAGGGAATATACTTGGTTTCAGCAACGAAAGCAGGTTGTTTAACAGTGCACGACTTTGAAAGTCTTTA
TTTGCAGACTAATGAAACAGGCTTGAGTGAAGACGAGACCAAACACTTGCTGCACCTTTTCCTAAATGAACAGCTTGACTTTGTACGGTGGAATCCTGCCAACCAAGATG
AGGTGGTCTGTACATCCATGAAAAGTAAGGAACTAAAGATCTTTGACATTGGTTATATTTCTTCAAAACCAGTCGAAGTGTTGAGAGTAAGACAAGGAATCAATAATTTA
GGGTCTGAAAATCATAAAGGTTTATCCGATATAGCATTTATTTCAGATAATTCGAGGTTACTTGCATCTGATACATGTGGTGTAATCAATATGTGGGACAGAAGAATTGG
AAAACTTCCATGCCTTGAGCTTACCAGTAATTCATGCAGTACCCTTAACAGAATCCAGCTGAATGTTGAAAATCAGATTATCTTTGGCGCTGGTAAGCATGGAGTCATCT
ATATTTGGGATCTTCGTGGAGGGAGAACATCTGGTGCTTTTCAAAATCATAAAGAGGTGTGTCATCCTCCTCTAAAATCATTGAAGTTAGCCTCAATGATAGAGAAAATT
GGGACTTTGAAGGAGCAAGCGAATATCATTTCAAAGGAAATACACTCTATTGATCTCAACCCAGTTTGTCCTTATCAGTTGGCCTTCCATCTTGACGACGGATGGTCAGG
TATTTTGGATGTGTATAATTTTCAAGTCACACATATTCATTGTCCGCCCCCAGCTTGGTTAAATGATTCCAGCATGCCCACAGATCAATTATTTTTAAGGAAACCATCAT
GGCTGCCTACAGATTCTATCTATGCAGTTGGATCTTCTTCTGATGAAGGCATACATCTCTTAGATTTTCATCCCGACGTCCGCTCTCCCAGCCATGTGGACTACGATGAT
GAGTTATGTGGTGCTGGAGGAGAAAACAAAAAACGACAGAACAGGTTTGTGAATTTGTCTGAAGGAGTTACTTCTTGTGCTGCTCACCCCCTCAATGGCACCATCATAGC
TGGAACCAAGAATTCGTCATTGATCATGATTTCCCAGAAGAATCAATCATGTTAG
Protein sequenceShow/hide protein sequence
MDKYLVPSDSVSQFPKPFPRNRWKLSTVELNGKFDPKYRQDLSVLLVQSYTEVGAFPHLYHIDGVFCPTGMNRIANGAQDHQLPFKKQGISAVDFDNKGIYLVSATKAGC
LTVHDFESLYLQTNETGLSEDETKHLLHLFLNEQLDFVRWNPANQDEVVCTSMKSKELKIFDIGYISSKPVEVLRVRQGINNLGSENHKGLSDIAFISDNSRLLASDTCG
VINMWDRRIGKLPCLELTSNSCSTLNRIQLNVENQIIFGAGKHGVIYIWDLRGGRTSGAFQNHKEVCHPPLKSLKLASMIEKIGTLKEQANIISKEIHSIDLNPVCPYQL
AFHLDDGWSGILDVYNFQVTHIHCPPPAWLNDSSMPTDQLFLRKPSWLPTDSIYAVGSSSDEGIHLLDFHPDVRSPSHVDYDDELCGAGGENKKRQNRFVNLSEGVTSCA
AHPLNGTIIAGTKNSSLIMISQKNQSC