; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014013 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014013
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionThioredoxin domain-containing protein
Genome locationscaffold3:46249011..46253643
RNA-Seq ExpressionSpg014013
SyntenySpg014013
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10429.1 uncharacterized protein E5676_scaffold459G00750 [Cucumis melo var. makuwa]0.0e+0072.14Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M S  DGDV CSPSWS A NWTVAGGCLE+TV YESFYSPINDEETV+S PK PL+L RPSPESGPCEITL F EKHEIRQVYVRSTARVYEMY+ TN+Q
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        +ENEYFCTVRCGAALRDEEVLHT+GI  VSAH+NGSNG VAE +S+  SNLNTNED+WVEVK  DGPTL HK++S TS+SGA+SV I+QD YEATAEITD
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        ANPCTSLTIRLLSLQNKS+VYVDE+YVFANPVDLEEES  ENSA+NSQSSLMSMLVPTLLQLSKTTGSS NNDGR   SN EG+H+LPK+ S+ LDST+S
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV
        VT L+Q  KSCVT+DDEVKL E+ ESD  V QP++ LQVPVK++ H  +EPL  IE+ILGQLV RMDRIENCFLRFEENMLKPINSI+GRLKQVEQQLE+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV

Query:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGD--------------
        LTKESH SEWPSCYRMSAPSFSAN S SNSFYNSGN HPSCGPIE DRKE+HS +SPI L+ISNSV SSLL PSLVVTAPEF N D              
Subjt:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGD--------------

Query:  ----DDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESE
            D D+E VV TAPEFSNGNDD QENQ+ EV VD+SKSK KPS+DD LASALAQF LSSSSIST EHS+TVAV+ PDLPNE+GNNHKK  + +L  SE
Subjt:  ----DDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESE

Query:  TDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCD
         DH SCS EID+ Q T NSASAS+S ANG N S S+HD S+K    D E V+   EC Y+  + +V  A  E+S  GM  LGNVEV  ET ED VSE   
Subjt:  TDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCD

Query:  IPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAP
        I IH   HHP+N  D  NVD N D  T++V +   DIDI+HDVLGFSRD SIVNFEIPILDVSFTS  DS S++ LK+LLG+  ESS+ ASC KESDD  
Subjt:  IPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAP

Query:  PIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN
        P GEQ +LILVEEE +EN +STNGPISVDMNYYTIMS+P+I ADGEN
Subjt:  PIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN

XP_008461308.1 PREDICTED: uncharacterized protein LOC103499935 [Cucumis melo]0.0e+0070.64Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M S  DGDV CSPSWS A NWTVAGGCLE+TV YESFYSPINDEETV+S PK PL+L RPSPESGPCEITL F EKHEIRQVYVRSTARVYEMY+ TN+Q
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        +ENEYFCTVRCGAALRDEEVLHT+GI  VSAH+NGSNG VAE +S+  SNLNTNED+WVEVK  DGPTL HK++S TS+SGA+SV I+QD YEATAEITD
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        ANPCTSLTIRLLSLQNKS+VYVDE+YVFANPVDLEEESP ENSA+NSQSSLMSMLVPTLLQLSKTT SS NNDGR   SN EG+H+LPK+ S+ L+ST+S
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV
        VT L Q  KSCVT+DDEVKL E+ ESD  V QP++ LQVPVK++ H  +EPL RIE+ILGQLV RMDRIENCFLRFEENMLKPINSI+GRLKQVEQQLE+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV

Query:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVV-----------------------
        LTKESH SEWPSCYRMSAPSFSAN S SNSFYNSGN HPSCGPIE DRKE+HS +SPI L+ISNSV SSLL PSLVV                       
Subjt:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVV-----------------------

Query:  -------------TAPEFLNGDDDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPN
                     TAPEF N DD D+E VV TAPEFSNGNDD QENQ+ EV VD+SKSK KPS+DD LASALAQF LSSSSIST EHS+TVAV+ PDLPN
Subjt:  -------------TAPEFLNGDDDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPN

Query:  ENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELG
        E+GNNHKK  + +L  SE DH SCS EID+ Q T NSASAS+S ANG N S S+HD S+K    D E V+   EC Y+  + +V  A  E+S  GM  LG
Subjt:  ENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELG

Query:  NVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGD
        NVEV+ ET+ED VSE   I IH   HHP+N  D  NVD N D  T++V +   DIDI+HDVLGFSRD SIVNFEIPILDVSFTS  DS S++ LK+LLG+
Subjt:  NVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGD

Query:  MVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN
          ESS+ ASC KESDD  P GEQ +LILVEEE +EN +STNGPISVDMNYYTIMS+P+I ADGEN
Subjt:  MVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN

XP_022151152.1 uncharacterized protein LOC111019145 [Momordica charantia]0.0e+0072.97Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M SQNDGD+G  PSWSSAANWTV GGCLE+TV YESFYSPINDE  VDS PK PLVL RPSPES PCEITL+F EKHEIRQVYVRSTARVYEMYY TNSQ
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        +ENEYFCTVRCGAA+RDEEVLHTNG   VSAH +G NG VAE NSQ GSNLNTNEDDW+EVK    P LDHKN+S  S+SG DSVRIKQDFYEATAEI D
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        A PCTSLTIRLLSLQNKSIVYVDEVYVFANPVD EEE PVENS ++SQSSLMSMLVPTLLQLSKTTG     DG    SN EG+  LP++GSKALDST+S
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVLT
        VT LQQEGKSC T DD VKL EKNESDR VRQ E+PLQVP KERT+DEPL+RIES+L QLV RMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVL+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVLT

Query:  KESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFSN
        KESH SEWPS YRMSAPSFSANESGSNSFYNSGN HPS  PIEPDRKELHS ASPIAL+ISNSV SSLL PS VVTAPEF NGDDDD             
Subjt:  KESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFSN

Query:  GNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSA
             QE QVPEVA++S  SKPKPSIDDVLASALAQF LSSSSI+T E++K  AV   +  NE+ NN++K+SS DL E+E DHISCSQE D  QCT NSA
Subjt:  GNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSA

Query:  SASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVD
        SAS+S ANG N S SR D SS            D +CKYQS DGDV VAS     KG  E+G+ EV   TSEDC  ET   PIH LH HPEN PD    D
Subjt:  SASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVD

Query:  GNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEED-KENATS
         + D M+  +E  DIDI+HDVLGFSR TSIV+FEIPILDVSF SI DS  +D LK LLGDM E +YGA C++ESDDA PIGEQ +LILVEEE+ +EN+TS
Subjt:  GNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEED-KENATS

Query:  TNGPISVDMNYYTIMSEPVIADGEN
         NG IS+DMNY TIM EP+IADGEN
Subjt:  TNGPISVDMNYYTIMSEPVIADGEN

XP_022991852.1 uncharacterized protein LOC111488374 isoform X1 [Cucurbita maxima]0.0e+0076.06Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M SQNDGDV CSPSW   ANWTVAGGCLE+TVVYESFYSPI++EETV+ GPKSPLVL RPSPESGPCEITL F EKHEIRQVYVRSTARVYEMY  T+SQ
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVR-IKQDFYEATAEIT
        NENEY CTVRCGAALRDEEVLHTNGI   SA INGSNG V E N+Q GSNLN NEDDWVEVK  DG  LDHKNNS    SG DS+R IKQDFYEATAEIT
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVR-IKQDFYEATAEIT

Query:  DANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTN
        DANPCTSLTIRLLSLQNKSIVYVDE+YVFANPVDLEEESPV N+A++SQSSLMSMLVPTLLQLSKTT SS +N  R S    EGIHIL K+GS ALDST+
Subjt:  DANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTN

Query:  SVTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVL
        SVT L+QEGKSCVT+DDEV+  E+ E DR VRQPE+ LQVPV ER H+EPL RIE++LGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV+
Subjt:  SVTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVL

Query:  TKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFS
        TK SH SEWPSCYRMSAPSFSA ES SNSFYNS N HPSCG   PD+KEL S  SPIAL++S+S GSSL+ PSLVVTAPEF N DD D+ +VV+ A EFS
Subjt:  TKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFS

Query:  NGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNS
        NGNDD QEN   EV VDSSKSKPKPSIDDVLASALAQFALSSSSIS  EHSKTV VR PDL NE+GNNHKKS S DL E+  DHISCSQE+D TQCT NS
Subjt:  NGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNS

Query:  ASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINV
        AS S+S  N  N S SRHD SSK    DD+ V+   E KY+SADG +     EQSAKGM ELGNVEVI ETSED +SE   IPIH LHHHP+N  D  N 
Subjt:  ASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINV

Query:  DGNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATS
          N DT    +   DIDI+HDVLGFSRDTSIVNFEIPILDVSFTSI DS S++ LKDLLGDM ESSY AS  KE DD  P GEQ DLILVEEE +ENA+S
Subjt:  DGNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATS

Query:  TNGPISVDMNYYTIMSEPVIADGENDR
        TNGPISVDMNYYTIMS+PVI DGEN R
Subjt:  TNGPISVDMNYYTIMSEPVIADGENDR

XP_038898715.1 uncharacterized protein LOC120086241 [Benincasa hispida]0.0e+0079.26Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M+SQNDGDV CSPSWS AANWTVAGGCLE+TV YESFYSPIN +ETV+SG KSPLVL  PSPESGPCEITL F EKHEIRQVYVRSTARVYEMYYATNSQ
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        NENEYFCTVRCGAALRDEEVLHTNGI GVSAH+NGSNG V E NSQ GSNLNTNED+WVEVK  DG  LDHKNNS TS+SGADSVRIKQDFYEATAEITD
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        ANPCTSLTIRLLSLQNKS+VYVDE+YVFANPVDLEEESPVENS ++SQSSLMSMLVPTLLQLSKTTGSS NNDG  S SNAEGIH+LPK+GS+ L+STNS
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHD--EPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV
        VT LQQ+ KS  T DDEVKL E+ ESDRPVRQPE+ LQVP ++R HD  E LHRIE+ILGQLVSRMDRIENCFLRFEENMLKPINSI+GRLKQVEQQLE+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHD--EPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV

Query:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEF
        LTKESH SEWPSCYRMSAPSFSANESGSNSFYNSGN HPSCGPI PD+KELHS ASPIAL+ISNSV SSLL PSLVVTAPEF NGD+DD+E VVVTAPEF
Subjt:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEF

Query:  SNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTN
        SNGN DGQENQVPEV VD  K+KPKPSIDDVLASALAQF LSSSSIST EHSKTVAVRSPDLPNE GNNHKKS S DL ESE DH SCS EID TQCTTN
Subjt:  SNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTN

Query:  SASASMSYANGGNLSSSRHDRSSKADKEDDEDVIN--DPECKYQSADGDVIVASA--EQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKP
        SASAS+  ANG NLS S  D S K    D E V+   + +C+   AD DV V  A  E + +GM EL NVEVI ETS D VSE   IPIH L HHPEN+P
Subjt:  SASASMSYANGGNLSSSRHDRSSKADKEDDEDVIN--DPECKYQSADGDVIVASA--EQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKP

Query:  DSINVDGNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSF-TSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEED
        D  N D N DT++V +   DIDI+HDVLGFSRDTSIVNFEIPILDVSF T+I DS S+D LKD  G   ESSYGASCLKE  D  P+ EQ +LILVEEE 
Subjt:  DSINVDGNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSF-TSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEED

Query:  KENATSTNGPISVDMNYYTIMSEPVIA-DGENDR
        +ENATSTNGPISVDMNYYTIMS+P+IA DGEN R
Subjt:  KENATSTNGPISVDMNYYTIMSEPVIA-DGENDR

TrEMBL top hitse value%identityAlignment
A0A1S3CE18 uncharacterized protein LOC1034999350.0e+0070.64Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M S  DGDV CSPSWS A NWTVAGGCLE+TV YESFYSPINDEETV+S PK PL+L RPSPESGPCEITL F EKHEIRQVYVRSTARVYEMY+ TN+Q
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        +ENEYFCTVRCGAALRDEEVLHT+GI  VSAH+NGSNG VAE +S+  SNLNTNED+WVEVK  DGPTL HK++S TS+SGA+SV I+QD YEATAEITD
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        ANPCTSLTIRLLSLQNKS+VYVDE+YVFANPVDLEEESP ENSA+NSQSSLMSMLVPTLLQLSKTT SS NNDGR   SN EG+H+LPK+ S+ L+ST+S
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV
        VT L Q  KSCVT+DDEVKL E+ ESD  V QP++ LQVPVK++ H  +EPL RIE+ILGQLV RMDRIENCFLRFEENMLKPINSI+GRLKQVEQQLE+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV

Query:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVV-----------------------
        LTKESH SEWPSCYRMSAPSFSAN S SNSFYNSGN HPSCGPIE DRKE+HS +SPI L+ISNSV SSLL PSLVV                       
Subjt:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVV-----------------------

Query:  -------------TAPEFLNGDDDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPN
                     TAPEF N DD D+E VV TAPEFSNGNDD QENQ+ EV VD+SKSK KPS+DD LASALAQF LSSSSIST EHS+TVAV+ PDLPN
Subjt:  -------------TAPEFLNGDDDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPN

Query:  ENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELG
        E+GNNHKK  + +L  SE DH SCS EID+ Q T NSASAS+S ANG N S S+HD S+K    D E V+   EC Y+  + +V  A  E+S  GM  LG
Subjt:  ENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELG

Query:  NVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGD
        NVEV+ ET+ED VSE   I IH   HHP+N  D  NVD N D  T++V +   DIDI+HDVLGFSRD SIVNFEIPILDVSFTS  DS S++ LK+LLG+
Subjt:  NVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGD

Query:  MVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN
          ESS+ ASC KESDD  P GEQ +LILVEEE +EN +STNGPISVDMNYYTIMS+P+I ADGEN
Subjt:  MVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN

A0A5A7UUB4 Uncharacterized protein0.0e+0070.64Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M S  DGDV CSPSWS A NWTVAGGCLE+TV YESFYSPINDEETV+S PK PL+L RPSPESGPCEITL F EKHEIRQVYVRSTARVYEMY+ TN+Q
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        +ENEYFCTVRCGAALRDEEVLHT+GI  VSAH+NGSNG VAE +S+  SNLNTNED+WVEVK  DGPTL HK++S TS+SGA+SV I+QD YEATAEITD
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        ANPCTSLTIRLLSLQNKS+VYVDE+YVFANPVDLEEESP ENSA+NSQSSLMSMLVPTLLQLSKTT SS NNDGR   SN EG+H+LPK+ S+ L+ST+S
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV
        VT L Q  KSCVT+DDEVKL E+ ESD  V QP++ LQVPVK++ H  +EPL RIE+ILGQLV RMDRIENCFLRFEENMLKPINSI+GRLKQVEQQLE+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV

Query:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVV-----------------------
        LTKESH SEWPSCYRMSAPSFSAN S SNSFYNSGN HPSCGPIE DRKE+HS +SPI L+ISNSV SSLL PSLVV                       
Subjt:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVV-----------------------

Query:  -------------TAPEFLNGDDDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPN
                     TAPEF N DD D+E VV TAPEFSNGNDD QENQ+ EV VD+SKSK KPS+DD LASALAQF LSSSSIST EHS+TVAV+ PDLPN
Subjt:  -------------TAPEFLNGDDDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPN

Query:  ENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELG
        E+GNNHKK  + +L  SE DH SCS EID+ Q T NSASAS+S ANG N S S+HD S+K    D E V+   EC Y+  + +V  A  E+S  GM  LG
Subjt:  ENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELG

Query:  NVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGD
        NVEV+ ET+ED VSE   I IH   HHP+N  D  NVD N D  T++V +   DIDI+HDVLGFSRD SIVNFEIPILDVSFTS  DS S++ LK+LLG+
Subjt:  NVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGD

Query:  MVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN
          ESS+ ASC KESDD  P GEQ +LILVEEE +EN +STNGPISVDMNYYTIMS+P+I ADGEN
Subjt:  MVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN

A0A5D3CJN8 Uncharacterized protein0.0e+0072.14Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M S  DGDV CSPSWS A NWTVAGGCLE+TV YESFYSPINDEETV+S PK PL+L RPSPESGPCEITL F EKHEIRQVYVRSTARVYEMY+ TN+Q
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        +ENEYFCTVRCGAALRDEEVLHT+GI  VSAH+NGSNG VAE +S+  SNLNTNED+WVEVK  DGPTL HK++S TS+SGA+SV I+QD YEATAEITD
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        ANPCTSLTIRLLSLQNKS+VYVDE+YVFANPVDLEEES  ENSA+NSQSSLMSMLVPTLLQLSKTTGSS NNDGR   SN EG+H+LPK+ S+ LDST+S
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV
        VT L+Q  KSCVT+DDEVKL E+ ESD  V QP++ LQVPVK++ H  +EPL  IE+ILGQLV RMDRIENCFLRFEENMLKPINSI+GRLKQVEQQLE+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTH--DEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV

Query:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGD--------------
        LTKESH SEWPSCYRMSAPSFSAN S SNSFYNSGN HPSCGPIE DRKE+HS +SPI L+ISNSV SSLL PSLVVTAPEF N D              
Subjt:  LTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGD--------------

Query:  ----DDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESE
            D D+E VV TAPEFSNGNDD QENQ+ EV VD+SKSK KPS+DD LASALAQF LSSSSIST EHS+TVAV+ PDLPNE+GNNHKK  + +L  SE
Subjt:  ----DDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESE

Query:  TDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCD
         DH SCS EID+ Q T NSASAS+S ANG N S S+HD S+K    D E V+   EC Y+  + +V  A  E+S  GM  LGNVEV  ET ED VSE   
Subjt:  TDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCD

Query:  IPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAP
        I IH   HHP+N  D  NVD N D  T++V +   DIDI+HDVLGFSRD SIVNFEIPILDVSFTS  DS S++ LK+LLG+  ESS+ ASC KESDD  
Subjt:  IPIHLLHHHPENKPDSINVDGNVD--TMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAP

Query:  PIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN
        P GEQ +LILVEEE +EN +STNGPISVDMNYYTIMS+P+I ADGEN
Subjt:  PIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVI-ADGEN

A0A6J1DCQ5 uncharacterized protein LOC1110191450.0e+0072.97Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M SQNDGD+G  PSWSSAANWTV GGCLE+TV YESFYSPINDE  VDS PK PLVL RPSPES PCEITL+F EKHEIRQVYVRSTARVYEMYY TNSQ
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD
        +ENEYFCTVRCGAA+RDEEVLHTNG   VSAH +G NG VAE NSQ GSNLNTNEDDW+EVK    P LDHKN+S  S+SG DSVRIKQDFYEATAEI D
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITD

Query:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS
        A PCTSLTIRLLSLQNKSIVYVDEVYVFANPVD EEE PVENS ++SQSSLMSMLVPTLLQLSKTTG     DG    SN EG+  LP++GSKALDST+S
Subjt:  ANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNS

Query:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVLT
        VT LQQEGKSC T DD VKL EKNESDR VRQ E+PLQVP KERT+DEPL+RIES+L QLV RMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVL+
Subjt:  VTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVLT

Query:  KESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFSN
        KESH SEWPS YRMSAPSFSANESGSNSFYNSGN HPS  PIEPDRKELHS ASPIAL+ISNSV SSLL PS VVTAPEF NGDDDD             
Subjt:  KESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFSN

Query:  GNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSA
             QE QVPEVA++S  SKPKPSIDDVLASALAQF LSSSSI+T E++K  AV   +  NE+ NN++K+SS DL E+E DHISCSQE D  QCT NSA
Subjt:  GNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSA

Query:  SASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVD
        SAS+S ANG N S SR D SS            D +CKYQS DGDV VAS     KG  E+G+ EV   TSEDC  ET   PIH LH HPEN PD    D
Subjt:  SASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVD

Query:  GNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEED-KENATS
         + D M+  +E  DIDI+HDVLGFSR TSIV+FEIPILDVSF SI DS  +D LK LLGDM E +YGA C++ESDDA PIGEQ +LILVEEE+ +EN+TS
Subjt:  GNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEED-KENATS

Query:  TNGPISVDMNYYTIMSEPVIADGEN
         NG IS+DMNY TIM EP+IADGEN
Subjt:  TNGPISVDMNYYTIMSEPVIADGEN

A0A6J1JN18 uncharacterized protein LOC111488374 isoform X10.0e+0076.06Show/hide
Query:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ
        M SQNDGDV CSPSW   ANWTVAGGCLE+TVVYESFYSPI++EETV+ GPKSPLVL RPSPESGPCEITL F EKHEIRQVYVRSTARVYEMY  T+SQ
Subjt:  MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQ

Query:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVR-IKQDFYEATAEIT
        NENEY CTVRCGAALRDEEVLHTNGI   SA INGSNG V E N+Q GSNLN NEDDWVEVK  DG  LDHKNNS    SG DS+R IKQDFYEATAEIT
Subjt:  NENEYFCTVRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVR-IKQDFYEATAEIT

Query:  DANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTN
        DANPCTSLTIRLLSLQNKSIVYVDE+YVFANPVDLEEESPV N+A++SQSSLMSMLVPTLLQLSKTT SS +N  R S    EGIHIL K+GS ALDST+
Subjt:  DANPCTSLTIRLLSLQNKSIVYVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTN

Query:  SVTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVL
        SVT L+QEGKSCVT+DDEV+  E+ E DR VRQPE+ LQVPV ER H+EPL RIE++LGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEV+
Subjt:  SVTALQQEGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVL

Query:  TKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFS
        TK SH SEWPSCYRMSAPSFSA ES SNSFYNS N HPSCG   PD+KEL S  SPIAL++S+S GSSL+ PSLVVTAPEF N DD D+ +VV+ A EFS
Subjt:  TKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFS

Query:  NGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNS
        NGNDD QEN   EV VDSSKSKPKPSIDDVLASALAQFALSSSSIS  EHSKTV VR PDL NE+GNNHKKS S DL E+  DHISCSQE+D TQCT NS
Subjt:  NGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNS

Query:  ASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINV
        AS S+S  N  N S SRHD SSK    DD+ V+   E KY+SADG +     EQSAKGM ELGNVEVI ETSED +SE   IPIH LHHHP+N  D  N 
Subjt:  ASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINV

Query:  DGNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATS
          N DT    +   DIDI+HDVLGFSRDTSIVNFEIPILDVSFTSI DS S++ LKDLLGDM ESSY AS  KE DD  P GEQ DLILVEEE +ENA+S
Subjt:  DGNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATS

Query:  TNGPISVDMNYYTIMSEPVIADGENDR
        TNGPISVDMNYYTIMS+PVI DGEN R
Subjt:  TNGPISVDMNYYTIMSEPVIADGENDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G47940.1 unknown protein1.1e-7130.98Show/hide
Query:  WSSAANWTVAGGCLESTVVYESFY--SPINDE----ETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQNENEYFCT
        ++S  NW +  G L   + +ES +  +P +D+      VD   KSPL+L  P P   PCEIT+ F ++HE+RQ+Y+RS+ARVYE+YY    +++ EY CT
Subjt:  WSSAANWTVAGGCLESTVVYESFY--SPINDE----ETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQNENEYFCT

Query:  VRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITDANPCTSLT
        VRCG A+RDEEVL    I    +  +     + E       N  T+EDDWVEVK SD   L+++               KQDFYEATAEI DA PCTS+T
Subjt:  VRCGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITDANPCTSLT

Query:  IRLLSLQNKSIVYVDEVYVFANPVDLEEESPVE--NSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNSVTALQQ
        +RLLSLQ+K    VDEVYVFA+PVD  E    E   +  +S SSLM+M +P LLQLS+        D + S                  D +NS   +  
Subjt:  IRLLSLQNKSIVYVDEVYVFANPVDLEEESPVE--NSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNSVTALQQ

Query:  EGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVLTKESHVS
             +   D++ +      D   +Q +   +V   E     P + +E+IL QLV+++  IE   +RFE+ MLKPINSI+ RL+ VE++LE L K+S  S
Subjt:  EGKSCVTIDDEVKLPEKNESDRPVRQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVLTKESHVS

Query:  EWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFSNGNDDGQ
        +     ++       +++      +  +G      + PD   + +S     +   N + + L     + +    ++G++     ++   PE SN      
Subjt:  EWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCGPIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFSNGNDDGQ

Query:  ENQVPEVAVDSSKSKPKP--SIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASM
             E    S + KPK   SI+D LASALA   LSS SI+  ++S+ + + +P+  +E+    ++        +  D    + E  E + +++  S++ 
Subjt:  ENQVPEVAVDSSKSKPKP--SIDDVLASALAQFALSSSSISTLEHSKTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASM

Query:  SYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVD---G
        S    G      H       K+ D+    D E +         V S   +A     L    V S T  DC +E  ++       +   +PDS+  +    
Subjt:  SYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVELGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVD---G

Query:  NVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFT-SITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATST
        NV T K + E    D++  VLGF   TS V+F  P+LDV F     DS S+   + L  +  ES     C  +  D        +L+ VE+E++     T
Subjt:  NVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFT-SITDSCSEDMLKDLLGDMVESSYGASCLKESDDAPPIGEQYDLILVEEEDKENATST

Query:  NGPISVDMNYYTIMSEPVIADGE
        +   SV+MN+Y     P+  +GE
Subjt:  NGPISVDMNYYTIMSEPVIADGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCACAGAACGACGGGGACGTTGGTTGCAGTCCTTCATGGAGCTCCGCCGCGAATTGGACGGTCGCTGGTGGCTGTTTGGAGAGTACAGTCGTTTATGAATCGTT
CTATTCTCCGATCAACGATGAGGAGACGGTCGATTCCGGCCCCAAGTCGCCTCTTGTTCTGCACCGCCCCTCACCGGAGTCTGGTCCATGCGAGATCACTCTTCATTTTG
TGGAAAAGCACGAGATCCGGCAGGTTTATGTTAGAAGCACCGCTCGAGTCTATGAGATGTACTACGCCACCAATTCTCAGAATGAAAATGAATATTTTTGTACTGTTCGT
TGTGGTGCTGCATTGAGAGATGAAGAAGTGCTTCACACGAATGGTATTGGAGGTGTGTCTGCACATATAAATGGGTCTAATGGACACGTGGCTGAAACAAATTCACAATG
TGGCAGTAATTTGAACACGAATGAAGATGACTGGGTTGAAGTTAAACCTTCCGATGGCCCAACCCTTGATCATAAAAACAATTCTTTTACTTCCCAGTCTGGTGCAGACT
CAGTGAGGATTAAGCAGGACTTCTATGAGGCCACAGCAGAAATAACGGATGCAAATCCTTGCACATCTCTTACAATCCGTCTTCTTTCACTTCAGAATAAAAGTATTGTA
TATGTTGATGAAGTCTATGTGTTTGCAAATCCCGTTGATTTAGAAGAAGAGAGCCCAGTGGAGAACTCAGCCCGAAATTCTCAAAGTTCTTTGATGTCCATGCTTGTCCC
AACCCTTCTGCAGTTATCTAAAACTACTGGCAGTAGTACGAACAATGATGGGCGTACTTCTACTTCTAATGCAGAGGGAATACATATATTACCCAAGGTTGGGTCAAAAG
CTCTCGATTCAACTAATAGTGTAACTGCACTTCAGCAGGAAGGAAAATCTTGTGTAACCATTGATGATGAAGTGAAATTGCCAGAAAAAAATGAGTCTGATAGACCTGTA
CGTCAGCCTGAGATGCCTCTACAAGTTCCTGTTAAAGAGAGAACGCATGATGAACCACTTCATCGTATTGAAAGCATTTTGGGCCAGCTTGTTTCTCGAATGGACAGAAT
AGAGAATTGTTTTTTGAGGTTTGAAGAAAATATGCTAAAACCTATTAACAGCATTGAAGGAAGGCTGAAGCAGGTTGAGCAGCAACTTGAAGTTTTAACCAAGGAGTCAC
ATGTTTCAGAATGGCCATCTTGCTACAGAATGTCTGCTCCGAGCTTCTCTGCTAATGAATCGGGCTCTAACTCCTTCTATAACAGTGGGAACGGTCATCCCAGTTGTGGG
CCAATTGAACCGGATCGAAAGGAACTTCATTCAAGTGCATCACCCATTGCACTTGAAATATCCAACTCGGTGGGTTCTTCACTGTTGTGTCCGAGTCTTGTGGTCACCGC
TCCAGAGTTTTTGAACGGTGATGATGATGATCGGGAGAGTGTTGTGGTTACTGCTCCTGAGTTTTCCAACGGTAATGATGATGGCCAAGAGAATCAAGTCCCCGAGGTTG
CAGTGGATTCATCAAAGAGTAAACCAAAGCCATCCATTGATGATGTATTAGCATCTGCTCTTGCGCAGTTCGCATTATCATCATCTTCAATCAGCACTCTAGAACATTCA
AAGACTGTAGCTGTTAGATCTCCAGACCTTCCAAATGAAAATGGAAACAATCACAAGAAATCTTCATCAGATGATTTGTTTGAAAGTGAAACAGATCATATAAGCTGTTC
CCAAGAAATTGATGAGACACAGTGCACCACGAATTCTGCTTCAGCCTCTATGTCTTATGCCAATGGTGGGAACTTGAGCTCTTCCAGGCATGATCGTTCTTCTAAAGCTG
ATAAGGAAGATGACGAAGATGTTATCAACGACCCCGAATGCAAGTATCAGAGTGCCGATGGCGATGTCATAGTTGCTTCAGCTGAACAAAGCGCAAAAGGAATGGTGGAA
CTTGGAAATGTGGAAGTTATTAGTGAAACAAGTGAAGATTGTGTTTCAGAAACTTGTGATATTCCTATCCACCTCCTTCACCATCATCCCGAAAATAAACCTGATAGTAT
CAATGTTGATGGCAATGTCGATACCATGAAGGTTAGAGAAGAAGTTCGTGACATAGACATAATCCACGATGTTCTCGGATTTTCTCGTGACACATCTATTGTGAATTTCG
AGATTCCCATCCTCGATGTAAGCTTCACCTCCATTACAGATTCATGTTCCGAGGATATGCTTAAAGACCTTCTCGGGGACATGGTGGAATCAAGCTACGGAGCTTCTTGT
CTCAAGGAAAGTGATGATGCTCCTCCCATTGGCGAGCAATATGACCTCATTTTGGTTGAGGAAGAGGACAAGGAGAATGCTACCTCAACGAATGGCCCCATATCTGTTGA
TATGAACTACTATACCATAATGAGCGAGCCTGTAATTGCTGATGGCGAAAATGATAGACCCCCGATTAAACTTGTATATGGAAGAAGATGTAAGAAGAATAGTGGGGAGG
ACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCACAGAACGACGGGGACGTTGGTTGCAGTCCTTCATGGAGCTCCGCCGCGAATTGGACGGTCGCTGGTGGCTGTTTGGAGAGTACAGTCGTTTATGAATCGTT
CTATTCTCCGATCAACGATGAGGAGACGGTCGATTCCGGCCCCAAGTCGCCTCTTGTTCTGCACCGCCCCTCACCGGAGTCTGGTCCATGCGAGATCACTCTTCATTTTG
TGGAAAAGCACGAGATCCGGCAGGTTTATGTTAGAAGCACCGCTCGAGTCTATGAGATGTACTACGCCACCAATTCTCAGAATGAAAATGAATATTTTTGTACTGTTCGT
TGTGGTGCTGCATTGAGAGATGAAGAAGTGCTTCACACGAATGGTATTGGAGGTGTGTCTGCACATATAAATGGGTCTAATGGACACGTGGCTGAAACAAATTCACAATG
TGGCAGTAATTTGAACACGAATGAAGATGACTGGGTTGAAGTTAAACCTTCCGATGGCCCAACCCTTGATCATAAAAACAATTCTTTTACTTCCCAGTCTGGTGCAGACT
CAGTGAGGATTAAGCAGGACTTCTATGAGGCCACAGCAGAAATAACGGATGCAAATCCTTGCACATCTCTTACAATCCGTCTTCTTTCACTTCAGAATAAAAGTATTGTA
TATGTTGATGAAGTCTATGTGTTTGCAAATCCCGTTGATTTAGAAGAAGAGAGCCCAGTGGAGAACTCAGCCCGAAATTCTCAAAGTTCTTTGATGTCCATGCTTGTCCC
AACCCTTCTGCAGTTATCTAAAACTACTGGCAGTAGTACGAACAATGATGGGCGTACTTCTACTTCTAATGCAGAGGGAATACATATATTACCCAAGGTTGGGTCAAAAG
CTCTCGATTCAACTAATAGTGTAACTGCACTTCAGCAGGAAGGAAAATCTTGTGTAACCATTGATGATGAAGTGAAATTGCCAGAAAAAAATGAGTCTGATAGACCTGTA
CGTCAGCCTGAGATGCCTCTACAAGTTCCTGTTAAAGAGAGAACGCATGATGAACCACTTCATCGTATTGAAAGCATTTTGGGCCAGCTTGTTTCTCGAATGGACAGAAT
AGAGAATTGTTTTTTGAGGTTTGAAGAAAATATGCTAAAACCTATTAACAGCATTGAAGGAAGGCTGAAGCAGGTTGAGCAGCAACTTGAAGTTTTAACCAAGGAGTCAC
ATGTTTCAGAATGGCCATCTTGCTACAGAATGTCTGCTCCGAGCTTCTCTGCTAATGAATCGGGCTCTAACTCCTTCTATAACAGTGGGAACGGTCATCCCAGTTGTGGG
CCAATTGAACCGGATCGAAAGGAACTTCATTCAAGTGCATCACCCATTGCACTTGAAATATCCAACTCGGTGGGTTCTTCACTGTTGTGTCCGAGTCTTGTGGTCACCGC
TCCAGAGTTTTTGAACGGTGATGATGATGATCGGGAGAGTGTTGTGGTTACTGCTCCTGAGTTTTCCAACGGTAATGATGATGGCCAAGAGAATCAAGTCCCCGAGGTTG
CAGTGGATTCATCAAAGAGTAAACCAAAGCCATCCATTGATGATGTATTAGCATCTGCTCTTGCGCAGTTCGCATTATCATCATCTTCAATCAGCACTCTAGAACATTCA
AAGACTGTAGCTGTTAGATCTCCAGACCTTCCAAATGAAAATGGAAACAATCACAAGAAATCTTCATCAGATGATTTGTTTGAAAGTGAAACAGATCATATAAGCTGTTC
CCAAGAAATTGATGAGACACAGTGCACCACGAATTCTGCTTCAGCCTCTATGTCTTATGCCAATGGTGGGAACTTGAGCTCTTCCAGGCATGATCGTTCTTCTAAAGCTG
ATAAGGAAGATGACGAAGATGTTATCAACGACCCCGAATGCAAGTATCAGAGTGCCGATGGCGATGTCATAGTTGCTTCAGCTGAACAAAGCGCAAAAGGAATGGTGGAA
CTTGGAAATGTGGAAGTTATTAGTGAAACAAGTGAAGATTGTGTTTCAGAAACTTGTGATATTCCTATCCACCTCCTTCACCATCATCCCGAAAATAAACCTGATAGTAT
CAATGTTGATGGCAATGTCGATACCATGAAGGTTAGAGAAGAAGTTCGTGACATAGACATAATCCACGATGTTCTCGGATTTTCTCGTGACACATCTATTGTGAATTTCG
AGATTCCCATCCTCGATGTAAGCTTCACCTCCATTACAGATTCATGTTCCGAGGATATGCTTAAAGACCTTCTCGGGGACATGGTGGAATCAAGCTACGGAGCTTCTTGT
CTCAAGGAAAGTGATGATGCTCCTCCCATTGGCGAGCAATATGACCTCATTTTGGTTGAGGAAGAGGACAAGGAGAATGCTACCTCAACGAATGGCCCCATATCTGTTGA
TATGAACTACTATACCATAATGAGCGAGCCTGTAATTGCTGATGGCGAAAATGATAGACCCCCGATTAAACTTGTATATGGAAGAAGATGTAAGAAGAATAGTGGGGAGG
ACTAG
Protein sequenceShow/hide protein sequence
MESQNDGDVGCSPSWSSAANWTVAGGCLESTVVYESFYSPINDEETVDSGPKSPLVLHRPSPESGPCEITLHFVEKHEIRQVYVRSTARVYEMYYATNSQNENEYFCTVR
CGAALRDEEVLHTNGIGGVSAHINGSNGHVAETNSQCGSNLNTNEDDWVEVKPSDGPTLDHKNNSFTSQSGADSVRIKQDFYEATAEITDANPCTSLTIRLLSLQNKSIV
YVDEVYVFANPVDLEEESPVENSARNSQSSLMSMLVPTLLQLSKTTGSSTNNDGRTSTSNAEGIHILPKVGSKALDSTNSVTALQQEGKSCVTIDDEVKLPEKNESDRPV
RQPEMPLQVPVKERTHDEPLHRIESILGQLVSRMDRIENCFLRFEENMLKPINSIEGRLKQVEQQLEVLTKESHVSEWPSCYRMSAPSFSANESGSNSFYNSGNGHPSCG
PIEPDRKELHSSASPIALEISNSVGSSLLCPSLVVTAPEFLNGDDDDRESVVVTAPEFSNGNDDGQENQVPEVAVDSSKSKPKPSIDDVLASALAQFALSSSSISTLEHS
KTVAVRSPDLPNENGNNHKKSSSDDLFESETDHISCSQEIDETQCTTNSASASMSYANGGNLSSSRHDRSSKADKEDDEDVINDPECKYQSADGDVIVASAEQSAKGMVE
LGNVEVISETSEDCVSETCDIPIHLLHHHPENKPDSINVDGNVDTMKVREEVRDIDIIHDVLGFSRDTSIVNFEIPILDVSFTSITDSCSEDMLKDLLGDMVESSYGASC
LKESDDAPPIGEQYDLILVEEEDKENATSTNGPISVDMNYYTIMSEPVIADGENDRPPIKLVYGRRCKKNSGED