; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g32980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g32980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptiondCTP pyrophosphatase 1
Genome locationchr3:23455491..23462406
RNA-Seq ExpressionMoc03g32980
SyntenyMoc03g32980
Gene Ontology termsGO:0009143 - nucleoside triphosphate catabolic process (biological process)
GO:0047429 - nucleoside-triphosphate diphosphatase activity (molecular function)
InterPro domainsIPR004518 - NTP pyrophosphohydrolase MazG, putative catalytic core
IPR008581 - Protein of unknown function DUF863, plant
IPR025984 - dCTP pyrophosphatase 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584423.1 hypothetical protein SDJN03_20355, partial [Cucurbita argyrosperma subsp. sororia]6.6e-26570.25Show/hide
Query:  QIFISVGILKELLVAVYG-LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICIT
        ++ +S  I  +L   V G ++ Q +EDKM+ GGHSS  FLSPLSAGLY   GR+ALKQIML QEA FRDQIYELHRLYKRQ EFIAEMKRES+ HD+CIT
Subjt:  QIFISVGILKELLVAVYG-LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICIT

Query:  RTQSDFCKSQVSALCAQDAHDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEM
         T+SDF KS+V ALCAQDAHDFP Y++  ISGE KS L SS  K++Q  SD+  NGI SK PY SESKSKILG GMFD+ELP D K LN + E+++VPEM
Subjt:  RTQSDFCKSQVSALCAQDAHDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEM

Query:  SSHHLHRIPEIAR---KPFL-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRE
        SS HL R+P+I     KPFL K+DLNASLHRD ST DSL E+TK+SVDLNDPPN EEE GCRS+ L DATGHR ILFHDL+GK NSKFLVFSE D+HGR 
Subjt:  SSHHLHRIPEIAR---KPFL-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRE

Query:  NGLSNNGYSESSSFYDRSKTYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKS
        NG SNNGYSES SFYDRSK YQPDKDI NSSLSSST S+ KS Q  IG              NAMLAEDDLCSVKN RS  G  SV+  LEGSFCNGSKS
Subjt:  NGLSNNGYSESSSFYDRSKTYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKS

Query:  GSIKEESHPSFKASANWTGGQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSV
          +KEES    KASANW  G+IDLNVCINE+FLATPCCS EMKLEVP +P K KHSST GEF  NQVG+HFLKSV+DDGKP EDL  I AEALVSISSSV
Subjt:  GSIKEESHPSFKASANWTGGQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSV

Query:  AQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGK
        AQN +K T   QS Q   ESLCW AEIVSSM  +PE  E+A+K KD  DS ELL N +DDFE+MTLKLKET  E CSLT S+HQEEA KNVSSPSCQLGK
Subjt:  AQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGK

Query:  GRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVI
        G  RRG+ KNFQTEILPSL TLSRYEVTEDIQTIGGLME+ SS SI GV KT S    +WTRGKRR C +SSK TE +  SIMDQVSS NE+ENKER VI
Subjt:  GRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVI

Query:  VWENITRRRRGQRYPACNRKIILGQV
        VW NITRRRRG+RYPAC RK ILGQV
Subjt:  VWENITRRRRGQRYPACNRKIILGQV

XP_022137621.1 uncharacterized protein LOC111009019 [Momordica charantia]0.0e+00100Show/hide
Query:  MVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRS
        MVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRS
Subjt:  MVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRS

Query:  SISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRD
        SISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRD
Subjt:  SISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRD

Query:  FSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSL
        FSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSL
Subjt:  FSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSL

Query:  SSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDF
        SSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDF
Subjt:  SSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDF

Query:  LATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMA
        LATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMA
Subjt:  LATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMA

Query:  DDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQ
        DDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQ
Subjt:  DDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQ

Query:  TIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKIILGQV
        TIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKIILGQV
Subjt:  TIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKIILGQV

XP_023519938.1 uncharacterized protein LOC111783256 isoform X1 [Cucurbita pepo subsp. pepo]2.1e-26671.71Show/hide
Query:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA
        ++ Q +EDKM+ GGHSS  FLSPLSAGLY   GR+ALKQIML QEA FRDQIYELHRLYKRQ EFIAEMKRES+ HD+CIT T+SDF KS+V ALCAQDA
Subjt:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA

Query:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPE---IARKPF
        H+FP Y++  I GE KS L SS  K++Q  SD+  NGI SK PY SESKSKILG GMFD+ELP D K LN + E+++VPEMSS HL R+P+   I+ KP 
Subjt:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPE---IARKPF

Query:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK
        L K+DLNASLHRD ST DSL E+TK+SVDLNDPPN EEE GCRS+ L DATGHREILFHDL+GK NSKFLVFSE D+HGR NG SNNGYSES SFYDRSK
Subjt:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK

Query:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG
         YQPDKDI NSSLSSST S+ KS Q  IG              NAMLAEDDLCSVKN RS  GS SV+  LEGSFCNGSKS  +KEES    KASANW  
Subjt:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG

Query:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE
        G+IDLNVCINE+FLA PCCS EMKLEVPV+P K KHSST GEF  NQVG+HFLKSV+DDG+P EDL  I AEALVSISSSVAQN +K T   QS Q   E
Subjt:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE

Query:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL
        SLCW AEIVSSM  +PE AE+A+K KD  DS ELL N +DDFE+MTLKLKET  E CSLT SNHQEEA KNVSSPSCQLGKGR RRG+ KNFQTEILPSL
Subjt:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL

Query:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR
         TLSRYEVTEDIQTIGGLMEV SS SI G  KT S    +WTRGKRR C +SSK TE V+ SIMDQVSSDNE+ENKER V+VW NITRRRRG+RYPAC R
Subjt:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR

Query:  KIILGQV
        K ILGQV
Subjt:  KIILGQV

XP_023519940.1 uncharacterized protein LOC111783256 isoform X2 [Cucurbita pepo subsp. pepo]2.1e-26671.71Show/hide
Query:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA
        ++ Q +EDKM+ GGHSS  FLSPLSAGLY   GR+ALKQIML QEA FRDQIYELHRLYKRQ EFIAEMKRES+ HD+CIT T+SDF KS+V ALCAQDA
Subjt:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA

Query:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPE---IARKPF
        H+FP Y++  I GE KS L SS  K++Q  SD+  NGI SK PY SESKSKILG GMFD+ELP D K LN + E+++VPEMSS HL R+P+   I+ KP 
Subjt:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPE---IARKPF

Query:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK
        L K+DLNASLHRD ST DSL E+TK+SVDLNDPPN EEE GCRS+ L DATGHREILFHDL+GK NSKFLVFSE D+HGR NG SNNGYSES SFYDRSK
Subjt:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK

Query:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG
         YQPDKDI NSSLSSST S+ KS Q  IG              NAMLAEDDLCSVKN RS  GS SV+  LEGSFCNGSKS  +KEES    KASANW  
Subjt:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG

Query:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE
        G+IDLNVCINE+FLA PCCS EMKLEVPV+P K KHSST GEF  NQVG+HFLKSV+DDG+P EDL  I AEALVSISSSVAQN +K T   QS Q   E
Subjt:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE

Query:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL
        SLCW AEIVSSM  +PE AE+A+K KD  DS ELL N +DDFE+MTLKLKET  E CSLT SNHQEEA KNVSSPSCQLGKGR RRG+ KNFQTEILPSL
Subjt:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL

Query:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR
         TLSRYEVTEDIQTIGGLMEV SS SI G  KT S    +WTRGKRR C +SSK TE V+ SIMDQVSSDNE+ENKER V+VW NITRRRRG+RYPAC R
Subjt:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR

Query:  KIILGQV
        K ILGQV
Subjt:  KIILGQV

XP_038894350.1 uncharacterized protein LOC120082969 isoform X1 [Benincasa hispida]9.5e-28075.28Show/hide
Query:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA
        ++ QI+EDKMV GG SS GFLSPLSAGLYV  GREALKQIMLKQEAIFRDQIYELHRLY+RQREFIAEMKRES  +D+CIT TQSDF KS+VSALCAQDA
Subjt:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA

Query:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIA----RKP
        H+FPVYN+S ISGE    LLSSL K++Q  SD+ LNGIC KNPYLSE KSKIL  GMFD+ELP DSK+LN K E++ VPEMSS HL R+PEI     RKP
Subjt:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIA----RKP

Query:  FL-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRS
        FL K+DL             LLE+TKVSVDLN+PPNFEEE GCRS+DLVDATGHREILFHDL GK NSK L FSEQ+++GR+NGLSNNG SES SF+DRS
Subjt:  FL-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRS

Query:  KTYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWT
        K YQPD+DIANSSLSSSTTSL KSAQ  +GH +L+ ES KMLIE AM+AEDD CSVKNLRSSIGSDSVS  LEGSFCNG KS ++KEE  P FKASANW 
Subjt:  KTYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWT

Query:  GGQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPC
         GQIDLNVCINE+FL TPCCS EMKL+VPV+P +EKHSST G    NQVG+HFL+SV+DDGKPSEDL +I AE LVSISS VAQN  K T    S Q   
Subjt:  GGQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPC

Query:  ESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPS
        ESLCWFAEIVSSM  DPE  E+ALK KD  DS ELL + +DDFE+MTLKLKET+ EGCSLT SNHQEEAVKNV   SCQLGKGRVRRG+  NFQTEILPS
Subjt:  ESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPS

Query:  LATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        LATLSRYEVTEDIQTIGGLMEVASS SI GVVKT+S  R +WTRGKRR C +SSK TETVI SIMDQVSS NELENKER VIVW NITRRRRGQRYPACN
Subjt:  LATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACN

Query:  RKIILGQV
        RKIILGQV
Subjt:  RKIILGQV

TrEMBL top hitse value%identityAlignment
A0A1S3BE34 uncharacterized protein LOC103488795 isoform X12.4e-26071.77Show/hide
Query:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA
        ++ QI+EDKM+ GG ++  FLSPLSAGLYV  GREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRES  HD+CIT TQSDF KS+VSALCAQDA
Subjt:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA

Query:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKY
         +F VY++  ISGE  + LLSSL K+ Q  SDS LNGI SKNPY SESKSKIL KGMFD+ELP DS +LN + E++ VPEMSS HL R+PE+        
Subjt:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKY

Query:  DLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQP
             +    S  DSLLE+TKVSVDLNDPPN  EE  CRSID V+A+GHREILFHDL GK NSK   FS+QD +GR+NGLS  GYSE  SFYDRS+ YQP
Subjt:  DLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQP

Query:  DKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQID
        DK+IANSSLSSSTTSL KSAQ  IGH IL++ES K LIENAMLA+ DLC VKNLR S GSDSVS  +EGSFCN SKS  +KEE+ P FKASANW  GQID
Subjt:  DKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQID

Query:  LNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSV--QDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESL
        LNVCINE+FLATPCCS EMKLEVPV+P KEKHSST GEF  NQ+ +HFL+SV  +DDGK  EDL TI AEALVSISSSVAQN  K T S  S Q   ESL
Subjt:  LNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSV--QDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESL

Query:  CWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLAT
        CW AEIVSSM  DP+ AE+ALK KD  DS ELL +++D+FE+MTLKLKE + +GCSLT SNHQEEAVKNVSSPSCQ GK R RRG+ KNFQTEILPSLAT
Subjt:  CWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLAT

Query:  LSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKI
        LSRYEVTEDIQTIGGLMEVASS SI  V K +S  R + TRGKRR C +SSK TETVIRS MD+VSSDNE +NKER VIVW NITRRRRGQRYPA NRKI
Subjt:  LSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKI

Query:  ILGQV
        IL QV
Subjt:  ILGQV

A0A6J1C758 uncharacterized protein LOC1110090190.0e+00100Show/hide
Query:  MVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRS
        MVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRS
Subjt:  MVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRS

Query:  SISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRD
        SISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRD
Subjt:  SISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRD

Query:  FSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSL
        FSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSL
Subjt:  FSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSL

Query:  SSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDF
        SSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDF
Subjt:  SSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDF

Query:  LATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMA
        LATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMA
Subjt:  LATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMA

Query:  DDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQ
        DDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQ
Subjt:  DDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQ

Query:  TIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKIILGQV
        TIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKIILGQV
Subjt:  TIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKIILGQV

A0A6J1E7X2 uncharacterized protein LOC111431607 isoform X11.8e-26371Show/hide
Query:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA
        ++ Q +EDKM+ GGHSS  FLSPLSAGLY   GR+ALKQIML QEA FRDQIYELHRLYKRQ EFIAEMKRES+ HD+CIT T+SDF KS+V ALCAQDA
Subjt:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA

Query:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIAR---KPF
        H+FP Y++  ISGE KS L SS  K++Q  S++  NGI SK PY SESKSKILG GMFD+ELP D K LN + E+++VPEMSS HL R+P+I     KPF
Subjt:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIAR---KPF

Query:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK
        L K+DLNASLHRD ST DSL E+TK+SVDLNDPPN EEE GCRS+ L DATGHR ILFHDL+GK NSKFLVFSE D+HGR NG SNNGYSES SFYDRSK
Subjt:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK

Query:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG
         YQPDKDI NSSLSSST S+ KS Q  IG              NAMLAEDDLCSVKN RS  GS SV+  LEGSFCNGSKS  +KEES    KASANW  
Subjt:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG

Query:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE
        G+IDLNVCINE+FLATPCCS EMKLEVP +P K KHSST GEF  NQVG+HFLKSV+DDG+P ED+  I AEALVSISSSVAQN +K T   QS Q   E
Subjt:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE

Query:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL
        SLCW AEIVSSM  +PE  E+A+K KD  DS E L N +DDFE+MTLKLKET  E CSLT SNHQEEA KNVSSPSCQL KGR RRG+ KNFQTEILPSL
Subjt:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL

Query:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR
         TLSRYEVTEDIQTIGGLME+ SS SI GV KT S    +WTRGKRR C +SSK TE +  SIMDQVSS NE+ENKER VIVW NITRRRRG+RYPAC R
Subjt:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR

Query:  KIILGQV
        K ILGQV
Subjt:  KIILGQV

A0A6J1E847 uncharacterized protein LOC111431607 isoform X21.8e-26371Show/hide
Query:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA
        ++ Q +EDKM+ GGHSS  FLSPLSAGLY   GR+ALKQIML QEA FRDQIYELHRLYKRQ EFIAEMKRES+ HD+CIT T+SDF KS+V ALCAQDA
Subjt:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA

Query:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIAR---KPF
        H+FP Y++  ISGE KS L SS  K++Q  S++  NGI SK PY SESKSKILG GMFD+ELP D K LN + E+++VPEMSS HL R+P+I     KPF
Subjt:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIAR---KPF

Query:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK
        L K+DLNASLHRD ST DSL E+TK+SVDLNDPPN EEE GCRS+ L DATGHR ILFHDL+GK NSKFLVFSE D+HGR NG SNNGYSES SFYDRSK
Subjt:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK

Query:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG
         YQPDKDI NSSLSSST S+ KS Q  IG              NAMLAEDDLCSVKN RS  GS SV+  LEGSFCNGSKS  +KEES    KASANW  
Subjt:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG

Query:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE
        G+IDLNVCINE+FLATPCCS EMKLEVP +P K KHSST GEF  NQVG+HFLKSV+DDG+P ED+  I AEALVSISSSVAQN +K T   QS Q   E
Subjt:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE

Query:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL
        SLCW AEIVSSM  +PE  E+A+K KD  DS E L N +DDFE+MTLKLKET  E CSLT SNHQEEA KNVSSPSCQL KGR RRG+ KNFQTEILPSL
Subjt:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL

Query:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR
         TLSRYEVTEDIQTIGGLME+ SS SI GV KT S    +WTRGKRR C +SSK TE +  SIMDQVSS NE+ENKER VIVW NITRRRRG+RYPAC R
Subjt:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR

Query:  KIILGQV
        K ILGQV
Subjt:  KIILGQV

A0A6J1KHA2 uncharacterized protein LOC1114957413.6e-26471.57Show/hide
Query:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA
        ++ Q +ED M+ GGHSS  FLSPLSAGLY   GR+ALKQIML QEA+FRDQIYELHRLYKRQ EFIAEMKRESH HD+CIT T+SDF KS+V+ALCAQDA
Subjt:  LYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDFCKSQVSALCAQDA

Query:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPE---IARKPF
        H+FP Y++  ISGE KS L SS  K++Q  SD+      SK PY SES+SKILG GMFD+ELP D K LN + E+++VPEMS  HL R+P+   I+ KPF
Subjt:  HDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPE---IARKPF

Query:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK
        L K+DLNASLHRD STTDSL E+TK+SVDLNDPPN EEE GCRS+ L DATGHR ILFHDL+GK NSKFLVFSE D+HGR NG SNNGYSES SFYDRSK
Subjt:  L-KYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSK

Query:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG
         YQPDKDI NSSLSSST S+KKS Q +IG              NAMLAEDDLCSVKNLRS  GS SV+  LEGSFCN SKS  +KEES    KASANW  
Subjt:  TYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTG

Query:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE
        G+IDLNVCINE+FLATPCCS EMKLEVPV+P K KHSST GEF  NQVG+HFLKSV+DDG+P EDL  I A+ALVSISSSVAQN +K T S QS Q   E
Subjt:  GQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCE

Query:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL
        SLCW AEIVSSM  DPE AE+ALK KD  DS ELL N +DDFE+MTLKLKET  E CSLT SNHQEEA KNVSSPSCQLGKGR RRG+ KNFQTEILPSL
Subjt:  SLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSL

Query:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR
         TLSRYEVTEDIQTIGGLME+ SS SI GV KT S    +WTRGKRR C +SSK TE VI SIMDQVSS NE+ENKER V VW NITRRRRG+RY AC R
Subjt:  ATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSC-NSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNR

Query:  KIILGQV
        K ILGQ+
Subjt:  KIILGQV

SwissProt top hitse value%identityAlignment
Q32KY6 dCTP pyrophosphatase 11.4e-2053.12Show/hide
Query:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKN
        +L ++R   AEFA  R W++ H PRNLLLALVGEVGEL+E+FQWK +   G   WS  ++  L+EELSD+L+YLV LA  C +DL +A L K+  N
Subjt:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKN

Q91VC0 dCTP pyrophosphatase 11.3e-2154.17Show/hide
Query:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKN
        +L ++R   AEFA  R W++ H PRNLLLALVGEVGEL+E+FQWK +   G   W   ++  L+EELSDVL+YLV LA  C +DL RA +SK+  N
Subjt:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKN

Q9H773 dCTP pyrophosphatase 16.5e-2155.91Show/hide
Query:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKI
        +L ++R   AEFA  R W++ H PRNLLLALVGEVGEL+E+FQWK +   G   WS  ++  L+EELSDVL+YLV LA  C +DL  A LSK+
Subjt:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKI

Q9QY93 dCTP pyrophosphatase 15.0e-2153.12Show/hide
Query:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKN
        +L ++R   AEFA  R W++ H PRNLLLALVGEVGEL+E+FQWK +   G   W   ++  L+EELSDVL+YLV LA  C +DL +A +SK+  N
Subjt:  SLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKN

Arabidopsis top hitse value%identityAlignment
AT1G13940.1 Plant protein of unknown function (DUF863)3.7e-1128.57Show/hide
Query:  QIDLNVCINEDFLATPCCSP---EMKLEVPVTPRKEKHSSTRGEFDGNQVG--THFLKSVQDD--GKPSEDLITIVAEALVSISSSVAQNRQKTTASGQS
        QIDLN+ +  D     C  P    +  + P+   +    S   E DGN  G     +KS+++    KP E    + AE +V+I S+      +  AS + 
Subjt:  QIDLNVCINEDFLATPCCSP---EMKLEVPVTPRKEKHSSTRGEFDGNQVG--THFLKSVQDD--GKPSEDLITIVAEALVSISSSVAQNRQKTTASGQS

Query:  AQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKL---KETKVEGCSLTVSNHQ-EEAVKNVSSPSCQLGKGRVRRGR-G
        ++     L WFAE V++  ++ +  ++   S++   S E     ID FE MTL+L    E +     L   N + EE        S +  +G  R+G+  
Subjt:  AQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKL---KETKVEGCSLTVSNHQ-EEAVKNVSSPSCQLGKGRVRRGR-G

Query:  KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVAS-SQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVI-------RSIMDQVSS-----DNELENKE
        ++FQ +ILP L +LS++EVTEDIQ   G M     S + TG+ +  +GSR    R +R     ++P    +        S+   VS+     + E+  ++
Subjt:  KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVAS-SQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVI-------RSIMDQVSS-----DNELENKE

Query:  RNVIVWENITRRRRGQRYPACN
        R+   W  +TRR R QR P+ +
Subjt:  RNVIVWENITRRRRGQRYPACN

AT1G13940.1 Plant protein of unknown function (DUF863)3.8e-0827.91Show/hide
Query:  SNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHD-------ICITRTQSDFCKSQVSALCAQDAHDFPVYNRS
        +NGF S  +     G+ ++ +++ ML+ EA+F+ Q+ ELHR+Y+ Q++ + E+KR+  N +       +    T  D  K ++ +    ++    VY+R 
Subjt:  SNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHD-------ICITRTQSDFCKSQVSALCAQDAHDFPVYNRS

Query:  SIS-GEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPE
        S+S  ED     S ++ S      S  NG  SK+  +SE +   + + M D+ LP D +++++  EV E+ +
Subjt:  SIS-GEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPE

AT1G26620.1 Plant protein of unknown function (DUF863)1.3e-1931.39Show/hide
Query:  IDLNVCINED-----FLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQ-DDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQ
        IDLN C NED     FL++     + +  + +       S   G+   ++      + +Q  DG    +LI + AEA+V+IS +  Q      AS  +  
Subjt:  IDLNVCINED-----FLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQ-DDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQ

Query:  APCESLCWFAEIVSSMADD--------PETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGR
        A    L WFAEI++S  D+        PE  +     +D   SGE     ID FE MTL ++ETK E           E +K   +   +  +G+ RRGR
Subjt:  APCESLCWFAEIVSSMADD--------PETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGR

Query:  GK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCN-SSKPTETVIRSIMDQVSSDNELENKERNVIVWENIT
         K +FQ + LP L++LSR+EVTEDIQ  GGLM+       +G+    +  R      KR   N +  P    +   M++  S   LE+ +  +  W   T
Subjt:  GK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCN-SSKPTETVIRSIMDQVSSDNELENKERNVIVWENIT

Query:  RRRRGQRYP
        RR R QR P
Subjt:  RRRRGQRYP

AT1G26620.1 Plant protein of unknown function (DUF863)3.9e-0542.55Show/hide
Query:  YVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHN
        Y GY ++ +K  ML+ EA+F++Q++ELHRLY+ Q+  + E+K ++ N
Subjt:  YVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHN

AT1G69360.1 Plant protein of unknown function (DUF863)1.1e-2027.87Show/hide
Query:  DLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLR
        DL+   N +F+         R+ G S+N     +    RS T   D ++ +  +++S     +S +  +G PI  ++ L +  E+  L    +C + N  
Subjt:  DLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLR

Query:  SSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDD
          + +    +L     C  S S  +  +     KA+ +     IDLN C +ED  +  C +P ++ +       E   +   E +G +    F +   + 
Subjt:  SSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDFLATPCCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDD

Query:  GKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPET-AEIALKSKDV-GDSGELLPNYIDDFEIMTLKLKETKVEGC
        G   ++LI   AEA+V+IS S         AS  +     E L WF   ++S  +D E+  +  L+++D  G   E      D FE MTL L +TK E  
Subjt:  GKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPET-AEIALKSKDV-GDSGELLPNYIDDFEIMTLKLKETKVEGC

Query:  ---SLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNS--
            L     + +   ++   S +  +G+ RRGR K +FQ +ILP LA+LSR EVTED+Q  GGLM+       TG    S  +R S  RG++R  ++  
Subjt:  ---SLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNS--

Query:  SKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPA
          P  + +   M+  SS   +  ++R++  W N TRR R  R PA
Subjt:  SKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPA

AT1G69360.1 Plant protein of unknown function (DUF863)7.1e-0744.44Show/hide
Query:  DKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMK
        DK +  G   NGF +         Y R+ LKQ ML+ EA+F++Q+YELHRLY+ Q+  +AE+K
Subjt:  DKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMK

AT3G25400.1 CONTAINS InterPro DOMAIN/s: NTP Pyrophosphohydrolase MazG-related, RS21-C6 (InterPro:IPR011394), EAR (InterPro:IPR009039), NTP pyrophosphohydrolase MazG, putative catalytic core (InterPro:IPR004518); Has 1123 Blast hits to 1121 proteins in 452 species: Archae - 22; Bacteria - 753; Metazoa - 81; Fungi - 3; Plants - 83; Viruses - 0; Other Eukaryotes - 181 (source: NCBI BLink).1.2e-3571Show/hide
Query:  VSLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKNAIK
        VSL  L  ++ +FA AR W+K+H+PRNLLLA+VGEVGELSEIFQWKGEVARG P+W  ++K HL EELSDVLLYLVRL+D CG+DLG+AAL KI  NAIK
Subjt:  VSLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKNAIK

AT5G61300.1 unknown protein5.5e-0722.32Show/hide
Query:  LYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESH---NHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRSSI-SGEDKSLLLSSL
        LY+   +E LKQ ML QE++F  QIYELHRLY+RQ++ + +M+   H   N  +   R    +  S +S             +++S+   ED S  +   
Subjt:  LYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESH---NHDICITRTQSDFCKSQVSALCAQDAHDFPVYNRSSI-SGEDKSLLLSSL

Query:  EKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRDFSTTDSLLEETKVS
                     GI      + +S  K+L     D ELP        + E S+  E          +  R   L+  L     +D     S     K  
Subjt:  EKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPFLKYDLNASLHRDFSTTDSLLEETKVS

Query:  VDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSLSSSTTSLKKSAQVT
        +DLN+P + EE                E+       +T S+    S  +    ENG+  N    S       ++ QP +++ ++SL     S + S  + 
Subjt:  VDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANSSLSSSTTSLKKSAQVT

Query:  IGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSV----------SSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDFLATP
        +  P LE  SL              CS    ++  G+  V           S L+ +F     +  I E +H   + S++            ++      
Subjt:  IGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSV----------SSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDFLATP

Query:  CCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPE
        CC P+++ +  +       S  RG          FL   +++ + S      VA A   +  S++ + +K++        P   L WFA I SS+ +D +
Subjt:  CCSPEMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPE

Query:  TAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQ-TEILPSLATLSRYEVTEDIQTIG
         +E+ L       S    P      E MTL+L E K E             V N ++P  +    R    RGK  Q  E LPSL+T +  + ++ +Q I 
Subjt:  TAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQ-TEILPSLATLSRYEVTEDIQTIG

Query:  GLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPAC
                                            +P+E          +++    +    VI W  +T+RRRG R PAC
Subjt:  GLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGACTAACTACAAAGACGATGACGTTTCGCTGACAGAACTTAGAGACAGAGTGGCGGAGTTCGCCCTAGCCAGAGGGTGGGACAAGCATCACACCCCCAGAAA
TCTCCTGCTTGCCCTAGTGGGAGAAGTGGGGGAGCTATCGGAGATATTCCAATGGAAGGGAGAGGTGGCGAGGGGGCTGCCCAATTGGAGCGCCGACGACAAAGAGCATC
TAGAAGAAGAGCTTTCAGACGTGTTGCTGTATCTTGTTCGTTTGGCAGACGTGTGTGGGCTCGATCTCGGTCGAGCTGCTCTCTCCAAGATCCTCAAGAATGCCATTAAG
GTTCTAGCAACTATCCAAATTTTTATTTCAGTAGGGATATTAAAGGAGTTGTTAGTGGCTGTGTATGGCCTGTATGTCCAAATCAATGAGGATAAAATGGTGAAGGGTGG
ACATAGCTCCAATGGTTTCTTATCACCATTGTCAGCTGGTCTGTATGTCGGATATGGCCGAGAAGCATTAAAGCAGATAATGCTGAAGCAAGAAGCGATTTTTAGGGATC
AGATTTATGAACTTCATCGCCTATACAAAAGACAGAGGGAGTTTATAGCTGAGATGAAGAGGGAATCGCACAATCATGATATATGCATCACGAGAACTCAGTCTGATTTT
TGTAAGTCTCAGGTCTCGGCATTGTGTGCTCAGGATGCACATGACTTCCCTGTATACAATCGATCATCTATTTCAGGTGAAGATAAGAGCTTGTTACTTTCAAGTCTAGA
GAAGAGTGTTCAAGCAGCTTCTGACTCTGTTCTGAATGGAATCTGTTCAAAGAATCCTTATTTATCAGAATCTAAGAGCAAGATTTTGGGAAAAGGGATGTTTGATATTG
AACTTCCAGTTGACAGTAAACACTTGAACGAAAAACATGAAGTGTCAGAAGTGCCTGAAATGTCAAGCCATCATCTCCATAGAATACCTGAGATTGCTCGAAAACCGTTT
CTCAAGTATGATTTGAATGCTTCATTACATAGGGATTTTTCAACCACAGATTCATTATTAGAGGAAACCAAGGTCTCGGTTGATTTAAATGATCCTCCTAACTTTGAGGA
AGAACACGGTTGTAGATCCATAGATTTAGTGGATGCTACTGGCCACAGGGAGATACTTTTTCATGATCTCTCTGGCAAAACGAATTCAAAGTTTCTTGTTTTCTCAGAGC
AAGATATCCATGGAAGAGAAAATGGACTATCTAACAATGGCTATTCTGAATCGAGTAGCTTTTATGACAGAAGCAAAACATATCAACCTGACAAAGATATTGCGAATTCA
AGTTTGTCATCATCCACTACATCTTTGAAAAAATCTGCACAAGTTACCATAGGACACCCTATTTTAGAGGCTGAGAGTTTGAAAATGTTAATAGAAAATGCAATGCTTGC
TGAAGACGATCTGTGTAGTGTTAAGAACTTGAGATCTAGCATTGGGTCTGATAGTGTAAGCAGCCTCCTTGAGGGAAGCTTCTGCAATGGCTCCAAGTCGGGGAGCATAA
AAGAAGAATCTCATCCCTCGTTTAAGGCTTCTGCAAATTGGACCGGAGGTCAAATTGACCTCAATGTATGCATAAATGAGGATTTTTTGGCAACACCTTGTTGCTCCCCT
GAAATGAAATTGGAGGTACCTGTGACTCCAAGAAAGGAAAAACACTCTTCAACTAGAGGAGAGTTTGATGGCAACCAAGTTGGGACACATTTTCTAAAGTCAGTTCAAGA
TGATGGCAAACCATCAGAGGATCTCATTACGATTGTTGCAGAAGCTTTGGTTTCTATTTCATCATCTGTGGCTCAAAATCGTCAAAAAACCACTGCCAGCGGTCAGTCTG
CTCAAGCTCCATGTGAATCCTTATGCTGGTTTGCTGAAATAGTTTCTTCAATGGCAGATGATCCAGAGACGGCTGAAATAGCCTTGAAAAGTAAGGATGTTGGTGACTCA
GGGGAGCTTTTGCCAAATTACATAGATGACTTTGAGATCATGACATTGAAGTTGAAAGAAACAAAAGTGGAAGGGTGTTCCCTGACTGTAAGTAACCACCAAGAGGAAGC
AGTAAAAAATGTTTCTTCGCCTTCATGTCAACTAGGGAAGGGCCGTGTGAGGCGGGGTCGGGGCAAGAATTTTCAGACAGAAATTCTTCCAAGTCTTGCAACTTTGTCAA
GGTATGAGGTCACTGAAGATATTCAGACAATTGGAGGCCTGATGGAAGTTGCAAGTTCTCAATCGATAACTGGTGTAGTGAAAACTTCAAGTGGAAGTAGAGCGTCATGG
ACAAGAGGGAAGAGACGGTCCTGTAATTCATCAAAACCTACAGAGACTGTGATAAGGTCAATCATGGATCAAGTAAGCAGTGATAATGAACTGGAGAATAAAGAAAGGAA
TGTTATAGTTTGGGAAAATATTACACGGCGACGGAGGGGACAGAGATATCCAGCTTGTAACCGAAAGATTATCTTAGGACAAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGACTAACTACAAAGACGATGACGTTTCGCTGACAGAACTTAGAGACAGAGTGGCGGAGTTCGCCCTAGCCAGAGGGTGGGACAAGCATCACACCCCCAGAAA
TCTCCTGCTTGCCCTAGTGGGAGAAGTGGGGGAGCTATCGGAGATATTCCAATGGAAGGGAGAGGTGGCGAGGGGGCTGCCCAATTGGAGCGCCGACGACAAAGAGCATC
TAGAAGAAGAGCTTTCAGACGTGTTGCTGTATCTTGTTCGTTTGGCAGACGTGTGTGGGCTCGATCTCGGTCGAGCTGCTCTCTCCAAGATCCTCAAGAATGCCATTAAG
GTTCTAGCAACTATCCAAATTTTTATTTCAGTAGGGATATTAAAGGAGTTGTTAGTGGCTGTGTATGGCCTGTATGTCCAAATCAATGAGGATAAAATGGTGAAGGGTGG
ACATAGCTCCAATGGTTTCTTATCACCATTGTCAGCTGGTCTGTATGTCGGATATGGCCGAGAAGCATTAAAGCAGATAATGCTGAAGCAAGAAGCGATTTTTAGGGATC
AGATTTATGAACTTCATCGCCTATACAAAAGACAGAGGGAGTTTATAGCTGAGATGAAGAGGGAATCGCACAATCATGATATATGCATCACGAGAACTCAGTCTGATTTT
TGTAAGTCTCAGGTCTCGGCATTGTGTGCTCAGGATGCACATGACTTCCCTGTATACAATCGATCATCTATTTCAGGTGAAGATAAGAGCTTGTTACTTTCAAGTCTAGA
GAAGAGTGTTCAAGCAGCTTCTGACTCTGTTCTGAATGGAATCTGTTCAAAGAATCCTTATTTATCAGAATCTAAGAGCAAGATTTTGGGAAAAGGGATGTTTGATATTG
AACTTCCAGTTGACAGTAAACACTTGAACGAAAAACATGAAGTGTCAGAAGTGCCTGAAATGTCAAGCCATCATCTCCATAGAATACCTGAGATTGCTCGAAAACCGTTT
CTCAAGTATGATTTGAATGCTTCATTACATAGGGATTTTTCAACCACAGATTCATTATTAGAGGAAACCAAGGTCTCGGTTGATTTAAATGATCCTCCTAACTTTGAGGA
AGAACACGGTTGTAGATCCATAGATTTAGTGGATGCTACTGGCCACAGGGAGATACTTTTTCATGATCTCTCTGGCAAAACGAATTCAAAGTTTCTTGTTTTCTCAGAGC
AAGATATCCATGGAAGAGAAAATGGACTATCTAACAATGGCTATTCTGAATCGAGTAGCTTTTATGACAGAAGCAAAACATATCAACCTGACAAAGATATTGCGAATTCA
AGTTTGTCATCATCCACTACATCTTTGAAAAAATCTGCACAAGTTACCATAGGACACCCTATTTTAGAGGCTGAGAGTTTGAAAATGTTAATAGAAAATGCAATGCTTGC
TGAAGACGATCTGTGTAGTGTTAAGAACTTGAGATCTAGCATTGGGTCTGATAGTGTAAGCAGCCTCCTTGAGGGAAGCTTCTGCAATGGCTCCAAGTCGGGGAGCATAA
AAGAAGAATCTCATCCCTCGTTTAAGGCTTCTGCAAATTGGACCGGAGGTCAAATTGACCTCAATGTATGCATAAATGAGGATTTTTTGGCAACACCTTGTTGCTCCCCT
GAAATGAAATTGGAGGTACCTGTGACTCCAAGAAAGGAAAAACACTCTTCAACTAGAGGAGAGTTTGATGGCAACCAAGTTGGGACACATTTTCTAAAGTCAGTTCAAGA
TGATGGCAAACCATCAGAGGATCTCATTACGATTGTTGCAGAAGCTTTGGTTTCTATTTCATCATCTGTGGCTCAAAATCGTCAAAAAACCACTGCCAGCGGTCAGTCTG
CTCAAGCTCCATGTGAATCCTTATGCTGGTTTGCTGAAATAGTTTCTTCAATGGCAGATGATCCAGAGACGGCTGAAATAGCCTTGAAAAGTAAGGATGTTGGTGACTCA
GGGGAGCTTTTGCCAAATTACATAGATGACTTTGAGATCATGACATTGAAGTTGAAAGAAACAAAAGTGGAAGGGTGTTCCCTGACTGTAAGTAACCACCAAGAGGAAGC
AGTAAAAAATGTTTCTTCGCCTTCATGTCAACTAGGGAAGGGCCGTGTGAGGCGGGGTCGGGGCAAGAATTTTCAGACAGAAATTCTTCCAAGTCTTGCAACTTTGTCAA
GGTATGAGGTCACTGAAGATATTCAGACAATTGGAGGCCTGATGGAAGTTGCAAGTTCTCAATCGATAACTGGTGTAGTGAAAACTTCAAGTGGAAGTAGAGCGTCATGG
ACAAGAGGGAAGAGACGGTCCTGTAATTCATCAAAACCTACAGAGACTGTGATAAGGTCAATCATGGATCAAGTAAGCAGTGATAATGAACTGGAGAATAAAGAAAGGAA
TGTTATAGTTTGGGAAAATATTACACGGCGACGGAGGGGACAGAGATATCCAGCTTGTAACCGAAAGATTATCTTAGGACAAGTATAG
Protein sequenceShow/hide protein sequence
MEKTNYKDDDVSLTELRDRVAEFALARGWDKHHTPRNLLLALVGEVGELSEIFQWKGEVARGLPNWSADDKEHLEEELSDVLLYLVRLADVCGLDLGRAALSKILKNAIK
VLATIQIFISVGILKELLVAVYGLYVQINEDKMVKGGHSSNGFLSPLSAGLYVGYGREALKQIMLKQEAIFRDQIYELHRLYKRQREFIAEMKRESHNHDICITRTQSDF
CKSQVSALCAQDAHDFPVYNRSSISGEDKSLLLSSLEKSVQAASDSVLNGICSKNPYLSESKSKILGKGMFDIELPVDSKHLNEKHEVSEVPEMSSHHLHRIPEIARKPF
LKYDLNASLHRDFSTTDSLLEETKVSVDLNDPPNFEEEHGCRSIDLVDATGHREILFHDLSGKTNSKFLVFSEQDIHGRENGLSNNGYSESSSFYDRSKTYQPDKDIANS
SLSSSTTSLKKSAQVTIGHPILEAESLKMLIENAMLAEDDLCSVKNLRSSIGSDSVSSLLEGSFCNGSKSGSIKEESHPSFKASANWTGGQIDLNVCINEDFLATPCCSP
EMKLEVPVTPRKEKHSSTRGEFDGNQVGTHFLKSVQDDGKPSEDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDS
GELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASW
TRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPACNRKIILGQV