; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018442 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018442
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationChr04:4210559..4212859
RNA-Seq ExpressionHG10018442
SyntenyHG10018442
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041504.1 hypothetical protein E6C27_scaffold6G00960 [Cucumis melo var. makuwa]8.8e-22490.53Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN  DKSSHVTSI SLFGGRNAEIQLLLGVVGAPLIPLPITFD QQPITR NI
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG
        KDNPIEASMAKYIVQQYVAAVGGEHALNCI+SMYAMGKVKMAASEF SGE    GKA VKGKNNGK      GGGGGE+GGFVVWQKRPELWCLELML G
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
         KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
Subjt:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        LVQLEDSHLLRIK A SRND NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEE
Subjt:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE

Query:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF
         GVG+I TS GKFP T+RCS  SR FSSRVAAIDQS+++ESE  +   DEDEDF
Subjt:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF

XP_004138438.2 uncharacterized protein LOC101207651 [Cucumis sativus]1.8e-22490.55Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSN TKTHAISWQAMKSWVKSN  DKSSH TSI SLFGGRNAEIQLLLGVVGAPLIPLPITFD QQPI R NI
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK-------GGGGGEMGGFVVWQKRPELWCLELMLS
        KDNPIEASMAKYIVQQYVAAVGGEHALNCI+SMYAMGKVKMAASEF SGE    GKAAVKGKNNGK       GGGGGEMGGFVVWQKRPELWCLELML 
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK-------GGGGGEMGGFVVWQKRPELWCLELMLS

Query:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
        G KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
Subjt:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE
        LLVQLEDSHLLRIK AGSRND NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKE
Subjt:  LLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE

Query:  EGGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF
        E GVG+I TS GKFPLTMRCS  SR FSSRVAAIDQS+++ESE  +   +EDEDF
Subjt:  EGGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF

XP_008441425.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485545 [Cucumis melo]2.6e-22390.31Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN  DKSSHVTSI SLFGGRNAEIQLLLGVVGAPLIPLPITFD QQPITR NI
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG
        KDNPIEASMAKYIVQQYVAAVGGEHALNCI+SMYAMGKVKMAASEF SGE    GKA VKGKNNGK      GGGGGE+GGFVVW+KRPELWCLELML G
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
         KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
Subjt:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        LVQLEDSHLLRIK A SRND NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEE
Subjt:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE

Query:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF
         GVG+I TS GKFP T+RCS  SR FSSRVAAIDQS+++ESE  +   DEDEDF
Subjt:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]1.1e-21888.76Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD
        MRKLCPNFDRE GLDTVLEVPIPEEMFS N  KTHAISWQAMKSWVKS H+D  SHV S+A+LFGGRNAEIQLLLGVVGAPLIP+P+ FD +PITRNIKD
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
        NPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKM ASEF SGEG LNGK  +K K NGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD

Query:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH
        GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH
Subjt:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH

Query:  LLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVITS
        LLRIKA GSRND +IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE GV VITS
Subjt:  LLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVITS

Query:  NGKFPLTMRC-SAGSRIFSSRVAAIDQSDEDESERRSNQSDEDED
        NGKFPLTMRC +A S+I SSRVAAID   ++ SE  SNQSDEDED
Subjt:  NGKFPLTMRC-SAGSRIFSSRVAAIDQSDEDESERRSNQSDEDED

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]2.0e-23994.87Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN-HHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSS  TKTH ISWQAMKSWVKSN HHDKSSHV SI+SLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN-HHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK--GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQY+AAVGGEHALN IDSMYAMGKVKMAASEFCSGEGCLNGKA   GKNNGK  GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK--GGGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGV
        DSHLLRIK AGSRND NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEE GVG+
Subjt:  DSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGV

Query:  ITSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF
        ITSNGKFP+TMRCSAGSR+FSSRV AIDQSDEDESE  SNQSDEDEDF
Subjt:  ITSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF

TrEMBL top hitse value%identityAlignment
A0A0A0KBJ0 Uncharacterized protein8.6e-22590.55Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSN TKTHAISWQAMKSWVKSN  DKSSH TSI SLFGGRNAEIQLLLGVVGAPLIPLPITFD QQPI R NI
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK-------GGGGGEMGGFVVWQKRPELWCLELMLS
        KDNPIEASMAKYIVQQYVAAVGGEHALNCI+SMYAMGKVKMAASEF SGE    GKAAVKGKNNGK       GGGGGEMGGFVVWQKRPELWCLELML 
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK-------GGGGGEMGGFVVWQKRPELWCLELMLS

Query:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
        G KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
Subjt:  GCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE
        LLVQLEDSHLLRIK AGSRND NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKE
Subjt:  LLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKE

Query:  EGGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF
        E GVG+I TS GKFPLTMRCS  SR FSSRVAAIDQS+++ESE  +   +EDEDF
Subjt:  EGGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF

A0A1S3B3D9 LOW QUALITY PROTEIN: uncharacterized protein LOC1034855451.2e-22390.31Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN  DKSSHVTSI SLFGGRNAEIQLLLGVVGAPLIPLPITFD QQPITR NI
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG
        KDNPIEASMAKYIVQQYVAAVGGEHALNCI+SMYAMGKVKMAASEF SGE    GKA VKGKNNGK      GGGGGE+GGFVVW+KRPELWCLELML G
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
         KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
Subjt:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        LVQLEDSHLLRIK A SRND NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEE
Subjt:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE

Query:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF
         GVG+I TS GKFP T+RCS  SR FSSRVAAIDQS+++ESE  +   DEDEDF
Subjt:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF

A0A5A7TDK0 Uncharacterized protein4.3e-22490.53Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN  DKSSHVTSI SLFGGRNAEIQLLLGVVGAPLIPLPITFD QQPITR NI
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-QQPITR-NI

Query:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG
        KDNPIEASMAKYIVQQYVAAVGGEHALNCI+SMYAMGKVKMAASEF SGE    GKA VKGKNNGK      GGGGGE+GGFVVWQKRPELWCLELML G
Subjt:  KDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGK------GGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
         KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL
Subjt:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        LVQLEDSHLLRIK A SRND NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEE
Subjt:  LVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE

Query:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF
         GVG+I TS GKFP T+RCS  SR FSSRVAAIDQS+++ESE  +   DEDEDF
Subjt:  GGVGVI-TSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSDEDEDF

A0A6J1BVR5 uncharacterized protein LOC1110061915.4e-21988.76Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD
        MRKLCPNFDRE GLDTVLEVPIPEEMFS N  KTHAISWQAMKSWVKS H+D  SHV S+A+LFGGRNAEIQLLLGVVGAPLIP+P+ FD +PITRNIKD
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
        NPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKM ASEF SGEG LNGK  +K K NGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD

Query:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH
        GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES+VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH
Subjt:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH

Query:  LLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVITS
        LLRIKA GSRND +IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE GV VITS
Subjt:  LLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVITS

Query:  NGKFPLTMRC-SAGSRIFSSRVAAIDQSDEDESERRSNQSDEDED
        NGKFPLTMRC +A S+I SSRVAAID   ++ SE  SNQSDEDED
Subjt:  NGKFPLTMRC-SAGSRIFSSRVAAIDQSDEDESERRSNQSDEDED

A0A6J1GZA9 uncharacterized protein LOC1114588777.8e-21888.44Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD
        MRKLCPNFDRE GLDTVLEVPIPEEMFS N  K HAISWQAMK+WVKSNHHDKSSHV SIASLFGGRNAEIQLLLGVVGAPLIPLPI F  QPIT NIKD
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGG--GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALN IDSMYAMGKVKM ASEF SGE CLNGKA       G GG  GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGG--GGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVI
        SHLLRIKA GSRND +IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+ GVG  
Subjt:  SHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVI

Query:  TSNGKFPLTMRC----SAGSRIFSSRVAAIDQSDEDESERRSNQSDEDED
        TSNGK PLTMRC    +AGS+I SSRVAAI   D DESE  SNQSDEDE+
Subjt:  TSNGKFPLTMRC----SAGSRIFSSRVAAIDQSDEDESERRSNQSDEDED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)2.5e-10750.49Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIAS-----------------LFGGRNAEIQLLLGVVGAPLI
        MRKLCPN DRE GL+TVLEVP+PEEMF+   +      W+ M + +K++     + VT++A+                 L    + E   LL +VG+PLI
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIAS-----------------LFGGRNAEIQLLLGVVGAPLI

Query:  P--LPITFDQQPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRP
        P  +P+ F    ++R I D  IEAS AKYIVQQYVAA GG  ALN + SMYA+G+V+M  SE  +GE    G     GK      G  E+GGFV+WQK P
Subjt:  P--LPITFDQQPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRP

Query:  ELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHT
         LW LEL++SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HT
Subjt:  ELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHT

Query:  VWGYFSQRTGLLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDF
        VWGYFSQRTGLLV+  D+ L+R+K+   +ND  +FWET+ME++I DY  +D VNIAH G+T  +L+R+G     + + ++EE W IEEVDFNI GL ++ 
Subjt:  VWGYFSQRTGLLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDF

Query:  FLPPSDLKKE
        FLPPSD+  +
Subjt:  FLPPSDLKKE

AT3G19540.1 Protein of unknown function (DUF620)5.2e-9744.29Show/hide
Query:  REYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKDNPIEASMAK
        R   L  V+E P P+E     N          +  WVK     + S   S+A+    R  +++LLLGV+GAPL P+ ++        +IK+ PIE S A+
Subjt:  REYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKDNPIEASMAK

Query:  YIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTP
        YI+QQY AA GG+   N I + YAMGK+KM  SE             V+ +N  K     E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR TP
Subjt:  YIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTP

Query:  WHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAAGS
        W  SH ++GP RPLRR LQGLDP++TA +F+ + CIGEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++ G 
Subjt:  WHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAAGS

Query:  RNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVITSNGKFPL---
             +FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A  H++TKMEE W IEEV FN+ GLS+D F+PP+DLK      G +T + ++P    
Subjt:  RNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVITSNGKFPL---

Query:  ----TMRCSAGSRIFSSRVAAIDQSDEDE
            T+  SA  R   ++VAA++    ++
Subjt:  ----TMRCSAGSRIFSSRVAAIDQSDEDE

AT3G55720.1 Protein of unknown function (DUF620)9.6e-13657.98Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQ----QPITR
        MR LCPNFDRE GL+TVLEVP+PEE+F S+N K+ A  W+++KS +  +  D SS   S+A+LFGGR+++IQ+LLG+VGAP IPLPI+ DQ     PI+ 
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQ----QPITR

Query:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNG---KAAVKGKN-NGKGGGGGEMGGFVVWQKRPELWCLELMLSG
         IK+  IE++MAKYIV+QY AA GGE AL+ ++SMYAMGKVKM  +EFC+ +  LNG   K  V+ +N N   G GGEMGGFV+W+K    W LEL++SG
Subjt:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNG---KAAVKGKN-NGKGGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG
        CK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE + S L++RS S +E ++HTVWG F QRTG
Subjt:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL---
        LLVQLED++L+RIK  G  +++ + WETT ETLIQDY++IDG+ IAH GKT VSL R  E+ E HSKT MEE WEIEEV FN+KGLS DFFLPP DL   
Subjt:  LLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDL---

Query:  KKEEGG--VGVITSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDE
        ++EE G   G  TS    PL +  ++  +I SS+V AI+   E E
Subjt:  KKEEGG--VGVITSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDE

AT5G05840.1 Protein of unknown function (DUF620)1.4e-16367.42Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKS-WVK-SNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-----QQP
        MRKLCPN++ E GL+TVLEVP+PEE+F+++ TK     W  MKS W K +     ++  T++  LFGGRNAEIQLLLGVVGAPLIPLP+  D     + P
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKS-WVK-SNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFD-----QQP

Query:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGC
        I ++IKD P+E SMA+YIV+QY+AAVGG+ ALN ++SMYAMGKV+M ASEFC+GEG LN K     K      GGGE+GGFV+WQK  ELWCLEL++SGC
Subjt:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGC

Query:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLL
        KISAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE S L+ARSSS+VEIIRHTVWG FSQRTGLL
Subjt:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLL

Query:  VQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK---
        +QLEDSHLLRIKA   ++DN+IFWETTME+LIQDYRT+DG+ +AHAGK++VSLFRFGE ++ HS+T+MEE WEIEE+DFNIKGLSMD FLPPSDLKK   
Subjt:  VQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKK---

Query:  --EEGGVGVITSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESE
          EE   G+  +N K P+ +R SA  RI SS+V AI + +EDESE
Subjt:  --EEGGVGVITSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESE

AT5G66740.1 Protein of unknown function (DUF620)4.2e-11553.57Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD
        MRKLCPN D++ GL+TVLEVPIPEEMFS       A+ WQ M +W+K+   DK S       L   R  E++ LL +VG+PLIPL +      + + +KD
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD
          I+AS AKYIVQQY+AA GG  ALN ++SM   G+VKM ASEF  G+      + V  K+N       EMGGFV+WQK P+LWCLEL++SGCK+  GS+
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSD

Query:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH
        G+++WR +    + AS G PRPLRRFLQGLDP+STA LF ++TCIGEK IN EDCFILKLE   +V  A+S  + EII HT+WGYFSQR+GLL+Q EDS 
Subjt:  GKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSH

Query:  LLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        LLR++   ++ D ++FWET+ E+++ DYR +D VNIAH GKT+V++FR+GE +  H + +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  LLRIKAAGSRNDNNIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGCCCCAACTTCGACCGCGAATACGGTCTCGACACTGTCCTTGAGGTACCCATCCCCGAAGAGATGTTTTCTTCCAACAACACTAAAACCCATGCTAT
CTCATGGCAAGCTATGAAATCTTGGGTGAAATCGAATCATCATGACAAATCATCACATGTTACTTCAATTGCGTCTTTATTTGGAGGCCGCAATGCCGAGATCCAGCTTT
TACTTGGCGTTGTGGGAGCTCCGTTAATTCCACTCCCCATCACTTTTGATCAACAACCCATTACTCGCAACATCAAAGACAATCCCATTGAGGCGTCTATGGCGAAGTAC
ATAGTGCAACAATATGTAGCTGCAGTGGGAGGAGAACATGCGTTGAATTGTATTGATAGTATGTATGCTATGGGGAAAGTGAAGATGGCTGCGTCGGAGTTTTGTTCAGG
CGAAGGGTGTTTGAATGGAAAAGCGGCGGTGAAGGGGAAGAATAACGGGAAAGGTGGCGGCGGGGGAGAGATGGGGGGATTTGTGGTTTGGCAGAAACGGCCGGAGTTAT
GGTGCTTGGAATTGATGCTGTCAGGCTGTAAAATTAGCGCCGGTAGCGATGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCACTCACATGCTTCTCGTGGCCCTCCT
CGCCCCTTACGCCGATTCTTGCAGGGACTGGATCCGAAATCGACGGCGACTCTGTTCTCAAACTCCACCTGCATCGGCGAGAAAACGATCAACGACGAAGATTGCTTCAT
TCTAAAACTGGAAGCCGAATCTTCAGTCCTGAGAGCAAGGAGCAGTAGCAGCGTCGAAATAATCCGACACACAGTTTGGGGATATTTCAGCCAAAGAACCGGCCTCCTCG
TGCAACTTGAAGATTCGCATCTCCTTCGAATCAAAGCCGCCGGATCTCGAAACGACAACAACATCTTCTGGGAAACGACAATGGAAACCCTAATTCAGGACTATAGAACG
ATCGACGGCGTCAACATTGCACACGCCGGTAAAACAACCGTCTCGCTTTTCCGATTTGGTGAAACTGCGGAAGGGCATTCGAAAACGAAGATGGAAGAGATTTGGGAGAT
CGAGGAAGTTGATTTCAATATCAAGGGTTTGTCGATGGACTTCTTTTTGCCTCCGAGTGATTTGAAGAAGGAGGAAGGAGGAGTTGGTGTGATTACGAGTAATGGAAAGT
TTCCGTTGACGATGCGATGTTCGGCTGGGTCGAGGATTTTCTCGTCCAGAGTGGCGGCCATTGATCAGAGTGATGAAGATGAATCGGAGAGGAGGAGTAATCAGAGTGAT
GAAGATGAAGATTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGCTTTGCCCCAACTTCGACCGCGAATACGGTCTCGACACTGTCCTTGAGGTACCCATCCCCGAAGAGATGTTTTCTTCCAACAACACTAAAACCCATGCTAT
CTCATGGCAAGCTATGAAATCTTGGGTGAAATCGAATCATCATGACAAATCATCACATGTTACTTCAATTGCGTCTTTATTTGGAGGCCGCAATGCCGAGATCCAGCTTT
TACTTGGCGTTGTGGGAGCTCCGTTAATTCCACTCCCCATCACTTTTGATCAACAACCCATTACTCGCAACATCAAAGACAATCCCATTGAGGCGTCTATGGCGAAGTAC
ATAGTGCAACAATATGTAGCTGCAGTGGGAGGAGAACATGCGTTGAATTGTATTGATAGTATGTATGCTATGGGGAAAGTGAAGATGGCTGCGTCGGAGTTTTGTTCAGG
CGAAGGGTGTTTGAATGGAAAAGCGGCGGTGAAGGGGAAGAATAACGGGAAAGGTGGCGGCGGGGGAGAGATGGGGGGATTTGTGGTTTGGCAGAAACGGCCGGAGTTAT
GGTGCTTGGAATTGATGCTGTCAGGCTGTAAAATTAGCGCCGGTAGCGATGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCACTCACATGCTTCTCGTGGCCCTCCT
CGCCCCTTACGCCGATTCTTGCAGGGACTGGATCCGAAATCGACGGCGACTCTGTTCTCAAACTCCACCTGCATCGGCGAGAAAACGATCAACGACGAAGATTGCTTCAT
TCTAAAACTGGAAGCCGAATCTTCAGTCCTGAGAGCAAGGAGCAGTAGCAGCGTCGAAATAATCCGACACACAGTTTGGGGATATTTCAGCCAAAGAACCGGCCTCCTCG
TGCAACTTGAAGATTCGCATCTCCTTCGAATCAAAGCCGCCGGATCTCGAAACGACAACAACATCTTCTGGGAAACGACAATGGAAACCCTAATTCAGGACTATAGAACG
ATCGACGGCGTCAACATTGCACACGCCGGTAAAACAACCGTCTCGCTTTTCCGATTTGGTGAAACTGCGGAAGGGCATTCGAAAACGAAGATGGAAGAGATTTGGGAGAT
CGAGGAAGTTGATTTCAATATCAAGGGTTTGTCGATGGACTTCTTTTTGCCTCCGAGTGATTTGAAGAAGGAGGAAGGAGGAGTTGGTGTGATTACGAGTAATGGAAAGT
TTCCGTTGACGATGCGATGTTCGGCTGGGTCGAGGATTTTCTCGTCCAGAGTGGCGGCCATTGATCAGAGTGATGAAGATGAATCGGAGAGGAGGAGTAATCAGAGTGAT
GAAGATGAAGATTTCTGA
Protein sequenceShow/hide protein sequence
MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHHDKSSHVTSIASLFGGRNAEIQLLLGVVGAPLIPLPITFDQQPITRNIKDNPIEASMAKY
IVQQYVAAVGGEHALNCIDSMYAMGKVKMAASEFCSGEGCLNGKAAVKGKNNGKGGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPP
RPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESSVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAAGSRNDNNIFWETTMETLIQDYRT
IDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEEIWEIEEVDFNIKGLSMDFFLPPSDLKKEEGGVGVITSNGKFPLTMRCSAGSRIFSSRVAAIDQSDEDESERRSNQSD
EDEDF