; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G06870 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G06870
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF620)
Genome locationClcChr11:6862031..6864649
RNA-Seq ExpressionClc11G06870
SyntenyClc11G06870
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041504.1 hypothetical protein E6C27_scaffold6G00960 [Cucumis melo var. makuwa]8.0e-22290.48Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN DKSSHVTS TS+FGGRNAEIQLLLGVVGAPLIPLPI FD QQPITR NIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC
        DNPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKMAASEFWSGE    GKA VKGKNNGK       GGGGE+GGFVVWQKRPELWCLELML G 
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC

Query:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL
        KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLL
Subjt:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL

Query:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
        VQLEDSHLLRIK   SRN+NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
Subjt:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG

Query:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG
        VG++ TS GKFP T+RCS  SR FSSRV AIDQS+++ESEG
Subjt:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG

XP_004138438.2 uncharacterized protein LOC101207651 [Cucumis sativus]3.2e-22390.72Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSN TKTHAISWQAMKSWVKSN DKSSH TS TS+FGGRNAEIQLLLGVVGAPLIPLPI FD QQPI R NIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK--------GGGGEMGGFVVWQKRPELWCLELMLSG
        DNPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKMAASEFWSGE    GKAAVKGKNNGK        GGGGEMGGFVVWQKRPELWCLELML G
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK--------GGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGL
         KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGL
Subjt:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEE
        LVQLEDSHLLRIK  GSRN+NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEE
Subjt:  LVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEE

Query:  GVGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG
        GVG++ TS GKFPLTMRCS  SR FSSRV AIDQS+++ESEG
Subjt:  GVGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG

XP_008441425.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485545 [Cucumis melo]2.3e-22190.25Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN DKSSHVTS TS+FGGRNAEIQLLLGVVGAPLIPLPI FD QQPITR NIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC
        DNPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKMAASEFWSGE    GKA VKGKNNGK       GGGGE+GGFVVW+KRPELWCLELML G 
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC

Query:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL
        KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLL
Subjt:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL

Query:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
        VQLEDSHLLRIK   SRN+NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
Subjt:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG

Query:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG
        VG++ TS GKFP T+RCS  SR FSSRV AIDQS+++ESEG
Subjt:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG

XP_022133661.1 uncharacterized protein LOC111006191 [Momordica charantia]7.0e-21887.07Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKDN
        MRKLCPNFDRE GLDTVLEVPIPEEMFS N  KTHAISWQAMKSWVKS++D  SHV S  ++FGGRNAEIQLLLGVVGAPLIP+P+ FD +PITRNIKDN
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
        PIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKM ASEF SGEG LNGK  +K KN   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK

Query:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
        VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
Subjt:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL

Query:  RIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVTSNGK
        RIKAGGSRN++IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV V+TSNGK
Subjt:  RIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVTSNGK

Query:  FPLTMRC-SAGSRIFSSRVVAID----------QSDEDESE
        FPLTMRC +A S+I SSRV AID          QSDEDE E
Subjt:  FPLTMRC-SAGSRIFSSRVVAID----------QSDEDESE

XP_038886072.1 uncharacterized protein LOC120076338 [Benincasa hispida]2.8e-23594.02Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN--HDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSS  TKTH ISWQAMKSWVKSN  HDKSSHV S +S+FGGRNAEIQLLLGVVGAPLIPLPI FDQQPITRNIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN--HDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK---GGGGEMGGFVVWQKRPELWCLELMLSGCKISA
        DNPIEASMAKYIVQQY+AAVGGEHALNSIDSMYAMGKVKMAASEF SGEGCLNGKA   GKNNGK   GGGGEMGGFVVWQKRPELWCLELMLSGCKISA
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK---GGGGEMGGFVVWQKRPELWCLELMLSGCKISA

Query:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
        GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE
Subjt:  GSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLE

Query:  DSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVV
        DSHLLRIK  GSRN+NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVG++
Subjt:  DSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVV

Query:  TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESE
        TSNGKFP+TMRCSAGSR+FSSRVVAIDQSDEDESE
Subjt:  TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESE

TrEMBL top hitse value%identityAlignment
A0A0A0KBJ0 Uncharacterized protein1.6e-22390.72Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSN TKTHAISWQAMKSWVKSN DKSSH TS TS+FGGRNAEIQLLLGVVGAPLIPLPI FD QQPI R NIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK--------GGGGEMGGFVVWQKRPELWCLELMLSG
        DNPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKMAASEFWSGE    GKAAVKGKNNGK        GGGGEMGGFVVWQKRPELWCLELML G
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK--------GGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGL
         KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGL
Subjt:  CKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGL

Query:  LVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEE
        LVQLEDSHLLRIK  GSRN+NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEE
Subjt:  LVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEE

Query:  GVGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG
        GVG++ TS GKFPLTMRCS  SR FSSRV AIDQS+++ESEG
Subjt:  GVGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG

A0A1S3B3D9 LOW QUALITY PROTEIN: uncharacterized protein LOC1034855451.1e-22190.25Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN DKSSHVTS TS+FGGRNAEIQLLLGVVGAPLIPLPI FD QQPITR NIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC
        DNPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKMAASEFWSGE    GKA VKGKNNGK       GGGGE+GGFVVW+KRPELWCLELML G 
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC

Query:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL
        KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLL
Subjt:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL

Query:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
        VQLEDSHLLRIK   SRN+NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
Subjt:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG

Query:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG
        VG++ TS GKFP T+RCS  SR FSSRV AIDQS+++ESEG
Subjt:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG

A0A5A7TDK0 Uncharacterized protein3.9e-22290.48Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK
        MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN DKSSHVTS TS+FGGRNAEIQLLLGVVGAPLIPLPI FD QQPITR NIK
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-QQPITR-NIK

Query:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC
        DNPIEASMAKYIVQQYVAAVGGEHALN I+SMYAMGKVKMAASEFWSGE    GKA VKGKNNGK       GGGGE+GGFVVWQKRPELWCLELML G 
Subjt:  DNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGK-------GGGGEMGGFVVWQKRPELWCLELMLSGC

Query:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL
        KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLL
Subjt:  KISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLL

Query:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
        VQLEDSHLLRIK   SRN+NIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEG
Subjt:  VQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEG

Query:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG
        VG++ TS GKFP T+RCS  SR FSSRV AIDQS+++ESEG
Subjt:  VGVV-TSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEG

A0A6J1BVR5 uncharacterized protein LOC1110061913.4e-21887.07Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKDN
        MRKLCPNFDRE GLDTVLEVPIPEEMFS N  KTHAISWQAMKSWVKS++D  SHV S  ++FGGRNAEIQLLLGVVGAPLIP+P+ FD +PITRNIKDN
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKDN

Query:  PIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
        PIEASMAKYIVQQYVAAVGGEHALNSI+SMYAMGKVKM ASEF SGEG LNGK  +K KN   GGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK
Subjt:  PIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGK

Query:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
        VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNS+C+GEKTINDEDCFILKLEAES VLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL
Subjt:  VAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLL

Query:  RIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVTSNGK
        RIKAGGSRN++IFWETTMETLIQDYRTIDGVNIAHAGKT+VSLFRFGE+AEGHS+TKMEE WEIEEVDFNIKGLSMDFFLPPSDLKKEEEGV V+TSNGK
Subjt:  RIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVTSNGK

Query:  FPLTMRC-SAGSRIFSSRVVAID----------QSDEDESE
        FPLTMRC +A S+I SSRV AID          QSDEDE E
Subjt:  FPLTMRC-SAGSRIFSSRVVAID----------QSDEDESE

A0A6J1GZA9 uncharacterized protein LOC1114588774.9e-21788.06Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN-HDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKD
        MRKLCPNFDRE GLDTVLEVPIPEEMFS N  K HAISWQAMK+WVKSN HDKSSHV S  S+FGGRNAEIQLLLGVVGAPLIPLPI F  QPIT NIKD
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSN-HDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAA-VKGKNNGKG--GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
        NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKM ASEF SGE CLNGKA  VK    G G  GGGEMGGFVVWQKRPELWCLELMLSGCKISAG
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAA-VKGKNNGKG--GGGEMGGFVVWQKRPELWCLELMLSGCKISAG

Query:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED
        SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLV+LED
Subjt:  SDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLED

Query:  SHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVT
        SHLLRIKAGGSRN++IFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGE+AEGHSKTKMEE WEIEEVDFNI+GLSMDFFLPPSDLKKE+EGVG  T
Subjt:  SHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVT

Query:  SNGKFPLTMRC----SAGSRIFSSRVVAID--------QSDEDE
        SNGK PLTMRC    +AGS+I SSRV AID        QSDEDE
Subjt:  SNGKFPLTMRC----SAGSRIFSSRVVAID--------QSDEDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G75160.1 Protein of unknown function (DUF620)1.2e-10951.12Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVT-----STTSVFGGRNAEIQ--------LLLGVVGAPLIP--LP
        MRKLCPN DRE GL+TVLEVP+PEEMF+   +      W+ M + +K++   ++  T     +++S     N  +Q         LL +VG+PLIP  +P
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVT-----STTSVFGGRNAEIQ--------LLLGVVGAPLIP--LP

Query:  IAFDQQPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLE
        + F    ++R I D  IEAS AKYIVQQYVAA GG  ALN++ SMYA+G+V+M  SE  +GE    G     GK     G  E+GGFV+WQK P LW LE
Subjt:  IAFDQQPITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLE

Query:  LMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFS
        L++SG KISAGSDGKVAW Q+    S A RGPPRPLRRF QGLDP+ TA+LF ++ CIGE+ +N EDCF+LK+E  S +L+A+ S + E+I HTVWGYFS
Subjt:  LMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFS

Query:  QRTGLLVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDL
        QRTGLLV+  D+ L+R+K+G  +N+ +FWET+ME++I DY  +D VNIAH G+T  +L+R+G     + + ++EE W IEEVDFNI GL ++ FLPPSD+
Subjt:  QRTGLLVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDL

Query:  KKE
          +
Subjt:  KKE

AT3G19540.1 Protein of unknown function (DUF620)4.1e-9946.56Show/hide
Query:  REYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKDNPIEASMAKY
        R   L  V+E P P+E     N          +  WVK    ++  V +T +    R  +++LLLGV+GAPL P+ ++        +IK+ PIE S A+Y
Subjt:  REYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKDNPIEASMAKY

Query:  IVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWH
        I+QQY AA GG+   NSI + YAMGK+KM  SE          + A +   N      E GGFV+WQ  P++W +EL + G K+ AG +GK+ WR TPW 
Subjt:  IVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWH

Query:  HSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRN
         SH ++GP RPLRR LQGLDP++TA +F+ + CIGEK +N EDCFILKL  +   L+ARS    EIIRH ++GYFSQ+TGLLV +EDSHL RI++ G   
Subjt:  HSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRN

Query:  ENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVTSNGKFP
        E +FWETT  + + DYR ++G+ IAH+G + V+LFRFGE A  H++TKMEE+W IEEV FN+ GLS+D F+PP+DLK      G +T + ++P
Subjt:  ENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVTSNGKFP

AT3G55720.1 Protein of unknown function (DUF620)3.2e-13657.43Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMK-SWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQ----QPITR
        MR LCPNFDRE GL+TVLEVP+PEE+F S+N K+ A  W+++K S ++S  D SS   S  ++FGGR+++IQ+LLG+VGAP IPLPI+ DQ     PI+ 
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMK-SWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQ----QPITR

Query:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGK-----AAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSG
         IK+  IE++MAKYIV+QY AA GGE AL++++SMYAMGKVKM  +EF + +  LNGK       ++  NN  G GGEMGGFV+W+K    W LEL++SG
Subjt:  NIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGK-----AAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSG

Query:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTG
        CK+SAG DG V WRQ+PW  HSHAS  P  PLRRFLQGLDPK+TA LF+ S C+GEK +N+E+CF+LKLE +   L++RS S +E ++HTVWG F QRTG
Subjt:  CKISAGSDGKVAWRQTPW-HHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTG

Query:  LLVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDL--KK
        LLVQLED++L+RIK G    + + WETT ETLIQDY++IDG+ IAH GKT VSL R  E+ E HSKT MEE+WEIEEV FN+KGLS DFFLPP DL  K+
Subjt:  LLVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDL--KK

Query:  EEE---GVGVVTSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDE
        EEE     G  TS    PL +  ++  +I SS+V AI+   E E
Subjt:  EEE---GVGVVTSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDE

AT5G05840.1 Protein of unknown function (DUF620)1.4e-16367.72Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKS-WVKSNHDKSSHVTST--TSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-----QQP
        MRKLCPN++ E GL+TVLEVP+PEE+F+++ TK     W  MKS W K     +   T+T  T +FGGRNAEIQLLLGVVGAPLIPLP+  D     + P
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKS-WVKSNHDKSSHVTST--TSVFGGRNAEIQLLLGVVGAPLIPLPIAFD-----QQP

Query:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCK
        I ++IKD P+E SMA+YIV+QY+AAVGG+ ALN+++SMYAMGKV+M ASEF +GEG LN K  VK ++  K GGGE+GGFV+WQK  ELWCLEL++SGCK
Subjt:  ITRNIKDNPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCK

Query:  ISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLV
        ISAGSD KVAWRQTPWH SHASRGPPRPLRRFLQGLDPKSTA LF+ S C+GEK INDEDCFILKL+AE   L+ARSSS+VEIIRHTVWG FSQRTGLL+
Subjt:  ISAGSDGKVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLV

Query:  QLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKK---EE
        QLEDSHLLRIKA    + +IFWETTME+LIQDYRT+DG+ +AHAGK++VSLFRFGE ++ HS+T+MEE WEIEE+DFNIKGLSMD FLPPSDLKK   EE
Subjt:  QLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKK---EE

Query:  EGV--GVVTSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESE
        E +  G+  +N K P+ +R SA  RI SS+V+AI + +EDESE
Subjt:  EGV--GVVTSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESE

AT5G66740.1 Protein of unknown function (DUF620)4.5e-11453.59Show/hide
Query:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKS-NHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKD
        MRKLCPN D++ GL+TVLEVPIPEEMFS       A+ WQ M +W+K+   DK S       +   R  E++ LL +VG+PLIPL +      + + +KD
Subjt:  MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKS-NHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKD

Query:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG
          I+AS AKYIVQQY+AA GG  ALN+++SM   G+VKM ASEF  G+      + V  K+N      EMGGFV+WQK P+LWCLEL++SGCK+  GS+G
Subjt:  NPIEASMAKYIVQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDG

Query:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL
        +++WR +    + AS G PRPLRRFLQGLDP+STA LF ++TCIGEK IN EDCFILKLE    V  A+S  + EII HT+WGYFSQR+GLL+Q EDS L
Subjt:  KVAWRQTPWHHSHASRGPPRPLRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHL

Query:  LRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEE
        LR++     +E++FWET+ E+++ DYR +D VNIAH GKT+V++FR+GE +  H + +M E W IEEVDFN+ GLS+D FLPP++L+ E+
Subjt:  LRIKAGGSRNENIFWETTMETLIQDYRTIDGVNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGCTTTGCCCTAACTTCGACCGCGAATACGGTCTCGACACTGTTCTTGAGGTACCCATTCCGGAAGAGATGTTTTCTTCTAACAACACCAAAACTCATGCAAT
TTCATGGCAAGCCATGAAATCATGGGTTAAATCCAATCATGACAAGTCATCACATGTTACTTCAACTACGTCTGTATTTGGAGGTCGCAATGCCGAGATCCAGCTATTAC
TCGGCGTTGTAGGAGCTCCGTTAATTCCACTCCCCATCGCTTTTGATCAACAACCCATTACGCGCAACATCAAAGACAACCCAATTGAGGCGTCTATGGCCAAGTACATT
GTGCAACAATATGTAGCTGCAGTGGGAGGAGAACATGCGTTGAACTCAATTGATAGTATGTATGCTATGGGGAAAGTGAAGATGGCGGCGTCAGAGTTTTGGTCCGGTGA
AGGGTGTTTGAATGGAAAAGCGGCGGTGAAGGGGAAGAACAACGGGAAAGGCGGCGGCGGAGAGATGGGAGGATTTGTTGTTTGGCAGAAACGGCCGGAGTTATGGTGCT
TGGAATTGATGCTGTCCGGCTGTAAAATTAGCGCCGGAAGCGATGGGAAAGTGGCTTGGAGACAAACTCCATGGCATCACTCACATGCTTCTCGTGGCCCTCCTCGCCCC
TTGCGCCGATTCTTGCAGGGGTTGGATCCAAAATCGACGGCGACTCTATTCTCAAACTCCACCTGCATCGGCGAGAAAACGATCAACGACGAAGATTGCTTCATTCTAAA
ACTGGAAGCCGAATCTCCAGTCCTGAGGGCAAGGAGCAGTAGCAGCGTCGAAATAATCAGGCACACAGTTTGGGGATATTTCAGCCAAAGAACCGGCCTCCTCGTGCAGC
TTGAAGATTCGCATCTCCTCCGAATCAAAGCCGGCGGATCTCGAAACGAAAATATCTTCTGGGAAACGACAATGGAAACCCTAATTCAGGATTACAGAACGATCGACGGC
GTCAATATTGCACACGCCGGTAAAACAACCGTTTCGCTTTTCCGATTTGGGGAAACTGCAGAGGGGCATTCGAAAACGAAGATGGAAGAGAATTGGGAGATCGAAGAAGT
TGATTTCAATATCAAGGGTTTGTCGATGGACTTCTTTTTGCCTCCGAGTGATCTGAAGAAGGAGGAAGAAGGAGTTGGTGTGGTTACGAGTAACGGAAAGTTTCCGTTGA
CGATGCGGTGTTCGGCTGGGTCGAGGATTTTTTCGTCGAGAGTGGTGGCCATTGATCAGAGTGATGAAGATGAATCGGAGGGGGAGTAA
mRNA sequenceShow/hide mRNA sequence
GTTACATTTCCATTCTCCCTTAATTTTCTTCTCCTATTATTATTCTCTTTCTTCTCTCTTTCACTTCCAACTCCTCATTAATAATCATCTCTCTTACTACTACTCTCAAT
CCAACAACAACAAAAATTCATCCTTCCCTTCTCTTCTTTCTCTCCAATGAGGAAGCTTTGCCCTAACTTCGACCGCGAATACGGTCTCGACACTGTTCTTGAGGTACCCA
TTCCGGAAGAGATGTTTTCTTCTAACAACACCAAAACTCATGCAATTTCATGGCAAGCCATGAAATCATGGGTTAAATCCAATCATGACAAGTCATCACATGTTACTTCA
ACTACGTCTGTATTTGGAGGTCGCAATGCCGAGATCCAGCTATTACTCGGCGTTGTAGGAGCTCCGTTAATTCCACTCCCCATCGCTTTTGATCAACAACCCATTACGCG
CAACATCAAAGACAACCCAATTGAGGCGTCTATGGCCAAGTACATTGTGCAACAATATGTAGCTGCAGTGGGAGGAGAACATGCGTTGAACTCAATTGATAGTATGTATG
CTATGGGGAAAGTGAAGATGGCGGCGTCAGAGTTTTGGTCCGGTGAAGGGTGTTTGAATGGAAAAGCGGCGGTGAAGGGGAAGAACAACGGGAAAGGCGGCGGCGGAGAG
ATGGGAGGATTTGTTGTTTGGCAGAAACGGCCGGAGTTATGGTGCTTGGAATTGATGCTGTCCGGCTGTAAAATTAGCGCCGGAAGCGATGGGAAAGTGGCTTGGAGACA
AACTCCATGGCATCACTCACATGCTTCTCGTGGCCCTCCTCGCCCCTTGCGCCGATTCTTGCAGGGGTTGGATCCAAAATCGACGGCGACTCTATTCTCAAACTCCACCT
GCATCGGCGAGAAAACGATCAACGACGAAGATTGCTTCATTCTAAAACTGGAAGCCGAATCTCCAGTCCTGAGGGCAAGGAGCAGTAGCAGCGTCGAAATAATCAGGCAC
ACAGTTTGGGGATATTTCAGCCAAAGAACCGGCCTCCTCGTGCAGCTTGAAGATTCGCATCTCCTCCGAATCAAAGCCGGCGGATCTCGAAACGAAAATATCTTCTGGGA
AACGACAATGGAAACCCTAATTCAGGATTACAGAACGATCGACGGCGTCAATATTGCACACGCCGGTAAAACAACCGTTTCGCTTTTCCGATTTGGGGAAACTGCAGAGG
GGCATTCGAAAACGAAGATGGAAGAGAATTGGGAGATCGAAGAAGTTGATTTCAATATCAAGGGTTTGTCGATGGACTTCTTTTTGCCTCCGAGTGATCTGAAGAAGGAG
GAAGAAGGAGTTGGTGTGGTTACGAGTAACGGAAAGTTTCCGTTGACGATGCGGTGTTCGGCTGGGTCGAGGATTTTTTCGTCGAGAGTGGTGGCCATTGATCAGAGTGA
TGAAGATGAATCGGAGGGGGAGTAATCAGAGTGACGAAGATGAAGATTTCTGATCAATTGTATTAAAAGATTAAACTCAATTTTTGGAAATTTGTACATATAAACCATGG
TTAATATATATAAATATGGGACATTGACTTTTTTTACTAATTAGAAAGGCTAATGTGTTGTTTG
Protein sequenceShow/hide protein sequence
MRKLCPNFDREYGLDTVLEVPIPEEMFSSNNTKTHAISWQAMKSWVKSNHDKSSHVTSTTSVFGGRNAEIQLLLGVVGAPLIPLPIAFDQQPITRNIKDNPIEASMAKYI
VQQYVAAVGGEHALNSIDSMYAMGKVKMAASEFWSGEGCLNGKAAVKGKNNGKGGGGEMGGFVVWQKRPELWCLELMLSGCKISAGSDGKVAWRQTPWHHSHASRGPPRP
LRRFLQGLDPKSTATLFSNSTCIGEKTINDEDCFILKLEAESPVLRARSSSSVEIIRHTVWGYFSQRTGLLVQLEDSHLLRIKAGGSRNENIFWETTMETLIQDYRTIDG
VNIAHAGKTTVSLFRFGETAEGHSKTKMEENWEIEEVDFNIKGLSMDFFLPPSDLKKEEEGVGVVTSNGKFPLTMRCSAGSRIFSSRVVAIDQSDEDESEGE