; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024599 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024599
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr10:4291350..4294797
RNA-Seq ExpressionLag0024599
SyntenyLag0024599
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137536.1 uncharacterized protein LOC111008961 [Momordica charantia]1.9e-26684.26Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGA  AS DDS STNRSSIHRRSRSLSRFSHP+PSSPVDK F E  ARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS

Query:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR
        SESSGA N AAASHRRGRSVSRHG +K+S  G++GKGR NYS G    VPE+NSRRRRS+SVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQ     
Subjt:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR

Query:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL
                HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNN +EKTIR+IYARK KQ NGDVVDNGLYEAMRKELR+AVEEIRVEL
Subjt:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL

Query:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS
        EQEMVNRNSSVGTFSDDL+S+D GVL+ TSPF RNYSTKQEQSEKRRDSLAKMV+EEQRGQ+LPKMVK+LP DLKNVVAENS R RKRSNDR+RMSKRLS
Subjt:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS

Query:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ
        EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNF++PA S++VPPGMDGVL PWLQWETSNDA+ YPRKN  EPPMTPQTFPWD NQE++N Q
Subjt:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ

Query:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLE-SSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        DQ NHS SSQGSWSPGV  G+ GKVVE++GSRFKK GNYQNQS LE   +SRFDI+EYLKRPS+E+FLLERWKQQH+ NCSGLLLCNR+FL
Subjt:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLE-SSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

XP_022994153.1 uncharacterized protein LOC111489977 [Cucurbita maxima]1.0e-26484.63Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGASV+SNDDS STNRSSIHRRSRSLSRFSHPLPSSPVDKGF EGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS

Query:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR
        SESS A +  A+S RRGRS+SRHGSAK+SGGG+EG+G+A+YS G    VPESNSRRRRS+SVVRYQISDSESDLD+SQ+SGT ++EKS+G GNKQ     
Subjt:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR

Query:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL
               +HKAD+SNRRP LRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNV EKTIR+I ARK KQLNGDVVDNGLYEAMRKELR+AVEEIRVEL
Subjt:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL

Query:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS
        EQEMVNRNSSVGT SDDLHSSD GV +HTSPFKRNYSTKQEQS+KRRDSLAKMV++EQR QELPK VK+  PDL NVVAENSSR RKRSNDRSRMSKRL+
Subjt:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS

Query:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ
        EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPAAS++VPPGMDGVL PWLQWETSNDATPYPRKN I P MTPQ FPWD NQEASNAQ
Subjt:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ

Query:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESR--FDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        D  NHS SSQGSWSPGV   LSGKVVE++GSRFKKTG YQNQS LES ESR  FDIDEYLKRPSNE+FLLERWKQQHKIN  GLLLCN +FL
Subjt:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESR--FDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

XP_023519165.1 uncharacterized protein LOC111782615 [Cucurbita pepo subsp. pepo]1.1e-26384.63Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNT-SRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSP DKGF EGSAR+GRFVNT SRGSGFPEISLDDLAVEFFGSGDRGRSAAR
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNT-SRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT
        SSESSGA N A AS+RRGRSVSRH  AK SGG NEGKGRA+YS G    VPESNSRRRRSLSVVRYQISDSESD+DRSQNSG RVKEKS+GIGNKQ    
Subjt:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT

Query:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDN-GLYEAMRKELRNAVEEIRV
                +HKADDSNRRP+L+RSLSQNDFKCHDGYSSQSSVLTDDEGKD YFGN VME+TIRSIYARK KQ NG VVDN GLYEAMRKELR+AVEEIRV
Subjt:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDN-GLYEAMRKELRNAVEEIRV

Query:  ELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKR
        ELEQEMVNRNSSVGTFSDDLHSSD G+ +HTSPF RNYSTKQEQS KRRDSL KMV EEQRGQE PKM+K+LPPDLKN VAEN SRTRKRS DRSRMSKR
Subjt:  ELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKR

Query:  LSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASN
        LSEEAEKYIEDFISNVEDTDISSLDGDRSD SSSL GK KPNF+I AAS+HVPPGMDGVLFPWLQWET NDATPYPRK+MIEPP TPQTFP D NQ+  N
Subjt:  LSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASN

Query:  AQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        AQD CN SASSQGSWSPGV   LSGK V++I SRFK+ GNYQNQS LES ESRFD+DEYLKRPSNE+FLLERWKQQH+ N SGLLLC+RLFL
Subjt:  AQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

XP_023542617.1 uncharacterized protein LOC111802468 [Cucurbita pepo subsp. pepo]2.2e-26284.46Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGASVASNDDS STNRSSIHRRSRSLSRFSHPLPSSPVDKGF EGSARRGRFVNTSRGSGFPEISLDDLA+EFFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS

Query:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR
        SESS A +  A+S RRGRSVSRHGSAK+SGGG++GKG+A+YS G    VPESNSRRRRS+SVVRYQISDSESDLD+SQ+SGTR++EKS G GNKQ     
Subjt:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR

Query:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL
               +HKAD+SNRRP LRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNV EKTIR+I ARK KQLNGDVVDNGLYEAMRKELR+AVEEIRVEL
Subjt:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL

Query:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS
        EQEMVNRNSSVGT SDDLHSSD GV +H+SPFKRNYSTKQEQS+KRRDSLAKMV++EQR QELPK VK+  PDL NVVAENSSR RKRSNDRSRMSKRL+
Subjt:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS

Query:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ
        EEAEKYIED+ISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPA S+ VPPGMDGVL PWLQWETSNDATPYPRKN I P +TPQ+FPWDANQEASNAQ
Subjt:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ

Query:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESR--FDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        D  NHS SSQGSWSPGV   LSGK VE+IGSRFKKTG YQNQS LES ESR  FDIDEYLKRPSNEEFLLERWKQQHK+N  GLLLCN +FL
Subjt:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESR--FDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

XP_038893835.1 uncharacterized protein LOC120082652 isoform X1 [Benincasa hispida]8.8e-27287.63Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGASV SNDDSASTNRSSIHRRSRSLSRFSHPLPSSP+DKGF E SA RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS

Query:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRA--NYSAG--VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR
         ESSGA N A AS+RRGRSVSRHGSAK++GGG+EGKGRA    S G  VPESNSRRRRSLSVVRYQISDSESDLDRSQ+SGTRVKE SFGIGNKQ     
Subjt:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRA--NYSAG--VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR

Query:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL
               +HKADDSNRRPTLRRSLSQNDFKCHDGYSS SSVLTDDEGKDAYFGN+VMEKTIRSIYARK KQ NG VVDNGLYEAMRKELR+AVEEIRVEL
Subjt:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL

Query:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS
        EQEMVNRNSSV TFSDDLHSSD GV +HTSPF RNYSTKQEQSEKRR+SL KMVMEEQRGQELPKMVK+LPPD+KN VAENSSRTRKRSNDRSRMSKRLS
Subjt:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS

Query:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ
        EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGK KPNF+I AASR VPPGMDGVL PWLQWETSNDATPYPRKNM EPP TPQTFPWD NQ+ SN Q
Subjt:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ

Query:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        D CNHS SSQGSWSPGVT G+S KVVE+IGSRFKK GNYQNQS LES ESRFDIDEYLKRPSNE FLLERWKQQHKINCSGLLLCNR+FL
Subjt:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

TrEMBL top hitse value%identityAlignment
A0A5D3BJY3 Uncharacterized protein2.9e-26083.25Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGASV SNDDS STNR S HRRSRSLSRFSHPLPSSP+DK F E SA  RGRFVNTSRGSGFPEISLDDLAVEFFGS DRGRS  R
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSAR-RGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT
        SSE SGA N + AS+RRGRSVSRHG  K+SGGG+E KGR   S      VPESNSRRRRSLSVVRYQISDSESD DRSQ+SGTRV+EKSFGIGNKQ    
Subjt:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT

Query:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVE
                +HKADDSNRRPTLRRSLSQNDFKCHDGYSS SSVLTDDEGKDA+FGN+V+EKT+RSIYARK KQ NG VVD+GLYEAMRKELR+AVEEIRVE
Subjt:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVE

Query:  LEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRL
        LEQEMVNRNSSV TFSDDLHSSD GV  HTSPF RNYS KQEQSEKRRDSL KMVME+QRGQ+LPKMVK+LPPDLKNVVA+NSSR+RKRS DRSRMSKRL
Subjt:  LEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRL

Query:  SEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNA
        SEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGK KPNF+IPAA R+VPPGMDGVL PWLQWETSNDATPYPRKNM EPP TPQT PWD NQ+ SNA
Subjt:  SEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNA

Query:  QDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
         D CNHS SSQGSWSPGVT GLSGKVVE+IGSRFK+ GN Q QS  ES ESRFDIDEYLKRPSNE+FLLERWKQQHKI CSG+LLCNR+FL
Subjt:  QDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

A0A6J1C6X2 uncharacterized protein LOC1110089619.1e-26784.26Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGA  AS DDS STNRSSIHRRSRSLSRFSHP+PSSPVDK F E  ARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS

Query:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR
        SESSGA N AAASHRRGRSVSRHG +K+S  G++GKGR NYS G    VPE+NSRRRRS+SVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQ     
Subjt:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR

Query:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL
                HKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNN +EKTIR+IYARK KQ NGDVVDNGLYEAMRKELR+AVEEIRVEL
Subjt:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL

Query:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS
        EQEMVNRNSSVGTFSDDL+S+D GVL+ TSPF RNYSTKQEQSEKRRDSLAKMV+EEQRGQ+LPKMVK+LP DLKNVVAENS R RKRSNDR+RMSKRLS
Subjt:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS

Query:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ
        EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNF++PA S++VPPGMDGVL PWLQWETSNDA+ YPRKN  EPPMTPQTFPWD NQE++N Q
Subjt:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ

Query:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLE-SSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        DQ NHS SSQGSWSPGV  G+ GKVVE++GSRFKK GNYQNQS LE   +SRFDI+EYLKRPS+E+FLLERWKQQH+ NCSGLLLCNR+FL
Subjt:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLE-SSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

A0A6J1EA51 uncharacterized protein LOC1114312775.7e-26184.12Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNT-SRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPL SSP DKGF EGSAR+GRFVNT SRGSGFPEISLDDLAVEFFGSGDRGRSAAR
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNT-SRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT
        SSESSGA N A AS+RRGRSVSRH  AK +GG NEGKGRA+YS G    VPESNSRRRRSLSVVRYQISDSESD+DRSQNSG RVKEKS+GIGNKQ    
Subjt:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT

Query:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDN-GLYEAMRKELRNAVEEIRV
                +HKAD+SNRRP+L+RSLSQNDFKCHDGYSSQSSVLTDDEGKD YFGN VME+TIR IYARK KQ N DVVDN GLYEAMRKELR+AVEEIRV
Subjt:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDN-GLYEAMRKELRNAVEEIRV

Query:  ELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKR
        ELEQEMVNRNSSVGTFSDDLHSSD G+ +HTSPF RNYSTKQEQS KRRDSL KMV EEQ GQE PKMVK+LPPDLKN VAEN SRTRKRS DRSRMSKR
Subjt:  ELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKR

Query:  LSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASN
        LSEEAEKYIEDFISNVEDTDISSLDGDRSD SSSL GK KPNF+I AAS+HVPPGMDGVLFPWLQWET NDATPYPRK+MIEPP T QTFP D NQ+ SN
Subjt:  LSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASN

Query:  AQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        AQD CN SASSQGSWSPGV   LSGK V++I SRFK  GNYQNQS LES ESRFD+DEYLKRPSNE+FLLERWKQQH+ NCSGLLLC+RLFL
Subjt:  AQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

A0A6J1K4C9 uncharacterized protein LOC1114899775.0e-26584.63Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGASV+SNDDS STNRSSIHRRSRSLSRFSHPLPSSPVDKGF EGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARS

Query:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR
        SESS A +  A+S RRGRS+SRHGSAK+SGGG+EG+G+A+YS G    VPESNSRRRRS+SVVRYQISDSESDLD+SQ+SGT ++EKS+G GNKQ     
Subjt:  SESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTR

Query:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL
               +HKAD+SNRRP LRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNV EKTIR+I ARK KQLNGDVVDNGLYEAMRKELR+AVEEIRVEL
Subjt:  VKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVEL

Query:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS
        EQEMVNRNSSVGT SDDLHSSD GV +HTSPFKRNYSTKQEQS+KRRDSLAKMV++EQR QELPK VK+  PDL NVVAENSSR RKRSNDRSRMSKRL+
Subjt:  EQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLS

Query:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ
        EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPAAS++VPPGMDGVL PWLQWETSNDATPYPRKN I P MTPQ FPWD NQEASNAQ
Subjt:  EEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQ

Query:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESR--FDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        D  NHS SSQGSWSPGV   LSGKVVE++GSRFKKTG YQNQS LES ESR  FDIDEYLKRPSNE+FLLERWKQQHKIN  GLLLCN +FL
Subjt:  DQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESR--FDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

A0A6J1KGU0 uncharacterized protein LOC1114951371.4e-26284.12Show/hide
Query:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNT-SRGSGFPEISLDDLAVEFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSP DKGF EGS R+GRFVNT SRGSGFPEISLDDLAVEFFGSGDRGRSAAR
Subjt:  MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNT-SRGSGFPEISLDDLAVEFFGSGDRGRSAAR

Query:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT
        SSESSGA N A AS+RRGRSVSRH  AK SGG NEGKGRA+YS G    VPESNSRRRRSLSVVRYQISDSESD+DRSQNSG RVKEKS+GIGNKQ    
Subjt:  SSESSGAANDAAASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAG----VPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGT

Query:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVD-NGLYEAMRKELRNAVEEIRV
                +HKADDSNRRP+L+RSLSQNDFKCHDGYSSQSSVLTDDEGKD YFGN VME+TIRSIYARK KQ NGDVVD  GLYEAMRKELR+AVEEIRV
Subjt:  RVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVD-NGLYEAMRKELRNAVEEIRV

Query:  ELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKR
        ELEQEMVNRNSSV TFSDDLHSS+ G+ +HTSPF RNYSTKQEQS KRRDS  KMV EEQRGQE PKMVK+LPPDLKN VAEN SRTRKRS DRSRMSKR
Subjt:  ELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKR

Query:  LSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASN
        LSEEAEKYIEDFISNVEDTDISSLDGDRSD SSSL GK KPNF+I AAS+H+PPGMDGVLFPWLQWET NDATPYPRK+MIEPP TPQTFP D NQ+ SN
Subjt:  LSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASN

Query:  AQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL
        AQD CN SASSQGSWSPGV   LSGK V++I SRFK+ GNYQNQS LES ESRFD+DEYLKRPSNE+FLLERWKQQH+ NCSGLLLC+R+FL
Subjt:  AQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein2.9e-7937.88Show/hide
Query:  MASSAFKSTTKR-TPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFF---------GS
        MA+SAF ST KR T +  S  S DDS+ + R S  RR RSLSRFSH +P   ++        R+G+FVNT RGSGF EISLDDLAVEFF          S
Subjt:  MASSAFKSTTKR-TPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFF---------GS

Query:  GDRGRSAARSSESSGAANDAAASHRRGRSVSRHGSAKSSGG-----------------------GNEG------------------------KGRANYSA
        G+RGRS  R+S   G   +   S RRGRSVSR GS   +GG                        N G                        + +   S 
Subjt:  GDRGRSAARSSESSGAANDAAASHRRGRSVSRHGSAKSSGG-----------------------GNEG------------------------KGRANYSA

Query:  GVPE-----SNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTRVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVL
        GV E     +NSRRRRSLSVVR +I +SESD+D+ Q S +    KSF  G         K +NSG+ K+  S+ R  LRRS SQN  K HDGYSSQSS +
Subjt:  GVPE-----SNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTRVKEKNSGTHKADDSNRRPTLRRSLSQNDFKCHDGYSSQSSVL

Query:  TDDEGKDAYFGNNVMEKTIRSIYAR-KGKQLNGDVVDNGLYEAMRKELRNAVEEIRVELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTK-Q
        TDDEGKD+    +  E+ IR++YA+ K      + + N  Y + RK L +              NR  S  T                    + Y+TK Q
Subjt:  TDDEGKDAYFGNNVMEKTIRSIYAR-KGKQLNGDVVDNGLYEAMRKELRNAVEEIRVELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKRNYSTK-Q

Query:  EQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRS-RMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKP
        E  E++R+ LA++++EEQRG+EL   +K +  +  +   E   RTRKRS DRS RMS  L++EAE++I++FISN+EDTD SSL+ +RS++SSS G     
Subjt:  EQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRS-RMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKP

Query:  NFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTP--QTFPW--DANQEASNAQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKK
        + Q  +A +     MDGV+ PWLQWET + +       +I+ P TP  ++  W  D  Q+AS+ +     + SS+GSWSP                    
Subjt:  NFQIPAASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTP--QTFPW--DANQEASNAQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKK

Query:  TGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNR
          +   +  + +   + D+ EYLKRP++ + L E WK +H+I+   L+LC+R
Subjt:  TGNYQNQSCLESSESRFDIDEYLKRPSNEEFLLERWKQQHKINCSGLLLCNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCTCTGCTTTCAAATCGACCACGAAACGGACGCCGATCGGAGCATCGGTGGCTTCAAACGACGACTCCGCTTCCACCAATCGGAGCTCGATTCACCGCCGCTC
TCGAAGTCTGAGCCGGTTTTCTCACCCTCTGCCGTCGTCTCCCGTCGATAAGGGCTTTGTTGAGGGTTCGGCTCGGCGGGGTAGGTTCGTCAACACGTCGAGAGGCTCCG
GATTCCCTGAGATCAGTCTCGATGACTTAGCGGTTGAATTCTTTGGTTCTGGTGATCGAGGGCGCTCCGCTGCGCGGAGCTCCGAGTCGAGTGGTGCTGCGAATGATGCT
GCGGCTTCGCATAGACGTGGGAGGTCGGTGTCGAGACACGGTAGCGCTAAAAGTAGTGGCGGTGGTAATGAAGGCAAAGGAAGAGCCAATTATAGCGCTGGTGTTCCTGA
AAGTAATTCGAGAAGAAGACGCTCTCTCTCGGTGGTTCGCTACCAGATTAGCGATTCGGAGAGTGATCTTGATCGATCTCAGAATTCTGGTACTCGTGTCAAAGAAAAGA
GCTTCGGTATTGGAAATAAGCAGAATTCTGGTACTCGTGTCAAAGAAAAGAATTCTGGTACTCATAAGGCTGATGATTCAAACCGTAGGCCAACATTGCGAAGATCTCTT
AGCCAGAATGATTTTAAGTGCCATGATGGCTATTCAAGCCAGTCTTCAGTTCTAACTGATGATGAAGGGAAGGACGCTTACTTTGGTAATAATGTAATGGAGAAGACTAT
TCGATCAATTTATGCAAGAAAGGGAAAGCAGCTCAACGGGGATGTTGTTGACAATGGCTTGTATGAAGCAATGCGGAAAGAACTTAGAAATGCTGTTGAAGAGATAAGGG
TGGAACTTGAGCAGGAAATGGTCAACAGAAATTCATCTGTTGGAACTTTCAGTGATGACTTGCATTCAAGTGATTTGGGTGTTCTTAAGCATACATCTCCATTTAAAAGA
AATTATTCAACAAAACAAGAACAGTCGGAGAAGCGCAGAGATTCGTTGGCTAAGATGGTGATGGAGGAGCAACGTGGTCAAGAACTTCCAAAGATGGTTAAACATTTGCC
TCCTGATCTGAAGAACGTTGTTGCAGAAAACTCCTCACGAACCAGAAAGAGGAGCAATGATCGAAGTAGGATGTCTAAAAGATTGAGTGAGGAGGCAGAAAAATACATCG
AGGACTTCATTTCCAATGTTGAAGATACAGACATTTCATCTCTTGATGGCGATAGGAGTGACACAAGTTCGTCTTTAGGGGGAAAAACAAAACCAAATTTCCAAATTCCA
GCAGCCAGCAGACATGTTCCTCCTGGAATGGATGGTGTCCTATTTCCGTGGTTGCAATGGGAAACCAGTAACGATGCTACTCCTTATCCTCGAAAGAATATGATCGAACC
ACCTATGACTCCACAAACTTTTCCATGGGATGCAAATCAGGAAGCAAGCAATGCACAAGATCAATGTAACCATTCTGCTAGCAGCCAAGGGAGTTGGAGCCCTGGAGTTA
CCTCTGGCCTTTCTGGGAAAGTTGTTGAAGAAATAGGAAGCAGATTCAAGAAAACTGGTAATTATCAGAACCAGTCCTGTTTGGAATCAAGCGAATCTCGGTTCGATATA
GACGAGTATCTAAAGCGTCCGAGCAATGAAGAGTTTCTCCTAGAAAGATGGAAGCAGCAACACAAAATCAACTGCAGTGGTCTTTTGCTCTGTAATCGTTTATTTTTATA
G
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCTCTGCTTTCAAATCGACCACGAAACGGACGCCGATCGGAGCATCGGTGGCTTCAAACGACGACTCCGCTTCCACCAATCGGAGCTCGATTCACCGCCGCTC
TCGAAGTCTGAGCCGGTTTTCTCACCCTCTGCCGTCGTCTCCCGTCGATAAGGGCTTTGTTGAGGGTTCGGCTCGGCGGGGTAGGTTCGTCAACACGTCGAGAGGCTCCG
GATTCCCTGAGATCAGTCTCGATGACTTAGCGGTTGAATTCTTTGGTTCTGGTGATCGAGGGCGCTCCGCTGCGCGGAGCTCCGAGTCGAGTGGTGCTGCGAATGATGCT
GCGGCTTCGCATAGACGTGGGAGGTCGGTGTCGAGACACGGTAGCGCTAAAAGTAGTGGCGGTGGTAATGAAGGCAAAGGAAGAGCCAATTATAGCGCTGGTGTTCCTGA
AAGTAATTCGAGAAGAAGACGCTCTCTCTCGGTGGTTCGCTACCAGATTAGCGATTCGGAGAGTGATCTTGATCGATCTCAGAATTCTGGTACTCGTGTCAAAGAAAAGA
GCTTCGGTATTGGAAATAAGCAGAATTCTGGTACTCGTGTCAAAGAAAAGAATTCTGGTACTCATAAGGCTGATGATTCAAACCGTAGGCCAACATTGCGAAGATCTCTT
AGCCAGAATGATTTTAAGTGCCATGATGGCTATTCAAGCCAGTCTTCAGTTCTAACTGATGATGAAGGGAAGGACGCTTACTTTGGTAATAATGTAATGGAGAAGACTAT
TCGATCAATTTATGCAAGAAAGGGAAAGCAGCTCAACGGGGATGTTGTTGACAATGGCTTGTATGAAGCAATGCGGAAAGAACTTAGAAATGCTGTTGAAGAGATAAGGG
TGGAACTTGAGCAGGAAATGGTCAACAGAAATTCATCTGTTGGAACTTTCAGTGATGACTTGCATTCAAGTGATTTGGGTGTTCTTAAGCATACATCTCCATTTAAAAGA
AATTATTCAACAAAACAAGAACAGTCGGAGAAGCGCAGAGATTCGTTGGCTAAGATGGTGATGGAGGAGCAACGTGGTCAAGAACTTCCAAAGATGGTTAAACATTTGCC
TCCTGATCTGAAGAACGTTGTTGCAGAAAACTCCTCACGAACCAGAAAGAGGAGCAATGATCGAAGTAGGATGTCTAAAAGATTGAGTGAGGAGGCAGAAAAATACATCG
AGGACTTCATTTCCAATGTTGAAGATACAGACATTTCATCTCTTGATGGCGATAGGAGTGACACAAGTTCGTCTTTAGGGGGAAAAACAAAACCAAATTTCCAAATTCCA
GCAGCCAGCAGACATGTTCCTCCTGGAATGGATGGTGTCCTATTTCCGTGGTTGCAATGGGAAACCAGTAACGATGCTACTCCTTATCCTCGAAAGAATATGATCGAACC
ACCTATGACTCCACAAACTTTTCCATGGGATGCAAATCAGGAAGCAAGCAATGCACAAGATCAATGTAACCATTCTGCTAGCAGCCAAGGGAGTTGGAGCCCTGGAGTTA
CCTCTGGCCTTTCTGGGAAAGTTGTTGAAGAAATAGGAAGCAGATTCAAGAAAACTGGTAATTATCAGAACCAGTCCTGTTTGGAATCAAGCGAATCTCGGTTCGATATA
GACGAGTATCTAAAGCGTCCGAGCAATGAAGAGTTTCTCCTAGAAAGATGGAAGCAGCAACACAAAATCAACTGCAGTGGTCTTTTGCTCTGTAATCGTTTATTTTTATA
G
Protein sequenceShow/hide protein sequence
MASSAFKSTTKRTPIGASVASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPVDKGFVEGSARRGRFVNTSRGSGFPEISLDDLAVEFFGSGDRGRSAARSSESSGAANDA
AASHRRGRSVSRHGSAKSSGGGNEGKGRANYSAGVPESNSRRRRSLSVVRYQISDSESDLDRSQNSGTRVKEKSFGIGNKQNSGTRVKEKNSGTHKADDSNRRPTLRRSL
SQNDFKCHDGYSSQSSVLTDDEGKDAYFGNNVMEKTIRSIYARKGKQLNGDVVDNGLYEAMRKELRNAVEEIRVELEQEMVNRNSSVGTFSDDLHSSDLGVLKHTSPFKR
NYSTKQEQSEKRRDSLAKMVMEEQRGQELPKMVKHLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFQIP
AASRHVPPGMDGVLFPWLQWETSNDATPYPRKNMIEPPMTPQTFPWDANQEASNAQDQCNHSASSQGSWSPGVTSGLSGKVVEEIGSRFKKTGNYQNQSCLESSESRFDI
DEYLKRPSNEEFLLERWKQQHKINCSGLLLCNRLFL