; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026831 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026831
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGermin-like protein 5-1
Genome locationtig00153047:1430434..1440747
RNA-Seq ExpressionSgr026831
SyntenySgr026831
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006952 - defense response (biological process)
GO:0009396 - folic acid-containing compound biosynthetic process (biological process)
GO:0035999 - tetrahydrofolate interconversion (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0048046 - apoplast (cellular component)
GO:0031213 - RSF complex (cellular component)
GO:0030272 - 5-formyltetrahydrofolate cyclo-ligase activity (molecular function)
GO:0030145 - manganese ion binding (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR001929 - Germin
IPR002698 - 5-formyltetrahydrofolate cyclo-ligase
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR024185 - 5-formyltetrahydrofolate cyclo-ligase-like domain superfamily
IPR037171 - NagB/RpiA transferase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH03880.1 5-formyltetrahydrofolate cycloligase [Prunus dulcis]1.1e-16560.23Show/hide
Query:  NGSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYV
        N + + +  D L  IF+QKR LRS VRKAL+AMDP+LRSHEDN IQSI+LEA WF+S QRLCAY+ CSALREVDTS +LS ILQ P K     + +K+YV
Subjt:  NGSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYV

Query:  PRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALS
        PRVEDKN HMRM NIS +DDL+ANSMNILEPAP+D+DGNEREDV+Q +DPVDLFLLPGLAFD+SGRRLGRGGGYYDTFLKNY ELAK RNWKQPLLVALS
Subjt:  PRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALS

Query:  YSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQN-----------------------------------------------------------
        YSVQI+DEGV P+TP+D+ VDALVSP+G IPIS A LD  +                                                           
Subjt:  YSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQN-----------------------------------------------------------

Query:  CSATMIAFTNSPVNM--LGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLN
        CS  M+  + S + +  L   ++LLLLP PS  ADPDPL D CVA+L A+ S + FPCK  SEVT+DDF FDGLSK+GN  N FG  +T GNVL+FPGLN
Subjt:  CSATMIAFTNSPVNM--LGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLN

Query:  TLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASN
        TLGLSMNRVD  PGGIN PHSHPRASE  +VI+G +L G VTT NVYY KV TAGQ+F +PRGLVHF+ N+G +KA+  TAFNS LPG+ ++  +LFA+ 
Subjt:  TLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASN

Query:  PSIPVEVLTKTFQVDDGVINSIK
        PSIP+EVLTKT+ VD+  IN++K
Subjt:  PSIPVEVLTKTFQVDDGVINSIK

KAA8540894.1 hypothetical protein F0562_024968 [Nyssa sinensis]5.7e-14362.04Show/hide
Query:  RAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKG--VWLFEKIYVPRVEDKNSHMRMFNISRMDDLIANSMNILE
        R   P+ RS EDN IQS +LEA WFKSS+RLCAY+SCSALREVDTS++LS+ILQ+P       + + +YVPRV+D+NSHMRM NIS +DDLIANSMNILE
Subjt:  RAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKG--VWLFEKIYVPRVEDKNSHMRMFNISRMDDLIANSMNILE

Query:  PAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGGI
        PAPVD+DG EREDVM  ++PVDL LLPGLAFDKSGRRLGRGGGYYDTFL  Y ELA  R WKQPLLVALSYS+QIMD+G IP+TPNDV VDALV+PS   
Subjt:  PAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGGI

Query:  PISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQG
                                                              DL A++S+NGFPCKP + VTS+DFFFDGLS+E NT N FG  +T G
Subjt:  PISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQG

Query:  NVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVI
        +VL FPGLNTLG+SMNRVD APGG+N PH HPR++ES VVI+GK+LVGFV+T NV+Y K++TAGQMF+IP+GLVHFQ NVG  KA+  TAFNSQLPG  +
Subjt:  NVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVI

Query:  VSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        +  TLFAS P IP +VLTK FQV D VINSIK
Subjt:  VSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

KAA8540899.1 hypothetical protein F0562_024963 [Nyssa sinensis]1.7e-13961.85Show/hide
Query:  EDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKG--VWLFEKIYVPRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNE
        ++N IQS +LEA WFKSS+RLCAY+SCSALREVDTS++LSEILQ+        + + +YVPRVED+NSHMRM NIS +DDLIANSMNILEPAPVD+DG E
Subjt:  EDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKG--VWLFEKIYVPRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNE

Query:  REDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQ
        REDVM  ++PVDL LLPGLAFDKSGRRLGRGGGYYDTFL  Y ELA  R WKQPLLVALSYS+QIMD+G IP+TPNDV VDAL++PS             
Subjt:  REDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQ

Query:  NCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNT
                                                    DL A++S+NGFPCKP + VTS+DFFFDGLS+E NT N FG  +T G+VL FPGLNT
Subjt:  NCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNT

Query:  LGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNP
        LG+S+NRVD APGG+N PH HPR++ES VVI+GK+LVGFV+T NV+Y K++TAG+MF+IP+GLVHFQ NVG  KA+  TAFNSQLPG  ++  TLFAS P
Subjt:  LGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNP

Query:  SIPVEVLTKTFQVDDGVINSIK
        SIP +VLTK FQV D VINSIK
Subjt:  SIPVEVLTKTFQVDDGVINSIK

KAF4352540.1 hypothetical protein F8388_012236 [Cannabis sativa]1.3e-17967.7Show/hide
Query:  DPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYVPRVEDKNSH
        D L AIF+QK+ +RS VRK+L+AMDPSLRSHED  +Q ++L A WFKS QRLCAY+SC ALREVDTS+LLSEIL++P K   + L +K+YVPRVEDKNSH
Subjt:  DPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYVPRVEDKNSH

Query:  MRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEG
        MRM NISR+DDLIANSM+ILEPA VDSDGNEREDVMQ NDPVDLF+LPGLAFD++GRRLGRGGGYYDTF++NY ELAK +NWKQPLLVALSYS QIM+EG
Subjt:  MRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEG

Query:  VIPLTPNDVSVDALVSPSGGIPISSAGLDN-----------------------------QNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQD
        VIP+T ND+ VDALVSPSG IPISSA LD+                             +N    M++ ++S   +L C + LLL   PS SADPDPLQD
Subjt:  VIPLTPNDVSVDALVSPSGGIPISSAGLDN-----------------------------QNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQD

Query:  FCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTD-NAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGF
        FCVA+LNAT S+NG+PCK VSE TS+DFFF GLSKEG+T+ N FGF +T GNV  FPGLNTLGLSMNRVD APGG+N PHSHPRA+E+ VVIKGK+LVGF
Subjt:  FCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTD-NAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGF

Query:  VTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        +TT+NV+Y KVLT G+MF+IPRGLVHFQ NVG  KA+ +TAFNSQLPGAV++   LFA+ P IP +VLTK FQV D V+NSIK
Subjt:  VTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

KGN63636.2 hypothetical protein Csa_013278 [Cucumis sativus]1.2e-20981.43Show/hide
Query:  MASNGSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIY
        MA+N S KPEN+D L+ IF+QKR LRSSVRKAL+AMDPS RS EDNVIQSI+LEA WFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK     +KIY
Subjt:  MASNGSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIY

Query:  VPRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVAL
        VPRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVD DGNEREDVMQT DP+DLFLLPGLAFDKSGRRLGRGGGYYDTFLKNY ELAKARNWKQPLLVAL
Subjt:  VPRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVAL

Query:  SYSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKP
        SYSVQIMDEG+IPLTPNDV VDALVSPSG IPISSAGLD    +  +  F         C   + +    S  ADPDPLQDFCVADLNAT+SLNGFPCKP
Subjt:  SYSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKP

Query:  VSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFII
         SEVT+DDFFFDGLSKEGNTDN FGF +TQGNVL FPGLNTLGLSMNRVDLA GGINAPHSHPRA+ES+VVIKGKVLVGFV+T++VYYYKVLT GQMF++
Subjt:  VSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFII

Query:  PRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        PRGLVHFQ+NVG  KA L+TAFNSQLPGAV+VSRTLFASNP +PVE+LTK FQVD  VINSIK
Subjt:  PRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

TrEMBL top hitse value%identityAlignment
A0A4Y1RHT9 5-formyltetrahydrofolate cycloligase5.2e-16660.23Show/hide
Query:  NGSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYV
        N + + +  D L  IF+QKR LRS VRKAL+AMDP+LRSHEDN IQSI+LEA WF+S QRLCAY+ CSALREVDTS +LS ILQ P K     + +K+YV
Subjt:  NGSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYV

Query:  PRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALS
        PRVEDKN HMRM NIS +DDL+ANSMNILEPAP+D+DGNEREDV+Q +DPVDLFLLPGLAFD+SGRRLGRGGGYYDTFLKNY ELAK RNWKQPLLVALS
Subjt:  PRVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALS

Query:  YSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQN-----------------------------------------------------------
        YSVQI+DEGV P+TP+D+ VDALVSP+G IPIS A LD  +                                                           
Subjt:  YSVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQN-----------------------------------------------------------

Query:  CSATMIAFTNSPVNM--LGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLN
        CS  M+  + S + +  L   ++LLLLP PS  ADPDPL D CVA+L A+ S + FPCK  SEVT+DDF FDGLSK+GN  N FG  +T GNVL+FPGLN
Subjt:  CSATMIAFTNSPVNM--LGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLN

Query:  TLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASN
        TLGLSMNRVD  PGGIN PHSHPRASE  +VI+G +L G VTT NVYY KV TAGQ+F +PRGLVHF+ N+G +KA+  TAFNS LPG+ ++  +LFA+ 
Subjt:  TLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASN

Query:  PSIPVEVLTKTFQVDDGVINSIK
        PSIP+EVLTKT+ VD+  IN++K
Subjt:  PSIPVEVLTKTFQVDDGVINSIK

A0A5J5BDZ0 Cupin type-1 domain-containing protein2.8e-14362.04Show/hide
Query:  RAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKG--VWLFEKIYVPRVEDKNSHMRMFNISRMDDLIANSMNILE
        R   P+ RS EDN IQS +LEA WFKSS+RLCAY+SCSALREVDTS++LS+ILQ+P       + + +YVPRV+D+NSHMRM NIS +DDLIANSMNILE
Subjt:  RAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKG--VWLFEKIYVPRVEDKNSHMRMFNISRMDDLIANSMNILE

Query:  PAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGGI
        PAPVD+DG EREDVM  ++PVDL LLPGLAFDKSGRRLGRGGGYYDTFL  Y ELA  R WKQPLLVALSYS+QIMD+G IP+TPNDV VDALV+PS   
Subjt:  PAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGGI

Query:  PISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQG
                                                              DL A++S+NGFPCKP + VTS+DFFFDGLS+E NT N FG  +T G
Subjt:  PISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQG

Query:  NVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVI
        +VL FPGLNTLG+SMNRVD APGG+N PH HPR++ES VVI+GK+LVGFV+T NV+Y K++TAGQMF+IP+GLVHFQ NVG  KA+  TAFNSQLPG  +
Subjt:  NVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVI

Query:  VSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        +  TLFAS P IP +VLTK FQV D VINSIK
Subjt:  VSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

A0A7J6E3V5 Cupin type-1 domain-containing protein6.3e-18067.7Show/hide
Query:  DPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYVPRVEDKNSH
        D L AIF+QK+ +RS VRK+L+AMDPSLRSHED  +Q ++L A WFKS QRLCAY+SC ALREVDTS+LLSEIL++P K   + L +K+YVPRVEDKNSH
Subjt:  DPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPK--GVWLFEKIYVPRVEDKNSH

Query:  MRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEG
        MRM NISR+DDLIANSM+ILEPA VDSDGNEREDVMQ NDPVDLF+LPGLAFD++GRRLGRGGGYYDTF++NY ELAK +NWKQPLLVALSYS QIM+EG
Subjt:  MRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEG

Query:  VIPLTPNDVSVDALVSPSGGIPISSAGLDN-----------------------------QNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQD
        VIP+T ND+ VDALVSPSG IPISSA LD+                             +N    M++ ++S   +L C + LLL   PS SADPDPLQD
Subjt:  VIPLTPNDVSVDALVSPSGGIPISSAGLDN-----------------------------QNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQD

Query:  FCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTD-NAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGF
        FCVA+LNAT S+NG+PCK VSE TS+DFFF GLSKEG+T+ N FGF +T GNV  FPGLNTLGLSMNRVD APGG+N PHSHPRA+E+ VVIKGK+LVGF
Subjt:  FCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTD-NAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGF

Query:  VTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        +TT+NV+Y KVLT G+MF+IPRGLVHFQ NVG  KA+ +TAFNSQLPGAV++   LFA+ P IP +VLTK FQV D V+NSIK
Subjt:  VTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

A0A803M1E7 Uncharacterized protein7.8e-14661.66Show/hide
Query:  MDPSLR---SHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFE--KIYVPRVEDKNSHMRMFNISRMDDLIANSMNIL
        +DP  +   S +++ IQ ++L+A WFKSS+RLCAY+SC ALREVDTS++L++ILQ   K     E   +YVPRVEDKNS++RM NIS MDDLIANSM+IL
Subjt:  MDPSLR---SHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFE--KIYVPRVEDKNSHMRMFNISRMDDLIANSMNIL

Query:  EPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGG
        EPAPVDS GN+RE+VM  ++ +DL LLPGLAFDKSGRRLGRGGGYYDTFL+NY +LA  +NWKQPLLVALSYSVQI+D+GVI +TPNDV VDALVSP+  
Subjt:  EPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVSVDALVSPSGG

Query:  IPISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQ
                                        + LL P+ S +ADPDPL DFC+ADLN++ +   FPCKP S VTS+DFFFDGL KEGNT N FG ++T 
Subjt:  IPISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQ

Query:  GNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAV
        GNVL FP LN LGLSMNR+D+AP G+N PH+HPR++ES VVI GKVLVGFVTT NV+Y KVL  GQMF+IPRGLVHF+ NVG  KAV++TAFNSQ PG V
Subjt:  GNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAV

Query:  IVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        +V  TLF + PSIP +VLT+TFQ+D  V++ I+
Subjt:  IVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

A0A803M1E8 Uncharacterized protein2.4e-16364.21Show/hide
Query:  GSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFE--KIYVP
        GS    +   L  IF+QKR +RS VRK+L+ MDPS R+ ED+ IQ ++L+A WFKSS+RLCAY+SC ALREVDTS++L++ILQ   K     E   +YVP
Subjt:  GSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFE--KIYVP

Query:  RVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSY
        RVEDKNS+MRM NIS MDDLIANSM+ILEPAPVDS GNERE+VM  ++ +DL LLPGLAFDKSGRRLGRGGGYYDTFL+NY +LA  +NWKQPLLVALSY
Subjt:  RVEDKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSY

Query:  SVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVS
        SVQIMD+GVIP+TPNDV VDALVSP+G IPISSA +       +        +  L C ++ LL P+ S +ADPDPL DFC+ADLN++ +   FPCKP S
Subjt:  SVQIMDEGVIPLTPNDVSVDALVSPSGGIPISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVS

Query:  EVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPR
         VTS+DFFFDGL KEGNT N FG ++T GNVL FP LN LGLSMNR+D+AP G+N PH+HPR++ES VVI GKVLVGFVTT NV+Y KVL  GQMF+IPR
Subjt:  EVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPR

Query:  GLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        GLVHF+ NVG  KAV++TAFNSQ PG V+V  TLF + PSI  +VL +TFQ+D  V++ I+
Subjt:  GLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

SwissProt top hitse value%identityAlignment
P92995 Germin-like protein subfamily T member 14.0e-6256.87Show/hide
Query:  PVNMLGCFIILL-LLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLA
        P  +L  F+++  L   PSLS+D DPLQDFCV DL A+ S+NGFPCK  S V++ DFF+ GL    +T N  G  +   NVL FPGLNTLG+SMN V+LA
Subjt:  PVNMLGCFIILL-LLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLA

Query:  PGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTF
        PGG+N PH HPRA+E   VI+G V VGF++TNN  + KVL AG+ F+IPRGLVHFQ+NVG  KA ++TAFNSQLPGAV++  TLF S P IP  VLT+ F
Subjt:  PGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTF

Query:  QVDDGVINSIK
        + DD  + ++K
Subjt:  QVDDGVINSIK

Q6I544 Germin-like protein 5-13.8e-5753.47Show/hide
Query:  IILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHS
        ++L LLP PS + DPD LQD CVADL + + +NGF CK  + VT DDF+F GL+  GNT+N +G  +T  NV   PGLNTLG+SM+R+D APGG+N PH+
Subjt:  IILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHS

Query:  HPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINS
        HPRA+E V V++G + VGF+TT N  Y K ++AG +F+ PRGL+HFQ N G + A +++AFNSQLPG   ++ TLFA++P +P  VLTK FQV    +  
Subjt:  HPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINS

Query:  IK
        IK
Subjt:  IK

Q75HJ4 Germin-like protein 3-81.5e-5655.05Show/hide
Query:  ADPDPLQDFCVADL--------NATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRA
        ADP+P+QDFCVA +         A  +  GFPCKP S V SDDFFF GL+   +TDN FGF +T  N   FPGLNTLG+S+ RVDLAPGG+N  HSHPRA
Subjt:  ADPDPLQDFCVADL--------NATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRA

Query:  SESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK
        +E + V+ G+VL GFV+T   +Y KVL  G+ F++PRG++HFQ+NVG   A ++TAFNSQ+PG V    TLF S+P IP  VL K+FQVD  +I  +K
Subjt:  SESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINSIK

Q8L539 5-formyltetrahydrofolate cyclo-ligase, mitochondrial3.1e-8364.19Show/hide
Query:  SRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIYVPRVE
        S   +N + L +IF+QKR +RS+VRK+L+AMDPSLR+ +D  IQ  +LEA WFKS + LCAY+SC +L EVDTS++LSEILQHP       +K+YVP VE
Subjt:  SRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIYVPRVE

Query:  DKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQ
        DKNS+MRM +IS M+DL+ANSMNILEPAPVD+ GN+REDV+Q ++P+DLF+LPGLAFD+ GRRLGRGGGYYDTFLK Y + AK + W+ PL+VALSYS Q
Subjt:  DKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQ

Query:  IMDEGVIPLTPNDVSVDALVSPSGGIPIS
        I+++G IP+TPNDV +DALV+PSG +PI+
Subjt:  IMDEGVIPLTPNDVSVDALVSPSGGIPIS

Q9LMC9 Germin-like protein subfamily T member 21.3e-6560.4Show/hide
Query:  IILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHS
        +++ +   PSLS+D DPLQDFCV DL A+ S+NGFPCK  S V++ DFFF GL    NT    G  ++  NVL FPGLNTLGLSMN V+ APGG+N PHS
Subjt:  IILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHS

Query:  HPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINS
        HPRA+E+ VVI+G V VGF+TTNN  + KVL AG+MF++PRGLVHFQ+NVG  KA L+T+FNSQLPG+ ++  TLF SNP+IP  VLTKTF+ DD  +N 
Subjt:  HPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINS

Query:  IK
        +K
Subjt:  IK

Arabidopsis top hitse value%identityAlignment
AT1G02335.1 germin-like protein subfamily 2 member 2 precursor5.3e-5450.49Show/hide
Query:  LGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGIN
        L C +I  +  Y     DPD LQD CVAD +    LNGFPCK    +T  DFFF G+SK    ++  G  +T  NV   PGLNTL +S+ R+D APGG+N
Subjt:  LGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGIN

Query:  APHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDG
         PH+HPRA+E V V++G++ VGF+TT N  + K +  G++F+ PRGLVHFQ N G S A +L+AFNSQLPG   V+ TLFA+ P++P +VLTKTFQV   
Subjt:  APHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDG

Query:  VINSIK
        +++ IK
Subjt:  VINSIK

AT1G18970.1 germin-like protein 42.8e-6356.87Show/hide
Query:  PVNMLGCFIILL-LLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLA
        P  +L  F+++  L   PSLS+D DPLQDFCV DL A+ S+NGFPCK  S V++ DFF+ GL    +T N  G  +   NVL FPGLNTLG+SMN V+LA
Subjt:  PVNMLGCFIILL-LLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLA

Query:  PGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTF
        PGG+N PH HPRA+E   VI+G V VGF++TNN  + KVL AG+ F+IPRGLVHFQ+NVG  KA ++TAFNSQLPGAV++  TLF S P IP  VLT+ F
Subjt:  PGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTF

Query:  QVDDGVINSIK
        + DD  + ++K
Subjt:  QVDDGVINSIK

AT1G18980.1 RmlC-like cupins superfamily protein9.3e-6760.4Show/hide
Query:  IILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHS
        +++ +   PSLS+D DPLQDFCV DL A+ S+NGFPCK  S V++ DFFF GL    NT    G  ++  NVL FPGLNTLGLSMN V+ APGG+N PHS
Subjt:  IILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQGNVLNFPGLNTLGLSMNRVDLAPGGINAPHS

Query:  HPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINS
        HPRA+E+ VVI+G V VGF+TTNN  + KVL AG+MF++PRGLVHFQ+NVG  KA L+T+FNSQLPG+ ++  TLF SNP+IP  VLTKTF+ DD  +N 
Subjt:  HPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASNPSIPVEVLTKTFQVDDGVINS

Query:  IK
        +K
Subjt:  IK

AT5G13050.1 5-formyltetrahydrofolate cycloligase2.2e-8464.19Show/hide
Query:  SRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIYVPRVE
        S   +N + L +IF+QKR +RS+VRK+L+AMDPSLR+ +D  IQ  +LEA WFKS + LCAY+SC +L EVDTS++LSEILQHP       +K+YVP VE
Subjt:  SRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIYVPRVE

Query:  DKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQ
        DKNS+MRM +IS M+DL+ANSMNILEPAPVD+ GN+REDV+Q ++P+DLF+LPGLAFD+ GRRLGRGGGYYDTFLK Y + AK + W+ PL+VALSYS Q
Subjt:  DKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQ

Query:  IMDEGVIPLTPNDVSVDALVSPSGGIPIS
        I+++G IP+TPNDV +DALV+PSG +PI+
Subjt:  IMDEGVIPLTPNDVSVDALVSPSGGIPIS

AT5G13050.2 5-formyltetrahydrofolate cycloligase1.2e-5864.07Show/hide
Query:  SRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIYVPRVE
        S   +N + L +IF+QKR +RS+VRK+L+AMDPSLR+ +D  IQ  +LEA WFKS + LCAY+SC +L EVDTS++LSEILQHP       +K+YVP VE
Subjt:  SRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIYVPRVE

Query:  DKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRG
        DKNS+MRM +IS M+DL+ANSMNILEPAPVD+ GN+REDV+Q ++P+DLF+LPGLAFD+ GRRLGRG
Subjt:  DKNSHMRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGTAACGGGAGTAGGAAACCAGAAAATGATGACCCATTGCAGGCCATATTCGAGCAGAAACGGACCCTTCGATCCAGCGTTCGAAAAGCCCTCAGAGCCATGGA
CCCCTCCCTCAGATCCCACGAAGATAATGTAATTCAGAGTATCATTTTGGAAGCTTCGTGGTTTAAATCTAGTCAGAGATTATGTGCTTATGTGAGTTGCAGTGCCTTAA
GAGAAGTTGATACGTCGAGACTGTTATCAGAGATTCTGCAACACCCACCAAAAGGTGTATGGTTATTCGAGAAGATTTATGTCCCGCGTGTGGAGGACAAGAATAGTCAC
ATGCGGATGTTCAATATTTCACGTATGGATGATCTCATCGCAAATTCAATGAATATCTTAGAACCAGCTCCTGTGGATAGTGATGGAAATGAACGTGAAGATGTTATGCA
GACAAATGACCCCGTTGATTTATTCCTCTTACCAGGACTAGCATTTGATAAATCAGGAAGACGACTTGGCCGAGGTGGAGGTTATTATGATACCTTCCTAAAGAACTACC
TAGAGCTTGCAAAGGCTCGGAATTGGAAGCAGCCCCTGCTTGTGGCACTGTCCTATTCAGTGCAGATAATGGATGAAGGAGTTATACCACTCACTCCAAATGACGTTTCG
GTCGATGCTCTTGTATCACCATCTGGAGGGATTCCCATCAGCTCGGCTGGATTAGACAACCAAAACTGCTCGGCCACCATGATTGCTTTCACAAACTCACCTGTAAACAT
GCTTGGTTGCTTCATTATATTGCTGCTTCTTCCTTATCCTTCTCTGTCAGCCGACCCCGATCCATTGCAGGATTTCTGCGTTGCTGATTTAAATGCTACCATATCACTCA
ATGGCTTCCCTTGCAAGCCAGTGTCAGAAGTCACTTCAGATGATTTCTTCTTTGATGGTTTGAGCAAAGAGGGCAACACAGATAATGCTTTCGGTTTCCAGATCACGCAA
GGGAATGTTCTGAACTTTCCAGGACTCAACACACTTGGGCTATCCATGAACCGTGTCGACCTTGCTCCTGGAGGAATAAATGCTCCTCACTCGCATCCTCGTGCCTCAGA
AAGCGTCGTCGTTATTAAGGGGAAAGTTCTTGTCGGGTTTGTGACAACAAATAATGTGTATTATTATAAGGTTTTGACTGCAGGGCAGATGTTTATCATTCCCAGAGGAC
TTGTTCATTTCCAGTTTAATGTTGGAACAAGCAAAGCAGTTCTGCTCACTGCTTTCAACAGTCAGTTGCCCGGGGCTGTGATCGTCTCCCGAACCTTGTTTGCTTCAAAT
CCTTCGATCCCTGTCGAAGTTCTCACCAAGACCTTCCAAGTAGATGATGGTGTTATCAACAGCATAAAGCTTTGCTCAGCAGGCTTCTTACCATCTGCTGAGCTAGGTTC
GACTTCAGCTTTTAAACCACCATTTTCATCTCTGGATTCCACTACTTTCATCCGTTTACCATTTGAATTACTCTGCCAAATCGCCGCCAAGCTGTACGCAGATGGATTCG
CAGCGGAGAAGGACGATTCATTGACAAGTGGAGGATCCATTTTCACCGACCCGAATCACCCAGCAGCAGGAATTCGCAAGGGATCAAAATCCGATATTAATGTCGGAACC
CAGAGAGAGAAAACGAAGAGCTTTCAGCTATCTATCCCTAGTTACCAAATTTCAGTGGAAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGTAACGGGAGTAGGAAACCAGAAAATGATGACCCATTGCAGGCCATATTCGAGCAGAAACGGACCCTTCGATCCAGCGTTCGAAAAGCCCTCAGAGCCATGGA
CCCCTCCCTCAGATCCCACGAAGATAATGTAATTCAGAGTATCATTTTGGAAGCTTCGTGGTTTAAATCTAGTCAGAGATTATGTGCTTATGTGAGTTGCAGTGCCTTAA
GAGAAGTTGATACGTCGAGACTGTTATCAGAGATTCTGCAACACCCACCAAAAGGTGTATGGTTATTCGAGAAGATTTATGTCCCGCGTGTGGAGGACAAGAATAGTCAC
ATGCGGATGTTCAATATTTCACGTATGGATGATCTCATCGCAAATTCAATGAATATCTTAGAACCAGCTCCTGTGGATAGTGATGGAAATGAACGTGAAGATGTTATGCA
GACAAATGACCCCGTTGATTTATTCCTCTTACCAGGACTAGCATTTGATAAATCAGGAAGACGACTTGGCCGAGGTGGAGGTTATTATGATACCTTCCTAAAGAACTACC
TAGAGCTTGCAAAGGCTCGGAATTGGAAGCAGCCCCTGCTTGTGGCACTGTCCTATTCAGTGCAGATAATGGATGAAGGAGTTATACCACTCACTCCAAATGACGTTTCG
GTCGATGCTCTTGTATCACCATCTGGAGGGATTCCCATCAGCTCGGCTGGATTAGACAACCAAAACTGCTCGGCCACCATGATTGCTTTCACAAACTCACCTGTAAACAT
GCTTGGTTGCTTCATTATATTGCTGCTTCTTCCTTATCCTTCTCTGTCAGCCGACCCCGATCCATTGCAGGATTTCTGCGTTGCTGATTTAAATGCTACCATATCACTCA
ATGGCTTCCCTTGCAAGCCAGTGTCAGAAGTCACTTCAGATGATTTCTTCTTTGATGGTTTGAGCAAAGAGGGCAACACAGATAATGCTTTCGGTTTCCAGATCACGCAA
GGGAATGTTCTGAACTTTCCAGGACTCAACACACTTGGGCTATCCATGAACCGTGTCGACCTTGCTCCTGGAGGAATAAATGCTCCTCACTCGCATCCTCGTGCCTCAGA
AAGCGTCGTCGTTATTAAGGGGAAAGTTCTTGTCGGGTTTGTGACAACAAATAATGTGTATTATTATAAGGTTTTGACTGCAGGGCAGATGTTTATCATTCCCAGAGGAC
TTGTTCATTTCCAGTTTAATGTTGGAACAAGCAAAGCAGTTCTGCTCACTGCTTTCAACAGTCAGTTGCCCGGGGCTGTGATCGTCTCCCGAACCTTGTTTGCTTCAAAT
CCTTCGATCCCTGTCGAAGTTCTCACCAAGACCTTCCAAGTAGATGATGGTGTTATCAACAGCATAAAGCTTTGCTCAGCAGGCTTCTTACCATCTGCTGAGCTAGGTTC
GACTTCAGCTTTTAAACCACCATTTTCATCTCTGGATTCCACTACTTTCATCCGTTTACCATTTGAATTACTCTGCCAAATCGCCGCCAAGCTGTACGCAGATGGATTCG
CAGCGGAGAAGGACGATTCATTGACAAGTGGAGGATCCATTTTCACCGACCCGAATCACCCAGCAGCAGGAATTCGCAAGGGATCAAAATCCGATATTAATGTCGGAACC
CAGAGAGAGAAAACGAAGAGCTTTCAGCTATCTATCCCTAGTTACCAAATTTCAGTGGAAGTTTAG
Protein sequenceShow/hide protein sequence
MASNGSRKPENDDPLQAIFEQKRTLRSSVRKALRAMDPSLRSHEDNVIQSIILEASWFKSSQRLCAYVSCSALREVDTSRLLSEILQHPPKGVWLFEKIYVPRVEDKNSH
MRMFNISRMDDLIANSMNILEPAPVDSDGNEREDVMQTNDPVDLFLLPGLAFDKSGRRLGRGGGYYDTFLKNYLELAKARNWKQPLLVALSYSVQIMDEGVIPLTPNDVS
VDALVSPSGGIPISSAGLDNQNCSATMIAFTNSPVNMLGCFIILLLLPYPSLSADPDPLQDFCVADLNATISLNGFPCKPVSEVTSDDFFFDGLSKEGNTDNAFGFQITQ
GNVLNFPGLNTLGLSMNRVDLAPGGINAPHSHPRASESVVVIKGKVLVGFVTTNNVYYYKVLTAGQMFIIPRGLVHFQFNVGTSKAVLLTAFNSQLPGAVIVSRTLFASN
PSIPVEVLTKTFQVDDGVINSIKLCSAGFLPSAELGSTSAFKPPFSSLDSTTFIRLPFELLCQIAAKLYADGFAAEKDDSLTSGGSIFTDPNHPAAGIRKGSKSDINVGT
QREKTKSFQLSIPSYQISVEV