; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024009 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024009
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionT4P13.26 protein isoform 1
Genome locationtig00001047:2415258..2417401
RNA-Seq ExpressionSgr024009
SyntenySgr024009
Gene Ontology termsGO:0016874 - ligase activity (molecular function)
InterPro domainsIPR025638 - Protein of unknown function DUF4336


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602525.1 hypothetical protein SDJN03_07758, partial [Cucurbita argyrosperma subsp. sororia]1.0e-23888.73Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAVPSPKSAILG+NP SRKDPA  FLGRPLKGFGF PK R K D + LIVASATP    +SSS+ N GERFYFNFTGFPFPLGPFLNRRTIRTE  
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
                               AVKG IWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLL ELEAPVEYIVLPTFAYEHKIFVGP
Subjt:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP

Query:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
        FSRKFPRAQIWVAPRQWSWPLNLPLEF GIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
Subjt:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL

Query:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA
        ASAKNGLAVKLLSKGKEVPEEPVVDN +NRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK IIPA
Subjt:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA

Query:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        HFAAPVNASR DFLTAFGFLDDLLGERY+NRPSLSLLF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

TYK00636.1 T4P13.26 protein isoform 1 [Cucumis melo var. makuwa]7.7e-24290.25Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAV SPKSAIL QNP SRKDP P FLGR  KGF    K R K  P+GLIVASAT    S+SSS  NV ERFYFNFTGFPFPLGPFLNRRTIRTEVK
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HL---PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIF
         L   PFLSF   GLFDL+E    + AVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKEL+APVEYI+LPTFAYEHKIF
Subjt:  HL---PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIF

Query:  VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKE
        VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRS+TLLVTDAVIFVPRQPPECISKE
Subjt:  VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKE

Query:  SLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSI
        SLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK I
Subjt:  SLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSI

Query:  IPAHFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        IPAHFAAPVNAS  DFLTAFGFLDDLLGERYVNRPSLS+LF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  IPAHFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

XP_022133514.1 uncharacterized protein LOC111006077 [Momordica charantia]5.0e-24189.35Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        M+AAIAVPSPKSAILG+NPISRKDPA +FL RPLKGFGF PK R K DP+GLIVASAT    ++SSS+RNVGERFYFNFTGFPFPLGPFLNRRTIRTE  
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
                               AVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
Subjt:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP

Query:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
        FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLS PWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLL
Subjt:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL

Query:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA
        ASAKNGLAVKLLSKGKEVPEEPVVDNK NRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK IIPA
Subjt:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA

Query:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        HFAAPV ASR DFLTAFGFLDDLLGERYVNRPSLSLLF+++MGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

XP_022957685.1 uncharacterized protein LOC111459147 [Cucurbita moschata]5.2e-23888.52Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAVPSPKSAILG+NP SRKDPA  FLGRPLKGFGF PK R K D + LIVASATP    +SSS+ N GERFYFNFTGFPFPLGPFLNRRTIRTE  
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
                               AVKG IWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLL ELEAPVEYIVLPTFAYEHKIFVGP
Subjt:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP

Query:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
        FSRKFPRAQIWVAPRQWSWPLNLPLEF GIFGAKTLKDEDL APWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
Subjt:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL

Query:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA
        ASAKNGLAVKLLSKGKEVPEEPVVDN +NRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK IIPA
Subjt:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA

Query:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        HFAAPVNASR DFLTAFGFLDDLLGERY+NRPSLSLLF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

XP_038889479.1 uncharacterized protein LOC120079388 [Benincasa hispida]8.5e-24189.77Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAVPSPKSAILGQ P  RKDPA  FLGR LKGFGFQPK R K DP+GLIVASATP    +SSS+ NV ERFYFNFTGFPFPLGPFLNRRTIRTE  
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
                               AVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYI+LPTFAYEHKIFVGP
Subjt:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP

Query:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
        FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
Subjt:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL

Query:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA
        ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK IIPA
Subjt:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA

Query:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        HFAAPVNAS  DFLTAFGFLDDLLGERYVNRPSLSLLF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

TrEMBL top hitse value%identityAlignment
A0A5A7TD84 T4P13.26 protein isoform 17.3e-23885.63Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAV SPKSAIL QNP SRKDP P FLGR  KGF    K R K  P+GLIVASAT    S+SSS  NV ERFYFNFTGFPFPLGPFLNRRTIRTEVK
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HL---PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIF
         L   PFLSF   GLFDL+E    + AVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKEL+APVEYI+LPTFAYEHKIF
Subjt:  HL---PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIF

Query:  VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEV--------------------------GIGPYVEVAFYH
        VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEV                          GIGPYVEVAFYH
Subjt:  VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEV--------------------------GIGPYVEVAFYH

Query:  KRSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKT
        KRS+TLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKT
Subjt:  KRSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKT

Query:  LVFSKVPEKVRDWIDRIVRDWKFKSIIPAHFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVK
        LVFSKVPEKVRDWIDRIVRDWKFK IIPAHFAAPVNAS  DFLTAFGFLDDLLGERYVNRPSLS+LF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVK
Subjt:  LVFSKVPEKVRDWIDRIVRDWKFKSIIPAHFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVK

Query:  KTVSGRKR
        KTVSGRKR
Subjt:  KTVSGRKR

A0A5D3BQU2 T4P13.26 protein isoform 13.7e-24290.25Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAV SPKSAIL QNP SRKDP P FLGR  KGF    K R K  P+GLIVASAT    S+SSS  NV ERFYFNFTGFPFPLGPFLNRRTIRTEVK
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HL---PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIF
         L   PFLSF   GLFDL+E    + AVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKEL+APVEYI+LPTFAYEHKIF
Subjt:  HL---PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIF

Query:  VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKE
        VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRS+TLLVTDAVIFVPRQPPECISKE
Subjt:  VGPFSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKE

Query:  SLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSI
        SLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK I
Subjt:  SLLASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSI

Query:  IPAHFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        IPAHFAAPVNAS  DFLTAFGFLDDLLGERYVNRPSLS+LF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  IPAHFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

A0A6J1BW70 uncharacterized protein LOC1110060772.4e-24189.35Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        M+AAIAVPSPKSAILG+NPISRKDPA +FL RPLKGFGF PK R K DP+GLIVASAT    ++SSS+RNVGERFYFNFTGFPFPLGPFLNRRTIRTE  
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
                               AVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
Subjt:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP

Query:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
        FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLS PWADEIEQKVLSSPEVGIGPYVEVAFYHK+SRTLLVTDAVIFVPRQPPECISKESLL
Subjt:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL

Query:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA
        ASAKNGLAVKLLSKGKEVPEEPVVDNK NRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK IIPA
Subjt:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA

Query:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        HFAAPV ASR DFLTAFGFLDDLLGERYVNRPSLSLLF+++MGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

A0A6J1H184 uncharacterized protein LOC1114591472.5e-23888.52Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAVPSPKSAILG+NP SRKDPA  FLGRPLKGFGF PK R K D + LIVASATP    +SSS+ N GERFYFNFTGFPFPLGPFLNRRTIRTE  
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
                               AVKG IWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLL ELEAPVEYIVLPTFAYEHKIFVGP
Subjt:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP

Query:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
        FSRKFPRAQIWVAPRQWSWPLNLPLEF GIFGAKTLKDEDL APWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
Subjt:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL

Query:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA
        ASAKNGLAVKLLSKGKEVPEEPVVDN +NRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK IIPA
Subjt:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA

Query:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        HFAAPVNASR DFLTAFGFLDDLLGERY+NRPSLSLLF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

A0A6J1JPB2 uncharacterized protein LOC1114876733.3e-23888.52Show/hide
Query:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK
        MTAAIAVPSPKSAILG+NP SRKDPA  FLGRPLKGFGF PK R K D + LIVASATP    +SSS+ N GERFYFNFTGFPFPLGPFLNRRTIRTE  
Subjt:  MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVK

Query:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP
                               AVKG IWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLL ELEAPVEYIVLPTFAYEHKIFVGP
Subjt:  HLPFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGP

Query:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
        FSRKFPRAQIWVAPRQWSWPLNLPLEF GIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL
Subjt:  FSRKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLL

Query:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA
        ASAKNGLAVKLLSKGKEVPEEPVVDN INR+KGWERMVLQILFLGPSNLLEPNASFA+MSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFK IIPA
Subjt:  ASAKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPA

Query:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
        HFAAPVNASR DFLTAFGFLDDLLGERY+NRPSLSLLF++LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  HFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01060.1 unknown protein9.2e-20174.84Show/hide
Query:  AAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVKHL
        AA+AV  PK +     P    D    FLG            R+   PV ++ AS+T  T      + +  +RFY NFTGFPFPLGPFLNRRTIRTE    
Subjt:  AAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVKHL

Query:  PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGPFS
                             AVKG IW+FEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKEC+QL+KEL APVEYIVLPTFAYEHKIFVGPFS
Subjt:  PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGPFS

Query:  RKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLLAS
        RKFP+AQ+WVAPRQWSWPLNLPLEF GIF AK +KD DLS PWA+EIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPR+PP  IS ESLLAS
Subjt:  RKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLLAS

Query:  AKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPAHF
        AKNGLAVK+LSKGK++P +PVVDN   RQKGWERMVLQILFLGPSNLLEPNASFA+MSQKLIVSPIVKTLVFSKVPEKVRDWID I RDW+FK IIPAHF
Subjt:  AKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPAHF

Query:  AAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
         AP+NA R DFL AFGFL+DLLGERYV RPSLSLLF++LMGKAASYFPPDDM+TLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  AAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

AT3G01060.2 unknown protein1.4e-19373.17Show/hide
Query:  AAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVKHL
        AA+AV  PK +     P    D    FLG            R+   PV ++ AS+T  T      + +  +RFY NFTGFPFPLGPFLNRRTIRTE    
Subjt:  AAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVKHL

Query:  PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGPFS
                             AVKG IW+FEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKEC+QL+KEL APVEYIVLPTFAYEHKIFVGPFS
Subjt:  PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGPFS

Query:  RKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLLAS
        RKFP+AQ+WVAPRQWSWPLNLPLEF GIF AK +KD DLS PWA+EIEQKVLSSP        EVAFYHKRSRTLLVTDAVIFVPR+PP  IS ESLLAS
Subjt:  RKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLLAS

Query:  AKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPAHF
        AKNGLAVK+LSKGK++P +PVVDN   RQKGWERMVLQILFLGPSNLLEPNASFA+MSQKLIVSPIVKTLVFSKVPEKVRDWID I RDW+FK IIPAHF
Subjt:  AKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPAHF

Query:  AAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR
         AP+NA R DFL AFGFL+DLLGERYV RPSLSLLF++LMGKAASYFPPDDM+TLSSLDQFLVSVGAVKKTVSGRKR
Subjt:  AAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFSTLMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR

AT3G01060.3 unknown protein3.1e-14871.96Show/hide
Query:  AAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVKHL
        AA+AV  PK +     P    D    FLG            R+   PV ++ AS+T  T      + +  +RFY NFTGFPFPLGPFLNRRTIRTE    
Subjt:  AAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVKHL

Query:  PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGPFS
                             AVKG IW+FEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKEC+QL+KEL APVEYIVLPTFAYEHKIFVGPFS
Subjt:  PFLSFFLSGLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGPFS

Query:  RKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLLAS
        RKFP+AQ+WVAPRQWSWPLNLPLEF GIF AK +KD DLS PWA+EIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPR+PP  IS ESLLAS
Subjt:  RKFPRAQIWVAPRQWSWPLNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLLAS

Query:  AKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEK
        AKNGLAVK+LSKGK++P +PVVDN   RQKGWERMVLQILFLGPSNLLEPNASFA+MSQKLIVSPIVKTLVFSKVPEK
Subjt:  AKNGLAVKLLSKGKEVPEEPVVDNKINRQKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCAGCCATTGCTGTTCCTTCCCCCAAATCAGCCATTTTAGGGCAAAATCCAATCTCTCGCAAAGACCCTGCTCCGTTTTTTCTCGGTCGGCCGCTGAAGGGTTT
CGGTTTTCAGCCGAAACGGAGGAAGAAAGCTGATCCTGTAGGTTTAATTGTTGCTTCGGCCACTCCACCCACAGCTTCAAATAGCAGCAGCAACAGAAATGTCGGTGAAA
GATTCTACTTCAACTTCACTGGATTTCCTTTTCCCCTCGGCCCTTTTCTGAATAGGCGCACTATAAGAACTGAGGTTAAACATTTACCCTTTCTTTCTTTCTTTCTTTCG
GGTCTGTTTGATTTGATAGAATTTGCTGCAAAGTTGTGTGCTGTCAAAGGTTCTATATGGCTTTTTGAGCAAGAACAAGCGTTAGGCTTCAGCAGTGTCTCAACAAACAT
TAGGATGACAGTCATCAAACTCAAATCTGGAGGATTATGGGTCCATGCACCCATTGCTCCAACCAAGGAGTGTATTCAGCTTCTGAAGGAGTTGGAGGCTCCTGTAGAAT
ACATTGTTTTACCAACGTTTGCTTATGAGCACAAAATTTTTGTTGGGCCATTTTCGAGGAAGTTCCCGCGAGCTCAGATATGGGTGGCACCGAGGCAGTGGAGTTGGCCT
TTGAACTTGCCATTGGAATTCTTGGGAATTTTTGGAGCTAAAACATTGAAAGATGAGGATTTATCTGCCCCATGGGCAGATGAGATCGAGCAGAAAGTTTTAAGCTCACC
AGAAGTTGGGATTGGGCCGTACGTGGAGGTTGCTTTTTATCATAAGCGGTCGAGAACACTTCTGGTGACAGATGCTGTAATCTTTGTCCCTAGACAACCACCTGAATGCA
TTAGCAAAGAATCCTTGTTGGCATCAGCAAAGAATGGTTTGGCAGTAAAATTATTGAGTAAAGGAAAGGAAGTCCCTGAAGAGCCAGTTGTTGACAATAAGATCAACCGT
CAAAAGGGGTGGGAAAGAATGGTTCTCCAGATATTGTTTCTTGGCCCTTCAAATCTCTTGGAGCCTAATGCTAGTTTTGCTCAAATGTCACAGAAACTTATTGTTTCACC
CATTGTAAAGACTCTTGTCTTCAGCAAAGTTCCTGAAAAGGTCAGGGACTGGATTGATAGAATCGTTCGAGATTGGAAGTTCAAGAGCATCATCCCTGCTCACTTTGCAG
CTCCAGTAAATGCAAGTAGGTTCGATTTCCTAACTGCGTTCGGGTTTCTCGATGACCTTTTGGGAGAGCGCTATGTCAACCGACCTTCACTCTCTCTTCTCTTTTCAACA
CTCATGGGGAAGGCTGCTAGTTACTTTCCACCAGATGATATGAAGACCTTATCATCCCTTGACCAGTTTTTAGTATCGGTTGGAGCCGTGAAGAAGACCGTCTCAGGCAG
AAAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGGCAGCCATTGCTGTTCCTTCCCCCAAATCAGCCATTTTAGGGCAAAATCCAATCTCTCGCAAAGACCCTGCTCCGTTTTTTCTCGGTCGGCCGCTGAAGGGTTT
CGGTTTTCAGCCGAAACGGAGGAAGAAAGCTGATCCTGTAGGTTTAATTGTTGCTTCGGCCACTCCACCCACAGCTTCAAATAGCAGCAGCAACAGAAATGTCGGTGAAA
GATTCTACTTCAACTTCACTGGATTTCCTTTTCCCCTCGGCCCTTTTCTGAATAGGCGCACTATAAGAACTGAGGTTAAACATTTACCCTTTCTTTCTTTCTTTCTTTCG
GGTCTGTTTGATTTGATAGAATTTGCTGCAAAGTTGTGTGCTGTCAAAGGTTCTATATGGCTTTTTGAGCAAGAACAAGCGTTAGGCTTCAGCAGTGTCTCAACAAACAT
TAGGATGACAGTCATCAAACTCAAATCTGGAGGATTATGGGTCCATGCACCCATTGCTCCAACCAAGGAGTGTATTCAGCTTCTGAAGGAGTTGGAGGCTCCTGTAGAAT
ACATTGTTTTACCAACGTTTGCTTATGAGCACAAAATTTTTGTTGGGCCATTTTCGAGGAAGTTCCCGCGAGCTCAGATATGGGTGGCACCGAGGCAGTGGAGTTGGCCT
TTGAACTTGCCATTGGAATTCTTGGGAATTTTTGGAGCTAAAACATTGAAAGATGAGGATTTATCTGCCCCATGGGCAGATGAGATCGAGCAGAAAGTTTTAAGCTCACC
AGAAGTTGGGATTGGGCCGTACGTGGAGGTTGCTTTTTATCATAAGCGGTCGAGAACACTTCTGGTGACAGATGCTGTAATCTTTGTCCCTAGACAACCACCTGAATGCA
TTAGCAAAGAATCCTTGTTGGCATCAGCAAAGAATGGTTTGGCAGTAAAATTATTGAGTAAAGGAAAGGAAGTCCCTGAAGAGCCAGTTGTTGACAATAAGATCAACCGT
CAAAAGGGGTGGGAAAGAATGGTTCTCCAGATATTGTTTCTTGGCCCTTCAAATCTCTTGGAGCCTAATGCTAGTTTTGCTCAAATGTCACAGAAACTTATTGTTTCACC
CATTGTAAAGACTCTTGTCTTCAGCAAAGTTCCTGAAAAGGTCAGGGACTGGATTGATAGAATCGTTCGAGATTGGAAGTTCAAGAGCATCATCCCTGCTCACTTTGCAG
CTCCAGTAAATGCAAGTAGGTTCGATTTCCTAACTGCGTTCGGGTTTCTCGATGACCTTTTGGGAGAGCGCTATGTCAACCGACCTTCACTCTCTCTTCTCTTTTCAACA
CTCATGGGGAAGGCTGCTAGTTACTTTCCACCAGATGATATGAAGACCTTATCATCCCTTGACCAGTTTTTAGTATCGGTTGGAGCCGTGAAGAAGACCGTCTCAGGCAG
AAAAAGGTAA
Protein sequenceShow/hide protein sequence
MTAAIAVPSPKSAILGQNPISRKDPAPFFLGRPLKGFGFQPKRRKKADPVGLIVASATPPTASNSSSNRNVGERFYFNFTGFPFPLGPFLNRRTIRTEVKHLPFLSFFLS
GLFDLIEFAAKLCAVKGSIWLFEQEQALGFSSVSTNIRMTVIKLKSGGLWVHAPIAPTKECIQLLKELEAPVEYIVLPTFAYEHKIFVGPFSRKFPRAQIWVAPRQWSWP
LNLPLEFLGIFGAKTLKDEDLSAPWADEIEQKVLSSPEVGIGPYVEVAFYHKRSRTLLVTDAVIFVPRQPPECISKESLLASAKNGLAVKLLSKGKEVPEEPVVDNKINR
QKGWERMVLQILFLGPSNLLEPNASFAQMSQKLIVSPIVKTLVFSKVPEKVRDWIDRIVRDWKFKSIIPAHFAAPVNASRFDFLTAFGFLDDLLGERYVNRPSLSLLFST
LMGKAASYFPPDDMKTLSSLDQFLVSVGAVKKTVSGRKR