; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022691 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022691
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTIP41-like protein
Genome locationtig00000589:938886..942023
RNA-Seq ExpressionSgr022691
SyntenySgr022691
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580509.1 hypothetical protein SDJN03_20511, partial [Cucurbita argyrosperma subsp. sororia]2.0e-11666.48Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MA+SLDDGEFWLPPQFLADDDN+    +  TNK SD N                            SPGESLKGSSE ESDEED +AG+TLRMA+STIDD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG
        GFD  +SKT ++SGSPQSTLCDMGSGSGCSQ SSRGSPK NCKV SPPAT DLLHAAAGEVARMRMNE+ GV   NRG       SQVSVPV NS+ GTG
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC
        FY+QLQA+Q  HL Q+    EIMQRQN  VG+Q+N+PAGYQ+QQ+H MVQ+GVR            RGLSSS W+PP   PQGSG RALFLGT G KREC
Subjt:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC

Query:  AGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        AGTGVF PRH+  QSE R+KPACSTVLVP RVMQALNLNL+DICSQPH++P+  GRLNSEN
Subjt:  AGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

KAG7027928.1 hypothetical protein SDJN02_09107 [Cucurbita argyrosperma subsp. argyrosperma]2.9e-11566.67Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MAESLDDGEFWLPPQFL+DDD+  DQ+SS TNK + +N                            S  ESLKGSSE ESD ED VAGLTL MARST+DD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG
        G D +++K RV++GSPQSTLCDMGSGSGCSQGSSRGSPKGNCKV SPPATWDLLHAAAG+VARMRM E HG VH NRG       SQVSVPV NS++ TG
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTA-GGKRECA
        FY+QLQA+Q   Q  QQ EIMQRQN  V +QIN  AGYQQQQIHQ+VQ+G+RSG          RGLSSSPW   +Q PQG+G RALF+GT+ GGKRECA
Subjt:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTA-GGKRECA

Query:  GTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        GTGVFLPRHT   S+PR+KP C TVLVP RVMQALNLNL+DICSQPHLQP+AA R NS+N
Subjt:  GTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

XP_022145688.1 uncharacterized protein LOC111015081 [Momordica charantia]3.4e-14075.7Show/hide
Query:  MMAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN-----------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFD
        MMAESLDDGEFWLPPQFLADDDN+LD KS  T K+SD+N                        SPGESLKGSSE ESDEEDY+AGLTLRMA S++DD FD
Subjt:  MMAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN-----------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFD

Query:  PEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNE-THGV-HHNRGLADPSRISQVSVPVNNSNAGTGFY
        P+++K RVVSGSPQSTLC MGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMR N+  HGV HHNRG        QVSVPVNNSNAGTGFY
Subjt:  PEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNE-THGV-HHNRGLADPSRISQVSVPVNNSNAGTGFY

Query:  RQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGT-AGGKRECAGT
        +QL A+Q       Q EIMQRQNLA G+Q+N+ AGYQQQQIHQMV +GVR G+FVGNRD+RSRGLSSS WLPPQQ PQGSGMRALFLGT   GKRECAGT
Subjt:  RQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGT-AGGKRECAGT

Query:  GVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        GVFLPRHTG QSEPRKKP CSTVLVP RVMQALNLNLDDICSQPHLQP+AAGR N+EN
Subjt:  GVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

XP_022941783.1 uncharacterized protein LOC111447045 [Cucurbita moschata]2.9e-11566.67Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MAESLDDGEFWLPPQFL+DDD+  DQ+SS TNK + +N                            S  ESLKGSSE ESD ED VAGLTL MARST+DD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG
        G D +++K RV++GSPQSTLCDMGSGSGCSQGSSRGSPKGNCKV SPPATWDLLHAAAG+VARMRM E HG VH NRG       SQVSVPV NS++ TG
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTA-GGKRECA
        FY+QLQA+Q   Q  QQ EIMQRQN  V +QIN  AGYQQQQIHQ+VQ+G+RSG          RGLSSSPW   +Q PQG+G RALF+GT+ GGKRECA
Subjt:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTA-GGKRECA

Query:  GTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        GTGVFLPRHT   S+PR+KP C TVLVP RVMQALNLNL+DICSQPHLQP+AA R NS+N
Subjt:  GTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

XP_023526837.1 uncharacterized protein LOC111790220 [Cucurbita pepo subsp. pepo]7.0e-11766.76Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MA+SLDDGEFWLPPQFLADDDN+    +  TNK SD N                            SPGESLKGSSE ESDEED +AG+TLRMA+STIDD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG
        GFD  +SKT ++SGSPQSTLCD+GSGSGCSQ SSRGSPK NCKV SPPAT DLLHAAAGEVARMRMNE+ GV   NRG       SQVSVPV NS+ GTG
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC
        FY+QLQA+Q  HL Q+    EIMQRQNL VG+Q+N+P GYQ+QQ+H MVQ+GVR            RG SSS W+PP   PQGSG RALFLGT GGKREC
Subjt:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC

Query:  AGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        AGTGVFLPRH+  QSE R+KPACSTVLVP RVMQALNLNL+DICSQPH+QP+  GRLNSEN
Subjt:  AGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

TrEMBL top hitse value%identityAlignment
A0A6J1CWM5 uncharacterized protein LOC1110150811.7e-14075.7Show/hide
Query:  MMAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN-----------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFD
        MMAESLDDGEFWLPPQFLADDDN+LD KS  T K+SD+N                        SPGESLKGSSE ESDEEDY+AGLTLRMA S++DD FD
Subjt:  MMAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN-----------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFD

Query:  PEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNE-THGV-HHNRGLADPSRISQVSVPVNNSNAGTGFY
        P+++K RVVSGSPQSTLC MGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMR N+  HGV HHNRG        QVSVPVNNSNAGTGFY
Subjt:  PEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNE-THGV-HHNRGLADPSRISQVSVPVNNSNAGTGFY

Query:  RQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGT-AGGKRECAGT
        +QL A+Q       Q EIMQRQNLA G+Q+N+ AGYQQQQIHQMV +GVR G+FVGNRD+RSRGLSSS WLPPQQ PQGSGMRALFLGT   GKRECAGT
Subjt:  RQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGT-AGGKRECAGT

Query:  GVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        GVFLPRHTG QSEPRKKP CSTVLVP RVMQALNLNLDDICSQPHLQP+AAGR N+EN
Subjt:  GVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

A0A6J1F0R5 uncharacterized protein LOC1114413331.9e-11566.85Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MA+SLDDGEFWLPPQFLADDDN+    +  TNK S  N                            SPGESLKGSSE ESDEED +AG TLRMA+STIDD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG
        GFD  +SKT ++SGSPQSTLCDMGSGSGCSQ SSRGSPK NCKV SPPAT DLLHAAAGEVARMRMNE+ GV   NRG       SQVSVPV NS+ GTG
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC
        FY+QLQA+Q  HL Q+    EIMQRQNL VG+Q+N+PAGYQ+QQ+H MVQ+GVR            RGLSSS W+PP   PQGSG RALFLGT G KREC
Subjt:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC

Query:  AGTGVFLPRHTGNQS-EPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        AGTGVFLPRH+  QS E R+KPACSTVLVP RVMQALNLNL+DICSQPH+QP+  GRLNSEN
Subjt:  AGTGVFLPRHTGNQS-EPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

A0A6J1FT27 uncharacterized protein LOC1114470451.4e-11566.67Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MAESLDDGEFWLPPQFL+DDD+  DQ+SS TNK + +N                            S  ESLKGSSE ESD ED VAGLTL MARST+DD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG
        G D +++K RV++GSPQSTLCDMGSGSGCSQGSSRGSPKGNCKV SPPATWDLLHAAAG+VARMRM E HG VH NRG       SQVSVPV NS++ TG
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTA-GGKRECA
        FY+QLQA+Q   Q  QQ EIMQRQN  V +QIN  AGYQQQQIHQ+VQ+G+RSG          RGLSSSPW   +Q PQG+G RALF+GT+ GGKRECA
Subjt:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTA-GGKRECA

Query:  GTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        GTGVFLPRHT   S+PR+KP C TVLVP RVMQALNLNL+DICSQPHLQP+AA R NS+N
Subjt:  GTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

A0A6J1I5F7 uncharacterized protein LOC1114700845.6e-11264.9Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MAESLDDGEFWLPPQFL+DD +  DQ+S  TNK + +N                            S  ESLKGSSE ESD ED VAGLTL MARST+DD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG
        G D +++K  V++GSPQSTLCDMGSGSGCSQGSSRGSPKGNCKV SPPATWDLLHAAAG+VARMRM E HG VH NRG        QVS+PV NS++  G
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHG-VHHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAG
        FY+QLQA+Q   Q  QQ EIMQRQNL V +QIN  AGYQQQQIHQ+V +G+ SG          RGLSSSPW   +Q PQGSG R+LF+GT GGKRECAG
Subjt:  FYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAG

Query:  TGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN
        TGVFLPRHT   S+ R+KP C TVLVP RVMQALNLNL+DICSQPHLQP+AAGR NS+N
Subjt:  TGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSEN

A0A6J1J2V3 uncharacterized protein LOC1114821807.1e-11566.11Show/hide
Query:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD
        MA+SLDDGEFWLPPQFLADDDN+  Q +  TNK S  N                            SPGESLKGSSE ESDEED +A +TLRMA+STIDD
Subjt:  MAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQN---------------------------RSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDD

Query:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG
        GFD  +SK  ++SGSPQSTLCDMGSGSGCSQ SSRGSPK NCKV SPPAT DLLHAAAGEVARM+MNE+ GV   NRG       SQVSVPV NS+ GTG
Subjt:  GFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGV-HHNRGLADPSRISQVSVPVNNSNAGTG

Query:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC
        FY+QLQA+Q  HL Q+    EIMQRQNL VG+Q+N+PA YQ+QQ+H MVQ+GVR            RGLSSS W+PP   PQGSG RALFLG  GGKREC
Subjt:  FYRQLQAIQ--HLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKREC

Query:  AGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSE
        AGTGVFLPRH+  QSE R+KPACSTVLVP RVMQALNLNL+DICSQPH+QP+  GRLNSE
Subjt:  AGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39870.1 unknown protein1.3e-1228.22Show/hide
Query:  SPGESLKGSSEAESDEEDYVAGLTLRMARST--IDDGFDPEHSKTRVVSGSPQSTLCDMGSGSGC-SQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVAR
        SPG+S +   E+  DEED++AGLT R+A ST  +         K +V + SPQSTL  +GS S   S+     SP            WD++ AAAGEVAR
Subjt:  SPGESLKGSSEAESDEEDYVAGLTLRMARST--IDDGFDPEHSKTRVVSGSPQSTLCDMGSGSGC-SQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVAR

Query:  MRMNETHGVHHNRGLADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRS
        +++  ++  HH              +P+                      Q    +++RQN A+  ++      +Q  +         S + +  R    
Subjt:  MRMNETHGVHHNRGLADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRS

Query:  RGLSSSP---------WLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPRH--TGNQSEPRKKPACSTVLVPGRVMQALNLNLDD
         GL  +P         WLPPQQ              A  KR  AGTGVFLPR   +   S+  K P  +  ++  +V +  NLN D+
Subjt:  RGLSSSP---------WLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPRH--TGNQSEPRKKPACSTVLVPGRVMQALNLNLDD

AT3G54000.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.9e-2933.91Show/hide
Query:  LDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQNRSP----------GESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKTRVVSGSPQSTLC
        +DD EFWLP +FL DDD L++++++      D +  P          G ++K ++  E DEE ++AGLT +M  S++ D F                   
Subjt:  LDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQNRSP----------GESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKTRVVSGSPQSTLC

Query:  DMG------SGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGVHHNRG-LADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQ
        +M       +G+GC   + R +   N +VSS    WDL  AA     RM +N+    H  RG L  P+++S  +   N+SN GTG+Y   Q++Q+ + Q 
Subjt:  DMG------SGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGVHHNRG-LADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQ

Query:  QQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPR---HTGNQ
         Q + +++Q L           +++Q + Q    GVR     GN++     LSSS W    Q P+   MRA+F+G   GKR   GTGVFLPR   HT ++
Subjt:  QQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPR---HTGNQ

Query:  SEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSENG
        +E R+KP  STVLVP R+ Q LNLNL +           + R  S NG
Subjt:  SEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNSENG

AT3G54000.2 unknown protein2.6e-2132.79Show/hide
Query:  LDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQNRSP----------GESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKTRVVSGSPQSTLC
        +DD EFWLP +FL DDD L++++++      D +  P          G ++K ++  E DEE ++AGLT +M  S++ D F                   
Subjt:  LDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQNRSP----------GESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKTRVVSGSPQSTLC

Query:  DMG------SGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGVHHNRG-LADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQ
        +M       +G+GC   + R +   N +VSS    WDL  AA     RM +N+    H  RG L  P+++S  +   N+SN GTG+Y   Q++Q+ + Q 
Subjt:  DMG------SGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGVHHNRG-LADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQ

Query:  QQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPR---HTGNQ
         Q + +++Q L           +++Q + Q    GVR     GN++     LSSS W    Q P+   MRA+F+G   GKR   GTGVFLPR   HT ++
Subjt:  QQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPR---HTGNQ

Query:  SEPRKKPA
        +E R+KP+
Subjt:  SEPRKKPA

AT3G54000.3 unknown protein2.6e-2132.79Show/hide
Query:  LDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQNRSP----------GESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKTRVVSGSPQSTLC
        +DD EFWLP +FL DDD L++++++      D +  P          G ++K ++  E DEE ++AGLT +M  S++ D F                   
Subjt:  LDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQNRSP----------GESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKTRVVSGSPQSTLC

Query:  DMG------SGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGVHHNRG-LADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQ
        +M       +G+GC   + R +   N +VSS    WDL  AA     RM +N+    H  RG L  P+++S  +   N+SN GTG+Y   Q++Q+ + Q 
Subjt:  DMG------SGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGVHHNRG-LADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQ

Query:  QQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPR---HTGNQ
         Q + +++Q L           +++Q + Q    GVR     GN++     LSSS W    Q P+   MRA+F+G   GKR   GTGVFLPR   HT ++
Subjt:  QQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPR---HTGNQ

Query:  SEPRKKPA
        +E R+KP+
Subjt:  SEPRKKPA

AT5G59050.1 unknown protein1.1e-0827.14Show/hide
Query:  SDQNRSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKT-RVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGE
        SD+  SP       ++ E +E++Y+  LT +M    + D  D +H K+    SGSPQSTL            S   SP G  +  SPP T     A    
Subjt:  SDQNRSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKT-RVVSGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGE

Query:  VARMRMNETHGVHHNRGLADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRD
           M   +T                 V++P  +  A      Q+++IQ   Q+ ++ +  +RQ                           R+ D +G++ 
Subjt:  VARMRMNETHGVHHNRGLADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMVQSGVRSGDFVGNRD

Query:  NRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDI
             L        Q Q   SG++A+F+  +G +    GTGVFLPR  G   E RKK  CSTV++P RV++AL ++ D +
Subjt:  NRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCTGAGAGTTTGGACGACGGAGAGTTTTGGCTTCCTCCTCAGTTTCTTGCTGACGATGATAATTTGTTGGATCAAAAGAGTAGTGGTACCAACAAAACCAGTGA
TCAAAACCGTTCACCTGGCGAATCACTGAAGGGTTCCAGCGAGGCGGAGAGTGATGAGGAGGATTATGTGGCTGGCTTGACTCTTCGTATGGCTCGCTCCACCATTGATG
ACGGGTTCGACCCGGAGCATTCAAAGACAAGGGTTGTGTCCGGATCTCCACAGTCGACTCTGTGCGATATGGGGAGTGGCAGTGGCTGCAGTCAAGGTTCCAGTCGCGGA
AGTCCAAAAGGCAATTGTAAGGTTTCATCCCCTCCGGCTACATGGGATCTGCTGCATGCGGCTGCAGGGGAAGTTGCTAGGATGCGAATGAATGAGACCCATGGGGTCCA
TCATAACAGAGGACTCGCTGATCCATCCAGAATATCTCAAGTCTCTGTTCCGGTGAACAACTCGAACGCCGGAACCGGATTTTACCGGCAGTTGCAGGCCATCCAACACC
TGCAACAGCAACAGCAACAGCATGAAATTATGCAGCGACAAAACTTGGCTGTGGGAGATCAGATAAATGCTCCTGCAGGGTATCAACAACAGCAGATCCATCAAATGGTC
CAGAGTGGAGTGAGGAGCGGCGACTTCGTTGGTAACAGAGACAATCGCAGTCGGGGATTGTCGTCGTCTCCATGGCTTCCACCGCAGCAGCAGCCTCAGGGTTCTGGAAT
GAGGGCTCTGTTTCTCGGAACTGCAGGCGGCAAAAGGGAATGCGCTGGAACTGGTGTCTTTTTGCCTCGACATACGGGCAATCAATCTGAGCCACGCAAAAAGCCAGCTT
GTTCTACGGTGTTGGTTCCTGGAAGAGTGATGCAAGCCCTGAATCTGAACCTGGACGACATTTGCAGTCAGCCCCATCTTCAACCCATCGCTGCCGGACGATTGAATTCA
GAGAACGGCTGCAGATCAACCGCGGCGGCGCGAATCATCAAAAACGCAACAGCCGTCGGCCGCCGGAAACGGACCATGAAATCAAGCTTCCCCAAGAATGGACTTACTGA
AACTGCCCCTCGCATGGAAGAAGTGGAGGTTAGGAAGGTAGTTATTGGTGTTACGTCGACCGACGTCAACTGTCATCTGTCACTTGGGAAGCCGCACATGGCGCGCACTA
TGGATGTTGGCCATTTACGCGCACGGTGGGCGCTTCCTCCTTCTCCTTTAAGATTCCATCCCATCCTTAATTCCCGCCGATCCTCAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGCTGAGAGTTTGGACGACGGAGAGTTTTGGCTTCCTCCTCAGTTTCTTGCTGACGATGATAATTTGTTGGATCAAAAGAGTAGTGGTACCAACAAAACCAGTGA
TCAAAACCGTTCACCTGGCGAATCACTGAAGGGTTCCAGCGAGGCGGAGAGTGATGAGGAGGATTATGTGGCTGGCTTGACTCTTCGTATGGCTCGCTCCACCATTGATG
ACGGGTTCGACCCGGAGCATTCAAAGACAAGGGTTGTGTCCGGATCTCCACAGTCGACTCTGTGCGATATGGGGAGTGGCAGTGGCTGCAGTCAAGGTTCCAGTCGCGGA
AGTCCAAAAGGCAATTGTAAGGTTTCATCCCCTCCGGCTACATGGGATCTGCTGCATGCGGCTGCAGGGGAAGTTGCTAGGATGCGAATGAATGAGACCCATGGGGTCCA
TCATAACAGAGGACTCGCTGATCCATCCAGAATATCTCAAGTCTCTGTTCCGGTGAACAACTCGAACGCCGGAACCGGATTTTACCGGCAGTTGCAGGCCATCCAACACC
TGCAACAGCAACAGCAACAGCATGAAATTATGCAGCGACAAAACTTGGCTGTGGGAGATCAGATAAATGCTCCTGCAGGGTATCAACAACAGCAGATCCATCAAATGGTC
CAGAGTGGAGTGAGGAGCGGCGACTTCGTTGGTAACAGAGACAATCGCAGTCGGGGATTGTCGTCGTCTCCATGGCTTCCACCGCAGCAGCAGCCTCAGGGTTCTGGAAT
GAGGGCTCTGTTTCTCGGAACTGCAGGCGGCAAAAGGGAATGCGCTGGAACTGGTGTCTTTTTGCCTCGACATACGGGCAATCAATCTGAGCCACGCAAAAAGCCAGCTT
GTTCTACGGTGTTGGTTCCTGGAAGAGTGATGCAAGCCCTGAATCTGAACCTGGACGACATTTGCAGTCAGCCCCATCTTCAACCCATCGCTGCCGGACGATTGAATTCA
GAGAACGGCTGCAGATCAACCGCGGCGGCGCGAATCATCAAAAACGCAACAGCCGTCGGCCGCCGGAAACGGACCATGAAATCAAGCTTCCCCAAGAATGGACTTACTGA
AACTGCCCCTCGCATGGAAGAAGTGGAGGTTAGGAAGGTAGTTATTGGTGTTACGTCGACCGACGTCAACTGTCATCTGTCACTTGGGAAGCCGCACATGGCGCGCACTA
TGGATGTTGGCCATTTACGCGCACGGTGGGCGCTTCCTCCTTCTCCTTTAAGATTCCATCCCATCCTTAATTCCCGCCGATCCTCAATCTGA
Protein sequenceShow/hide protein sequence
MMAESLDDGEFWLPPQFLADDDNLLDQKSSGTNKTSDQNRSPGESLKGSSEAESDEEDYVAGLTLRMARSTIDDGFDPEHSKTRVVSGSPQSTLCDMGSGSGCSQGSSRG
SPKGNCKVSSPPATWDLLHAAAGEVARMRMNETHGVHHNRGLADPSRISQVSVPVNNSNAGTGFYRQLQAIQHLQQQQQQHEIMQRQNLAVGDQINAPAGYQQQQIHQMV
QSGVRSGDFVGNRDNRSRGLSSSPWLPPQQQPQGSGMRALFLGTAGGKRECAGTGVFLPRHTGNQSEPRKKPACSTVLVPGRVMQALNLNLDDICSQPHLQPIAAGRLNS
ENGCRSTAAARIIKNATAVGRRKRTMKSSFPKNGLTETAPRMEEVEVRKVVIGVTSTDVNCHLSLGKPHMARTMDVGHLRARWALPPSPLRFHPILNSRRSSI