; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028601 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028601
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLEA_2 domain-containing protein
Genome locationtig00153204:2694326..2700310
RNA-Seq ExpressionSgr028601
SyntenySgr028601
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060912.1 Late embryogenesis abundant protein, LEA-14 [Cucumis melo var. makuwa]2.5e-16785.48Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTS+APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKI+PNDVSRG HRK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        GQKPWKECDVIEEEGLLEDEDRGK+LPRRCYVLAFILGF +LFSMFALILWGAS+PMKP++TMKSITFEQFKIQAGSDFTGVATDMASVNSTVKL FRNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
        GSFFGVHV+STPVDLTYSEI+VASG+VK+FYQSRKS RS+TINVIGTR+PLYGSGASLSSSTGTP TP+PLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI

Query:  TFDPKKLNVP-------IKKADCDSSKEAAAAAAYGRRR-TGGEDPADRNGVSAGCPFARVVHNR
         FDPKKLNVP       +  ADCD+S         GR+   G    +DRNGVS GCPFAR+V  R
Subjt:  TFDPKKLNVP-------IKKADCDSSKEAAAAAAYGRRR-TGGEDPADRNGVSAGCPFARVVHNR

KAG6585673.1 putative serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia]1.0e-16079.63Show/hide
Query:  KRVAVSNGVTKLSSHNQLCIFTRLNSLEWLLSLHTHIAKSPSETATMHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSP
        +RV +SN V        L IFT  NSL    +      ++PS+T TMHAKTDSEVTS+A SSPTRSPRRPVYYVQSPSRDS+DGEKTATSFHSTPVLTSP
Subjt:  KRVAVSNGVTKLSSHNQLCIFTRLNSLEWLLSLHTHIAKSPSETATMHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSP

Query:  MDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGAS
        MDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRKGQKPWKECDVIEEEGLLEDEDR ++L RRCY+LAF+LGF +LFS+FALILWGAS
Subjt:  MDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGAS

Query:  KPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGS
        +PMKPK+TMKSI F QFK+QAGSDFTGVATDMASVNSTVKLTFRNTG+FFGVHV+STPVDLTY EIS+ASG+VK FYQSRKSQRS+TINVIGTRIPLYGS
Subjt:  KPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGS

Query:  GASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPITFDPKKLNVPIKKADCDSSKEAAAAAAYGRRR
        G SLSS  GTP TPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI FDPKKLNVP+   +C   + A A   +  R+
Subjt:  GASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPITFDPKKLNVPIKKADCDSSKEAAAAAAYGRRR

KAG6598363.1 hypothetical protein SDJN03_08141, partial [Cucurbita argyrosperma subsp. sororia]2.9e-16382.51Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTS A SSPTRSPRRP YYVQSPSRDSHDG+KTATSFHSTPVLTSPMDSPPHSRSS+GRHSRESSS+RFSGSLKPGSRKISPNDVSR  HRK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        GQKPW +CD I+EEGLLEDED+GK+LPRRCY+LAFILGF+LLFS FAL+LWGAS+PMKPKITMKSITFEQF+IQAGSDFTGVATDMASVNS+VKL FRNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
        GSFFGVHV+STPVDLTYSEISVASG+VK+FYQSRKSQRS+TI+VIGTR+PLYGSGASLSSSTGT ATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI

Query:  TFDPKKLNVPIKKADCDSSKEAAAAAAYGRRRTGGEDPADRNGVSAGCPFARVVHNRLVSGQRQVN
         FDPKKLNVP+   +C   + + ++      R       D NG SA CP ARVV +RLVSGQR VN
Subjt:  TFDPKKLNVPIKKADCDSSKEAAAAAAYGRRRTGGEDPADRNGVSAGCPFARVVHNRLVSGQRQVN

XP_022132311.1 uncharacterized protein LOC111005194 [Momordica charantia]1.0e-16094.03Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSP-RRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRG-GH
        MHAKTDSEVTSLAPSSPTRSP RRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSS+RFSGSLKPGSRKISPNDVSRG G+
Subjt:  MHAKTDSEVTSLAPSSPTRSP-RRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRG-GH

Query:  RKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR
        RKGQKPWKECDVIEEEGLLEDEDR  +LPRRCYVLAFILGF +LFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR
Subjt:  RKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR

Query:  NTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDC
        NTGSFFGVHVTSTPVDLTYSEISVASGSVK+FYQSRKSQRS+TINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFV+RSRAYVLGQLVKPKFYRHIDC
Subjt:  NTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDC

Query:  PITFDPKKLNVPIKKADC
        PI FDPKKLNVP+   +C
Subjt:  PITFDPKKLNVPIKKADC

XP_038884165.1 uncharacterized protein LOC120075075 [Benincasa hispida]7.9e-16192.41Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTS+APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRG HRK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        GQKPWKECDVIEEEGLLEDEDRGK+LPRRCYVLAFILGFV+LFSMFALILWGAS+PMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKL F+NT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
        GSFFGVHV+ TPV+LTYSEI+VASG+VK+FYQSRKS RS+TINVIGTR+PLYGSGAS SSSTGTP TP+PLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI

Query:  TFDPKKLNVPIKKADC
         FDPKKLNVPI   +C
Subjt:  TFDPKKLNVPIKKADC

TrEMBL top hitse value%identityAlignment
A0A0A0LNF3 Uncharacterized protein8.0e-15991.14Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTS+APSSPTRSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKI+PNDVSRG HRK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        GQKPWKECDVIEEEGLLEDEDRGK+LPRRCYVLAFILGFV+LFSMFALILWGAS+PMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKL FRNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
        GSFFGVHV+ TPVDL+YSEI+VASG+VK+FYQSRKS RS+TINVIGTR+PLYGSGASLS STGTP TP+PLKL FVIRSRAYVLGQLVKPKFYRHIDCPI
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI

Query:  TFDPKKLNVPIKKADC
         FD KKLNVP+   +C
Subjt:  TFDPKKLNVPIKKADC

A0A1S3BBJ5 uncharacterized protein LOC1034878792.5e-16091.77Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTS+APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKI+PNDVSRG HRK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        GQKPWKECDVIEEEGLLEDEDRGK+LPRRCYVLAFILGF +LFSMFALILWGAS+PMKP++TMKSITFEQFKIQAGSDFTGVATDMASVNSTVKL FRNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
        GSFFGVHV+STPVDLTYSEI+VASG+VK+FYQSRKS RS+TINVIGTR+PLYGSGASLSSSTGTP TP+PLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI

Query:  TFDPKKLNVPIKKADC
         FD KKLNVP+   +C
Subjt:  TFDPKKLNVPIKKADC

A0A5A7UYD8 Late embryogenesis abundant protein, LEA-141.2e-16785.48Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTS+APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKI+PNDVSRG HRK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        GQKPWKECDVIEEEGLLEDEDRGK+LPRRCYVLAFILGF +LFSMFALILWGAS+PMKP++TMKSITFEQFKIQAGSDFTGVATDMASVNSTVKL FRNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
        GSFFGVHV+STPVDLTYSEI+VASG+VK+FYQSRKS RS+TINVIGTR+PLYGSGASLSSSTGTP TP+PLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI

Query:  TFDPKKLNVP-------IKKADCDSSKEAAAAAAYGRRR-TGGEDPADRNGVSAGCPFARVVHNR
         FDPKKLNVP       +  ADCD+S         GR+   G    +DRNGVS GCPFAR+V  R
Subjt:  TFDPKKLNVP-------IKKADCDSSKEAAAAAAYGRRR-TGGEDPADRNGVSAGCPFARVVHNR

A0A6J1BRX1 uncharacterized protein LOC1110051945.0e-16194.03Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSP-RRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRG-GH
        MHAKTDSEVTSLAPSSPTRSP RRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSS+RFSGSLKPGSRKISPNDVSRG G+
Subjt:  MHAKTDSEVTSLAPSSPTRSP-RRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRG-GH

Query:  RKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR
        RKGQKPWKECDVIEEEGLLEDEDR  +LPRRCYVLAFILGF +LFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR
Subjt:  RKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR

Query:  NTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDC
        NTGSFFGVHVTSTPVDLTYSEISVASGSVK+FYQSRKSQRS+TINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFV+RSRAYVLGQLVKPKFYRHIDC
Subjt:  NTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDC

Query:  PITFDPKKLNVPIKKADC
        PI FDPKKLNVP+   +C
Subjt:  PITFDPKKLNVPIKKADC

A0A6J1GJ34 uncharacterized protein LOC1114546651.4e-15588.71Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTS+A SSPTRSPRRPVYYVQSPSRDS+DGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        GQKPWKECDVIEEEGLLEDEDR ++L RRCY+LAF+LGF +LFS+FALILWGAS+PMKPK+TMKSI F QFK+QAGSDFTGVATDMASVNSTVKLTFRNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
        G+FFGVHV+STPVDLTY EIS+ASG+VK FYQSRKSQRS+TINVIGTRIPLYGSG SLSS  GTP TPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI

Query:  TFDPKKLNVPIKKADCDSS
         FDPKKLNVP+   +C  S
Subjt:  TFDPKKLNVPIKKADCDSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45688.1 unknown protein3.0e-11867.17Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTSLA SSP RSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVL SPM SPPHS SS+GRHSRESSSSRFSGSLKPGSRK++PND S+     
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        G+K WKEC VIEEEGLL+D DR   +PRRCYVLAFI+GF +LF  F+LIL+GA+KPMKPKIT+KSITFE  KIQAG D  GV TDM ++N+T+++ +RNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASL------------SSSTGTPA---------TPVPLKLSFVIRS
        G+FFGVHVTSTP+DL++S+I + SGSVK+FYQ RKS+R+V ++VIG +IPLYGSG++L                G P           PVP+ LSFV+RS
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASL------------SSSTGTPA---------TPVPLKLSFVIRS

Query:  RAYVLGQLVKPKFYRHIDCPITFDPKKLN
        RAYVLG+LV+PKFY+ I+C I F+ K LN
Subjt:  RAYVLGQLVKPKFYRHIDCPITFDPKKLN

AT1G45688.2 unknown protein2.4e-9175.33Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTSLA SSP RSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVL SPM SPPHS SS+GRHSRESSSSRFSGSLKPGSRK++PND S+     
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT
        G+K WKEC VIEEEGLL+D DR   +PRRCYVLAFI+GF +LF  F+LIL+GA+KPMKPKIT+KSITFE  KIQAG D  GV TDM ++N+T+++ +RNT
Subjt:  GQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNT

Query:  GSFFGVHVTSTPVDLTYSEISVASGSV
        G+FFGVHVTSTP+DL++S+I + SGSV
Subjt:  GSFFGVHVTSTPVDLTYSEISVASGSV

AT2G41990.1 CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864)5.8e-4541.75Show/hide
Query:  MHAKTDSEVTSL--APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGH
        MHAKTDSE TS+  A  SP RS  RP+YYVQSPS  +HD EK   SF S   L      P +   S   HSRESS+SRFS       + I          
Subjt:  MHAKTDSEVTSL--APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGH

Query:  RKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR
        R+ ++   + D   + G  +D+D  + +  R YV   +L  + LF++F+LILWGASK   PK+T+K +      +QAG+D +GV TDM S+NSTV++ +R
Subjt:  RKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFR

Query:  NTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDC
        N  +FF VHVT++P+ L YS + ++SG + +F   R  + +V   V G +IPLYG G S    T      +PL L+ V+ S+AY+LG+LV  KFY  I C
Subjt:  NTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDC

Query:  PITFDPKKL
          T D   L
Subjt:  PITFDPKKL

AT4G35170.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.0e-4139.53Show/hide
Query:  APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVG---RHSRESSSS--RFSGSLKPGSRKISPNDVSRGGHRKGQKPWKE
        A SSP ++ R+PVY V SP     D   T + F       SP  SP + +  V     HS   SSS  R SG L+     +  +D+ R  H       ++
Subjt:  APSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVG---RHSRESSSS--RFSGSLKPGSRKISPNDVSRGGHRKGQKPWKE

Query:  CDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNTGSFFGVH
         D  E +G    +++ + + R    L F L  VL F++F LILWG SK   P  T+K +  E   +Q+G+D +GV TDM ++NSTV++ +RN  +FF VH
Subjt:  CDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNTGSFFGVH

Query:  VTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPV-PLKLSFVIRSRAYVLGQLVKPKFYRHIDCPITFDPKK
        VTS P+ L+YS++ +ASG + +F Q RKS+R +   V G +IPLYG   +L      P   V PL L+F +R+RAYVLG+LVK  F+ +I C ITF   K
Subjt:  VTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSSSTGTPATPV-PLKLSFVIRSRAYVLGQLVKPKFYRHIDCPITFDPKK

Query:  L
        L
Subjt:  L

AT5G42860.1 unknown protein5.9e-9858.68Show/hide
Query:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK
        MHAKTDSEVTSL+ SSPTRSPRRP Y+VQSPSRDSHDGEKTATSFHSTPVLTSPM SPPHS          SSSSRFS        KI+      G  RK
Subjt:  MHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRK

Query:  GQKPWKECDVIEEEGLLEDEDR-GKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRN
        G    K+  +IEEEGLL+D DR  + LPRRCYVLAFI+GF LLF+ F+LIL+ A+KP KPKI++KSITFEQ K+QAG D  G+ TDM ++N+T+++ +RN
Subjt:  GQKPWKECDVIEEEGLLEDEDR-GKTLPRRCYVLAFILGFVLLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRN

Query:  TGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSS--------------------STGTPATPVPLKLSFVIRS
        TG+FFGVHVTS+P+DL++S+I++ SGS+K+FYQSRKSQR+V +NV+G +IPLYGSG++L                          P  PVP++L+F +RS
Subjt:  TGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGSGASLSS--------------------STGTPATPVPLKLSFVIRS

Query:  RAYVLGQLVKPKFYRHIDCPITFDPKKL--NVPI
        RAYVLG+LV+PKFY+ I C I F+ KKL  ++PI
Subjt:  RAYVLGQLVKPKFYRHIDCPITFDPKKL--NVPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGGTGGATTGAAGTCAGCAGTGAAATAAGAACACTGGACGGAAAGATTCGTGGTGGCGGTGCCAACTGTAACATAGCTGTCACAAAACGCGTCGCCGTCTCTAA
TGGCGTTACCAAGCTCTCTTCCCACAATCAACTCTGCATTTTCACTCGCTTGAACTCACTGGAGTGGCTTCTATCTCTACACACACACATAGCAAAATCTCCATCGGAAA
CTGCCACAATGCACGCAAAAACAGACTCCGAGGTCACCAGTCTCGCGCCGTCTTCCCCGACCAGATCTCCCCGCCGACCGGTCTACTACGTTCAGAGCCCATCCAGGGAT
TCTCACGATGGGGAGAAGACCGCCACTTCCTTTCACTCTACTCCTGTTCTCACTAGCCCCATGGATTCCCCTCCCCACTCTCGCTCCTCCGTCGGCCGTCACTCCAGAGA
ATCTTCCTCCAGCAGATTTTCTGGGTCATTGAAACCCGGATCGCGCAAGATCTCTCCCAATGATGTCTCGCGGGGTGGGCATCGGAAGGGCCAGAAGCCATGGAAGGAGT
GCGATGTAATCGAAGAGGAAGGTCTTCTTGAAGATGAAGATCGTGGAAAGACTCTTCCTCGTCGCTGCTATGTCCTCGCTTTCATTTTGGGATTTGTTCTTCTGTTCTCC
ATGTTCGCTTTGATTCTCTGGGGTGCAAGCAAGCCAATGAAGCCCAAGATCACAATGAAGAGCATAACATTCGAGCAATTCAAAATCCAAGCCGGTTCAGATTTCACAGG
CGTGGCCACTGACATGGCCTCTGTAAATTCCACTGTGAAACTCACGTTTCGAAACACCGGATCATTCTTTGGCGTCCATGTCACGTCAACTCCCGTCGATTTGACATACT
CTGAGATTTCAGTCGCATCAGGAAGCGTAAAGCAGTTCTATCAATCCCGAAAGAGTCAGAGATCTGTCACGATCAATGTAATAGGGACCAGAATCCCACTGTATGGCAGC
GGAGCAAGCCTGAGCAGCTCCACGGGAACGCCGGCCACGCCAGTGCCTCTGAAACTGAGTTTCGTGATCAGATCCAGAGCCTACGTGCTGGGGCAATTAGTGAAGCCAAA
ATTCTACAGGCACATCGACTGCCCCATAACTTTCGATCCCAAAAAACTCAACGTCCCCATAAAGAAGGCGGACTGCGACAGCTCGAAAGAGGCAGCAGCAGCAGCAGCTT
ACGGGCGACGCCGAACGGGAGGAGAGGACCCAGCAGACAGAAACGGGGTCTCGGCTGGCTGCCCGTTTGCCCGAGTAGTGCATAACAGGTTGGTTTCCGGGCAGCGACAA
GTGAATGGACGGTTAGAGGGATTACTTCATTTTGAAGTTATGAAATATCGAATTCTAACCCCGGGCTTCCTCGACAGCATTTCCATCAAGCCCTCCAGGTTCCAAGAAGA
AAAGATTGAAGGAACTACAATTTTGGAAAATAATTCACATATATTGTCCTGTGGATTTCCTACAACCAAATTCGGCGCTACGGGGTTGGGTCTGTCCCTTGATGATACCC
TTAATGTTTGTAATGAAGATTCCCCTCTTCTTACACTCATGGGCGATGTCAAAGATAGTGAGTTGCAGCCAGAGGATGCAGAGATTGATAGATTTCTAAAAGCTCAGGTC
TGGGTTTTTGCACAGTTCACATTTGGCTTCTTGATCCTACTGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGGTGGATTGAAGTCAGCAGTGAAATAAGAACACTGGACGGAAAGATTCGTGGTGGCGGTGCCAACTGTAACATAGCTGTCACAAAACGCGTCGCCGTCTCTAA
TGGCGTTACCAAGCTCTCTTCCCACAATCAACTCTGCATTTTCACTCGCTTGAACTCACTGGAGTGGCTTCTATCTCTACACACACACATAGCAAAATCTCCATCGGAAA
CTGCCACAATGCACGCAAAAACAGACTCCGAGGTCACCAGTCTCGCGCCGTCTTCCCCGACCAGATCTCCCCGCCGACCGGTCTACTACGTTCAGAGCCCATCCAGGGAT
TCTCACGATGGGGAGAAGACCGCCACTTCCTTTCACTCTACTCCTGTTCTCACTAGCCCCATGGATTCCCCTCCCCACTCTCGCTCCTCCGTCGGCCGTCACTCCAGAGA
ATCTTCCTCCAGCAGATTTTCTGGGTCATTGAAACCCGGATCGCGCAAGATCTCTCCCAATGATGTCTCGCGGGGTGGGCATCGGAAGGGCCAGAAGCCATGGAAGGAGT
GCGATGTAATCGAAGAGGAAGGTCTTCTTGAAGATGAAGATCGTGGAAAGACTCTTCCTCGTCGCTGCTATGTCCTCGCTTTCATTTTGGGATTTGTTCTTCTGTTCTCC
ATGTTCGCTTTGATTCTCTGGGGTGCAAGCAAGCCAATGAAGCCCAAGATCACAATGAAGAGCATAACATTCGAGCAATTCAAAATCCAAGCCGGTTCAGATTTCACAGG
CGTGGCCACTGACATGGCCTCTGTAAATTCCACTGTGAAACTCACGTTTCGAAACACCGGATCATTCTTTGGCGTCCATGTCACGTCAACTCCCGTCGATTTGACATACT
CTGAGATTTCAGTCGCATCAGGAAGCGTAAAGCAGTTCTATCAATCCCGAAAGAGTCAGAGATCTGTCACGATCAATGTAATAGGGACCAGAATCCCACTGTATGGCAGC
GGAGCAAGCCTGAGCAGCTCCACGGGAACGCCGGCCACGCCAGTGCCTCTGAAACTGAGTTTCGTGATCAGATCCAGAGCCTACGTGCTGGGGCAATTAGTGAAGCCAAA
ATTCTACAGGCACATCGACTGCCCCATAACTTTCGATCCCAAAAAACTCAACGTCCCCATAAAGAAGGCGGACTGCGACAGCTCGAAAGAGGCAGCAGCAGCAGCAGCTT
ACGGGCGACGCCGAACGGGAGGAGAGGACCCAGCAGACAGAAACGGGGTCTCGGCTGGCTGCCCGTTTGCCCGAGTAGTGCATAACAGGTTGGTTTCCGGGCAGCGACAA
GTGAATGGACGGTTAGAGGGATTACTTCATTTTGAAGTTATGAAATATCGAATTCTAACCCCGGGCTTCCTCGACAGCATTTCCATCAAGCCCTCCAGGTTCCAAGAAGA
AAAGATTGAAGGAACTACAATTTTGGAAAATAATTCACATATATTGTCCTGTGGATTTCCTACAACCAAATTCGGCGCTACGGGGTTGGGTCTGTCCCTTGATGATACCC
TTAATGTTTGTAATGAAGATTCCCCTCTTCTTACACTCATGGGCGATGTCAAAGATAGTGAGTTGCAGCCAGAGGATGCAGAGATTGATAGATTTCTAAAAGCTCAGGTC
TGGGTTTTTGCACAGTTCACATTTGGCTTCTTGATCCTACTGATTTAG
Protein sequenceShow/hide protein sequence
MVRWIEVSSEIRTLDGKIRGGGANCNIAVTKRVAVSNGVTKLSSHNQLCIFTRLNSLEWLLSLHTHIAKSPSETATMHAKTDSEVTSLAPSSPTRSPRRPVYYVQSPSRD
SHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGGHRKGQKPWKECDVIEEEGLLEDEDRGKTLPRRCYVLAFILGFVLLFS
MFALILWGASKPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLTFRNTGSFFGVHVTSTPVDLTYSEISVASGSVKQFYQSRKSQRSVTINVIGTRIPLYGS
GASLSSSTGTPATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPITFDPKKLNVPIKKADCDSSKEAAAAAAYGRRRTGGEDPADRNGVSAGCPFARVVHNRLVSGQRQ
VNGRLEGLLHFEVMKYRILTPGFLDSISIKPSRFQEEKIEGTTILENNSHILSCGFPTTKFGATGLGLSLDDTLNVCNEDSPLLTLMGDVKDSELQPEDAEIDRFLKAQV
WVFAQFTFGFLILLI