; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G014530 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G014530
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationCG_Chr09:26498677..26501698
RNA-Seq ExpressionClCG09G014530
SyntenyClCG09G014530
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571818.1 NDR1/HIN1-like protein 13, partial [Cucurbita argyrosperma subsp. sororia]1.6e-9991.67Show/hide
Query:  MSSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKV
        M SSR       P Y PR SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVG+QYM IT PNPT ASLSLNIRMIFTAVNPNKV
Subjt:  MSSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKV

Query:  GIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPR
        GIKYGESRFTVMYRGIPLGRAS+PGFFQDPHSQRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQVSVDCAIVISPR
Subjt:  GIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPR

Query:  KQSLTYKQCGFDGLNV
        KQSLTYKQCGFDGLNV
Subjt:  KQSLTYKQCGFDGLNV

XP_004139805.1 uncharacterized protein LOC101207234 [Cucumis sativus]1.8e-10695.35Show/hide
Query:  SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG
        SSSR  PNG+HPRYNPR SSSSA+FKGCCCCLFLLFSFLALL+LA+VLVVVLALKPKKPQFDLQQV VQY+GITNPNPTTASLSLNIRMIFTAVNPNKVG
Subjt:  SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG

Query:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
        IKY ESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
Subjt:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK

Query:  QSLTYKQCGFDGLNV
        QSLTYKQCGFDGLNV
Subjt:  QSLTYKQCGFDGLNV

XP_008447191.1 PREDICTED: uncharacterized protein LOC103489700 [Cucumis melo]2.5e-10893.75Show/hide
Query:  RLHNQKPM-SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIF
        +  NQ  M SSSR  PNG+HPRYNPR SSSSA+FKGCCCCLFLLFSFLALL+LA+VLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIF
Subjt:  RLHNQKPM-SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIF

Query:  TAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVD
        TAVNPNKVGIKY ESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVD
Subjt:  TAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVD

Query:  CAIVISPRKQSLTYKQCGFDGLNV
        CAIVISPRKQSLTYKQCGFDGLNV
Subjt:  CAIVISPRKQSLTYKQCGFDGLNV

XP_022971998.1 uncharacterized protein LOC111470648 [Cucurbita maxima]5.1e-10192.09Show/hide
Query:  MSSSRAPPNGLHPRYNPRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG
        M SSR   +   P Y PRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVG+QYM IT PNPT ASLSLNIRMIFTAVNPNKVG
Subjt:  MSSSRAPPNGLHPRYNPRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG

Query:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
        IKYGESRFTVMYRGIPLGRAS+PGFFQDPHSQRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQVSVDCAIVISPRK
Subjt:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK

Query:  QSLTYKQCGFDGLNV
        QSLTYKQCGFDGLNV
Subjt:  QSLTYKQCGFDGLNV

XP_038887420.1 uncharacterized protein LOC120077562 [Benincasa hispida]1.9e-11198.14Show/hide
Query:  MSSSRAPPNGLHPRYNPRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG
        MSSSR  PNG+HPRYNPRSSSSASFKGCCCCLFLLFSFLALLILA+VLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG
Subjt:  MSSSRAPPNGLHPRYNPRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG

Query:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
        IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
Subjt:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK

Query:  QSLTYKQCGFDGLNV
        QSLTYKQCGFDGLNV
Subjt:  QSLTYKQCGFDGLNV

TrEMBL top hitse value%identityAlignment
A0A0A0K5H7 LEA_2 domain-containing protein8.7e-10795.35Show/hide
Query:  SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG
        SSSR  PNG+HPRYNPR SSSSA+FKGCCCCLFLLFSFLALL+LA+VLVVVLALKPKKPQFDLQQV VQY+GITNPNPTTASLSLNIRMIFTAVNPNKVG
Subjt:  SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG

Query:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
        IKY ESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
Subjt:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK

Query:  QSLTYKQCGFDGLNV
        QSLTYKQCGFDGLNV
Subjt:  QSLTYKQCGFDGLNV

A0A1S3BGU8 uncharacterized protein LOC1034897001.2e-10893.75Show/hide
Query:  RLHNQKPM-SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIF
        +  NQ  M SSSR  PNG+HPRYNPR SSSSA+FKGCCCCLFLLFSFLALL+LA+VLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIF
Subjt:  RLHNQKPM-SSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIF

Query:  TAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVD
        TAVNPNKVGIKY ESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVD
Subjt:  TAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVD

Query:  CAIVISPRKQSLTYKQCGFDGLNV
        CAIVISPRKQSLTYKQCGFDGLNV
Subjt:  CAIVISPRKQSLTYKQCGFDGLNV

A0A6J1EUX4 uncharacterized protein LOC1114378361.6e-9787.22Show/hide
Query:  SSSRAPPNGL--------HPRYNPRSS--SSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPN-PTT--ASLSLNIR
        S+SR  PNG         HP Y+PRSS  SSASFKGCCCCLFLL SFLALL+LAVVLVVVLALKPKKPQFDLQQVGVQYMGIT PN PTT  ASLSLNIR
Subjt:  SSSRAPPNGL--------HPRYNPRSS--SSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPN-PTT--ASLSLNIR

Query:  MIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQV
        M+FTAVNPNKVGIKY ESRFTVMYRGIPLGRASVPGF Q+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQV
Subjt:  MIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQV

Query:  SVDCAIVISPRKQSLTYKQCGFDGLNV
        SVDCAIVISPRKQSLTYKQCGFDGL+V
Subjt:  SVDCAIVISPRKQSLTYKQCGFDGLNV

A0A6J1GKY5 uncharacterized protein LOC1114552783.3e-9889.5Show/hide
Query:  QKPMSSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNP
        +K M  SR       P Y PR SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVG+QYM IT PNPT ASLSL+IRMIFTAVNP
Subjt:  QKPMSSSRAPPNGLHPRYNPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNP

Query:  NKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVI
        NKVGIKYGESRFTVMYRGIPLGRAS+PGFFQDPHSQRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQV VDCAIVI
Subjt:  NKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVI

Query:  SPRKQSLTYKQCGFDGLNV
        SPRKQSLTYKQCGFDGLNV
Subjt:  SPRKQSLTYKQCGFDGLNV

A0A6J1I8L5 uncharacterized protein LOC1114706482.5e-10192.09Show/hide
Query:  MSSSRAPPNGLHPRYNPRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG
        M SSR   +   P Y PRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVG+QYM IT PNPT ASLSLNIRMIFTAVNPNKVG
Subjt:  MSSSRAPPNGLHPRYNPRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNKVG

Query:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK
        IKYGESRFTVMYRGIPLGRAS+PGFFQDPHSQRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQVSVDCAIVISPRK
Subjt:  IKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRK

Query:  QSLTYKQCGFDGLNV
        QSLTYKQCGFDGLNV
Subjt:  QSLTYKQCGFDGLNV

SwissProt top hitse value%identityAlignment
Q9FI03 NDR1/HIN1-like protein 267.8e-0423.21Show/hide
Query:  LFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPN-KVGIKYGESRFTVMYRGIPL-GRASVPGFFQDP
        LF  FS     +L ++ +V L L P++P+F L +  +  + +T    +T  L+ ++++   + NPN KVGI Y +      YRG  +   AS+P F+Q  
Subjt:  LFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPN-KVGIKYGESRFTVMYRGIPL-GRASVPGFFQDP

Query:  HSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVIS
             + A +    + + Q+    + R+ S   ++ + +  D   + ++ ++ S   + +V+C  +++
Subjt:  HSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVIS

Arabidopsis top hitse value%identityAlignment
AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.6e-0723.58Show/hide
Query:  NGLHPRYNPRSS--SSASFKGCC--CCLFLLFSFLALLIL--AVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNK-VGI
        N   P Y P +    ++  +GCC  CC + +F  + LL++  A   VV L  +P++P F + ++ +  +  T+    T ++SL++     A NPNK VG 
Subjt:  NGLHPRYNPRSS--SSASFKGCC--CCLFLLFSFLALLIL--AVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPNK-VGI

Query:  KYGESRFTVMYRG-------IPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSP--GVQVSVDC
         Y  +  T +Y+        + +G+ ++  F     +   + +TI      L +  A  L  D      V ++I+ +   K+++ +  +P  G++V+ + 
Subjt:  KYGESRFTVMYRG-------IPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSP--GVQVSVDC

Query:  AIVISPRKQSLT
          V++P  +  T
Subjt:  AIVISPRKQSLT

AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.4e-8672.81Show/hide
Query:  PMSSSRAPPNG-------LHPRYNP-RSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNP----NPTTASLSLNI
        P SSSRA  NG         P Y    SSSSAS KGCCCCLFLLF+FLALL+LAVVL+V+LA+KPKKPQFDLQQV V YMGI+NP    +PTTASLSL I
Subjt:  PMSSSRAPPNG-------LHPRYNP-RSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNP----NPTTASLSLNI

Query:  RMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQ
        RM+FTAVNPNKVGI+YGES FTVMY+G+PLGRA+VPGF+QD HS + V+ATI+VDRVNL+QA AADL+RDASLNDRVEL + GDV AKIR+++F+SPGVQ
Subjt:  RMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQ

Query:  VSVDCAIVISPRKQSLTYKQCGFDGLNV
        VSV+C I ISPRKQ+L YKQCGFDGL+V
Subjt:  VSVDCAIVISPRKQSLTYKQCGFDGLNV

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-0723.64Show/hide
Query:  QKPMSSSRAPPNGLHPRYNPRSSSSASFKGC----CCCLFLLFSFLALLILAVVLVVV--LALKPKKPQFDLQQVGV-QYMGITNPNPTTASLSLNIRMI
        +KP ++   PP         +S+++ + K       C + + F+ L +L++A+V+V++     KPK+P   +  V V +     NP      L+L + + 
Subjt:  QKPMSSSRAPPNGLHPRYNPRSSSSASFKGC----CCCLFLLFSFLALLILAVVLVVV--LALKPKKPQFDLQQVGV-QYMGITNPNPTTASLSLNIRMI

Query:  FTAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSV
         +  NPN++G  Y  S   + YRG  +G A +P           ++ T+ +    LL      L+ D  +   + L     V  K+ +L      VQ S 
Subjt:  FTAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSV

Query:  DCAIVISPRKQSLTYKQCGF
         C + IS   +++T + C +
Subjt:  DCAIVISPRKQSLTYKQCGF

AT4G26490.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.3e-0525.62Show/hide
Query:  PRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPN-KVGIKYGESRFTVMYRGI
        PRSS ++ +  C      +FS L +      L+V LA++P+ P FD+    +  +    P      LS    M+    NPN K+ +K+ + R  + +   
Subjt:  PRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPN-KVGIKYGESRFTVMYRGI

Query:  PLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAK
         +    V  F Q  H  R     +    V L    A +L R    N+++E  I G    K
Subjt:  PLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAK

AT5G53730.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.6e-0523.21Show/hide
Query:  LFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPN-KVGIKYGESRFTVMYRGIPL-GRASVPGFFQDP
        LF  FS     +L ++ +V L L P++P+F L +  +  + +T    +T  L+ ++++   + NPN KVGI Y +      YRG  +   AS+P F+Q  
Subjt:  LFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLNIRMIFTAVNPN-KVGIKYGESRFTVMYRGIPL-GRASVPGFFQDP

Query:  HSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVIS
             + A +    + + Q+    + R+ S   ++ + +  D   + ++ ++ S   + +V+C  +++
Subjt:  HSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAATAGTAGCGGAGGCAGTGAGAGGAAAGGAAGGCGTTGTTGTTGGTAGGCTACACAATCAAAAACCAATGTCTTCCTCAAGAGCTCCCCCCAACGGCCTCCACCC
TCGCTACAACCCCAGATCCTCCTCCTCCGCCTCCTTCAAAGGCTGCTGCTGCTGCCTCTTCCTCCTCTTCTCCTTTCTCGCCCTCTTAATCCTCGCTGTCGTCCTCGTCG
TCGTACTCGCTCTCAAACCCAAAAAACCCCAGTTCGATCTCCAGCAGGTCGGCGTCCAGTATATGGGCATAACCAATCCCAATCCCACCACTGCCTCTCTTTCCCTCAAC
ATTCGGATGATTTTCACCGCCGTTAACCCTAACAAAGTCGGAATCAAGTACGGCGAGTCCCGCTTCACGGTTATGTACCGAGGAATTCCGCTTGGGAGAGCCTCCGTTCC
TGGATTCTTTCAAGACCCTCACAGCCAGCGCCAGGTCGATGCTACCATCGCCGTCGATCGCGTTAATCTCCTTCAGGCTGACGCTGCCGATTTGATCCGTGATGCCTCGT
TGAACGACCGCGTCGAGCTTAGAATACTCGGCGATGTTGCTGCTAAGATCCGCCTCTTGTCCTTCAATTCCCCCGGCGTTCAGGTTTCAGTAGATTGTGCAATTGTGATT
AGTCCAAGAAAGCAGTCTCTCACATACAAGCAATGTGGTTTTGATGGCTTAAATGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGAATAGTAGCGGAGGCAGTGAGAGGAAAGGAAGGCGTTGTTGTTGGTAGGCTACACAATCAAAAACCAATGTCTTCCTCAAGAGCTCCCCCCAACGGCCTCCACCC
TCGCTACAACCCCAGATCCTCCTCCTCCGCCTCCTTCAAAGGCTGCTGCTGCTGCCTCTTCCTCCTCTTCTCCTTTCTCGCCCTCTTAATCCTCGCTGTCGTCCTCGTCG
TCGTACTCGCTCTCAAACCCAAAAAACCCCAGTTCGATCTCCAGCAGGTCGGCGTCCAGTATATGGGCATAACCAATCCCAATCCCACCACTGCCTCTCTTTCCCTCAAC
ATTCGGATGATTTTCACCGCCGTTAACCCTAACAAAGTCGGAATCAAGTACGGCGAGTCCCGCTTCACGGTTATGTACCGAGGAATTCCGCTTGGGAGAGCCTCCGTTCC
TGGATTCTTTCAAGACCCTCACAGCCAGCGCCAGGTCGATGCTACCATCGCCGTCGATCGCGTTAATCTCCTTCAGGCTGACGCTGCCGATTTGATCCGTGATGCCTCGT
TGAACGACCGCGTCGAGCTTAGAATACTCGGCGATGTTGCTGCTAAGATCCGCCTCTTGTCCTTCAATTCCCCCGGCGTTCAGGTTTCAGTAGATTGTGCAATTGTGATT
AGTCCAAGAAAGCAGTCTCTCACATACAAGCAATGTGGTTTTGATGGCTTAAATGTATGACTCTCATCCCTTCATTCAGCTTCTATCTCTCTCCCCTTTTAGATTCAAAA
TCATTATTCATTCATTAGTTTCGTCTTTCGATTATTTCATTTGGTCCCTAACTAATTAATCTAGTCCTTAAGATGAGGATTGAAATGGTTGTGGAATTGGGGATTAAATT
GAATTCATCAAAATTTTTTGGAAAGACGAAATTAGTAAATGGATAACAATTTGTAGAGAGTGTAATTTCTTGAGAGAGATTAGAAGAAT
Protein sequenceShow/hide protein sequence
MRIVAEAVRGKEGVVVGRLHNQKPMSSSRAPPNGLHPRYNPRSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGVQYMGITNPNPTTASLSLN
IRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASVPGFFQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVI
SPRKQSLTYKQCGFDGLNV