; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021967 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021967
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationtig00153870:210614..211669
RNA-Seq ExpressionSgr021967
SyntenySgr021967
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PPD69681.1 hypothetical protein GOBAR_DD33434 [Gossypium barbadense]1.9e-5960.99Show/hide
Query:  RSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILP
        R    R N  ++    ++  + PPYPW+T  RA VH+L+YL S  I TITGDV+C RCERQYE+ YDL TKF EIA++I +NK+S+HDRAP  W NP+LP
Subjt:  RSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILP

Query:  ACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR
         CKFC QEN  +PVI  ++    +INWLFLLLGQ LGC  L  LKYFC +T NHRTGAK+R+LYLTYL LCKQL+PNGPFDR
Subjt:  ACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR

PPS12530.1 hypothetical protein GOBAR_AA08089 [Gossypium barbadense]1.1e-5961.67Show/hide
Query:  SSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPAC
        SS+  N  ++    ++  + PPYPW+T  RA VH+L+YL S  I TITGDV+C RCERQYE+ YDL TKF EIA++I +NK+S+HDRAP  W NP+LP C
Subjt:  SSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPAC

Query:  KFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR
        KFC QEN  +PVI  ++    +INWLFLLLGQ LGC  L  LKYFC +T NHRTGAK+R+LYLTYL LCKQL+PNGPFDR
Subjt:  KFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR

XP_008439384.1 PREDICTED: protein PAF1 homolog [Cucumis melo]3.2e-5942.82Show/hide
Query:  DDRHSLDLELSLRPP--------ADNTAARHQLFLLPPPADNTAVPQQLFLL-------SPPADNNAVQHQLFLQPPPV---DNINAVQHQLFFRPPPVD
        +  +  +L+LSLRPP          +    H  F  PPP+  +   Q L  L       +PP  ++  QH L LQPPP    + +  V  Q    PPP +
Subjt:  DDRHSLDLELSLRPP--------ADNTAARHQLFLLPPPADNTAVPQQLFLL-------SPPADNNAVQHQLFLQPPPV---DNINAVQHQLFFRPPPVD

Query:  DTAA-----QQPEPLPSPPQ-PVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTL-RPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSR
           +     + P P PSP + P  LP+P   P      P P       YL P P++  Q     + +P+R + +                   P+R R++
Subjt:  DTAA-----QQPEPLPSPPQ-PVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTL-RPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSR

Query:  PENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVI
         +NS IEPPYPWST+  AV+H L YL++N ILTI G+VKC RC+R+ EI Y+L++KF+EI  FI + KD++HDRAP  W NPIL  C FC +E C  P+I
Subjt:  PENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVI

Query:  PGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN
              + NINWLFLLLG  LGCLKL  LKYFCT TN HRTGAK+RL+YLTYLALCKQL+PN
Subjt:  PGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]3.7e-6855.17Show/hide
Query:  PPVDNINAVQHQLFFRPPPVDDTAA-----QQPE---PLPSPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPR--RSSTSTS
        PP ++ N +  +L  RPP   D +A     QQ E   P P  PQP  L + T L   +NQ+   H  TSS            RNS++LR R  R+     
Subjt:  PPVDNINAVQHQLFFRPPPVDDTAA-----QQPE---PLPSPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPR--RSSTSTS

Query:  TSARNTVRYRSSSARSNP----RRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLH
          AR +   R +   ++P    RRSR   +N+PI+PPYPWST+ +AVVH L+YL+ NQILTITGDV+CDRCE+QY I YDL+TKF EIASFI KNK +LH
Subjt:  TSARNTVRYRSSSARSNP----RRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLH

Query:  DRAPLSWTNPILPACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN
        DRAP SWTNP    CK C +ENC RP IP  EGD+ NINWLFLLLGQ +G LKL HLKYFC YTNNHRTGAKNRL+YLTYL LCKQL+P+
Subjt:  DRAPLSWTNPILPACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN

XP_022135938.1 probable serine/threonine-protein kinase samkC [Momordica charantia]1.3e-7651.02Show/hide
Query:  DDRHSLDLELSLRPPADNTAARHQLFLLPPPADNTAVPQQ--LFLLSPPADNNAVQHQLFLQPPPVDNINAVQHQLFFRPPPVDDTAAQQPEPLPSPPQP
        DD + LDLELSLR P      + + F  P P +NT + QQ   ++   P +N     Q +L   P +   +          P+   +   P PL S PQP
Subjt:  DDRHSLDLELSLRPPADNTAARHQLFLLPPPADNTAVPQQ--LFLLSPPADNNAVQHQLFLQPPPVDNINAVQHQLFFRPPPVDDTAAQQPEPLPSPPQP

Query:  VVLPI--PTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVH
          L +    L P+   + P PHP TSS   S S      + SR  R  + S ST  S+    + + + +    RR R +P+++ IEPPYPWST  RAVVH
Subjt:  VVLPI--PTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVH

Query:  SLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIP--GEEGDHMNINWLFLLLGQ
         L YLQ NQILTITGDVKC +C++QY+I YDL+TKF+EIASFI KNKD+LHDRAP SWTNP LP CKFC QE+C RPVIP   E+ D+ NINWLFLLLGQ
Subjt:  SLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIP--GEEGDHMNINWLFLLLGQ

Query:  TLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN
         +GCL L HLKYFCTYTNNHRT AK+RL+YLTYL+LCKQL+P+
Subjt:  TLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN

TrEMBL top hitse value%identityAlignment
A0A1S3AZB1 protein PAF1 homolog1.5e-5942.82Show/hide
Query:  DDRHSLDLELSLRPP--------ADNTAARHQLFLLPPPADNTAVPQQLFLL-------SPPADNNAVQHQLFLQPPPV---DNINAVQHQLFFRPPPVD
        +  +  +L+LSLRPP          +    H  F  PPP+  +   Q L  L       +PP  ++  QH L LQPPP    + +  V  Q    PPP +
Subjt:  DDRHSLDLELSLRPP--------ADNTAARHQLFLLPPPADNTAVPQQLFLL-------SPPADNNAVQHQLFLQPPPV---DNINAVQHQLFFRPPPVD

Query:  DTAA-----QQPEPLPSPPQ-PVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTL-RPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSR
           +     + P P PSP + P  LP+P   P      P P       YL P P++  Q     + +P+R + +                   P+R R++
Subjt:  DTAA-----QQPEPLPSPPQ-PVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTL-RPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSR

Query:  PENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVI
         +NS IEPPYPWST+  AV+H L YL++N ILTI G+VKC RC+R+ EI Y+L++KF+EI  FI + KD++HDRAP  W NPIL  C FC +E C  P+I
Subjt:  PENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVI

Query:  PGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN
              + NINWLFLLLG  LGCLKL  LKYFCT TN HRTGAK+RL+YLTYLALCKQL+PN
Subjt:  PGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN

A0A2P5YAB0 Uncharacterized protein5.3e-6061.67Show/hide
Query:  SSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPAC
        SS+  N  ++    ++  + PPYPW+T  RA VH+L+YL S  I TITGDV+C RCERQYE+ YDL TKF EIA++I +NK+S+HDRAP  W NP+LP C
Subjt:  SSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPAC

Query:  KFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR
        KFC QEN  +PVI  ++    +INWLFLLLGQ LGC  L  LKYFC +T NHRTGAK+R+LYLTYL LCKQL+PNGPFDR
Subjt:  KFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR

A0A6J1C462 uncharacterized protein LOC1110077683.1e-6855.17Show/hide
Query:  PPVDNINAVQHQLFFRPPPVDDTAA-----QQPE---PLPSPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPR--RSSTSTS
        PP ++ N +  +L  RPP   D +A     QQ E   P P  PQP  L + T L   +NQ+   H  TSS            RNS++LR R  R+     
Subjt:  PPVDNINAVQHQLFFRPPPVDDTAA-----QQPE---PLPSPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPR--RSSTSTS

Query:  TSARNTVRYRSSSARSNP----RRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLH
          AR +   R +   ++P    RRSR   +N+PI+PPYPWST+ +AVVH L+YL+ NQILTITGDV+CDRCE+QY I YDL+TKF EIASFI KNK +LH
Subjt:  TSARNTVRYRSSSARSNP----RRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLH

Query:  DRAPLSWTNPILPACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN
        DRAP SWTNP    CK C +ENC RP IP  EGD+ NINWLFLLLGQ +G LKL HLKYFC YTNNHRTGAKNRL+YLTYL LCKQL+P+
Subjt:  DRAPLSWTNPILPACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN

A0A6J1C690 probable serine/threonine-protein kinase samkC6.2e-7751.02Show/hide
Query:  DDRHSLDLELSLRPPADNTAARHQLFLLPPPADNTAVPQQ--LFLLSPPADNNAVQHQLFLQPPPVDNINAVQHQLFFRPPPVDDTAAQQPEPLPSPPQP
        DD + LDLELSLR P      + + F  P P +NT + QQ   ++   P +N     Q +L   P +   +          P+   +   P PL S PQP
Subjt:  DDRHSLDLELSLRPPADNTAARHQLFLLPPPADNTAVPQQ--LFLLSPPADNNAVQHQLFLQPPPVDNINAVQHQLFFRPPPVDDTAAQQPEPLPSPPQP

Query:  VVLPI--PTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVH
          L +    L P+   + P PHP TSS   S S      + SR  R  + S ST  S+    + + + +    RR R +P+++ IEPPYPWST  RAVVH
Subjt:  VVLPI--PTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVH

Query:  SLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIP--GEEGDHMNINWLFLLLGQ
         L YLQ NQILTITGDVKC +C++QY+I YDL+TKF+EIASFI KNKD+LHDRAP SWTNP LP CKFC QE+C RPVIP   E+ D+ NINWLFLLLGQ
Subjt:  SLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIP--GEEGDHMNINWLFLLLGQ

Query:  TLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN
         +GCL L HLKYFCTYTNNHRT AK+RL+YLTYL+LCKQL+P+
Subjt:  TLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPN

A0A7J9MQZ4 Uncharacterized protein2.9e-5847.55Show/hide
Query:  PVDNINAVQHQLFFRPPPVDDTAAQQPEPLP---SPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLS---PSPSRIEQRNSRTLRPRRSSTSTSTSARN
        PV      Q Q+  +P  ++    Q P  LP   SP   ++ P   L+  +S+   +P   +    ++   PSPS      SR +R RR+ST       N
Subjt:  PVDNINAVQHQLFFRPPPVDDTAAQQPEPLP---SPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLS---PSPSRIEQRNSRTLRPRRSSTSTSTSARN

Query:  TVRYRSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTN
         +R                 ++  + PPYPW+T  RA VH+L+YL S  I TITGDV+C RCERQYE+ YDL TKF EIA++I +NK+S+HDRAP  W N
Subjt:  TVRYRSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTN

Query:  PILPACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR
        P+LP CKFC QEN  +PVI  ++    +INWLFLLLGQ LGC  L  LKYFC +T NHRTGAK+R+LYLTYL LCKQL+PNGPFDR
Subjt:  PILPACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein7.8e-4039.77Show/hide
Query:  PPPVDDTAAQQPEPLPSPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPE
        P P D  A +   P P PP      IP  +     Q P P P   + +  PS           L P  SS  T    +  V   + S R    RS    +
Subjt:  PPPVDDTAAQQPEPLPSPPQPVVLPIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPE

Query:  NSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIPG
        +  I PP+PW+T+ R  + SL YL+SNQI TITG+V+C  CE+ Y+++Y+L  +F E+  F +  K  + DRA   W  P    C+ C +E   +PVI  
Subjt:  NSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIPG

Query:  EEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEP
         +     INWLFLLLGQTLG   L  LK FC ++ NHRTGAK+R+LYLTY+ LCK L+P
Subjt:  EEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)8.6e-3937.88Show/hide
Query:  PSPPQP------------VVLPIPTL---LPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPE
        PSPPQP            V++P   L   +P  +  + TP P   S  + P P   +        PRR       + RN+ R  +   R+   R      
Subjt:  PSPPQP------------VVLPIPTL---LPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPE

Query:  NSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIPG
           I PPYPW+T     + S   L SN I  I+G V C  C+R   + Y+L  KF+E+  +I  NK+ +  RAP SW+ P L  C+ CK E   +PV+  
Subjt:  NSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIPG

Query:  EEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFD
         + +   INWLFLLLGQ LGC  L  L+YFC   + HRTG+K+R++Y+TYL+LCKQL+P GPF+
Subjt:  EEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNRLLYLTYLALCKQLEPNGPFD

AT2G16190.2 FUNCTIONS IN: molecular_function unknown7.3e-2234.36Show/hide
Query:  PSPPQP------------VVLPIPTL---LPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPE
        PSPPQP            V++P   L   +P  +  + TP P   S  + P P   +        PRR       + RN+ R  +   R+   R      
Subjt:  PSPPQP------------VVLPIPTL---LPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPE

Query:  NSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIPG
           I PPYPW+T     + S   L SN I  I+G V C  C+R   + Y+L  KF+E+  +I  NK+ +  RAP SW+ P L  C+ CK E   +PV+  
Subjt:  NSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITGDVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIPG

Query:  EEGDHMNINWLFLLLGQTLGCLKLHHL
         + +   INWLFLLLGQ LGC  L  L
Subjt:  EEGDHMNINWLFLLLGQTLGCLKLHHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATGGAATTTTCTAACCGAAGCGACGATCGCCACAGTCTCGACCTGGAACTCTCTCTCCGTCCACCGGCAGACAACACTGCAGCGCGGCACCAACTCTTTCTCCT
TCCGCCGCCGGCAGACAACACTGCAGTGCCGCAGCAACTCTTTCTCCTTTCGCCGCCGGCAGACAATAATGCAGTGCAGCACCAACTCTTTCTCCAACCGCCGCCAGTAG
ACAACATTAATGCAGTGCAGCACCAACTCTTTTTCCGTCCGCCGCCGGTAGACGACACTGCAGCGCAGCAGCCGGAGCCTCTGCCTTCTCCTCCGCAACCGGTGGTCCTT
CCGATTCCGACATTGCTTCCGGAAGCTTCGAACCAAATGCCGACTCCGCATCCTTGCACTTCCTCTCGCTATCTGTCTCCAAGCCCTAGCCGAATCGAGCAGCGTAATTC
ACGTACCTTGAGACCTAGGCGATCTTCTACTTCTACATCCACTTCTGCTCGAAACACCGTAAGGTATCGATCTTCAAGTGCGCGAAGCAATCCGAGACGATCTAGAAGCA
GACCAGAGAACTCGCCGATCGAGCCTCCGTATCCATGGTCGACGGATCTTCGAGCGGTGGTGCACTCCCTGAGTTACCTCCAATCGAACCAGATCCTGACCATCACCGGC
GACGTCAAATGCGATCGATGTGAGAGGCAGTACGAGATCGCGTACGATCTGCTCACGAAGTTCAATGAGATTGCAAGTTTCATAGTGAAAAACAAGGATAGTTTGCACGA
CAGAGCTCCGCTTTCGTGGACGAACCCTATCTTACCGGCGTGCAAGTTCTGCAAGCAAGAAAACTGCGAGAGGCCGGTGATACCGGGGGAAGAGGGCGACCATATGAACA
TCAATTGGCTTTTCTTGCTTTTAGGACAAACGCTTGGATGTTTGAAGCTCCATCATCTGAAATACTTCTGCACTTACACCAACAATCATCGAACAGGTGCCAAGAATCGC
CTTCTTTATCTCACTTATCTTGCTCTCTGCAAGCAACTTGAACCAAATGGACCTTTCGATCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATGGAATTTTCTAACCGAAGCGACGATCGCCACAGTCTCGACCTGGAACTCTCTCTCCGTCCACCGGCAGACAACACTGCAGCGCGGCACCAACTCTTTCTCCT
TCCGCCGCCGGCAGACAACACTGCAGTGCCGCAGCAACTCTTTCTCCTTTCGCCGCCGGCAGACAATAATGCAGTGCAGCACCAACTCTTTCTCCAACCGCCGCCAGTAG
ACAACATTAATGCAGTGCAGCACCAACTCTTTTTCCGTCCGCCGCCGGTAGACGACACTGCAGCGCAGCAGCCGGAGCCTCTGCCTTCTCCTCCGCAACCGGTGGTCCTT
CCGATTCCGACATTGCTTCCGGAAGCTTCGAACCAAATGCCGACTCCGCATCCTTGCACTTCCTCTCGCTATCTGTCTCCAAGCCCTAGCCGAATCGAGCAGCGTAATTC
ACGTACCTTGAGACCTAGGCGATCTTCTACTTCTACATCCACTTCTGCTCGAAACACCGTAAGGTATCGATCTTCAAGTGCGCGAAGCAATCCGAGACGATCTAGAAGCA
GACCAGAGAACTCGCCGATCGAGCCTCCGTATCCATGGTCGACGGATCTTCGAGCGGTGGTGCACTCCCTGAGTTACCTCCAATCGAACCAGATCCTGACCATCACCGGC
GACGTCAAATGCGATCGATGTGAGAGGCAGTACGAGATCGCGTACGATCTGCTCACGAAGTTCAATGAGATTGCAAGTTTCATAGTGAAAAACAAGGATAGTTTGCACGA
CAGAGCTCCGCTTTCGTGGACGAACCCTATCTTACCGGCGTGCAAGTTCTGCAAGCAAGAAAACTGCGAGAGGCCGGTGATACCGGGGGAAGAGGGCGACCATATGAACA
TCAATTGGCTTTTCTTGCTTTTAGGACAAACGCTTGGATGTTTGAAGCTCCATCATCTGAAATACTTCTGCACTTACACCAACAATCATCGAACAGGTGCCAAGAATCGC
CTTCTTTATCTCACTTATCTTGCTCTCTGCAAGCAACTTGAACCAAATGGACCTTTCGATCGCTGA
Protein sequenceShow/hide protein sequence
MPMEFSNRSDDRHSLDLELSLRPPADNTAARHQLFLLPPPADNTAVPQQLFLLSPPADNNAVQHQLFLQPPPVDNINAVQHQLFFRPPPVDDTAAQQPEPLPSPPQPVVL
PIPTLLPEASNQMPTPHPCTSSRYLSPSPSRIEQRNSRTLRPRRSSTSTSTSARNTVRYRSSSARSNPRRSRSRPENSPIEPPYPWSTDLRAVVHSLSYLQSNQILTITG
DVKCDRCERQYEIAYDLLTKFNEIASFIVKNKDSLHDRAPLSWTNPILPACKFCKQENCERPVIPGEEGDHMNINWLFLLLGQTLGCLKLHHLKYFCTYTNNHRTGAKNR
LLYLTYLALCKQLEPNGPFDR