; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003953 (gene) of Chayote v1 genome

Gene IDSed0003953
OrganismSechium edule (Chayote v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationLG04:44640278..44641396
RNA-Seq ExpressionSed0003953
SyntenySed0003953
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-4955.29Show/hide
Query:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ
        + RR RT+AD R I+PPYPWS  +RA IH +EYL+SNNIV I G+V C +C+  YE+EY+L++KF+EIARFI R  ++ ++RAP  W  P+  NC  C +
Subjt:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ

Query:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
        +NCVE +IP  +    F  INWLFL LGQ++G L+  QLK FC  T NHRTGAKDRL++LTY+ L KQL+
Subjt:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

XP_008439384.1 PREDICTED: protein PAF1 homolog [Cucumis melo]2.0e-5142.74Show/hide
Query:  NLELSLAPPSSSSSPPHHKPKQSKSVKFPSICHQEPPNIPELAPLQLSLSQEPLQPGPLLGPSQPNSPQRGPLLQPWPSRQGSLQL---PGPSQPEPLQP
        NL+LSL PP  S  PP  +   +    F S     PP++  L  L LSL      P P   P+Q +  Q    LQP PS      L   P P  P P   
Subjt:  NLELSLAPPSSSSSPPHHKPKQSKSVKFPSICHQEPPNIPELAPLQLSLSQEPLQPGPLLGPSQPNSPQRGPLLQPWPSRQGSLQL---PGPSQPEPLQP

Query:  WPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTK
         P     L L  P  P S    P          S LP P Q      + LQP P++Q  +Q Q P+ P  +RQ             Q P I K +R+RT+
Subjt:  WPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTK

Query:  ADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVI
        AD+  I+PPYPWST + A IH +EYL +NNI+ I GEV+C +C  K E+EY+L+SKF+EI RFI R  ++ ++RAP  W  P+ LNC  CN++ CVE +I
Subjt:  ADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVI

Query:  PKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
         +++   INWLFL LG  LG L+  QLK FCT+TN HRTGAKDRL+YLTY+ L KQL+
Subjt:  PKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

XP_022135938.1 probable serine/threonine-protein kinase samkC [Momordica charantia]8.3e-5042.28Show/hide
Query:  PSQPNSPQRGPLLQPWPSRQGSLQLPGPSQPEPLQPWPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQV
        P +   P +   L   PS + +     PS P PL   P + +SLQL         Q +PLQP P  Q    +P PS  + +  + L+P   R+ S +   
Subjt:  PSQPNSPQRGPLLQPWPSRQGSLQLPGPSQPEPLQPWPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQV

Query:  PKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFI
          P  ++++            +Q+P I   RR R K  D  I+PPYPWST  RA +H ++YL+ N I+ ITG+V+C+QC+++Y++EYDLV+KF+EIA FI
Subjt:  PKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFI

Query:  ARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIP----KSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
         +N +  ++RAPSSW  P   NC+ C Q++C+  VIP      D++ INWLFL LGQM+G L    LK FCT TNNHRT AKDRL+YLTY++L KQL+
Subjt:  ARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIP----KSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

XP_022972400.1 uncharacterized protein KIAA0754-like [Cucurbita maxima]1.8e-4955.29Show/hide
Query:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ
        + RR RT+AD R I+PPYPWS  +RA IH +EYL+SNNIV I G+V C +C+  YE+EY+L++KF+EIARFI R  ++ ++RAP  W  P+  NC  C +
Subjt:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ

Query:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
        +NCVE +IP  +    F  INWLFL LGQ++G L+  QLK FC  T NHRTGAKDRL++LTY+ L KQL+
Subjt:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]2.2e-5046.3Show/hide
Query:  SLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKE-----RRQRTKA
        SL+LP P    S    P  P P       L  P  P+     PL+ +P    ++ L   K P    +   + N+  + ++Q+P  + E     RR+RT+A
Subjt:  SLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKE-----RRQRTKA

Query:  DDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIP
        D   I+PPYPWST+RRA IH ++YL+SNNIV I GEV+C +C++KYEMEYDL++KF EIARFI    +  ++RAP  W  P+  NC +CN++ CVE VI 
Subjt:  DDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIP

Query:  KSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
        + D+  INWLFL LG+ LG L+  QLK FC +TN HRTGAK+RLLYL Y+TL  QL+
Subjt:  KSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

TrEMBL top hitse value%identityAlignment
A0A1S3AZB1 protein PAF1 homolog9.5e-5242.74Show/hide
Query:  NLELSLAPPSSSSSPPHHKPKQSKSVKFPSICHQEPPNIPELAPLQLSLSQEPLQPGPLLGPSQPNSPQRGPLLQPWPSRQGSLQL---PGPSQPEPLQP
        NL+LSL PP  S  PP  +   +    F S     PP++  L  L LSL      P P   P+Q +  Q    LQP PS      L   P P  P P   
Subjt:  NLELSLAPPSSSSSPPHHKPKQSKSVKFPSICHQEPPNIPELAPLQLSLSQEPLQPGPLLGPSQPNSPQRGPLLQPWPSRQGSLQL---PGPSQPEPLQP

Query:  WPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTK
         P     L L  P  P S    P          S LP P Q      + LQP P++Q  +Q Q P+ P  +RQ             Q P I K +R+RT+
Subjt:  WPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTK

Query:  ADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVI
        AD+  I+PPYPWST + A IH +EYL +NNI+ I GEV+C +C  K E+EY+L+SKF+EI RFI R  ++ ++RAP  W  P+ LNC  CN++ CVE +I
Subjt:  ADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVI

Query:  PKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
         +++   INWLFL LG  LG L+  QLK FCT+TN HRTGAKDRL+YLTY+ L KQL+
Subjt:  PKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

A0A6J1C462 uncharacterized protein LOC1110077681.9e-4740.37Show/hide
Query:  PFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQP-----------NSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPA
        PF   +L +    +P S+          +Q    LP P +P           +++  + +    S + S  L+V +P    R   R          +EPA
Subjt:  PFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQP-----------NSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPA

Query:  --IMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCR
          ++++RR R +  +  I PPYPWST  +A +H + YLR N I+ ITG+V C++C+++Y +EYDL++KFEEIA FI +N    ++RAP SW  P  L+C+
Subjt:  --IMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCR

Query:  MCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
        +C ++NCV   IP+ D + INWLFL LGQM+G L+ + LK FC  TNNHRTGAK+RL+YLTY+TL KQL+
Subjt:  MCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

A0A6J1C690 probable serine/threonine-protein kinase samkC4.0e-5042.28Show/hide
Query:  PSQPNSPQRGPLLQPWPSRQGSLQLPGPSQPEPLQPWPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQV
        P +   P +   L   PS + +     PS P PL   P + +SLQL         Q +PLQP P  Q    +P PS  + +  + L+P   R+ S +   
Subjt:  PSQPNSPQRGPLLQPWPSRQGSLQLPGPSQPEPLQPWPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQV

Query:  PKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFI
          P  ++++            +Q+P I   RR R K  D  I+PPYPWST  RA +H ++YL+ N I+ ITG+V+C+QC+++Y++EYDLV+KF+EIA FI
Subjt:  PKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFI

Query:  ARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIP----KSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
         +N +  ++RAPSSW  P   NC+ C Q++C+  VIP      D++ INWLFL LGQM+G L    LK FCT TNNHRT AKDRL+YLTY++L KQL+
Subjt:  ARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIP----KSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

A0A6J1GM83 mucin-16-like1.2e-4955.29Show/hide
Query:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ
        + RR RT+AD R I+PPYPWS  +RA IH +EYL+SNNIV I G+V C +C+  YE+EY+L++KF+EIARFI R  ++ ++RAP  W  P+  NC  C +
Subjt:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ

Query:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
        +NCVE +IP  +    F  INWLFL LGQ++G L+  QLK FC  T NHRTGAKDRL++LTY+ L KQL+
Subjt:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

A0A6J1I8I0 uncharacterized protein KIAA0754-like8.9e-5055.29Show/hide
Query:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ
        + RR RT+AD R I+PPYPWS  +RA IH +EYL+SNNIV I G+V C +C+  YE+EY+L++KF+EIARFI R  ++ ++RAP  W  P+  NC  C +
Subjt:  KERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQ

Query:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
        +NCVE +IP  +    F  INWLFL LGQ++G L+  QLK FC  T NHRTGAKDRL++LTY+ L KQL+
Subjt:  QNCVEAVIPKSD----FETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

SwissProt top hitse value%identityAlignment
Q55CW2 Probable serine/threonine-protein kinase samkC1.1e-0430.43Show/hide
Query:  LSLAPPSSSSSPPHHKPKQSKSVKFPSI------CHQEPPNIPELAPLQLSLSQEPLQPGPLLGPSQPNSPQRGPLLQPWPSRQGSLQLPGP-----SQP
        L+ + P      P  KP+ S+S   PS          +PP   +  P       +P QP   L PS P S     L +P P  Q S   P P     S+P
Subjt:  LSLAPPSSSSSPPHHKPKQSKSVKFPSI------CHQEPPNIPELAPLQLSLSQEPLQPGPLLGPSQPNSPQRGPLLQPWPSRQGSLQLPGP-----SQP

Query:  EPLQPWPFRQESLQLPGPSQP-NSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKE
         PL+P P  Q S   P PS   +SS+  PL+P P+ Q S   P  S+P   Q +P QP P++  SS+ Q P+    Q+Q +++        +Q+    K 
Subjt:  EPLQPWPFRQESLQLPGPSQP-NSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKE

Query:  RRQRTKA
        + +++K+
Subjt:  RRQRTKA

Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein8.9e-3434.47Show/hide
Query:  SLQLPGPSQPEPLQPWPFRQESLQLPGPSQPNSSQREPLQPWPSRQG----SSQLPGPSQP---------NSSQREPLQPW-------PSRQGSSQLQVP
        SL L   S    ++P       ++ P P  P      P+  WP+        S +P P  P         N  Q+ P  P        PS        + 
Subjt:  SLQLPGPSQPEPLQPWPFRQESLQLPGPSQPNSSQREPLQPWPSRQG----SSQLPGPSQP---------NSSQREPLQPW-------PSRQGSSQLQVP

Query:  KPPLNQRQHKRRANDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIA
         PP+     KR     +   R    + K      K+D   I PP+PW+TNRR +I ++EYL SN I  ITGEV+C  C++ Y++ Y+L  +F E+ +F  
Subjt:  KPPLNQRQHKRRANDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIA

Query:  RNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK
               +RA   WA P    C +C ++  V+ VI +   + INWLFL LGQ LG    +QLK FC  + NHRTGAKDR+LYLTY+ L K L+
Subjt:  RNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLK

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)8.0e-3537.99Show/hide
Query:  PFRQES----LQLPGPSQPNSSQREPLQP---WPSRQGSSQLPGPSQ--------PNSSQREPLQPWPSRQGSSQLQVPKPPLNQ--------RQHKRRA
        P RQE     +QL     P ++Q  P QP        G++ +  P+Q        PN S R PL   PS +      +P P LNQ         +  R  
Subjt:  PFRQES----LQLPGPSQPNSSQREPLQP---WPSRQGSSQLPGPSQ--------PNSSQREPLQPWPSRQGSSQLQVPKPPLNQ--------RQHKRRA

Query:  NDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSS
              N + P    ER       DREI PPYPW+T +  KI +   L SNNI  I+G+V C  C     +EY+L  KF E+  +I  N E+  +RAP S
Subjt:  NDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSS

Query:  WATPVALNCRMCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQL
        W+TP  + CR C  +  ++ V+ +   E INWLFL LGQMLG    DQL+ FC   + HRTG+KDR++Y+TY++L KQL
Subjt:  WATPVALNCRMCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQL

AT2G16190.2 FUNCTIONS IN: molecular_function unknown6.4e-2436.14Show/hide
Query:  PFRQES----LQLPGPSQPNSSQREPLQP---WPSRQGSSQLPGPSQ--------PNSSQREPLQPWPSRQGSSQLQVPKPPLNQ--------RQHKRRA
        P RQE     +QL     P ++Q  P QP        G++ +  P+Q        PN S R PL   PS +      +P P LNQ         +  R  
Subjt:  PFRQES----LQLPGPSQPNSSQREPLQP---WPSRQGSSQLPGPSQ--------PNSSQREPLQPWPSRQGSSQLQVPKPPLNQ--------RQHKRRA

Query:  NDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSS
              N + P    ER       DREI PPYPW+T +  KI +   L SNNI  I+G+V C  C     +EY+L  KF E+  +I  N E+  +RAP S
Subjt:  NDILTANRQEPAIMKERRQRTKADDREIDPPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSS

Query:  WATPVALNCRMCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQL
        W+TP  + CR C  +  ++ V+ +   E INWLFL LGQMLG    DQL
Subjt:  WATPVALNCRMCNQQNCVEAVIPKSDFETINWLFLFLGQMLGNLRHDQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATTGATCTAAACGTTACTCCCCAAACTGAGGATGAGGGTTTGAATCTTGAACTCTCTCTCGCTCCTCCCTCGTCGTCATCGTCCCCACCGCACCATAAACCGAA
ACAATCCAAATCGGTTAAGTTTCCATCGATATGCCACCAAGAACCCCCCAATATCCCCGAGCTAGCGCCATTGCAACTAAGTTTATCCCAAGAGCCATTGCAACCGGGGC
CATTGCTTGGGCCATCTCAACCGAATTCACCCCAACGGGGGCCATTATTGCAACCTTGGCCATCCCGACAGGGGTCATTGCAATTACCGGGGCCATCCCAACCAGAGCCA
TTGCAACCTTGGCCATTCCGACAAGAGTCATTGCAATTACCGGGGCCATCCCAACCGAATTCATCCCAACGGGAGCCTTTGCAACCTTGGCCATCCCGACAGGGATCATC
GCAATTACCGGGGCCATCCCAACCAAATTCATCCCAACGGGAGCCTTTGCAACCTTGGCCATCCCGACAGGGATCATCGCAATTACAGGTGCCAAAACCACCATTAAATC
AGAGGCAACATAAAAGAAGAGCGAACGACATATTGACGGCAAATCGCCAGGAACCAGCAATAATGAAAGAGAGACGACAAAGAACAAAAGCAGACGACAGGGAGATCGAT
CCACCGTATCCCTGGTCGACGAACCGTCGAGCCAAGATCCACACAATGGAGTACCTACGATCGAACAACATAGTAAGGATCACAGGGGAGGTAGAGTGCAACCAATGCAA
AGAAAAGTACGAAATGGAGTACGACTTAGTATCGAAGTTTGAGGAGATAGCGAGGTTCATAGCAAGAAACGGGGAAGATTGTAACAACAGAGCTCCGAGTTCATGGGCGA
CGCCAGTTGCACTGAATTGTAGGATGTGCAACCAACAAAACTGCGTAGAAGCCGTGATTCCGAAGAGCGATTTCGAGACCATAAATTGGTTGTTCTTGTTCTTGGGACAA
ATGCTTGGAAATTTGCGACACGACCAACTCAAAAAATTCTGTACTGAGACCAACAATCATCGAACTGGTGCAAAGGATCGCCTTCTTTATCTCACTTACATTACTTTGTA
TAAACAACTTAAGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGATTGATCTAAACGTTACTCCCCAAACTGAGGATGAGGGTTTGAATCTTGAACTCTCTCTCGCTCCTCCCTCGTCGTCATCGTCCCCACCGCACCATAAACCGAA
ACAATCCAAATCGGTTAAGTTTCCATCGATATGCCACCAAGAACCCCCCAATATCCCCGAGCTAGCGCCATTGCAACTAAGTTTATCCCAAGAGCCATTGCAACCGGGGC
CATTGCTTGGGCCATCTCAACCGAATTCACCCCAACGGGGGCCATTATTGCAACCTTGGCCATCCCGACAGGGGTCATTGCAATTACCGGGGCCATCCCAACCAGAGCCA
TTGCAACCTTGGCCATTCCGACAAGAGTCATTGCAATTACCGGGGCCATCCCAACCGAATTCATCCCAACGGGAGCCTTTGCAACCTTGGCCATCCCGACAGGGATCATC
GCAATTACCGGGGCCATCCCAACCAAATTCATCCCAACGGGAGCCTTTGCAACCTTGGCCATCCCGACAGGGATCATCGCAATTACAGGTGCCAAAACCACCATTAAATC
AGAGGCAACATAAAAGAAGAGCGAACGACATATTGACGGCAAATCGCCAGGAACCAGCAATAATGAAAGAGAGACGACAAAGAACAAAAGCAGACGACAGGGAGATCGAT
CCACCGTATCCCTGGTCGACGAACCGTCGAGCCAAGATCCACACAATGGAGTACCTACGATCGAACAACATAGTAAGGATCACAGGGGAGGTAGAGTGCAACCAATGCAA
AGAAAAGTACGAAATGGAGTACGACTTAGTATCGAAGTTTGAGGAGATAGCGAGGTTCATAGCAAGAAACGGGGAAGATTGTAACAACAGAGCTCCGAGTTCATGGGCGA
CGCCAGTTGCACTGAATTGTAGGATGTGCAACCAACAAAACTGCGTAGAAGCCGTGATTCCGAAGAGCGATTTCGAGACCATAAATTGGTTGTTCTTGTTCTTGGGACAA
ATGCTTGGAAATTTGCGACACGACCAACTCAAAAAATTCTGTACTGAGACCAACAATCATCGAACTGGTGCAAAGGATCGCCTTCTTTATCTCACTTACATTACTTTGTA
TAAACAACTTAAGGGCTAG
Protein sequenceShow/hide protein sequence
MGIDLNVTPQTEDEGLNLELSLAPPSSSSSPPHHKPKQSKSVKFPSICHQEPPNIPELAPLQLSLSQEPLQPGPLLGPSQPNSPQRGPLLQPWPSRQGSLQLPGPSQPEP
LQPWPFRQESLQLPGPSQPNSSQREPLQPWPSRQGSSQLPGPSQPNSSQREPLQPWPSRQGSSQLQVPKPPLNQRQHKRRANDILTANRQEPAIMKERRQRTKADDREID
PPYPWSTNRRAKIHTMEYLRSNNIVRITGEVECNQCKEKYEMEYDLVSKFEEIARFIARNGEDCNNRAPSSWATPVALNCRMCNQQNCVEAVIPKSDFETINWLFLFLGQ
MLGNLRHDQLKKFCTETNNHRTGAKDRLLYLTYITLYKQLKG