; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G002570 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G002570
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationCmo_Chr18:1709535..1711596
RNA-Seq ExpressionCmoCh18G002570
SyntenyCmoCh18G002570
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]2.2e-21899.5Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPIT PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-21799.01Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPIT PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST+KQSE+IEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]2.4e-220100Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSL
        RKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSL
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSL

Query:  DDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLI
        DDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLI
Subjt:  DDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLI

Query:  GLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPV
        GLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPV
Subjt:  GLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPV

Query:  RC
        RC
Subjt:  RC

XP_022994450.1 sporozoite surface protein 2-like [Cucurbita maxima]3.5e-8693.72Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPS+PNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP
        +KLDTKT  KTTTRFSSPRPTKPIT PPSKSN KGA+GSGSRSDFSRAKPSDS KG+PKNLRSGRL+DQQDEQIVRSYSDGSYGA TLSDP
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]5.2e-21597.77Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPIN NELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPIT PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID PKCSCGST+KQSE+IEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGL+DSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQ+LKADQNEAMS+PIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085888.6e-8352.36Show/hide
Query:  VRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPS-RPNSKLAPSRPAAPSP--
        +RG+ISPPRSRSSPR+ RP +NNGA NPPSRPNYMSPRRRPTT       +THRK P  T  +   K      +PPRI PS +P + + PSR   P+   
Subjt:  VRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPS-RPNSKLAPSRPAAPSP--

Query:  NERKLDTKTAPKT-TTRFSSPR-----------PTKPITPPSKSNGKG-----ASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGS
        N++KLDTKT PK    + SSPR           P +  TP SK+         A  S SRSD   A  S S K  P +  S + +D ++      YS G+
Subjt:  NERKLDTKTAPKT-TTRFSSPR-----------PTKPITPPSKSNGKG-----ASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGS

Query:  YGARTLSDPDVHK-LHQLSLDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYI-DTPK-C
        Y    L+DP ++  L +LS+D KDLA+I+LHAN +YES+ S+T EE   S   N+ R+FQIYK+IASH Q NSS+TSY TKLK LWDELE Y  D P+ C
Subjt:  YGARTLSDPDVHK-LHQLSLDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYI-DTPK-C

Query:  SCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLK
        SCG+ EK S  +EREKVMQFL+GLN+SYSTIC QIL ++PFPT+EKA   I+REEKR ELV SLE+VAAKV++N WLLQN  S NG ++ + H  +    
Subjt:  SCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLK

Query:  ADQNEAMSIPIEPLLIDLGSPVRC
         D  E  S P E LLIDLGSPVRC
Subjt:  ADQNEAMSIPIEPLLIDLGSPVRC

A0A6J1C6T8 uncharacterized protein LOC111008934 isoform X25.5e-3740.17Show/hide
Query:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNY-MSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNER
        RG+ISPPR+ S P       NN A NPP RPNY MSP  RPTT +NP+E   +    + T           SSN  R       + +AP  P     +++
Subjt:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNY-MSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNER

Query:  KLDTKTAPKTTTRFSSPRPTKP---------ITPPSKSNGKGASGSGSRSD-----FSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTL
        KLD      ++T+ ++P PT+          I+  +    KG + SGSRSD         K SDS   +  N  S   + Q+ + I    ++G     +L
Subjt:  KLDTKTAPKTTTRFSSPRPTKP---------ITPPSKSNGKGASGSGSRSD-----FSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTL

Query:  SDPDVHK-LHQLSL-----DDKDLA-------NIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID
        + P ++  LH+LS      +D  +        +IVLH+              +CSSQ +N  R+F+IYK+IASH QGNSSITSY T+LK LWDELE Y D
Subjt:  SDPDVHK-LHQLSL-----DDKDLA-------NIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID

Query:  TPKCSCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKR
          +C C S     E +EREKVMQFL+GLND YSTIC QIL ++PFPTVEKA   ++REEKR
Subjt:  TPKCSCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKR

A0A6J1C7L7 uncharacterized protein LOC1110089866.9e-4042.73Show/hide
Query:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTP--INPNELRTHRKEPQPTVVKRPTKPTDRSS-NPPRIDPSRPNSKLAPSRPAAPSPN
        RG+ISPP+SR S  ES    NN A NPPS PNYMS  RR T    +NP + +T     +PT      + T  SS  P  + PSR     AP+ P   + N
Subjt:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTP--INPNELRTHRKEPQPTVVKRPTKPTDRSS-NPPRIDPSRPNSKLAPSRPAAPSPN

Query:  ERKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDV-HKLHQL
                   +TT+ +  +    IT    SN  G      R     +  S S  G      SG   D  +  I     D +     +  P V ++L QL
Subjt:  ERKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDV-HKLHQL

Query:  SLDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST--EKQSEQIEREKVM
        S+D K  A +V  AN + ES+   TK EECS Q +N+ R+ +IYK+IASH QGNSSITSY TKL+ LW+ELE Y D P+C   S   +K S+ +EREKVM
Subjt:  SLDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST--EKQSEQIEREKVM

Query:  QFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREE
        QFL+GLNDSYSTIC+QIL ++PFPTVEKA   I+ +E
Subjt:  QFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like1.2e-220100Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSL
        RKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSL
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSL

Query:  DDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLI
        DDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLI
Subjt:  DDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLI

Query:  GLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPV
        GLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPV
Subjt:  GLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPV

Query:  RC
        RC
Subjt:  RC

A0A6J1K179 sporozoite surface protein 2-like1.7e-8693.72Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPS+PNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP
        +KLDTKT  KTTTRFSSPRPTKPIT PPSKSN KGA+GSGSRSDFSRAKPSDS KG+PKNLRSGRL+DQQDEQIVRSYSDGSYGA TLSDP
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPIT-PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.6e-1031.25Show/hide
Query:  RMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGS-----TEKQSEQIEREKVMQFLIG--LNDSYSTICAQILHMKPFPTVEKA
        +++Q+ + +A+  QG  S+  Y  KL  +W EL  Y   P+C CG      T++  E  E+E+  +FL+G  LN  +  +  +I+  KP P++ +A
Subjt:  RMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGS-----TEKQSEQIEREKVMQFLIG--LNDSYSTICAQILHMKPFPTVEKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGCATTATCAGCCCCCCAAGATCCCGATCTTCCCCCAGAGAAAGCAGACCATTCAACAACAATGGAGCCACCAATCCACCGTCCAGACCCAATTAC
ATGTCCCCACGCCGCCGCCCAACGACACCCATTAACCCCAACGAGCTGCGAACCCATAGAAAAGAACCCCAACCCACCGTCGTGAAACGCCCAACCAAACCCACC
GATAGATCTTCAAACCCTCCACGCATCGACCCATCAAGACCCAATTCAAAGCTTGCCCCCTCAAGGCCAGCAGCCCCAAGCCCAAATGAGAGGAAATTAGACACC
AAAACAGCGCCTAAAACCACCACTAGATTCTCTTCCCCTCGCCCAACCAAACCAATAACTCCGCCGTCGAAGAGCAATGGAAAAGGAGCAAGTGGGTCTGGTTCA
AGATCCGATTTTTCTCGTGCCAAACCAAGCGATTCTCAAAAGGGTACTCCCAAAAACTTGCGGAGCGGTCGGTTGAATGATCAGCAAGATGAACAAATTGTTCGT
AGTTATTCGGATGGATCTTATGGTGCTCGTACTCTTAGCGACCCTGATGTTCATAAGTTACATCAGCTTTCTCTTGATGATAAGGATCTTGCGAACATCGTTCTT
CATGCAAATTTGGTATACGAATCACTCGCCTCAGAAACAAAGGAAGAAGAATGTTCTTCCCAAGGCAATAATAGTTCAAGAATGTTTCAAATTTACAAGGAAATT
GCATCCCATCATCAAGGAAACTCATCCATTACATCTTACATTACAAAGCTGAAGGCTTTATGGGATGAACTCGAGGCCTACATTGACACCCCCAAATGTTCTTGC
GGTTCAACAGAGAAACAGAGTGAGCAAATTGAAAGGGAAAAAGTCATGCAATTTCTTATAGGATTAAACGATTCTTATTCCACCATTTGTGCTCAAATTCTTCAT
ATGAAGCCATTTCCAACGGTGGAGAAAGCTTGTTGTGCAATACTTCGTGAAGAAAAACGCAGGGAATTGGTGTTATCATTGGAGATTGTTGCCGCCAAAGTGATC
CAAAACAATTGGCTTCTTCAAAATGGTCATTCGATCAATGGTGATAATGAAGAAGTTGATCATATGAATCTTCAGGACCTGAAAGCTGATCAAAATGAAGCTATG
AGTATTCCCATTGAGCCATTGCTCATAGACCTTGGCTCTCCTGTTCGATGTTGA
mRNA sequenceShow/hide mRNA sequence
CCCACCGTGTAACAATTAGGAAATGGCCTCATAACCTTCCATGAAATGCCCCAAACTTCATCACTGAGTTCACCAAAAAGCAGAGAAGAAACAGAACAAACCAGC
TCTGTTTATTAGCTCAGCCAGAGAAACAATGGTTCGAGGCATTATCAGCCCCCCAAGATCCCGATCTTCCCCCAGAGAAAGCAGACCATTCAACAACAATGGAGC
CACCAATCCACCGTCCAGACCCAATTACATGTCCCCACGCCGCCGCCCAACGACACCCATTAACCCCAACGAGCTGCGAACCCATAGAAAAGAACCCCAACCCAC
CGTCGTGAAACGCCCAACCAAACCCACCGATAGATCTTCAAACCCTCCACGCATCGACCCATCAAGACCCAATTCAAAGCTTGCCCCCTCAAGGCCAGCAGCCCC
AAGCCCAAATGAGAGGAAATTAGACACCAAAACAGCGCCTAAAACCACCACTAGATTCTCTTCCCCTCGCCCAACCAAACCAATAACTCCGCCGTCGAAGAGCAA
TGGAAAAGGAGCAAGTGGGTCTGGTTCAAGATCCGATTTTTCTCGTGCCAAACCAAGCGATTCTCAAAAGGGTACTCCCAAAAACTTGCGGAGCGGTCGGTTGAA
TGATCAGCAAGATGAACAAATTGTTCGTAGTTATTCGGATGGATCTTATGGTGCTCGTACTCTTAGCGACCCTGATGTTCATAAGTTACATCAGCTTTCTCTTGA
TGATAAGGATCTTGCGAACATCGTTCTTCATGCAAATTTGGTATACGAATCACTCGCCTCAGAAACAAAGGAAGAAGAATGTTCTTCCCAAGGCAATAATAGTTC
AAGAATGTTTCAAATTTACAAGGAAATTGCATCCCATCATCAAGGAAACTCATCCATTACATCTTACATTACAAAGCTGAAGGCTTTATGGGATGAACTCGAGGC
CTACATTGACACCCCCAAATGTTCTTGCGGTTCAACAGAGAAACAGAGTGAGCAAATTGAAAGGGAAAAAGTCATGCAATTTCTTATAGGATTAAACGATTCTTA
TTCCACCATTTGTGCTCAAATTCTTCATATGAAGCCATTTCCAACGGTGGAGAAAGCTTGTTGTGCAATACTTCGTGAAGAAAAACGCAGGGAATTGGTGTTATC
ATTGGAGATTGTTGCCGCCAAAGTGATCCAAAACAATTGGCTTCTTCAAAATGGTCATTCGATCAATGGTGATAATGAAGAAGTTGATCATATGAATCTTCAGGA
CCTGAAAGCTGATCAAAATGAAGCTATGAGTATTCCCATTGAGCCATTGCTCATAGACCTTGGCTCTCCTGTTCGATGTTGAATGTTTTCAGCTCAAGAGAGGTT
AGCTCCATGACTGGTGAGTAGCAGCTTGTACTCATTTATTAAATGCAATAGATATCGTTGAATTAATGCAAACAACCCATATCTTGAACTGAATAGCTATTTTTT
ACTTTCTAAGGTAGAGGGACGTTTAGTTTATTCTTCAAAAGTAAGTAATTATGTTGAAAAGTTTCAATTATCTCATCTTCTTAA
Protein sequenceShow/hide protein sequence
MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNERKLDT
KTAPKTTTRFSSPRPTKPITPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSLDDKDLANIVL
HANLVYESLASETKEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILH
MKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPVRC