; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g044000 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g044000
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationCsor_Chr18:1630575..1632294
RNA-Seq ExpressionCsor.00g044000
SyntenyCsor.00g044000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]7.18e-283100Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]5.90e-28299.5Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST+KQSE+IEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]3.15e-27999.5Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPP SKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

XP_022994450.1 sporozoite surface protein 2-like [Cucurbita maxima]1.18e-11094.24Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPS+PNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP
        +KLDTKT  KTTTRFSSPRPTKPITPPPSKSN KGA+GSGSRSDFSRAKPSDS KG+PKNLRSGRL+DQQDEQIVRSYSDGSYGA TLSDP
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]8.96e-27797.77Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPIN NELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID PKCSCGST+KQSE+IEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGL+DSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQ+LKADQNEAMS+PIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085885.44e-10151.87Show/hide
Query:  VRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPS-RPNSKLAPSRPAAPSP--
        +RG+ISPPRSRSSPR+ RP NN GA NPPSRPNYMSPRRRPTT       +THRK P  T     T+ T +S  PPRI PS +P + + PSR   P+   
Subjt:  VRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPS-RPNSKLAPSRPAAPSP--

Query:  NERKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNG----KG----------------ASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSY
        N++KLDTKT PK        +P+ P T P    NG    +G                A  S SRSD   A  S S K  P    S       D +    Y
Subjt:  NERKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNG----KG----------------ASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSY

Query:  SDGSYGARTLSDPDVHK-LHQLSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID-T
        S G+Y    L+DP ++  L +LS+D KDLA+I+LHAN +YES+ S+T EE   S   N+ R+FQIYK+IASH Q NSS+TSY TKLK LWDELE Y D  
Subjt:  SDGSYGARTLSDPDVHK-LHQLSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID-T

Query:  PKC-SCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNL
        P+C SCG+ EK S  +EREKVMQFL+GLN+SYSTIC QIL ++PFPT+EKA   I+REEKR ELV SLE+VAAKV++N WLLQN  S NG ++ + H  +
Subjt:  PKC-SCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNL

Query:  QDLKADQNEAMSIPIEPLLIDLGSPVRC
             D  E  S P E LLIDLGSPVRC
Subjt:  QDLKADQNEAMSIPIEPLLIDLGSPVRC

A0A6J1C6T8 uncharacterized protein LOC111008934 isoform X23.60e-4340.16Show/hide
Query:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNY-MSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNER
        RG+ISPPR+ S P       NN A NPP RPNY MSP  RPTT +NP+E   +    + T           SSN  R       + +AP  P     +++
Subjt:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNY-MSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNER

Query:  KLDTKTAPKTTTRFSSPRPTKP-------------ITPPPSKSNGKGASGSGSRSDFSRA-----KPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSY
        KLD      ++T+ ++P PT+                P P     KG + SGSRSD   A     K SDS   +  N  S   + Q+ + I    ++G  
Subjt:  KLDTKTAPKTTTRFSSPRPTKP-------------ITPPPSKSNGKGASGSGSRSDFSRA-----KPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSY

Query:  GARTLSDPDVHK-LHQLSL-----DDKDLA-------NIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDEL
           +L+ P ++  LH+LS      +D  +        +IVLH+              +CSSQ N   R+F+IYK+IASH QGNSSITSY T+LK LWDEL
Subjt:  GARTLSDPDVHK-LHQLSL-----DDKDLA-------NIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDEL

Query:  EAYIDTPKCSCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKR
        E Y D  +C C S     E +EREKVMQFL+GLND YSTIC QIL ++PFPTVEKA   ++REEKR
Subjt:  EAYIDTPKCSCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKR

A0A6J1C7L7 uncharacterized protein LOC1110089861.18e-4641.5Show/hide
Query:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTP--INPNELRT-HRKEPQPTVVKRPTKPTDRSS-NPPRIDPSRPNSKLAPSRPAAPSP
        RG+ISPP+SR S  ES    NN A NPPS PNYMS  RR T    +NP + +T H K   PT      + T  SS  P  + PSR     AP+ P     
Subjt:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTP--INPNELRT-HRKEPQPTVVKRPTKPTDRSS-NPPRIDPSRPNSKLAPSRPAAPSP

Query:  NERKLDTKTAPKTTTRFSS--------PRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSD
        N+      T  K TT  +S        PR    I+     S+G                             SG   D  +  I     D +     +  
Subjt:  NERKLDTKTAPKTTTRFSS--------PRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSD

Query:  PDV-HKLHQLSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST--EKQ
        P V ++L QLS+D K  A +V  AN + ES+   T +EECS Q N + R+ +IYK+IASH QGNSSITSY TKL+ LW+ELE Y D P+C   S   +K 
Subjt:  PDV-HKLHQLSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST--EKQ

Query:  SEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREE
        S+ +EREKVMQFL+GLNDSYSTIC+QIL ++PFPTVEKA   I+ +E
Subjt:  SEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like1.52e-27999.5Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPP SKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

A0A6J1K179 sporozoite surface protein 2-like5.69e-11194.24Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPS+PNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP
        +KLDTKT  KTTTRFSSPRPTKPITPPPSKSN KGA+GSGSRSDFSRAKPSDS KG+PKNLRSGRL+DQQDEQIVRSYSDGSYGA TLSDP
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).9.7e-1031.25Show/hide
Query:  RMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGS-----TEKQSEQIEREKVMQFLIG--LNDSYSTICAQILHMKPFPTVEKA
        +++Q+ + +A+  QG  S+  Y  KL  +W EL  Y   P+C CG      T++  E  E+E+  +FL+G  LN  +  +  +I+  KP P++ +A
Subjt:  RMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGS-----TEKQSEQIEREKVMQFLIG--LNDSYSTICAQILHMKPFPTVEKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGCATTATCAGCCCCCCAAGATCCCGATCTTCCCCCAGAGAAAGCAGACCATTCAACAACAATGGAGCCACCAATCCACCGTCCAGACCCAATTACATGTC
CCCACGCCGCCGCCCAACGACACCCATTAACCCCAACGAGCTGCGAACCCATAGAAAAGAACCCCAACCCACCGTCGTGAAACGCCCAACCAAACCCACCGATAGATCTT
CAAACCCTCCACGCATCGACCCATCAAGACCCAATTCAAAGCTTGCCCCCTCAAGGCCAGCAGCCCCAAGCCCAAATGAGAGGAAATTAGACACCAAAACAGCGCCTAAA
ACCACCACTAGATTCTCTTCCCCTCGCCCAACCAAACCAATAACTCCACCGCCGTCGAAGAGCAATGGAAAAGGAGCAAGTGGGTCTGGTTCAAGATCCGATTTTTCTCG
TGCCAAACCAAGCGATTCTCAAAAGGGTACTCCCAAAAACTTGCGGAGCGGTCGGTTGAATGATCAGCAAGATGAACAAATTGTTCGTAGTTATTCGGATGGATCTTATG
GTGCTCGTACTCTTAGCGACCCTGATGTTCATAAGTTACATCAGCTTTCTCTTGATGATAAGGATCTTGCGAACATCGTTCTTCATGCAAATTTGGTATACGAATCACTC
GCCTCAGAAACAAACGAAGAAGAATGTTCTTCCCAAGGGAATAATAGTTCAAGAATGTTTCAAATTTACAAGGAAATTGCATCCCATCATCAAGGAAACTCATCCATTAC
ATCTTACATTACAAAGCTGAAGGCTTTATGGGATGAACTCGAGGCCTACATTGACACCCCCAAATGTTCTTGCGGTTCAACAGAGAAACAGAGTGAGCAAATTGAAAGGG
AAAAAGTCATGCAATTTCTTATAGGATTAAACGATTCTTATTCCACCATTTGTGCTCAAATTCTTCATATGAAGCCATTTCCAACGGTGGAAAAAGCTTGTTGTGCAATA
CTTCGTGAAGAAAAACGCAGGGAATTGGTGTTGTCATTGGAGATTGTTGCGGCCAAAGTGATCCAAAACAATTGGCTTCTTCAAAATGGTCATTCGATCAATGGTGATAA
TGAAGAAGTTGATCATATGAATCTTCAGGACCTGAAAGCTGATCAAAACGAAGCTATGAGTATTCCCATTGAGCCATTGCTCATAGACCTTGGCTCTCCTGTTCGATGTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAGGCATTATCAGCCCCCCAAGATCCCGATCTTCCCCCAGAGAAAGCAGACCATTCAACAACAATGGAGCCACCAATCCACCGTCCAGACCCAATTACATGTC
CCCACGCCGCCGCCCAACGACACCCATTAACCCCAACGAGCTGCGAACCCATAGAAAAGAACCCCAACCCACCGTCGTGAAACGCCCAACCAAACCCACCGATAGATCTT
CAAACCCTCCACGCATCGACCCATCAAGACCCAATTCAAAGCTTGCCCCCTCAAGGCCAGCAGCCCCAAGCCCAAATGAGAGGAAATTAGACACCAAAACAGCGCCTAAA
ACCACCACTAGATTCTCTTCCCCTCGCCCAACCAAACCAATAACTCCACCGCCGTCGAAGAGCAATGGAAAAGGAGCAAGTGGGTCTGGTTCAAGATCCGATTTTTCTCG
TGCCAAACCAAGCGATTCTCAAAAGGGTACTCCCAAAAACTTGCGGAGCGGTCGGTTGAATGATCAGCAAGATGAACAAATTGTTCGTAGTTATTCGGATGGATCTTATG
GTGCTCGTACTCTTAGCGACCCTGATGTTCATAAGTTACATCAGCTTTCTCTTGATGATAAGGATCTTGCGAACATCGTTCTTCATGCAAATTTGGTATACGAATCACTC
GCCTCAGAAACAAACGAAGAAGAATGTTCTTCCCAAGGGAATAATAGTTCAAGAATGTTTCAAATTTACAAGGAAATTGCATCCCATCATCAAGGAAACTCATCCATTAC
ATCTTACATTACAAAGCTGAAGGCTTTATGGGATGAACTCGAGGCCTACATTGACACCCCCAAATGTTCTTGCGGTTCAACAGAGAAACAGAGTGAGCAAATTGAAAGGG
AAAAAGTCATGCAATTTCTTATAGGATTAAACGATTCTTATTCCACCATTTGTGCTCAAATTCTTCATATGAAGCCATTTCCAACGGTGGAAAAAGCTTGTTGTGCAATA
CTTCGTGAAGAAAAACGCAGGGAATTGGTGTTGTCATTGGAGATTGTTGCGGCCAAAGTGATCCAAAACAATTGGCTTCTTCAAAATGGTCATTCGATCAATGGTGATAA
TGAAGAAGTTGATCATATGAATCTTCAGGACCTGAAAGCTGATCAAAACGAAGCTATGAGTATTCCCATTGAGCCATTGCTCATAGACCTTGGCTCTCCTGTTCGATGTT
GA
Protein sequenceShow/hide protein sequence
MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNERKLDTKTAPK
TTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSLDDKDLANIVLHANLVYESL
ASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTEKQSEQIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAI
LREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPVRC