; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg06838 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg06838
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationCarg_Chr18:1570789..1572507
RNA-Seq ExpressionCarg06838
SyntenyCarg06838
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]2.4e-22099.5Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL
        LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST+KQSE+IEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-221100Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL
        LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]1.5e-21799.01Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPIT PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST+KQSE+IEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

XP_022994450.1 sporozoite surface protein 2-like [Cucurbita maxima]3.7e-8894.24Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPS+PNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP
        +KLDTKT  KTTTRFSSPRPTKPITPPPSKSN KGA+GSGSRSDFSRAKPSDS KG+PKNLRSGRL+DQQDEQIVRSYSDGSYGA TLSDP
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]4.2e-21798.26Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPIN NELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID PKCSCGSTQKQSEEIEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGL+DSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQ+LKADQNEAMS+PIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085881.9e-8251.17Show/hide
Query:  VRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPS-RPNSKLAPSRPAAPSP--
        +RG+ISPPRSRSSPR+ RP +NNGA NPPSRPNYMSPRRRPTT       +THRK P  T  +   K      +PPRI PS +P + + PSR   P+   
Subjt:  VRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPS-RPNSKLAPSRPAAPSP--

Query:  NERKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNG----KG----------------ASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSY
        N++KLDTKT PK        +P+ P T P    NG    +G                A  S SRSD   A  S S K  P +  S + +D ++      Y
Subjt:  NERKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNG----KG----------------ASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSY

Query:  SDGSYGARTLSDPDVHK-LHQLSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYI-DT
        S G+Y    L+DP ++  L +LS+D KDLA+I+LHAN +YES+ S+T EE   S   N+ R+FQIYK+IASH Q NSS+TSY TKLK LWDELE Y  D 
Subjt:  SDGSYGARTLSDPDVHK-LHQLSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYI-DT

Query:  PK-CSCGSTQKQSEEIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNL
        P+ CSCG+ +K S  +EREKVMQFL+GLN+SYSTIC QIL ++PFPT+EKA   I+REEKR ELV SLE+VAAKV++N WLLQN  S NG ++ + H  +
Subjt:  PK-CSCGSTQKQSEEIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNL

Query:  QDLKADQNEAMSIPIEPLLIDLGSPVRC
             D  E  S P E LLIDLGSPVRC
Subjt:  QDLKADQNEAMSIPIEPLLIDLGSPVRC

A0A6J1C6T8 uncharacterized protein LOC111008934 isoform X22.1e-3639.89Show/hide
Query:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNY-MSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNER
        RG+ISPPR+ S P       NN A NPP RPNY MSP  RPTT +NP+E   +    + T           SSN  R       + +AP  P     +++
Subjt:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNY-MSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNER

Query:  KLDTKTAPKTTTRFSSPRPTK--------PITPPPSKSNGKGASGSGSRSD-----FSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTL
        KLD      ++T+ ++P PT+         +    +    KG + SGSRSD         K SDS   +  N  S   + Q+ + I    ++G     +L
Subjt:  KLDTKTAPKTTTRFSSPRPTK--------PITPPPSKSNGKGASGSGSRSD-----FSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTL

Query:  SDPDVHK-LHQLSL-----DDKDLA-------NIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID
        + P ++  LH+LS      +D  +        +IVLH+              +CSSQ +N  R+F+IYK+IASH QGNSSITSY T+LK LWDELE Y D
Subjt:  SDPDVHK-LHQLSL-----DDKDLA-------NIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYID

Query:  TPKCSCGSTQKQSEEIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKR
          +C C S     E +EREKVMQFL+GLND YSTIC QIL ++PFPTVEKA   ++REEKR
Subjt:  TPKCSCGSTQKQSEEIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREEKR

A0A6J1C7L7 uncharacterized protein LOC1110089865.9e-3942.01Show/hide
Query:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTP--INPNELRTHRKEPQPTVVKRPTKPTDRSS-NPPRIDPSRPNSKLAPSRPAAPSPN
        RG+ISPP+SR S  ES    NN A NPPS PNYMS  RR T    +NP + +T     +PT      + T  SS  P  + PSR     AP+ P   + N
Subjt:  RGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTP--INPNELRTHRKEPQPTVVKRPTKPTDRSS-NPPRIDPSRPNSKLAPSRPAAPSPN

Query:  ERKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDV-HKLHQ
             T  A  TT+R S+    +     P+  +    S  G                      SG   D  +  I     D +     +  P V ++L Q
Subjt:  ERKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDV-HKLHQ

Query:  LSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST--QKQSEEIEREKV
        LS+D K  A +V  AN + ES+   T +EECS Q +N+ R+ +IYK+IASH QGNSSITSY TKL+ LW+ELE Y D P+C   S   QK S+ +EREKV
Subjt:  LSLDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST--QKQSEEIEREKV

Query:  MQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREE
        MQFL+GLNDSYSTIC+QIL ++PFPTVEKA   I+ +E
Subjt:  MQFLIGLNDSYSTICAQILHMKPFPTVEKACCAILREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like7.1e-21899.01Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
        RKLDTKTAPKTTTRFSSPRPTKPIT PPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLS

Query:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL
        LDDKDLANIVLHANLVYESLASET EEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGST+KQSE+IEREKVMQFL
Subjt:  LDDKDLANIVLHANLVYESLASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFL

Query:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
        IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP
Subjt:  IGLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSP

Query:  VRC
        VRC
Subjt:  VRC

A0A6J1K179 sporozoite surface protein 2-like1.8e-8894.24Show/hide
Query:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE
        MVRGIISPPRSRSSPRESRPFNNNGATNPPS+PNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKL PSRPAAPSPNE
Subjt:  MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNE

Query:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP
        +KLDTKT  KTTTRFSSPRPTKPITPPPSKSN KGA+GSGSRSDFSRAKPSDS KG+PKNLRSGRL+DQQDEQIVRSYSDGSYGA TLSDP
Subjt:  RKLDTKTAPKTTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.4e-1031.25Show/hide
Query:  RMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGS-----TQKQSEEIEREKVMQFLIG--LNDSYSTICAQILHMKPFPTVEKA
        +++Q+ + +A+  QG  S+  Y  KL  +W EL  Y   P+C CG      T++  E  E+E+  +FL+G  LN  +  +  +I+  KP P++ +A
Subjt:  RMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGS-----TQKQSEEIEREKVMQFLIG--LNDSYSTICAQILHMKPFPTVEKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGCATTATCAGCCCCCCAAGATCCCGATCTTCCCCCAGAGAAAGCAGACCATTCAACAACAATGGAGCCACCAATCCACCGTCCAGACCCAATTACATGTC
CCCACGCCGCCGCCCAACGACACCCATTAACCCCAACGAGCTGCGAACCCATAGAAAAGAACCCCAACCCACCGTCGTGAAACGCCCAACCAAACCCACCGATAGATCTT
CAAACCCTCCACGCATCGACCCATCAAGACCCAATTCAAAGCTTGCCCCCTCAAGGCCAGCAGCCCCAAGCCCAAATGAGAGGAAATTAGACACCAAAACAGCGCCTAAA
ACCACCACTAGATTCTCTTCCCCTCGCCCAACCAAACCAATAACTCCACCGCCGTCGAAGAGCAATGGAAAAGGAGCAAGTGGGTCTGGTTCAAGATCCGATTTTTCTCG
TGCCAAACCAAGCGATTCTCAAAAGGGTACTCCCAAAAACTTGCGGAGCGGTCGGTTGAATGATCAGCAAGATGAACAAATTGTTCGTAGTTATTCGGATGGATCTTATG
GTGCTCGTACTCTTAGCGACCCTGATGTTCATAAGTTACATCAGCTTTCTCTTGATGATAAGGATCTTGCGAACATCGTTCTTCATGCAAATTTGGTATACGAATCACTC
GCCTCAGAAACAAACGAAGAAGAATGTTCTTCCCAAGGGAATAATAGTTCAAGAATGTTTCAAATTTACAAGGAAATTGCATCCCATCATCAAGGAAACTCATCCATTAC
ATCTTACATTACAAAGCTGAAGGCTTTATGGGATGAACTCGAGGCCTACATTGACACCCCCAAATGTTCTTGCGGTTCAACACAGAAACAGAGTGAGGAAATTGAAAGGG
AAAAAGTCATGCAATTTCTTATAGGATTAAACGATTCTTATTCCACCATTTGTGCTCAAATTCTTCATATGAAGCCATTTCCAACGGTGGAAAAAGCTTGTTGTGCAATA
CTTCGTGAAGAAAAACGCAGGGAATTGGTGTTGTCATTGGAGATTGTTGCGGCCAAAGTGATCCAAAACAATTGGCTTCTTCAAAATGGTCATTCGATCAATGGTGATAA
TGAAGAAGTTGATCATATGAATCTTCAGGACCTGAAAGCTGATCAAAACGAAGCTATGAGTATTCCCATTGAGCCATTGCTCATAGACCTTGGCTCTCCTGTTCGATGTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAGGCATTATCAGCCCCCCAAGATCCCGATCTTCCCCCAGAGAAAGCAGACCATTCAACAACAATGGAGCCACCAATCCACCGTCCAGACCCAATTACATGTC
CCCACGCCGCCGCCCAACGACACCCATTAACCCCAACGAGCTGCGAACCCATAGAAAAGAACCCCAACCCACCGTCGTGAAACGCCCAACCAAACCCACCGATAGATCTT
CAAACCCTCCACGCATCGACCCATCAAGACCCAATTCAAAGCTTGCCCCCTCAAGGCCAGCAGCCCCAAGCCCAAATGAGAGGAAATTAGACACCAAAACAGCGCCTAAA
ACCACCACTAGATTCTCTTCCCCTCGCCCAACCAAACCAATAACTCCACCGCCGTCGAAGAGCAATGGAAAAGGAGCAAGTGGGTCTGGTTCAAGATCCGATTTTTCTCG
TGCCAAACCAAGCGATTCTCAAAAGGGTACTCCCAAAAACTTGCGGAGCGGTCGGTTGAATGATCAGCAAGATGAACAAATTGTTCGTAGTTATTCGGATGGATCTTATG
GTGCTCGTACTCTTAGCGACCCTGATGTTCATAAGTTACATCAGCTTTCTCTTGATGATAAGGATCTTGCGAACATCGTTCTTCATGCAAATTTGGTATACGAATCACTC
GCCTCAGAAACAAACGAAGAAGAATGTTCTTCCCAAGGGAATAATAGTTCAAGAATGTTTCAAATTTACAAGGAAATTGCATCCCATCATCAAGGAAACTCATCCATTAC
ATCTTACATTACAAAGCTGAAGGCTTTATGGGATGAACTCGAGGCCTACATTGACACCCCCAAATGTTCTTGCGGTTCAACACAGAAACAGAGTGAGGAAATTGAAAGGG
AAAAAGTCATGCAATTTCTTATAGGATTAAACGATTCTTATTCCACCATTTGTGCTCAAATTCTTCATATGAAGCCATTTCCAACGGTGGAAAAAGCTTGTTGTGCAATA
CTTCGTGAAGAAAAACGCAGGGAATTGGTGTTGTCATTGGAGATTGTTGCGGCCAAAGTGATCCAAAACAATTGGCTTCTTCAAAATGGTCATTCGATCAATGGTGATAA
TGAAGAAGTTGATCATATGAATCTTCAGGACCTGAAAGCTGATCAAAACGAAGCTATGAGTATTCCCATTGAGCCATTGCTCATAGACCTTGGCTCTCCTGTTCGATGTT
GA
Protein sequenceShow/hide protein sequence
MVRGIISPPRSRSSPRESRPFNNNGATNPPSRPNYMSPRRRPTTPINPNELRTHRKEPQPTVVKRPTKPTDRSSNPPRIDPSRPNSKLAPSRPAAPSPNERKLDTKTAPK
TTTRFSSPRPTKPITPPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGARTLSDPDVHKLHQLSLDDKDLANIVLHANLVYESL
ASETNEEECSSQGNNSSRMFQIYKEIASHHQGNSSITSYITKLKALWDELEAYIDTPKCSCGSTQKQSEEIEREKVMQFLIGLNDSYSTICAQILHMKPFPTVEKACCAI
LREEKRRELVLSLEIVAAKVIQNNWLLQNGHSINGDNEEVDHMNLQDLKADQNEAMSIPIEPLLIDLGSPVRC