; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0008984 (gene) of Chayote v1 genome

Gene IDSed0008984
OrganismSechium edule (Chayote v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationLG08:36996616..36999278
RNA-Seq ExpressionSed0008984
SyntenySed0008984
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]6.2e-10860.33Show/hide
Query:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP
        MVRGIISPPRSRSSPRE+RPF  NNN   NPPSRPNYMSP RRP T    ++ R  RKE QP    RP++   DRSS  PR DPS+P +SK  PSR  P 
Subjt:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP

Query:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY
        AP+P+++KLD  +   P   TR +S RP KP        P PSK N KGA GSGSRSD S    K  DS+ G+     SG   D       R YSDG Y 
Subjt:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY

Query:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG
             DP V  KLHQLSLDDKDLAN+VLHAN +YES  S+T EE+CSSQ NN  +R+ QI+KEI+SH QGNSSITSYITKL  LWDEL  YID P+CSCG
Subjt:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG

Query:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH
        S EK SEQI+REKVMQF++GL+DSYST CA+IL MKPFPTVEKAC  I+REEKRRE+V SLE  A KVIQNNWL+ NG+  N DN +     +++ + D 
Subjt:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH

Query:  NEVACIPVEPLLIDLGSPVRC
        NE   IP+EPLLIDLGSPVRC
Subjt:  NEVACIPVEPLLIDLGSPVRC

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-10759.86Show/hide
Query:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP
        MVRGIISPPRSRSSPRE+RPF  NNN   NPPSRPNYMSP RRP T    ++ R  RKE QP    RP++   DRSS  PR DPS+P +SK  PSR  P 
Subjt:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP

Query:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY
        AP+P+++KLD  +   P   TR +S RP KP        P PSK N KGA GSGSRSD S    K  DS+ G+     SG   D       R YSDG Y 
Subjt:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY

Query:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG
             DP V  KLHQLSLDDKDLAN+VLHAN +YES  S+T EE+CSSQ NN  +R+ QI+KEI+SH QGNSSITSYITKL  LWDEL  YID P+CSCG
Subjt:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG

Query:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH
        S +K SE+I+REKVMQF++GL+DSYST CA+IL MKPFPTVEKAC  I+REEKRRE+V SLE  A KVIQNNWL+ NG+  N DN +     +++ + D 
Subjt:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH

Query:  NEVACIPVEPLLIDLGSPVRC
        NE   IP+EPLLIDLGSPVRC
Subjt:  NEVACIPVEPLLIDLGSPVRC

XP_022137024.1 uncharacterized protein LOC111008588 [Momordica charantia]5.4e-7248.49Show/hide
Query:  EQTMVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQP-ATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSR
        E+  +RG+ISPPRSRSSPR+ RP   +NN   NPPSRPNYMSP RRP TA    + ++ +   +P AT  R ++ +P R   +P+  P+ PI    +PSR
Subjt:  EQTMVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQP-ATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSR

Query:  RTPPAPNPHDKKLDANSN-KTPVAKTRPASSRPVKPGN-TTPARGPTPSKGNVKG------AIGSGSRSDASGLGHKYLDSRNGSSGGSGHRVDGDVGNP
        R  P    + KKLD  +  K   AK     ++P +  N   P RGPTP   N         AI S SRSD S   +    S++  S GS    D    +P
Subjt:  RTPPAPNPHDKKLDANSN-KTPVAKTRPASSRPVKPGN-TTPARGPTPSKGNVKG------AIGSGSRSDASGLGHKYLDSRNGSSGGSGHRVDGDVGNP

Query:  NRCYSDGHYYGAFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATY
           YS G  Y    DP +   L +LS+D KDLA+++LHANSIYES  SDT EE  S QSN    RI QI+K+I+SHRQ NSS+TSY TKL  LWDEL TY
Subjt:  NRCYSDGHYYGAFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATY

Query:  I-DVPQ-CSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLVNGNFNNVDNNDGIK
          DVPQ CSCG++EK S  ++REKVMQF++GL++SYST C +IL ++PFPT+EKA S+IIREEKR E+V SLE  A KV++N WL+  + ++   +DGI 
Subjt:  I-DVPQ-CSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLVNGNFNNVDNNDGIK

Query:  EQ----QIDHNEVACIPVEPLLIDLGSPVRC
        E+      D+ E+   P E LLIDLGSPVRC
Subjt:  EQ----QIDHNEVACIPVEPLLIDLGSPVRC

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]3.6e-10860.81Show/hide
Query:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP
        MVRGIISPPRSRSSPRE+RPF  NNN   NPPSRPNYMSP RRP T    ++ R  RKE QP    RP++   DRSS  PR DPS+P +SK  PSR  P 
Subjt:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP

Query:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY
        AP+P+++KLD  +   P   TR +S RP KP   TP     PSK N KGA GSGSRSD S    K  DS+ G+     SG   D       R YSDG Y 
Subjt:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY

Query:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG
             DP V  KLHQLSLDDKDLAN+VLHAN +YES  S+TKEE+CSSQ NN  +R+ QI+KEI+SH QGNSSITSYITKL  LWDEL  YID P+CSCG
Subjt:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG

Query:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH
        S EK SEQI+REKVMQF++GL+DSYST CA+IL MKPFPTVEKAC  I+REEKRRE+V SLE  A KVIQNNWL+ NG+  N DN +     +++ + D 
Subjt:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH

Query:  NEVACIPVEPLLIDLGSPVRC
        NE   IP+EPLLIDLGSPVRC
Subjt:  NEVACIPVEPLLIDLGSPVRC

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]8.6e-11060.57Show/hide
Query:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP
        MVRGIISPPRSRSSPRE+RPF  NNN   NPPSRPNYMSP RRP T   A++ R  RKE QP    RP++   DRSS  PR DPS+P +SK +PSR  P 
Subjt:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP

Query:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY
        AP+P+++KLD  +   P   TR +S RP KP        P PSK N KGA GSGSRSD S    K  DS+ G+     SG   D       R YSDG Y 
Subjt:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY

Query:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG
             DP V  KLHQLSLDDKDLAN+VLHAN +YES  S+TKEE+CSSQ NN  +R+ QI+KEI+SH QGNSSITSYITKL  LWDEL  YID+P+CSCG
Subjt:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG

Query:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH
        S +K SE+I+REKVMQF++GLDDSYST CA+IL MKPFPTVEKAC  I+REEKRRE+V SLE  A KVIQNNWL+ NG+  N DN +     ++E + D 
Subjt:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH

Query:  NEVACIPVEPLLIDLGSPVRC
        NE   +P+EPLLIDLGSPVRC
Subjt:  NEVACIPVEPLLIDLGSPVRC

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085882.6e-7248.49Show/hide
Query:  EQTMVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQP-ATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSR
        E+  +RG+ISPPRSRSSPR+ RP   +NN   NPPSRPNYMSP RRP TA    + ++ +   +P AT  R ++ +P R   +P+  P+ PI    +PSR
Subjt:  EQTMVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQP-ATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSR

Query:  RTPPAPNPHDKKLDANSN-KTPVAKTRPASSRPVKPGN-TTPARGPTPSKGNVKG------AIGSGSRSDASGLGHKYLDSRNGSSGGSGHRVDGDVGNP
        R  P    + KKLD  +  K   AK     ++P +  N   P RGPTP   N         AI S SRSD S   +    S++  S GS    D    +P
Subjt:  RTPPAPNPHDKKLDANSN-KTPVAKTRPASSRPVKPGN-TTPARGPTPSKGNVKG------AIGSGSRSDASGLGHKYLDSRNGSSGGSGHRVDGDVGNP

Query:  NRCYSDGHYYGAFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATY
           YS G  Y    DP +   L +LS+D KDLA+++LHANSIYES  SDT EE  S QSN    RI QI+K+I+SHRQ NSS+TSY TKL  LWDEL TY
Subjt:  NRCYSDGHYYGAFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATY

Query:  I-DVPQ-CSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLVNGNFNNVDNNDGIK
          DVPQ CSCG++EK S  ++REKVMQF++GL++SYST C +IL ++PFPT+EKA S+IIREEKR E+V SLE  A KV++N WL+  + ++   +DGI 
Subjt:  I-DVPQ-CSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLVNGNFNNVDNNDGIK

Query:  EQ----QIDHNEVACIPVEPLLIDLGSPVRC
        E+      D+ E+   P E LLIDLGSPVRC
Subjt:  EQ----QIDHNEVACIPVEPLLIDLGSPVRC

A0A6J1C6T8 uncharacterized protein LOC111008934 isoform X25.7e-3539.47Show/hide
Query:  RGIISPPRSRSSPRENRPFNNNNNATANPPSRPNY-MSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPPA
        RG+ISPPR+ S P         NNA ANPP RPNY MSP  RP T    S+        +  +           SS A R     P              
Subjt:  RGIISPPRSRSSPRENRPFNNNNNATANPPSRPNY-MSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPPA

Query:  PNP-----HDKKLDANSNKTPVAKTR---PASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRS---DASGLGHKYLDSRNGSS------GGSGHRVD-G
        PNP     H KKLD N+N     KT    P   + ++ G T  +    P     KG   SGSRS   DA+ +  K+ DS N SS       GSG R +  
Subjt:  PNP-----HDKKLDANSNKTPVAKTR---PASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRS---DASGLGHKYLDSRNGSS------GGSGHRVD-G

Query:  DVGNPNRCYSDGHYYGAFRDPAVRAKLHQLSL-----DDKDLA-------NLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSI
        +VGN     + G  Y +   P +   LH+LS      +D  +        ++VLH+              +CSSQSN    RI +I+K+I+SHRQGNSSI
Subjt:  DVGNPNRCYSDGHYYGAFRDPAVRAKLHQLSL-----DDKDLA-------NLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSI

Query:  TSYITKLNKLWDELATYIDVPQCSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKR
        TSY T+L  LWDEL TY D+ QC       S E ++REKVMQF+VGL+D YST C +IL ++PFPTVEKA S++IREEKR
Subjt:  TSYITKLNKLWDELATYIDVPQCSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKR

A0A6J1C6U3 uncharacterized protein LOC111008934 isoform X19.8e-3539.47Show/hide
Query:  RGIISPPRSRSSPRENRPFNNNNNATANPPSRPNY-MSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPPA
        RG+ISPPR+ S P         NNA ANPP RPNY MSP  RP T    S+        +  +           SS A R     P              
Subjt:  RGIISPPRSRSSPRENRPFNNNNNATANPPSRPNY-MSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPPA

Query:  PNP-----HDKKLDANSNKTPVAKTR---PASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRS---DASGLGHKYLDSRNGSS------GGSGHRVD-G
        PNP     H KKLD N+N     KT    P   + ++ G T  +    P     KG   SGSRS   DA+ +  K+ DS N SS       GSG R +  
Subjt:  PNP-----HDKKLDANSNKTPVAKTR---PASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRS---DASGLGHKYLDSRNGSS------GGSGHRVD-G

Query:  DVGNPNRCYSDGHYYGAFRDPAVRAKLHQLSL----DDKDLANLVLHANSIYESFNSD---TKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYIT
        +VGN     + G  Y +   P +   LH+LS      +     + ++   I      D       +CSSQSN    RI +I+K+I+SHRQGNSSITSY T
Subjt:  DVGNPNRCYSDGHYYGAFRDPAVRAKLHQLSL----DDKDLANLVLHANSIYESFNSD---TKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYIT

Query:  KLNKLWDELATYIDVPQCSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKR
        +L  LWDEL TY D+ QC       S E ++REKVMQF+VGL+D YST C +IL ++PFPTVEKA S++IREEKR
Subjt:  KLNKLWDELATYIDVPQCSCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKR

A0A6J1C7L7 uncharacterized protein LOC1110089868.8e-4442.57Show/hide
Query:  RGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP-A
        RG+ISPP+SR S  E+   ++ NNA ANPPS PNYMS  RR   A   +  +     T+P  G          S+    T  S P  +  +PSRR P  A
Subjt:  RGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP-A

Query:  PNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGGSGHRVDGDVGNPNRCYSDGHYYGAF
        PN  +   D ++ K  +AK   + +  V      P RGPT                         +   +GSS GSGH  D +  N      D       
Subjt:  PNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGGSGHRVDGDVGNPNRCYSDGHYYGAF

Query:  RDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCGSI--
          P V  +L QLS+D K  A +V  ANS+ ES    TKEE CS QSN    RIL+I+K+I+SHRQGNSSITSY TKL  LW+EL TY D+PQC   S   
Subjt:  RDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCGSI--

Query:  EKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREE
        +K S+ ++REKVMQF+VGL+DSYST C++IL ++PFPTVEKA S+II +E
Subjt:  EKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like1.7e-10860.81Show/hide
Query:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP
        MVRGIISPPRSRSSPRE+RPF  NNN   NPPSRPNYMSP RRP T    ++ R  RKE QP    RP++   DRSS  PR DPS+P +SK  PSR  P 
Subjt:  MVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPDRSSTAPRTDPSKPISSKPLPSRRTPP

Query:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY
        AP+P+++KLD  +   P   TR +S RP KP   TP     PSK N KGA GSGSRSD S    K  DS+ G+     SG   D       R YSDG Y 
Subjt:  APNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGG--SGHRVDGDVGNPNRCYSDGHYY

Query:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG
             DP V  KLHQLSLDDKDLAN+VLHAN +YES  S+TKEE+CSSQ NN  +R+ QI+KEI+SH QGNSSITSYITKL  LWDEL  YID P+CSCG
Subjt:  G-AFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCG

Query:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH
        S EK SEQI+REKVMQF++GL+DSYST CA+IL MKPFPTVEKAC  I+REEKRRE+V SLE  A KVIQNNWL+ NG+  N DN +     +++ + D 
Subjt:  SIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLV-NGNFNNVDNND----GIKEQQIDH

Query:  NEVACIPVEPLLIDLGSPVRC
        NE   IP+EPLLIDLGSPVRC
Subjt:  NEVACIPVEPLLIDLGSPVRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.2e-1028.16Show/hide
Query:  IGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCGS-----IEKSSEQIQREKVMQFVVG--LDDSYSTFCAKILDMKPFPTVEKAC
        +  +I Q+ + +++ RQG  S+  Y  KL+K+W EL+ Y  +P+C CG       +++ E  ++E+  +F++G  L+  +     KI+  KP P++ +A 
Subjt:  IGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQCSCGS-----IEKSSEQIQREKVMQFVVG--LDDSYSTFCAKILDMKPFPTVEKAC

Query:  SVI
        +++
Subjt:  SVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGATCGCATCGTGGCCGTCAAATTTGTTCACTTGGTAGTTTACCAGAAAACAGAGCACAGAAAAGGCAGGTCTGTTTAGACGAGATCGGAGCAGAGCAAACAAT
GGTGAGAGGAATTATCAGTCCCCCAAGATCCCGTTCTTCTCCAAGAGAAAACAGACCTTTCAATAATAACAACAATGCCACCGCCAATCCACCCTCCAGACCCAACTACA
TGTCTCCTTGTCGCCGCCCGGCAACGGCAACGCCCGCCTCCGATCCACGAAATCCCAGAAAAGAAACCCAACCCGCCACCGGTCCACGCCCCAGCAGAACCAACCCCGAC
AGATCTTCAACCGCTCCCCGAACCGACCCATCCAAACCCATTTCTTCCAAGCCCCTGCCCTCGAGGCGAACACCACCAGCCCCAAATCCTCACGACAAGAAATTAGACGC
CAACAGCAACAAAACGCCGGTCGCTAAAACCCGACCGGCTTCCTCTCGCCCGGTCAAACCGGGAAATACTACACCGGCGCGTGGGCCCACGCCTTCGAAGGGAAATGTGA
AGGGAGCCATCGGGTCTGGTTCCAGATCCGATGCTTCCGGTTTGGGTCATAAGTACTTAGATTCTCGAAATGGTTCCAGCGGCGGGTCGGGTCATCGGGTGGACGGGGAT
GTTGGAAACCCGAACCGTTGTTATTCGGATGGACATTATTATGGTGCGTTTCGTGATCCTGCTGTTCGTGCTAAGTTGCATCAGCTTTCTTTAGATGACAAAGATCTTGC
AAACCTCGTCCTTCACGCAAATTCGATATATGAATCATTCAATTCAGATACAAAGGAAGAACAATGTAGTTCTCAAAGCAACAACATTGGTACAAGAATACTTCAAATCT
TCAAAGAAATTTCATCTCACCGTCAAGGAAACTCCTCCATCACATCCTACATTACAAAGCTAAATAAATTATGGGACGAACTCGCAACCTACATCGACGTGCCTCAATGT
TCTTGTGGTTCTATCGAGAAGTCAAGCGAGCAAATACAAAGGGAAAAAGTAATGCAATTTGTCGTCGGATTAGACGATTCTTATTCCACATTTTGCGCGAAAATCCTCGA
CATGAAGCCATTTCCAACCGTGGAGAAAGCTTGTTCTGTGATAATTCGAGAAGAAAAACGCAGAGAAGTGGTTCAGTCATTGGAAAATTTTGCTGAGAAAGTAATTCAAA
ACAATTGGCTTGTTAATGGGAACTTCAACAATGTTGATAATAATGATGGTATTAAGGAGCAACAAATTGATCATAATGAAGTTGCATGCATCCCTGTTGAGCCATTGCTG
ATTGATCTTGGCTCTCCTGTTCGTTGTTGA
mRNA sequenceShow/hide mRNA sequence
GTTGACTTGACCCGAAAAACCCCTCCATTCCACATCTAAGAGAATGACAGGATCGCATCGTGGCCGTCAAATTTGTTCACTTGGTAGTTTACCAGAAAACAGAGCACAGA
AAAGGCAGGTCTGTTTAGACGAGATCGGAGCAGAGCAAACAATGGTGAGAGGAATTATCAGTCCCCCAAGATCCCGTTCTTCTCCAAGAGAAAACAGACCTTTCAATAAT
AACAACAATGCCACCGCCAATCCACCCTCCAGACCCAACTACATGTCTCCTTGTCGCCGCCCGGCAACGGCAACGCCCGCCTCCGATCCACGAAATCCCAGAAAAGAAAC
CCAACCCGCCACCGGTCCACGCCCCAGCAGAACCAACCCCGACAGATCTTCAACCGCTCCCCGAACCGACCCATCCAAACCCATTTCTTCCAAGCCCCTGCCCTCGAGGC
GAACACCACCAGCCCCAAATCCTCACGACAAGAAATTAGACGCCAACAGCAACAAAACGCCGGTCGCTAAAACCCGACCGGCTTCCTCTCGCCCGGTCAAACCGGGAAAT
ACTACACCGGCGCGTGGGCCCACGCCTTCGAAGGGAAATGTGAAGGGAGCCATCGGGTCTGGTTCCAGATCCGATGCTTCCGGTTTGGGTCATAAGTACTTAGATTCTCG
AAATGGTTCCAGCGGCGGGTCGGGTCATCGGGTGGACGGGGATGTTGGAAACCCGAACCGTTGTTATTCGGATGGACATTATTATGGTGCGTTTCGTGATCCTGCTGTTC
GTGCTAAGTTGCATCAGCTTTCTTTAGATGACAAAGATCTTGCAAACCTCGTCCTTCACGCAAATTCGATATATGAATCATTCAATTCAGATACAAAGGAAGAACAATGT
AGTTCTCAAAGCAACAACATTGGTACAAGAATACTTCAAATCTTCAAAGAAATTTCATCTCACCGTCAAGGAAACTCCTCCATCACATCCTACATTACAAAGCTAAATAA
ATTATGGGACGAACTCGCAACCTACATCGACGTGCCTCAATGTTCTTGTGGTTCTATCGAGAAGTCAAGCGAGCAAATACAAAGGGAAAAAGTAATGCAATTTGTCGTCG
GATTAGACGATTCTTATTCCACATTTTGCGCGAAAATCCTCGACATGAAGCCATTTCCAACCGTGGAGAAAGCTTGTTCTGTGATAATTCGAGAAGAAAAACGCAGAGAA
GTGGTTCAGTCATTGGAAAATTTTGCTGAGAAAGTAATTCAAAACAATTGGCTTGTTAATGGGAACTTCAACAATGTTGATAATAATGATGGTATTAAGGAGCAACAAAT
TGATCATAATGAAGTTGCATGCATCCCTGTTGAGCCATTGCTGATTGATCTTGGCTCTCCTGTTCGTTGTTGAATATTTTCAGCTAAATCTAAGGAATAATTCGAGGATA
AAAGTATACTACTTTTCTAAACTATTAATTATTATGAAGAGGCTTCTAGTGGAAAAACTAAACCTTGTTGGGTAGGAGTTTGTGTGCTTTTTCTTTTTATGTAGTTTATT
TGTAGTGGATGAAAATATTAGTGGGATTTAGCAATATAATTGTTTACTAATAAATCAAGTAGGAACTTTTTCTTTTTAA
Protein sequenceShow/hide protein sequence
MTGSHRGRQICSLGSLPENRAQKRQVCLDEIGAEQTMVRGIISPPRSRSSPRENRPFNNNNNATANPPSRPNYMSPCRRPATATPASDPRNPRKETQPATGPRPSRTNPD
RSSTAPRTDPSKPISSKPLPSRRTPPAPNPHDKKLDANSNKTPVAKTRPASSRPVKPGNTTPARGPTPSKGNVKGAIGSGSRSDASGLGHKYLDSRNGSSGGSGHRVDGD
VGNPNRCYSDGHYYGAFRDPAVRAKLHQLSLDDKDLANLVLHANSIYESFNSDTKEEQCSSQSNNIGTRILQIFKEISSHRQGNSSITSYITKLNKLWDELATYIDVPQC
SCGSIEKSSEQIQREKVMQFVVGLDDSYSTFCAKILDMKPFPTVEKACSVIIREEKRREVVQSLENFAEKVIQNNWLVNGNFNNVDNNDGIKEQQIDHNEVACIPVEPLL
IDLGSPVRC