; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024700 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024700
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationchr10:5067346..5069271
RNA-Seq ExpressionLag0024700
SyntenyLag0024700
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]7.5e-11962.86Show/hide
Query:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ
        MVRGIISPPRSRSSPRE  PF  NNN  TNPPSRPNYMSPRRRPTTP+NPNE  +HR+E   T VK    R TK  DRS N  RIDPS+  SK A SR  
Subjt:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ

Query:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD
         AAP+PNE+KLD     TKTA K +TR SSPR TKPI+ P     PSK N KG A G G RSD S  AK S        DS+  +PK+L S G L  QQD
Subjt:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD

Query:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDT-KEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK
         ++      + YS G +GA       + +L QLS+D KD ANIVL  HAN +YES +S+T +EECSSQ NN++RMFQIYKEIASH QGNSSITSYITKLK
Subjt:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDT-KEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK

Query:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS
        ALWDEL AY+D P+CSC ++EK +E IEREKVMQFL+GL+DSYS ICAQ+L M+P+PTV+KA   I+REEKRRELVLSLEIVAAKV+QNNWLLQNGHS +
Subjt:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS

Query:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC
        GDN      E VD  NLQ    D QNE  + P  EPLLIDLGSPVRC
Subjt:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-11862.64Show/hide
Query:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ
        MVRGIISPPRSRSSPRE  PF  NNN  TNPPSRPNYMSPRRRPTTP+NPNE  +HR+E   T VK    R TK  DRS N  RIDPS+  SK A SR  
Subjt:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ

Query:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD
         AAP+PNE+KLD     TKTA K +TR SSPR TKPI+ P     PSK N KG A G G RSD S  AK S        DS+  +PK+L S G L  QQD
Subjt:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD

Query:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDT-KEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK
         ++      + YS G +GA       + +L QLS+D KD ANIVL  HAN +YES +S+T +EECSSQ NN++RMFQIYKEIASH QGNSSITSYITKLK
Subjt:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDT-KEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK

Query:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS
        ALWDEL AY+D P+CSC +++K +E IEREKVMQFL+GL+DSYS ICAQ+L M+P+PTV+KA   I+REEKRRELVLSLEIVAAKV+QNNWLLQNGHS +
Subjt:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS

Query:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC
        GDN      E VD  NLQ    D QNE  + P  EPLLIDLGSPVRC
Subjt:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC

XP_022137024.1 uncharacterized protein LOC111008588 [Momordica charantia]8.7e-8351.01Show/hide
Query:  VRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPS-KSTSKPASSRRQ
        +RG+ISPPRSRSSPR+  P  +NN AP NPPSRPNYMSPRRRPTT    + Q +HR+ +      ++A RATK      +  RI PS K T+    SRR 
Subjt:  VRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPS-KSTSKPASSRRQ

Query:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSP--KHLASGGSLGHQ
              N+KKLD+        AK S+  + P+  +   +PPRG TP   N           +     A A   AS++   + N SP  KHL S GS  H 
Subjt:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSP--KHLASGGSLGHQ

Query:  QDVKVGDVGNQKRYSGGHFGAL------DQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK
              D+ N   YSGG +  L      + LQ+LSIDGKD A+I+L  HANSIYES  SDT EE  S  +NA R+FQIYK+IASHRQ NSS+TSY TKLK
Subjt:  QDVKVGDVGNQKRYSGGHFGAL------DQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK

Query:  ALWDELAAYM-DVPQ-CSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHS
         LWDEL  Y  DVPQ CSC A EK++ H+EREKVMQFL+GL++SYS IC Q+L+++P+PT++KAYS+IIREEKR ELV SLE+VAAKVM+N WLLQN  S
Subjt:  ALWDELAAYM-DVPQ-CSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHS

Query:  NSGDNSNDGIGEVVDNLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC
        ++G   +DGI E V+     + +   E+ +FP  E LLIDLGSPVRC
Subjt:  NSGDNSNDGIGEVVDNLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]1.7e-11862.86Show/hide
Query:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ
        MVRGIISPPRSRSSPRE  PF  NNN  TNPPSRPNYMSPRRRPTTP+NPNE  +HR+E   T VK    R TK  DRS N  RIDPS+  SK A SR  
Subjt:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ

Query:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD
         AAP+PNE+KLD     TKTA K +TR SSPR TKPI+       PSK N KG A G G RSD S  AK S        DS+  +PK+L S G L  QQD
Subjt:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD

Query:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDTK-EECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK
         ++      + YS G +GA       + +L QLS+D KD ANIVL  HAN +YES +S+TK EECSSQ NN++RMFQIYKEIASH QGNSSITSYITKLK
Subjt:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDTK-EECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK

Query:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS
        ALWDEL AY+D P+CSC ++EK +E IEREKVMQFL+GL+DSYS ICAQ+L M+P+PTV+KA   I+REEKRRELVLSLEIVAAKV+QNNWLLQNGHS +
Subjt:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS

Query:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC
        GDN      E VD  NLQ    D QNE  + P  EPLLIDLGSPVRC
Subjt:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]2.2e-11862.11Show/hide
Query:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ
        MVRGIISPPRSRSSPRE  PF  NNN  TNPPSRPNYMSPRRRPTTP+N NE  +HR+E   T VK    R TK  DRS N  RIDPS+  SK   SR  
Subjt:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ

Query:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD
         AAP+PNE+KLD     TKTA K +TR SSPR TKPI+ P     PSK N KG A G G RSD S  AK S        DS+  +PK+L S G L  QQD
Subjt:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD

Query:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDTK-EECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK
         ++      + YS G +GA       + +L QLS+D KD ANIVL  HAN +YES +S+TK EECSSQ NN++RMFQIYKEIASH QGNSSITSYITKLK
Subjt:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDTK-EECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK

Query:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS
        ALWDEL AY+D+P+CSC +++K +E IEREKVMQFL+GLDDSYS ICAQ+L M+P+PTV+KA   I+REEKRRELVLSLEIVAAKV+QNNWLLQNGHS +
Subjt:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS

Query:  GDNSNDGIGEVVDNLQGQHVD-QQNEVTNFPATEPLLIDLGSPVRC
        GDN      E VD++  Q +   QNE  + P  EPLLIDLGSPVRC
Subjt:  GDNSNDGIGEVVDNLQGQHVD-QQNEVTNFPATEPLLIDLGSPVRC

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085884.2e-8351.01Show/hide
Query:  VRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPS-KSTSKPASSRRQ
        +RG+ISPPRSRSSPR+  P  +NN AP NPPSRPNYMSPRRRPTT    + Q +HR+ +      ++A RATK      +  RI PS K T+    SRR 
Subjt:  VRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPS-KSTSKPASSRRQ

Query:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSP--KHLASGGSLGHQ
              N+KKLD+        AK S+  + P+  +   +PPRG TP   N           +     A A   AS++   + N SP  KHL S GS  H 
Subjt:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSP--KHLASGGSLGHQ

Query:  QDVKVGDVGNQKRYSGGHFGAL------DQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK
              D+ N   YSGG +  L      + LQ+LSIDGKD A+I+L  HANSIYES  SDT EE  S  +NA R+FQIYK+IASHRQ NSS+TSY TKLK
Subjt:  QDVKVGDVGNQKRYSGGHFGAL------DQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK

Query:  ALWDELAAYM-DVPQ-CSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHS
         LWDEL  Y  DVPQ CSC A EK++ H+EREKVMQFL+GL++SYS IC Q+L+++P+PT++KAYS+IIREEKR ELV SLE+VAAKVM+N WLLQN  S
Subjt:  ALWDELAAYM-DVPQ-CSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHS

Query:  NSGDNSNDGIGEVVDNLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC
        ++G   +DGI E V+     + +   E+ +FP  E LLIDLGSPVRC
Subjt:  NSGDNSNDGIGEVVDNLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC

A0A6J1C6T8 uncharacterized protein LOC111008934 isoform X21.9e-4341.51Show/hide
Query:  RGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNY-MSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQT
        RG+ISPPR+ S P         NNA  NPP RPNY MSP  RPTT VNP+EQ  +     +T+  S+AIRAT                            
Subjt:  RGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNY-MSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQT

Query:  AAPNP-----NEKKLD---SNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGG
        A PNP     ++KKLD   +N  STKT A   TR    R          G T    N      G       S    A+ +A +      N SP HL+  G
Subjt:  AAPNP-----NEKKLD---SNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGG

Query:  SLGHQQDVKVGDVGNQKRYSGGHFGALDQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLKA
        S   Q    VG+ G     S  H    + L +LS  G    +  + ++     +        +CSSQS N  R+F+IYK+IASHRQGNSSITSY T+LK 
Subjt:  SLGHQQDVKVGDVGNQKRYSGGHFGALDQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLKA

Query:  LWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKR
        LWDEL  Y D+ QC CS+     EH+EREKVMQFLVGL+D YS IC Q+L++RP+PTV+KAYS++IREEKR
Subjt:  LWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKR

A0A6J1C6U3 uncharacterized protein LOC111008934 isoform X15.0e-4441.91Show/hide
Query:  RGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNY-MSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQT
        RG+ISPPR+ S P         NNA  NPP RPNY MSP  RPTT VNP+EQ  +     +T+  S+AIRAT                            
Subjt:  RGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNY-MSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQT

Query:  AAPNP-----NEKKLD---SNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGG
        A PNP     ++KKLD   +N  STKT A   TR    R          G T    N      G       S    A+ +A +      N SP HL+  G
Subjt:  AAPNP-----NEKKLD---SNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGG

Query:  SLGHQQDVKVGDVGNQKRYSGGHFGALDQLQQLSIDGKD----FANIVLQVHANSIYESASSD--TKEECSSQSNNATRMFQIYKEIASHRQGNSSITSY
        S   Q    VG+ G     S  H    + L +LS  G      +  I + V    IY           +CSSQS N  R+F+IYK+IASHRQGNSSITSY
Subjt:  SLGHQQDVKVGDVGNQKRYSGGHFGALDQLQQLSIDGKD----FANIVLQVHANSIYESASSD--TKEECSSQSNNATRMFQIYKEIASHRQGNSSITSY

Query:  ITKLKALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKR
         T+LK LWDEL  Y D+ QC CS+     EH+EREKVMQFLVGL+D YS IC Q+L++RP+PTV+KAYS++IREEKR
Subjt:  ITKLKALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKR

A0A6J1C7L7 uncharacterized protein LOC1110089863.4e-4845.36Show/hide
Query:  RGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTP--VNP-NEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRR
        RG+ISPP+SR S  E    ++ NNA  NPPS PNYMS  RR T    VNP  +Q +H + T      S+AIRATK      N+S     K T    S RR
Subjt:  RGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTP--VNP-NEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRR

Query:  QTAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQ
         T APN N    + N  +  T AKI+T  +S      + Q PRG T                  + S A  S   S +H D+ N++ +        G ++
Subjt:  QTAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQ

Query:  DVKVGDVGNQKRYSGGHFGALDQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLKALWDELA
        D              GH   ++QLQQLSIDGK  A +V +  ANS+ ES    TKEECS QS NA R+ +IYK+IASHRQGNSSITSY TKL+ LW+EL 
Subjt:  DVKVGDVGNQKRYSGGHFGALDQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLKALWDELA

Query:  AYMDVPQCSCSAS---EKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREE
         Y D+PQC CS S   +K ++ +EREKVMQFLVGL+DSYS IC+Q+L++RP+PTV+KAYS+II +E
Subjt:  AYMDVPQCSCSAS---EKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like8.1e-11962.86Show/hide
Query:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ
        MVRGIISPPRSRSSPRE  PF  NNN  TNPPSRPNYMSPRRRPTTP+NPNE  +HR+E   T VK    R TK  DRS N  RIDPS+  SK A SR  
Subjt:  MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQ

Query:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD
         AAP+PNE+KLD     TKTA K +TR SSPR TKPI+       PSK N KG A G G RSD S  AK S        DS+  +PK+L S G L  QQD
Subjt:  TAAPNPNEKKLDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQD

Query:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDTK-EECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK
         ++      + YS G +GA       + +L QLS+D KD ANIVL  HAN +YES +S+TK EECSSQ NN++RMFQIYKEIASH QGNSSITSYITKLK
Subjt:  VKVGDVGNQKRYSGGHFGA-------LDQLQQLSIDGKDFANIVLQVHANSIYESASSDTK-EECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLK

Query:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS
        ALWDEL AY+D P+CSC ++EK +E IEREKVMQFL+GL+DSYS ICAQ+L M+P+PTV+KA   I+REEKRRELVLSLEIVAAKV+QNNWLLQNGHS +
Subjt:  ALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVGLDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNS

Query:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC
        GDN      E VD  NLQ    D QNE  + P  EPLLIDLGSPVRC
Subjt:  GDNSNDGIGEVVD--NLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.4e-0926Show/hide
Query:  RMFQIYKEIASHRQGNSSITSYITKLKALWDELAAYMDVPQ-----CSCSASEKVNEHIEREKVMQFLVG--LDDSYSNICAQVLVMRPYPTVDKAYSVI
        +++Q+ + +A+ RQG  S+  Y  KL  +W EL+ Y  +P+     C+C  +++  E  E+E+  +FL+G  L+  +  +  +++  +P P++ +A++++
Subjt:  RMFQIYKEIASHRQGNSSITSYITKLKALWDELAAYMDVPQ-----CSCSASEKVNEHIEREKVMQFLVG--LDDSYSNICAQVLVMRPYPTVDKAYSVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAGAGGCATTATCAGCCCTCCAAGATCCAGATCTTCTCCCAGAGAAAGAACACCCTTCAACAACAACAACAATGCCCCCACCAATCCGCCCTCCAGACCCAATTA
CATGTCCCCACGTCGGCGCCCGACGACTCCGGTCAATCCCAACGAGCAGCCAAGTCATCGAAGAGAAACCCACTCCACCGCCGTAAAATCCAGCGCCATACGGGCAACCA
AAGCCAACGACAGATCTTTAAATACTTCACGCATCGACCCATCAAAATCCACTTCAAAGCCTGCGTCCTCAAGGCGACAGACAGCAGCCCCAAATCCAAACGAGAAGAAA
TTAGACAGCAACGGTCCCAGCACCAAAACAGCCGCTAAAATCAGCACTAGATCATCTTCTCCTCGCGCAACCAAACCGATTAGTCAACCGCCGCGAGGACTTACGCCTTC
GAAGGGCAATGTAAAAGGATCGGCGATTGGTTGTGGTCTGAGATCCGATGTTTCGTCTGGTGCTAAGGCGAGCGGCGTTGCGAGTCAAAACCATTTTGATTCTCGAAACG
ATTCTCCCAAGCATTTGGCGAGCGGTGGGTCGCTGGGTCATCAGCAAGATGTGAAAGTTGGCGATGTTGGGAATCAGAAGCGTTATTCGGGTGGACATTTCGGTGCTCTT
GATCAGTTACAACAGCTTTCTATTGACGGTAAGGATTTTGCGAACATCGTCCTTCAAGTTCATGCAAACTCAATATACGAATCAGCGAGTTCAGATACAAAGGAAGAATG
TTCTTCTCAAAGCAACAATGCTACAAGAATGTTTCAAATTTACAAGGAAATTGCATCTCATCGTCAAGGAAACTCCTCCATTACATCTTACATCACAAAGCTGAAGGCAT
TATGGGATGAACTTGCAGCCTACATGGATGTGCCTCAATGTTCTTGCAGTGCAAGTGAGAAGGTGAATGAGCACATAGAGAGAGAAAAAGTTATGCAATTTCTTGTGGGA
TTAGACGATTCTTATTCCAACATTTGCGCCCAAGTCCTTGTTATGAGGCCATATCCAACTGTTGACAAAGCTTATTCTGTAATAATTCGAGAAGAAAAACGTAGGGAATT
GGTTTTATCATTAGAAATTGTTGCAGCTAAAGTGATGCAAAATAATTGGCTTCTTCAGAATGGTCATTCCAATAGTGGTGATAATAGTAATGATGGTATTGGAGAAGTTG
TTGATAATCTTCAAGGGCAGCATGTCGATCAACAAAATGAAGTTACGAACTTCCCGGCCACTGAGCCATTGCTGATAGACCTTGGCTCTCCTGTGCGATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAAGAGGCATTATCAGCCCTCCAAGATCCAGATCTTCTCCCAGAGAAAGAACACCCTTCAACAACAACAACAATGCCCCCACCAATCCGCCCTCCAGACCCAATTA
CATGTCCCCACGTCGGCGCCCGACGACTCCGGTCAATCCCAACGAGCAGCCAAGTCATCGAAGAGAAACCCACTCCACCGCCGTAAAATCCAGCGCCATACGGGCAACCA
AAGCCAACGACAGATCTTTAAATACTTCACGCATCGACCCATCAAAATCCACTTCAAAGCCTGCGTCCTCAAGGCGACAGACAGCAGCCCCAAATCCAAACGAGAAGAAA
TTAGACAGCAACGGTCCCAGCACCAAAACAGCCGCTAAAATCAGCACTAGATCATCTTCTCCTCGCGCAACCAAACCGATTAGTCAACCGCCGCGAGGACTTACGCCTTC
GAAGGGCAATGTAAAAGGATCGGCGATTGGTTGTGGTCTGAGATCCGATGTTTCGTCTGGTGCTAAGGCGAGCGGCGTTGCGAGTCAAAACCATTTTGATTCTCGAAACG
ATTCTCCCAAGCATTTGGCGAGCGGTGGGTCGCTGGGTCATCAGCAAGATGTGAAAGTTGGCGATGTTGGGAATCAGAAGCGTTATTCGGGTGGACATTTCGGTGCTCTT
GATCAGTTACAACAGCTTTCTATTGACGGTAAGGATTTTGCGAACATCGTCCTTCAAGTTCATGCAAACTCAATATACGAATCAGCGAGTTCAGATACAAAGGAAGAATG
TTCTTCTCAAAGCAACAATGCTACAAGAATGTTTCAAATTTACAAGGAAATTGCATCTCATCGTCAAGGAAACTCCTCCATTACATCTTACATCACAAAGCTGAAGGCAT
TATGGGATGAACTTGCAGCCTACATGGATGTGCCTCAATGTTCTTGCAGTGCAAGTGAGAAGGTGAATGAGCACATAGAGAGAGAAAAAGTTATGCAATTTCTTGTGGGA
TTAGACGATTCTTATTCCAACATTTGCGCCCAAGTCCTTGTTATGAGGCCATATCCAACTGTTGACAAAGCTTATTCTGTAATAATTCGAGAAGAAAAACGTAGGGAATT
GGTTTTATCATTAGAAATTGTTGCAGCTAAAGTGATGCAAAATAATTGGCTTCTTCAGAATGGTCATTCCAATAGTGGTGATAATAGTAATGATGGTATTGGAGAAGTTG
TTGATAATCTTCAAGGGCAGCATGTCGATCAACAAAATGAAGTTACGAACTTCCCGGCCACTGAGCCATTGCTGATAGACCTTGGCTCTCCTGTGCGATGTTGA
Protein sequenceShow/hide protein sequence
MVRGIISPPRSRSSPRERTPFNNNNNAPTNPPSRPNYMSPRRRPTTPVNPNEQPSHRRETHSTAVKSSAIRATKANDRSLNTSRIDPSKSTSKPASSRRQTAAPNPNEKK
LDSNGPSTKTAAKISTRSSSPRATKPISQPPRGLTPSKGNVKGSAIGCGLRSDVSSGAKASGVASQNHFDSRNDSPKHLASGGSLGHQQDVKVGDVGNQKRYSGGHFGAL
DQLQQLSIDGKDFANIVLQVHANSIYESASSDTKEECSSQSNNATRMFQIYKEIASHRQGNSSITSYITKLKALWDELAAYMDVPQCSCSASEKVNEHIEREKVMQFLVG
LDDSYSNICAQVLVMRPYPTVDKAYSVIIREEKRRELVLSLEIVAAKVMQNNWLLQNGHSNSGDNSNDGIGEVVDNLQGQHVDQQNEVTNFPATEPLLIDLGSPVRC