; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr002954 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr002954
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationtig00001864:45699..47471
RNA-Seq ExpressionSgr002954
SyntenySgr002954
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]6.6e-8649.46Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK    R  K     ++PPR   S   S+ + +   + +
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN
        P+      KT  KT+   +S RP+ P              + PP       PS+  G         KGA  S SRSD   A    +          G  N
Subjt:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN

Query:  ---------PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKAL
                  YS G +   TL DPDVH +L QLS+D KDLANIVLHAN +YES+ S+T +EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKAL
Subjt:  ---------PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKAL

Query:  WDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQA
        WDEL +Y D P+CSC + EK SE  EREKVMQFL+GL+DSYSTICAQI  ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  
Subjt:  WDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQA

Query:  NEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC
        NE           EVD+ NLQ+ + DQ+E+                S P E LLIDLGSPVRC
Subjt:  NEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-8549.24Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK    R  K     ++PPR   S   S+ + +   + +
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN
        P+      KT  KT+   +S RP+ P              + PP       PS+  G         KGA  S SRSD   A    +          G  N
Subjt:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN

Query:  ---------PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKAL
                  YS G +   TL DPDVH +L QLS+D KDLANIVLHAN +YES+ S+T +EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKAL
Subjt:  ---------PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKAL

Query:  WDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQA
        WDEL +Y D P+CSC + +K SE  EREKVMQFL+GL+DSYSTICAQI  ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  
Subjt:  WDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQA

Query:  NEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC
        NE           EVD+ NLQ+ + DQ+E+                S P E LLIDLGSPVRC
Subjt:  NEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC

XP_022137024.1 uncharacterized protein LOC111008588 [Momordica charantia]1.1e-10456.74Show/hide
Query:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS
        MTLRGLISPPRSRSSPR+ RPHNN A  NPPS+P Y+SP RRPT        + RK +      A+A R  K      SPPR R SPKP+ P    S   
Subjt:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS

Query:  NPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQ-----AVNI
        NP+  H+  K   KT+     ++PSSPR  T  QRLR  NGVEPP  PT   K + P+    PK      AIASASRSD P A++  +S       +   
Subjt:  NPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQ-----AVNI

Query:  SNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELAS
         ++ N +PYSGG +T L DP ++N LQ+LSIDGKDLA+I+LHANSIYESIGSDT EE S +++APRIFQIYKDIASHRQ N S+TSYFTKLK LWDEL +
Subjt:  SNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELAS

Query:  YS-DLPQ-CSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYS
        YS D+PQ CSC AMEKLS H EREKVMQFL+GL++SYSTIC QI L++PFPT+EKAY +IIREEKR ELV SLE+VAAKVM+NKWL QNDQSS   N Y 
Subjt:  YS-DLPQ-CSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYS

Query:  LLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC
             DD I  EV+ N ++                   +VE+PSFP+E LLIDLGSPVRC
Subjt:  LLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]1.7e-8650.66Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK    R  K     ++PPR   S   S+ + +   + +
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN
        P+      KT  KT+   +S RP+ P T  ++   +  +G    S  +  KPS  +  T PKN           RS   G  N     Q V         
Subjt:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN

Query:  PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSD
         YS G +   TL DPDVH +L QLS+D KDLANIVLHAN +YES+ S+TK EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKALWDEL +Y D
Subjt:  PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSD

Query:  LPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNG
         P+CSC + EK SE  EREKVMQFL+GL+DSYSTICAQI  ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  NE       
Subjt:  LPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNG

Query:  DDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC
            EVD+ NLQ+ + DQ+E+                S P E LLIDLGSPVRC
Subjt:  DDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]1.7e-8650.54Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSR--PSRNASGS
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +N NE  + RKE   T VK    R  K     ++PPR   S   S+  PSR A+ S
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSR--PSRNASGS

Query:  SNPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGN
          P+      KT  KT+   +S RP+ P              + PP       PS+  G         KGA  S SRSD   A    +          G 
Subjt:  SNPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGN

Query:  QN---------PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLK
         N          YS G +   TL DPDVH +L QLS+D KDLANIVLHAN +YES+ S+TK EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLK
Subjt:  QN---------PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLK

Query:  ALWDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSS
        ALWDEL +Y D+P+CSC + +K SE  EREKVMQFL+GLDDSYSTICAQI  ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +
Subjt:  ALWDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSS

Query:  QANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC
          NE           EVD+ NLQE + DQ+E+                S P E LLIDLGSPVRC
Subjt:  QANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085885.3e-10556.74Show/hide
Query:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS
        MTLRGLISPPRSRSSPR+ RPHNN A  NPPS+P Y+SP RRPT        + RK +      A+A R  K      SPPR R SPKP+ P    S   
Subjt:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS

Query:  NPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQ-----AVNI
        NP+  H+  K   KT+     ++PSSPR  T  QRLR  NGVEPP  PT   K + P+    PK      AIASASRSD P A++  +S       +   
Subjt:  NPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQ-----AVNI

Query:  SNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELAS
         ++ N +PYSGG +T L DP ++N LQ+LSIDGKDLA+I+LHANSIYESIGSDT EE S +++APRIFQIYKDIASHRQ N S+TSYFTKLK LWDEL +
Subjt:  SNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELAS

Query:  YS-DLPQ-CSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYS
        YS D+PQ CSC AMEKLS H EREKVMQFL+GL++SYSTIC QI L++PFPT+EKAY +IIREEKR ELV SLE+VAAKVM+NKWL QNDQSS   N Y 
Subjt:  YS-DLPQ-CSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYS

Query:  LLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC
             DD I  EV+ N ++                   +VE+PSFP+E LLIDLGSPVRC
Subjt:  LLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC

A0A6J1C6U3 uncharacterized protein LOC111008934 isoform X15.0e-4742.13Show/hide
Query:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMY-VSPYRRP-TVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASG
        MT RGLISPPR+ S P       NNAA NPP +P Y +SP  RP TVVNP+EQ         T+  ++AIR             + + P P     +   
Subjt:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMY-VSPYRRP-TVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASG

Query:  SSNPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLR-------RNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQA
          + ++N+S  KTTA                 T  QRLR        N    P  PT +   S P  A     N +    + +  +  P   +I  S Q 
Subjt:  SSNPHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLR-------RNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQA

Query:  VNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKD----LANIVLHANSIYESIGSD----TKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFT
         NISNVGN     G  +T+L  P ++N L +LS  G         I ++   I    G D       +CSS+++ PRIF+IYKDIASHRQGN SITSYFT
Subjt:  VNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKD----LANIVLHANSIYESIGSD----TKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFT

Query:  KLKALWDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKR
        +LK LWDEL +Y+DL QC C +     EH EREKVMQFL+GL+D YSTIC QI L+RPFPTVEKAY ++IREEKR
Subjt:  KLKALWDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKR

A0A6J1C6Z1 uncharacterized protein LOC1110089785.9e-4839.51Show/hide
Query:  RGLISPPRSRSSPRESRPHNNNAAGNPPSKPM------YVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNAS
        RGLISPPRS              AG+ PS+P+      Y+S  RRPT  +  +    R                       +PPR R+S +     +NA 
Subjt:  RGLISPPRSRSSPRESRPHNNNAAGNPPSKPM------YVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNAS

Query:  GSSN------PHHNHSGCKTTAKTSYAANSSRPS-SPRTITAQQRLR-----RNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNI
           N       ++N+S  KTTAK S    SSRPS SPR     ++ R      +N    P P   +    P  +T    N +      +  +  P   +I
Subjt:  GSSN------PHHNHSGCKTTAKTSYAANSSRPS-SPRTITAQQRLR-----RNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNI

Query:  AASVQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSID------------------GKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKD
          S Q  NI+NVG+     G  HT L  P +++ L +LS D                  GK +ANIVL  NSIY+ +GS+TK+E SS+++   IFQIYK 
Subjt:  AASVQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSID------------------GKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKD

Query:  IASHRQGNFSITSYFTKLKALWDELASYSDLPQC-SCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLE
         ASHRQ + S+TSYF KLK LWD+L +YSDLPQC S  A +KLSEH EREKV+QFL+GL+DSYSTI +QI  +RP PTVEKAY++ I+EEK+R L   L+
Subjt:  IASHRQGNFSITSYFTKLKALWDELASYSDLPQC-SCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLE

Query:  IVAAK
        ++  K
Subjt:  IVAAK

A0A6J1C7L7 uncharacterized protein LOC1110089865.0e-5547.54Show/hide
Query:  RGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPT---VVNPNEQGSQRKENHT-TTVKASAIRVGKNNHAGNSPPRNRSSPKPSR--PSRN--
        RGLISPP+SR S  ES    NNAA NPPS P Y+S  RR T   VVNP +Q    +  HT  T  ++AIR  KN           SSPKP+   PSR   
Subjt:  RGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPT---VVNPNEQGSQRKENHT-TTVKASAIRVGKNNHAGNSPPRNRSSPKPSR--PSRN--

Query:  --ASGSSNPHHNHSGCK-TTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAV
          A  + N H+++S  K T AK + + NS+                 NGV+           QPRG T          I+ AS     G+S+ +   Q  
Subjt:  --ASGSSNPHHNHSGCK-TTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAV

Query:  NISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDEL
        N +N+  +   S    T +G P V NQLQQLSIDGK  A +V  ANS+ ES+G  TKEECS +++A RI +IYKDIASHRQGN SITSYFTKL+ LW+EL
Subjt:  NISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDEL

Query:  ASYSDLPQCSCRAM--EKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREE
         +YSDLPQC   +   +K S+  EREKVMQFL+GL+DSYSTIC+QI L+RPFPTVEKAY +II +E
Subjt:  ASYSDLPQCSCRAM--EKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like8.5e-8750.66Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK    R  K     ++PPR   S   S+ + +   + +
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN
        P+      KT  KT+   +S RP+ P T  ++   +  +G    S  +  KPS  +  T PKN           RS   G  N     Q V         
Subjt:  PHHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQN

Query:  PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSD
         YS G +   TL DPDVH +L QLS+D KDLANIVLHAN +YES+ S+TK EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKALWDEL +Y D
Subjt:  PYSGGHH--TTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSD

Query:  LPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNG
         P+CSC + EK SE  EREKVMQFL+GL+DSYSTICAQI  ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  NE       
Subjt:  LPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYSTICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNG

Query:  DDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC
            EVD+ NLQ+ + DQ+E+                S P E LLIDLGSPVRC
Subjt:  DDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.3e-1029Show/hide
Query:  RIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQ-----CSCRAMEKLSEHGEREKVMQFLLG--LDDSYSTICAQIFLLRPFPTVEKAYYVI
        +I+Q+ + +A+ RQG  S+  YF KL  +W EL+ Y+ +P+     C+C   ++  E  E+E+  +FL+G  L+  +  +  +I   +P P++ +A+ ++
Subjt:  RIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQ-----CSCRAMEKLSEHGEREKVMQFLLG--LDDSYSTICAQIFLLRPFPTVEKAYYVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTAAGAGGCCTCATCAGTCCCCCGAGATCCAGATCTTCTCCAAGAGAAAGCAGACCACACAATAATAATGCAGCCGGCAACCCGCCGTCGAAACCCATGTACGT
GTCACCATATCGGAGACCGACGGTGGTGAATCCTAATGAGCAGGGGAGTCAGAGAAAAGAAAACCACACCACCACCGTAAAAGCCAGTGCTATACGTGTAGGCAAGAATA
ACCACGCCGGTAACAGTCCGCCGAGAAATAGATCTTCTCCAAAGCCAAGCAGACCATCGCGCAACGCCTCCGGCAGCAGCAACCCACATCACAACCATTCCGGCTGCAAA
ACGACCGCTAAGACCAGCTACGCTGCAAATAGTTCTAGACCATCTTCCCCTCGTACAATCACTGCTCAGCAGAGACTGCGTAGAAACAATGGGGTTGAACCTCCTTCACC
AACGACCACTGTGAAACCAAGTCAACCGCGAGGAGCCACACATCCCAAGAACAATACTGTAAAAGGAGCAATTGCTTCTGCTTCAAGATCCGATGTTCCTGGTGCTTCTA
ATATTGCAGCAAGCGTCCAAGCTGTGAACATCAGTAATGTTGGGAATCAGAACCCTTATTCGGGTGGACATCACACTACACTCGGTGATCCTGATGTTCACAATCAATTA
CAACAGCTTTCTATAGACGGTAAAGATCTTGCAAACATAGTCCTTCATGCAAACTCAATATATGAATCAATTGGCTCTGATACGAAGGAAGAATGTTCTTCTGAAACCGA
TGCTCCGAGAATATTTCAAATTTACAAGGACATTGCATCTCATCGTCAAGGAAACTTCTCCATTACATCCTACTTTACAAAGCTGAAGGCATTATGGGATGAACTTGCAT
CCTACAGTGATCTGCCTCAGTGTTCCTGCCGTGCAATGGAGAAGCTAAGTGAGCATGGGGAGAGAGAGAAGGTGATGCAATTTCTTTTGGGATTAGACGATTCTTATTCC
ACCATCTGCGCCCAAATCTTTCTTTTGCGGCCATTTCCAACAGTCGAGAAAGCTTATTATGTAATCATTCGAGAAGAAAAACGGAGGGAACTGGTTTTTTCATTAGAAAT
TGTTGCAGCAAAAGTAATGCAAAACAAGTGGCTTTTCCAAAATGATCAGTCGAGCAGCCAAGCAAATGAGTACAGTTTATTAAGTAATGGAGATGATCCTATTGAAGTTG
ATAATAATCTTCAGGAGCAGCAAGTTGATCAAAGTGAAAGTGGCACAACGACGCCGGCGTCTATATACAACCCTGACGTCGAAGTTCCGAGCTTCCCGCATGAGCAATTG
CTGATAGACCTTGGCTCTCCCGTGCGATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACACTAAGAGGCCTCATCAGTCCCCCGAGATCCAGATCTTCTCCAAGAGAAAGCAGACCACACAATAATAATGCAGCCGGCAACCCGCCGTCGAAACCCATGTACGT
GTCACCATATCGGAGACCGACGGTGGTGAATCCTAATGAGCAGGGGAGTCAGAGAAAAGAAAACCACACCACCACCGTAAAAGCCAGTGCTATACGTGTAGGCAAGAATA
ACCACGCCGGTAACAGTCCGCCGAGAAATAGATCTTCTCCAAAGCCAAGCAGACCATCGCGCAACGCCTCCGGCAGCAGCAACCCACATCACAACCATTCCGGCTGCAAA
ACGACCGCTAAGACCAGCTACGCTGCAAATAGTTCTAGACCATCTTCCCCTCGTACAATCACTGCTCAGCAGAGACTGCGTAGAAACAATGGGGTTGAACCTCCTTCACC
AACGACCACTGTGAAACCAAGTCAACCGCGAGGAGCCACACATCCCAAGAACAATACTGTAAAAGGAGCAATTGCTTCTGCTTCAAGATCCGATGTTCCTGGTGCTTCTA
ATATTGCAGCAAGCGTCCAAGCTGTGAACATCAGTAATGTTGGGAATCAGAACCCTTATTCGGGTGGACATCACACTACACTCGGTGATCCTGATGTTCACAATCAATTA
CAACAGCTTTCTATAGACGGTAAAGATCTTGCAAACATAGTCCTTCATGCAAACTCAATATATGAATCAATTGGCTCTGATACGAAGGAAGAATGTTCTTCTGAAACCGA
TGCTCCGAGAATATTTCAAATTTACAAGGACATTGCATCTCATCGTCAAGGAAACTTCTCCATTACATCCTACTTTACAAAGCTGAAGGCATTATGGGATGAACTTGCAT
CCTACAGTGATCTGCCTCAGTGTTCCTGCCGTGCAATGGAGAAGCTAAGTGAGCATGGGGAGAGAGAGAAGGTGATGCAATTTCTTTTGGGATTAGACGATTCTTATTCC
ACCATCTGCGCCCAAATCTTTCTTTTGCGGCCATTTCCAACAGTCGAGAAAGCTTATTATGTAATCATTCGAGAAGAAAAACGGAGGGAACTGGTTTTTTCATTAGAAAT
TGTTGCAGCAAAAGTAATGCAAAACAAGTGGCTTTTCCAAAATGATCAGTCGAGCAGCCAAGCAAATGAGTACAGTTTATTAAGTAATGGAGATGATCCTATTGAAGTTG
ATAATAATCTTCAGGAGCAGCAAGTTGATCAAAGTGAAAGTGGCACAACGACGCCGGCGTCTATATACAACCCTGACGTCGAAGTTCCGAGCTTCCCGCATGAGCAATTG
CTGATAGACCTTGGCTCTCCCGTGCGATGTTGA
Protein sequenceShow/hide protein sequence
MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSNPHHNHSGCK
TTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVQAVNISNVGNQNPYSGGHHTTLGDPDVHNQL
QQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCRAMEKLSEHGEREKVMQFLLGLDDSYS
TICAQIFLLRPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDNNLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQL
LIDLGSPVRC