; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019619 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019619
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationtig00153349:432082..433856
RNA-Seq ExpressionSgr019619
SyntenySgr019619
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573173.1 hypothetical protein SDJN03_27060, partial [Cucurbita argyrosperma subsp. sororia]1.3e-9048.16Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK                          RP++    SSN
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK
         PRID S+P + +  SR   AAP+PN    ++KLD      KT  KT+   +S RP+ P              + PP       PS+  G         K
Subjt:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK

Query:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH
        GA  S SRSD   A                  K  DS+  +PK+L S     QQ   I    +   Y      TL DPDVH +L QLS+D KDLANIVLH
Subjt:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH

Query:  ANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA
        AN +YES+ S+T +EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKALWDEL +Y D P+CSCG+ EK SE  EREKVMQFL+GL++SYSTICA
Subjt:  ANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA

Query:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP
        QIL ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  NE           EVD+ NLQ+ + DQ+E+                
Subjt:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP

Query:  SFPHEQLLIDLGSPVRC
        S P E LLIDLGSPVRC
Subjt:  SFPHEQLLIDLGSPVRC

KAG7012356.1 hypothetical protein SDJN02_25108, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-9047.97Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK                          RP++    SSN
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK
         PRID S+P + +  SR   AAP+PN    ++KLD      KT  KT+   +S RP+ P              + PP       PS+  G         K
Subjt:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK

Query:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH
        GA  S SRSD   A                  K  DS+  +PK+L S     QQ   I    +   Y      TL DPDVH +L QLS+D KDLANIVLH
Subjt:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH

Query:  ANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA
        AN +YES+ S+T +EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKALWDEL +Y D P+CSCG+ +K SE  EREKVMQFL+GL++SYSTICA
Subjt:  ANSIYESIGSDT-KEECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA

Query:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP
        QIL ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  NE           EVD+ NLQ+ + DQ+E+                
Subjt:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP

Query:  SFPHEQLLIDLGSPVRC
        S P E LLIDLGSPVRC
Subjt:  SFPHEQLLIDLGSPVRC

XP_022137024.1 uncharacterized protein LOC111008588 [Momordica charantia]1.0e-11155.02Show/hide
Query:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS
        MTLRGLISPPRSRSSPR+ RPHNN A  NPPS+P Y+SP RRPT        + RK +      A+A R  K      SPPR R SP             
Subjt:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS

Query:  NPRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTV
               KPTAPIV SRR    PNPNDL +QKKLD          KT+     ++PSSPR  T  QRLR  NGVEPP  PT   K + P+    PK    
Subjt:  NPRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTV

Query:  KGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPSHGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH
          AIASASRSD P A++                      + S KHLPS G  Q      ++ N +PYSGG +T L DP ++N LQ+LSIDGKDLA+I+LH
Subjt:  KGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPSHGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH

Query:  ANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYS-DLPQ-CSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA
        ANSIYESIGSDT EE S +++APRIFQIYKDIASHRQ N S+TSYFTKLK LWDEL +YS D+PQ CSCGAMEKLS H EREKVMQFL+GL+NSYSTIC 
Subjt:  ANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYS-DLPQ-CSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA

Query:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEV
        QILL+QPFPT+EKAY +IIREEKR ELV SLE+VAAKVM+NKWL QNDQSS   N Y      DD I  EV+ N ++                   +VE+
Subjt:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEV

Query:  PSFPHEQLLIDLGSPVRC
        PSFP+E LLIDLGSPVRC
Subjt:  PSFPHEQLLIDLGSPVRC

XP_022954810.1 serine/arginine repetitive matrix protein 1-like [Cucurbita moschata]1.0e-9048.36Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK                          RP++    SSN
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK
         PRID S+P + +  SR   AAP+PN    ++KLD      KT  KT+   +S RP+ P              + PPS                K+N  K
Subjt:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK

Query:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH
        GA  S SRSD   A                  K  DS+  +PK+L S     QQ   I    +   Y      TL DPDVH +L QLS+D KDLANIVLH
Subjt:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH

Query:  ANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA
        AN +YES+ S+TK EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKALWDEL +Y D P+CSCG+ EK SE  EREKVMQFL+GL++SYSTICA
Subjt:  ANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA

Query:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP
        QIL ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  NE           EVD+ NLQ+ + DQ+E+                
Subjt:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP

Query:  SFPHEQLLIDLGSPVRC
        S P E LLIDLGSPVRC
Subjt:  SFPHEQLLIDLGSPVRC

XP_023542694.1 uncharacterized protein LOC111802521 [Cucurbita pepo subsp. pepo]2.1e-9148.55Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +N NE  + RKE   T VK                          RP++    SSN
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK
         PRID S+P + +V SR   AAP+PN    ++KLD      KT  KT+   +S RP+ P              + PP       PS+  G         K
Subjt:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK

Query:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH
        GA  S SRSD   A                  K  DS+  +PK+L S     QQ   I    +   Y      TL DPDVH +L QLS+D KDLANIVLH
Subjt:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH

Query:  ANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA
        AN +YES+ S+TK EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKALWDEL +Y D+P+CSCG+ +K SE  EREKVMQFL+GLD+SYSTICA
Subjt:  ANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA

Query:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP
        QIL ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  NE           EVD+ NLQE + DQ+E+                
Subjt:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP

Query:  SFPHEQLLIDLGSPVRC
        S P E LLIDLGSPVRC
Subjt:  SFPHEQLLIDLGSPVRC

TrEMBL top hitse value%identityAlignment
A0A6J1C5Z8 uncharacterized protein LOC1110085885.1e-11255.02Show/hide
Query:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS
        MTLRGLISPPRSRSSPR+ RPHNN A  NPPS+P Y+SP RRPT        + RK +      A+A R  K      SPPR R SP             
Subjt:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSS

Query:  NPRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTV
               KPTAPIV SRR    PNPNDL +QKKLD          KT+     ++PSSPR  T  QRLR  NGVEPP  PT   K + P+    PK    
Subjt:  NPRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPP-SPTTTVKPSQPRGATHPKNNTV

Query:  KGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPSHGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH
          AIASASRSD P A++                      + S KHLPS G  Q      ++ N +PYSGG +T L DP ++N LQ+LSIDGKDLA+I+LH
Subjt:  KGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPSHGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH

Query:  ANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYS-DLPQ-CSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA
        ANSIYESIGSDT EE S +++APRIFQIYKDIASHRQ N S+TSYFTKLK LWDEL +YS D+PQ CSCGAMEKLS H EREKVMQFL+GL+NSYSTIC 
Subjt:  ANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYS-DLPQ-CSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA

Query:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEV
        QILL+QPFPT+EKAY +IIREEKR ELV SLE+VAAKVM+NKWL QNDQSS   N Y      DD I  EV+ N ++                   +VE+
Subjt:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPI--EVDNNLQEQQVDQSESGTTTPASIYNPDVEV

Query:  PSFPHEQLLIDLGSPVRC
        PSFP+E LLIDLGSPVRC
Subjt:  PSFPHEQLLIDLGSPVRC

A0A6J1C6U3 uncharacterized protein LOC111008934 isoform X13.1e-5342.43Show/hide
Query:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMY-VSPYRRP-TVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASG
        MT RGLISPPR+ S P       NNAA NPP +P Y +SP  RP TVVNP+EQ       +   +KA++                         S NA  
Subjt:  MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMY-VSPYRRP-TVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASG

Query:  SSNPRIDASKPTAPIVSSRRPTAAPNPNDLQS-QKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNN
        +++                   A PNP DLQ+ QKKLDHN++            NSS  ++    T  QRLR                    GAT   NN
Subjt:  SSNPRIDASKPTAPIVSSRRPTAAPNPNDLQS-QKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNN

Query:  TV---KGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNN-SPKHLPSHGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKD-
        T    KG  AS SRSD   A             NLI+ K+ DS NN SP HL  HG  Q+  NISNVGN     G  +T+L  P ++N L +LS  G   
Subjt:  TV---KGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNN-SPKHLPSHGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKD-

Query:  ---LANIVLHANSIYESIGSD----TKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFL
              I ++   I    G D       +CSS+++ PRIF+IYKDIASHRQGN SITSYFT+LK LWDEL +Y+DL QC C +     EH EREKVMQFL
Subjt:  ---LANIVLHANSIYESIGSD----TKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFL

Query:  LGLDNSYSTICAQILLLQPFPTVEKAYYVIIREEKR
        +GL++ YSTIC QILL++PFPTVEKAY ++IREEKR
Subjt:  LGLDNSYSTICAQILLLQPFPTVEKAYYVIIREEKR

A0A6J1C6Z1 uncharacterized protein LOC1110089788.2e-5444.53Show/hide
Query:  SPPRNRSSPKPSRP-----------SRNASGSS--NPRIDASKPTAPIVSSRRPTAAPNPNDLQSQ-KKLD--HNHSGCKTTAKTSYAANSSRPS-SPRT
        SPPR+ +   PSRP           SR  + SS  +P  D   P     S RR TA  N +DLQ+  KKLD  +N+S  KTTAK S    SSRPS SPR 
Subjt:  SPPRNRSSPKPSRP-----------SRNASGSS--NPRIDASKPTAPIVSSRRPTAAPNPNDLQSQ-KKLD--HNHSGCKTTAKTSYAANSSRPS-SPRT

Query:  ITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDS-RNNSPKHLPSHGGHQQAVNISNV
                  NG+E           QPRG T   +N      A+      P   + A       + NLI+ K+FDS  N+SP HL  HG  Q+  NI+NV
Subjt:  ITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDS-RNNSPKHLPSHGGHQQAVNISNV

Query:  GNQNPYSGGHHTTLGDPDVHNQLQQLSID------------------GKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSIT
        G+     G  HT L  P +++ L +LS D                  GK +ANIVL  NSIY+ +GS+TK+E SS+++   IFQIYK  ASHRQ + S+T
Subjt:  GNQNPYSGGHHTTLGDPDVHNQLQQLSID------------------GKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSIT

Query:  SYFTKLKALWDELASYSDLPQC-SCGAMEKLSEHGEREKVMQFLLGLDNSYSTICAQILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAK
        SYF KLK LWD+L +YSDLPQC S GA +KLSEH EREKV+QFL+GL++SYSTI +QIL ++P PTVEKAY++ I+EEK+R L   L+++  K
Subjt:  SYFTKLKALWDELASYSDLPQC-SCGAMEKLSEHGEREKVMQFLLGLDNSYSTICAQILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAK

A0A6J1C7L7 uncharacterized protein LOC1110089864.1e-5342.69Show/hide
Query:  RGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPT---VVNPNEQGSQRKENHT-TTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGS
        RGLISPP+SR S  ES    NNAA NPPS P Y+S  RR T   VVNP +Q    +  HT  T  ++AIR  KN           SSPKP+         
Subjt:  RGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPT---VVNPNEQGSQRKENHT-TTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGS

Query:  SNPRIDASKPTAPIVSSRRPTAAP-NPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNT
                 P  P  S RRPT AP NPN           H+   TT  T     +SR S+             NGV+           QPRG T      
Subjt:  SNPRIDASKPTAPIVSSRRPTAAP-NPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNT

Query:  VKGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPSHG-GHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIV
                              +++  SG+                  SHG GH Q  N +N+  +   S    T +G P V NQLQQLSIDGK  A +V
Subjt:  VKGAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPSHG-GHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIV

Query:  LHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCS--CGAMEKLSEHGEREKVMQFLLGLDNSYSTI
          ANS+ ES+G  TKEECS +++A RI +IYKDIASHRQGN SITSYFTKL+ LW+EL +YSDLPQC       +K S+  EREKVMQFL+GL++SYSTI
Subjt:  LHANSIYESIGSDTKEECSSETDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCS--CGAMEKLSEHGEREKVMQFLLGLDNSYSTI

Query:  CAQILLLQPFPTVEKAYYVIIREE
        C+QILL++PFPTVEKAY +II +E
Subjt:  CAQILLLQPFPTVEKAYYVIIREE

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like4.9e-9148.36Show/hide
Query:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN
        +RG+ISPPRSRSSPRESRP NNN A NPPS+P Y+SP RRPT  +NPNE  + RKE   T VK                          RP++    SSN
Subjt:  LRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTV-VNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSN

Query:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK
         PRID S+P + +  SR   AAP+PN    ++KLD      KT  KT+   +S RP+ P              + PPS                K+N  K
Subjt:  -PRIDASKPTAPIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVK

Query:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH
        GA  S SRSD   A                  K  DS+  +PK+L S     QQ   I    +   Y      TL DPDVH +L QLS+D KDLANIVLH
Subjt:  GAIASASRSDVPGASNIAASVVAKESGNLISQKYFDSRNNSPKHLPS-HGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLH

Query:  ANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA
        AN +YES+ S+TK EECSS+  ++ R+FQIYK+IASH QGN SITSY TKLKALWDEL +Y D P+CSCG+ EK SE  EREKVMQFL+GL++SYSTICA
Subjt:  ANSIYESIGSDTK-EECSSE-TDAPRIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICA

Query:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP
        QIL ++PFPTVEKA   I+REEKRRELV SLEIVAAKV+QN WL QN  S +  NE           EVD+ NLQ+ + DQ+E+                
Subjt:  QILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKWLFQNDQSSSQANEYSLLSNGDDPIEVDN-NLQEQQVDQSESGTTTPASIYNPDVEVP

Query:  SFPHEQLLIDLGSPVRC
        S P E LLIDLGSPVRC
Subjt:  SFPHEQLLIDLGSPVRC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.9e-1030Show/hide
Query:  RIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCG-----AMEKLSEHGEREKVMQFLLG--LDNSYSTICAQILLLQPFPTVEKAYYVI
        +I+Q+ + +A+ RQG  S+  YF KL  +W EL+ Y+ +P+C CG       ++  E  E+E+  +FL+G  L+  +  +  +I+  +P P++ +A+ ++
Subjt:  RIFQIYKDIASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCG-----AMEKLSEHGEREKVMQFLLG--LDNSYSTICAQILLLQPFPTVEKAYYVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTAAGAGGCCTCATCAGTCCCCCGAGATCCAGATCTTCTCCGAGAGAAAGCAGACCACACAATAATAATGCAGCCGGCAACCCGCCGTCGAAACCCATGTACGT
GTCACCATATCGGAGACCGACGGTGGTGAATCCTAATGAGCAGGGGAGTCAGAGAAAAGAAAACCACACCACCACCGTAAAAGCCAGTGCTATACGTGTAGGCAAGAATA
ACCACGCCGGTAACAGTCCGCCGAGAAATAGATCTTCTCCAAAGCCAAGCAGACCATCGCGCAACGCCTCCGGCAGCAGCAACCCACGTATTGATGCATCAAAACCCACT
GCTCCGATCGTCTCTTCCCGACGACCCACGGCGGCACCAAATCCTAACGATCTGCAAAGTCAGAAGAAATTAGATCACAACCATTCCGGCTGCAAAACGACCGCTAAGAC
CAGCTACGCTGCAAATAGTTCTAGACCATCTTCCCCTCGTACAATCACTGCTCAGCAGAGACTGCGTAGAAACAATGGGGTTGAACCTCCTTCACCAACGACCACTGTGA
AACCAAGTCAACCGCGAGGAGCCACACATCCCAAGAACAATACTGTAAAAGGAGCAATTGCTTCTGCTTCAAGATCCGATGTTCCTGGTGCTTCTAATATTGCAGCAAGC
GTCGTAGCCAAGGAGAGTGGGAATCTGATCAGTCAAAAATATTTTGATTCTCGAAATAATTCTCCCAAGCATTTACCAAGCCATGGCGGTCATCAGCAAGCTGTGAACAT
CAGTAATGTTGGGAATCAGAACCCTTATTCGGGTGGACATCACACTACACTCGGTGATCCTGATGTTCACAATCAATTACAACAGCTTTCTATAGACGGTAAAGATCTTG
CAAACATAGTCCTTCATGCAAACTCAATATATGAATCAATTGGCTCTGATACGAAGGAAGAATGTTCTTCTGAAACCGATGCTCCGAGAATATTTCAAATTTACAAGGAC
ATTGCATCTCATCGTCAAGGAAACTTCTCCATTACATCCTACTTTACAAAGCTGAAGGCATTATGGGATGAACTTGCATCCTACAGTGATCTGCCTCAGTGTTCCTGCGG
TGCAATGGAGAAGCTAAGTGAGCATGGGGAGAGAGAGAAGGTGATGCAATTTCTTTTGGGATTAGACAATTCTTATTCCACCATCTGCGCCCAAATCCTTCTTTTGCAGC
CATTTCCAACAGTCGAGAAAGCTTATTATGTAATCATTCGAGAAGAAAAACGGAGGGAACTGGTTTTTTCATTAGAAATTGTTGCAGCAAAAGTAATGCAAAACAAGTGG
CTTTTCCAAAATGATCAGTCGAGCAGCCAAGCAAATGAGTACAGTTTATTAAGTAATGGAGATGATCCTATTGAAGTTGATAATAATCTTCAGGAGCAGCAAGTTGATCA
AAGTGAAAGTGGCACAACGACGCCGGCGTCTATATACAACCCTGACGTCGAAGTTCCGAGCTTCCCGCATGAGCAATTGCTGATAGACCTTGGCTCTCCCGTGCGATGTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGACACTAAGAGGCCTCATCAGTCCCCCGAGATCCAGATCTTCTCCGAGAGAAAGCAGACCACACAATAATAATGCAGCCGGCAACCCGCCGTCGAAACCCATGTACGT
GTCACCATATCGGAGACCGACGGTGGTGAATCCTAATGAGCAGGGGAGTCAGAGAAAAGAAAACCACACCACCACCGTAAAAGCCAGTGCTATACGTGTAGGCAAGAATA
ACCACGCCGGTAACAGTCCGCCGAGAAATAGATCTTCTCCAAAGCCAAGCAGACCATCGCGCAACGCCTCCGGCAGCAGCAACCCACGTATTGATGCATCAAAACCCACT
GCTCCGATCGTCTCTTCCCGACGACCCACGGCGGCACCAAATCCTAACGATCTGCAAAGTCAGAAGAAATTAGATCACAACCATTCCGGCTGCAAAACGACCGCTAAGAC
CAGCTACGCTGCAAATAGTTCTAGACCATCTTCCCCTCGTACAATCACTGCTCAGCAGAGACTGCGTAGAAACAATGGGGTTGAACCTCCTTCACCAACGACCACTGTGA
AACCAAGTCAACCGCGAGGAGCCACACATCCCAAGAACAATACTGTAAAAGGAGCAATTGCTTCTGCTTCAAGATCCGATGTTCCTGGTGCTTCTAATATTGCAGCAAGC
GTCGTAGCCAAGGAGAGTGGGAATCTGATCAGTCAAAAATATTTTGATTCTCGAAATAATTCTCCCAAGCATTTACCAAGCCATGGCGGTCATCAGCAAGCTGTGAACAT
CAGTAATGTTGGGAATCAGAACCCTTATTCGGGTGGACATCACACTACACTCGGTGATCCTGATGTTCACAATCAATTACAACAGCTTTCTATAGACGGTAAAGATCTTG
CAAACATAGTCCTTCATGCAAACTCAATATATGAATCAATTGGCTCTGATACGAAGGAAGAATGTTCTTCTGAAACCGATGCTCCGAGAATATTTCAAATTTACAAGGAC
ATTGCATCTCATCGTCAAGGAAACTTCTCCATTACATCCTACTTTACAAAGCTGAAGGCATTATGGGATGAACTTGCATCCTACAGTGATCTGCCTCAGTGTTCCTGCGG
TGCAATGGAGAAGCTAAGTGAGCATGGGGAGAGAGAGAAGGTGATGCAATTTCTTTTGGGATTAGACAATTCTTATTCCACCATCTGCGCCCAAATCCTTCTTTTGCAGC
CATTTCCAACAGTCGAGAAAGCTTATTATGTAATCATTCGAGAAGAAAAACGGAGGGAACTGGTTTTTTCATTAGAAATTGTTGCAGCAAAAGTAATGCAAAACAAGTGG
CTTTTCCAAAATGATCAGTCGAGCAGCCAAGCAAATGAGTACAGTTTATTAAGTAATGGAGATGATCCTATTGAAGTTGATAATAATCTTCAGGAGCAGCAAGTTGATCA
AAGTGAAAGTGGCACAACGACGCCGGCGTCTATATACAACCCTGACGTCGAAGTTCCGAGCTTCCCGCATGAGCAATTGCTGATAGACCTTGGCTCTCCCGTGCGATGTT
GA
Protein sequenceShow/hide protein sequence
MTLRGLISPPRSRSSPRESRPHNNNAAGNPPSKPMYVSPYRRPTVVNPNEQGSQRKENHTTTVKASAIRVGKNNHAGNSPPRNRSSPKPSRPSRNASGSSNPRIDASKPT
APIVSSRRPTAAPNPNDLQSQKKLDHNHSGCKTTAKTSYAANSSRPSSPRTITAQQRLRRNNGVEPPSPTTTVKPSQPRGATHPKNNTVKGAIASASRSDVPGASNIAAS
VVAKESGNLISQKYFDSRNNSPKHLPSHGGHQQAVNISNVGNQNPYSGGHHTTLGDPDVHNQLQQLSIDGKDLANIVLHANSIYESIGSDTKEECSSETDAPRIFQIYKD
IASHRQGNFSITSYFTKLKALWDELASYSDLPQCSCGAMEKLSEHGEREKVMQFLLGLDNSYSTICAQILLLQPFPTVEKAYYVIIREEKRRELVFSLEIVAAKVMQNKW
LFQNDQSSSQANEYSLLSNGDDPIEVDNNLQEQQVDQSESGTTTPASIYNPDVEVPSFPHEQLLIDLGSPVRC