; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024565 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024565
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00001291:4320495..4321271
RNA-Seq ExpressionSgr024565
SyntenySgr024565
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.1e-4845.34Show/hide
Query:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH
        T NPA+ +W RQDRL+SSWLLGSM+E+IL+ M+ C SA+EIWETL+ I+S+    + MQ K +L N+KKG M +K+Y  K+   VD+L ++   +S  +H
Subjt:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH

Query:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS
        I++IL+GLGSDY S++SVISA++    +QE+ +LL+T E++ E   +  S+ +LPSV++V Q  EK + S   T++ N+HN      S N RGGR  GRS
Subjt:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS

Query:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN
        +RG R   N ++PQCQ C K G++A +C+  +    NS  ++P   N
Subjt:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.1e-4845.34Show/hide
Query:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH
        T NPA+ +W RQDRL+SSWLLGSM+E+IL+ M+ C SA+EIWETL+ I+S+    + MQ K +L N+KKG M +K+Y  K+   VD+L ++   +S  +H
Subjt:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH

Query:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS
        I++IL+GLGSDY S++SVISA++    +QE+ +LL+T E++ E   +  S+ +LPSV++V Q  EK + S   T++ N+HN      S N RGGR  GRS
Subjt:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS

Query:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN
        +RG R   N ++PQCQ C K G++A +C+  +    NS  ++P   N
Subjt:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]8.3e-4949.54Show/hide
Query:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS
        +QD+L++SWL  SM E+IL  M+ CN+ARE+W+ LE +Y++ N  ++MQLK +L+N+KKG + +KDY  KVK+LVDSL A G K+++++HI+ IL+GL S
Subjt:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS

Query:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS
        +++S VSVISA+++ + LQE+Y+LL++HE R ERN+ IN+DG+LPSV+L Q   K+SNS  +                NNR   +G  + R  R WN+N+
Subjt:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS

Query:  RPQCQFCGKFGHTAIKCY
        RPQCQ  GKFGHTA++CY
Subjt:  RPQCQFCGKFGHTAIKCY

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]8.3e-4949.54Show/hide
Query:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS
        +QD+L++SWL  SM E+IL  M+ CN+ARE+W+ LE +Y++ N  ++MQLK +L+N+KKG + +KDY  KVK+LVDSL A G K+++++HI+ IL+GL S
Subjt:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS

Query:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS
        +++S VSVISA+++ + LQE+Y+LL++HE R ERN+ IN+DG+LPSV+L Q   K+SNS  +                NNR   +G  + R  R WN+N+
Subjt:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS

Query:  RPQCQFCGKFGHTAIKCY
        RPQCQ  GKFGHTA++CY
Subjt:  RPQCQFCGKFGHTAIKCY

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]8.6e-5448.61Show/hide
Query:  QNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHI
        QNPA+  W +QD+L+S+WLLGSMNEDILS M+ C SAREIW  LE ++++    ++MQLK +L+N KKG +S+KDY  K+K+LVDSL   G K+S ++HI
Subjt:  QNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHI

Query:  VFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGS--HTQQSNNNRGGRNGGRS
        + IL+GLG ++D+I+SVI+A++ P+ LQE+ +LL+  E R ERN +INSDGSLPSV+L          N+++ K N H     +  QSN ++ GR     
Subjt:  VFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGS--HTQQSNNNRGGRNGGRS

Query:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLHELNSQRFAPGFQNFQNFSP
        S   R W  N++PQCQ CG+FGHTA++CY   E N   F     N   FSP
Subjt:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLHELNSQRFAPGFQNFQNFSP

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-4845.34Show/hide
Query:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH
        T NPA+ +W RQDRL+SSWLLGSM+E+IL+ M+ C SA+EIWETL+ I+S+    + MQ K +L N+KKG M +K+Y  K+   VD+L ++   +S  +H
Subjt:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH

Query:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS
        I++IL+GLGSDY S++SVISA++    +QE+ +LL+T E++ E   +  S+ +LPSV++V Q  EK + S   T++ N+HN      S N RGGR  GRS
Subjt:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS

Query:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN
        +RG R   N ++PQCQ C K G++A +C+  +    NS  ++P   N
Subjt:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-4845.34Show/hide
Query:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH
        T NPA+ +W RQDRL+SSWLLGSM+E+IL+ M+ C SA+EIWETL+ I+S+    + MQ K +L N+KKG M +K+Y  K+   VD+L ++   +S  +H
Subjt:  TQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEH

Query:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS
        I++IL+GLGSDY S++SVISA++    +QE+ +LL+T E++ E   +  S+ +LPSV++V Q  EK + S   T++ N+HN      S N RGGR  GRS
Subjt:  IVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLV-QNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS

Query:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN
        +RG R   N ++PQCQ C K G++A +C+  +    NS  ++P   N
Subjt:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLH--ELNSQRFAPGFQN

A0A6J1C6N9 dr1-associated corepressor homolog isoform X14.0e-4949.54Show/hide
Query:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS
        +QD+L++SWL  SM E+IL  M+ CN+ARE+W+ LE +Y++ N  ++MQLK +L+N+KKG + +KDY  KVK+LVDSL A G K+++++HI+ IL+GL S
Subjt:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS

Query:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS
        +++S VSVISA+++ + LQE+Y+LL++HE R ERN+ IN+DG+LPSV+L Q   K+SNS  +                NNR   +G  + R  R WN+N+
Subjt:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS

Query:  RPQCQFCGKFGHTAIKCY
        RPQCQ  GKFGHTA++CY
Subjt:  RPQCQFCGKFGHTAIKCY

A0A6J1C8R2 dr1-associated corepressor homolog isoform X24.0e-4949.54Show/hide
Query:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS
        +QD+L++SWL  SM E+IL  M+ CN+ARE+W+ LE +Y++ N  ++MQLK +L+N+KKG + +KDY  KVK+LVDSL A G K+++++HI+ IL+GL S
Subjt:  RQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGLGS

Query:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS
        +++S VSVISA+++ + LQE+Y+LL++HE R ERN+ IN+DG+LPSV+L Q   K+SNS  +                NNR   +G  + R  R WN+N+
Subjt:  DYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNS

Query:  RPQCQFCGKFGHTAIKCY
        RPQCQ  GKFGHTA++CY
Subjt:  RPQCQFCGKFGHTAIKCY

A0A6J1DLT9 uncharacterized protein LOC1110217574.1e-5448.61Show/hide
Query:  QNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHI
        QNPA+  W +QD+L+S+WLLGSMNEDILS M+ C SAREIW  LE ++++    ++MQLK +L+N KKG +S+KDY  K+K+LVDSL   G K+S ++HI
Subjt:  QNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHI

Query:  VFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGS--HTQQSNNNRGGRNGGRS
        + IL+GLG ++D+I+SVI+A++ P+ LQE+ +LL+  E R ERN +INSDGSLPSV+L          N+++ K N H     +  QSN ++ GR     
Subjt:  VFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGS--HTQQSNNNRGGRNGGRS

Query:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLHELNSQRFAPGFQNFQNFSP
        S   R W  N++PQCQ CG+FGHTA++CY   E N   F     N   FSP
Subjt:  SRGGRQWNNNSRPQCQFCGKFGHTAIKCYSLHELNSQRFAPGFQNFQNFSP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-0423.53Show/hide
Query:  WSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLK-KGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSG
        W + +R   S ++  +++  L+      +AR+I E L+ +Y   +    + L+ +L +LK    MS+  +      L+  L A G KI   + I  +L  
Subjt:  WSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLK-KGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSG

Query:  LGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWN
        L S YD I++ I   S     +E   L       L++   I +D +  S  ++       ++NNNT K N      T+     +G              N
Subjt:  LGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWN

Query:  NNSRPQCQFCGKFGHTAIKCY
        +  + +C  CG+ GH    C+
Subjt:  NNSRPQCQFCGKFGHTAIKCY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-0422.53Show/hide
Query:  WSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKG-GMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSG
        W+  D   +S +   +++D++++++  ++AR IW  LE +Y +      + LK QL  L    G +   ++     L+  L  +G KI  ++  + +L+ 
Subjt:  WSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKG-GMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSG

Query:  LGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNS-NNNTSKGNFHNGSHTQQSN
        L S YD++ + I        L+++ + L+ +E    R    N   +L +    +++++SSN+   + ++G   N S ++  N
Subjt:  LGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNS-NNNTSKGNFHNGSHTQQSN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.8e-2129.92Show/hide
Query:  PTQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQE
        P  NP +  W RQD+L+ S +LG+++  +   +    +A +IWETL +IY+  +   + QL+ QL+   KG  +I DY+  + +  D L  +G  +   E
Subjt:  PTQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQE

Query:  HIVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS
         +  +L  L  +Y  ++  I+AK  P  L EI+  L+ HE+++     ++S   +P      +   ++ +NNN + GN +N       NNN   +   +S
Subjt:  HIVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRS

Query:  SRGGRQWNNNSRP---QCQFCGKFGHTAIKC----YSLHELNSQRFAPGFQNFQ
        S      NN S+P   +CQ CG  GH+A +C    + L  +NSQ+    F  +Q
Subjt:  SRGGRQWNNNSRP---QCQFCGKFGHTAIKC----YSLHELNSQRFAPGFQNFQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-1527.92Show/hide
Query:  MPTQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQ
        +P  NP +  W RQD+L+ S +LG+++  +   +    +A +IWETL +IY+  +   + QL+               ++ +     D L  +G  +   
Subjt:  MPTQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQ

Query:  EHIVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGR
        E +  +L  L  DY  ++  I+AK  P  L EI+  L+  E++L     +NS   +P   +  N     N+N N ++ N   G +   +NNN    +   
Subjt:  EHIVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGR

Query:  SSRGGRQWNNNSRP---QCQFCGKFGHTAIKCYSLHELNS
        SS G R  N   +P   +CQ C   GH+A +C  LH+  S
Subjt:  SSRGGRQWNNNSRP---QCQFCGKFGHTAIKCYSLHELNS

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.4e-0828.05Show/hide
Query:  NPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSL
        +P +  W + + ++  WL+ SM + +L  ++   +A ++WE L R++     +KI QL+ +L  L++GG S+++Y  K+  +
Subjt:  NPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-0625.69Show/hide
Query:  MPTQNPAFVIWSRQDRLMSSWLLGSMN-EDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISI
        +PT N   V W ++D ++   L G++  +      V  +++R+IW  ++  +  +   + ++L  +L+    G M + DY  K+K L DSL  V   ++ 
Subjt:  MPTQNPAFVIWSRQDRLMSSWLLGSMN-EDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISI

Query:  QEHIVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINS---DGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGR
        +  ++++L+GL   +D+I++VI  +       +   +L   E+RL+R    N    D S  S  L  + E    +N   S GN        + NN   GR
Subjt:  QEHIVFILSGLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINS---DGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGR

Query:  NGGRSSRGGRQWNNNSRP
         G  S      +N+ +RP
Subjt:  NGGRSSRGGRQWNNNSRP

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.6e-1127.32Show/hide
Query:  WSRQDRLMSSWLLGSMNEDILSHM--VGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILS
        W  +D L+  W+ G++ + +L  +  VGC +AR++W +LE ++  +   + +Q + +L+      +S+ +Y  K+KSL D L  V + IS +  ++ +L+
Subjt:  WSRQDRLMSSWLLGSMNEDILSHM--VGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILS

Query:  GLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQW
        GL   YD I++VI  KS      E  ++L+  E+RL   +   S  S  +   + N   +           +HN +       ++    GG SS G    
Subjt:  GLGSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQW

Query:  NNNSR
        NNN R
Subjt:  NNNSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTACACAAAATCCAGCATTTGTTATCTGGTCCAGACAAGATCGCTTGATGTCTTCATGGCTTCTTGGTTCGATGAATGAGGATATCCTATCCCATATGGTAGGCTG
TAATTCTGCTCGTGAAATTTGGGAGACTCTTGAAAGGATCTATTCGACCTCGAATACTGTCAAGATAATGCAATTGAAGGGTCAGTTGCAGAATTTAAAAAAGGGAGGTA
TGAGCATTAAAGACTATGTAGCCAAAGTCAAAAGTCTGGTCGATTCTTTGCATGCTGTGGGCAATAAGATTTCTATTCAAGAACATATTGTATTCATTCTTTCGGGTTTA
GGTTCTGATTATGACTCAATTGTGTCAGTTATTTCTGCAAAATCGAAGCCTAAACCTCTTCAAGAAATATATGCCTTACTGATGACTCATGAGAATAGATTAGAAAGGAA
TGCAGTGATCAATTCAGATGGTTCCTTGCCCAGTGTGGATCTTGTTCAAAATTTTGAGAAGAGTTCTAATTCTAACAACAATACTTCAAAGGGTAACTTTCATAATGGGT
CTCATACACAGCAATCTAACAATAATAGAGGTGGTCGTAATGGTGGTCGTTCTTCTCGCGGTGGTCGACAATGGAATAACAATTCACGACCTCAATGTCAGTTTTGTGGC
AAGTTTGGTCACACTGCTATAAAGTGTTACTCTCTTCACGAATTAAATTCTCAACGTTTTGCTCCTGGTTTTCAGAATTTTCAAAATTTCTCTCCCAATTTTCAGCGACA
GCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTACACAAAATCCAGCATTTGTTATCTGGTCCAGACAAGATCGCTTGATGTCTTCATGGCTTCTTGGTTCGATGAATGAGGATATCCTATCCCATATGGTAGGCTG
TAATTCTGCTCGTGAAATTTGGGAGACTCTTGAAAGGATCTATTCGACCTCGAATACTGTCAAGATAATGCAATTGAAGGGTCAGTTGCAGAATTTAAAAAAGGGAGGTA
TGAGCATTAAAGACTATGTAGCCAAAGTCAAAAGTCTGGTCGATTCTTTGCATGCTGTGGGCAATAAGATTTCTATTCAAGAACATATTGTATTCATTCTTTCGGGTTTA
GGTTCTGATTATGACTCAATTGTGTCAGTTATTTCTGCAAAATCGAAGCCTAAACCTCTTCAAGAAATATATGCCTTACTGATGACTCATGAGAATAGATTAGAAAGGAA
TGCAGTGATCAATTCAGATGGTTCCTTGCCCAGTGTGGATCTTGTTCAAAATTTTGAGAAGAGTTCTAATTCTAACAACAATACTTCAAAGGGTAACTTTCATAATGGGT
CTCATACACAGCAATCTAACAATAATAGAGGTGGTCGTAATGGTGGTCGTTCTTCTCGCGGTGGTCGACAATGGAATAACAATTCACGACCTCAATGTCAGTTTTGTGGC
AAGTTTGGTCACACTGCTATAAAGTGTTACTCTCTTCACGAATTAAATTCTCAACGTTTTGCTCCTGGTTTTCAGAATTTTCAAAATTTCTCTCCCAATTTTCAGCGACA
GCAGTAA
Protein sequenceShow/hide protein sequence
MPTQNPAFVIWSRQDRLMSSWLLGSMNEDILSHMVGCNSAREIWETLERIYSTSNTVKIMQLKGQLQNLKKGGMSIKDYVAKVKSLVDSLHAVGNKISIQEHIVFILSGL
GSDYDSIVSVISAKSKPKPLQEIYALLMTHENRLERNAVINSDGSLPSVDLVQNFEKSSNSNNNTSKGNFHNGSHTQQSNNNRGGRNGGRSSRGGRQWNNNSRPQCQFCG
KFGHTAIKCYSLHELNSQRFAPGFQNFQNFSPNFQRQQ