; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040873 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040873
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr13:9187299..9188660
RNA-Seq ExpressionLag0040873
SyntenyLag0040873
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]3.3e-5346.15Show/hide
Query:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI
        PP +S+ LP     N +PQ+L+    TSP    P++  PL +KL D NY++WK QLLN +IA  +E F++G+   P +FLD  Q Q NP+F  W+++NR+
Subjt:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI

Query:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF
        +MSWIY+S+ E  +G+I+G +SA +IWE L  +Y ++S A +  +R+ LQ I+K+ L+   Y+ + + + +   +IGEP++Y DHL Y L GLG +YNPF
Subjt:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF

Query:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVD
        VTSIQ++  RPS+ +V SLLL+YDARLE+QS+ D
Subjt:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVD

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]1.9e-4835.1Show/hide
Query:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI
        PP +S+ LP       +PQ+++    TSP    P++  PL +KL D NY++WK QLLN +IA  +E F++G+   P +FLD  Q Q NP+F  W+++NR+
Subjt:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI

Query:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF
        +MSWIY+S+ E  +G+I+G +SA +IWE L  +Y ++S A +  +R+ LQ I+K+ L+   Y+ + + + +   +IGEP++Y DHL Y L GLG +YNPF
Subjt:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF

Query:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGP-GVLGRPN
        VTSIQ++  RPS+                                             P+     T  P+   PS+ S+P+     N Y  P G    P+
Subjt:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGP-GVLGRPN

Query:  FSPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPT
        +SP     PS+   RP+CQI  K GHTA  CY+ H  L +    P P+     ++ NP+
Subjt:  FSPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPT

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]2.8e-5242.31Show/hide
Query:  NPTTGPAGFPFPPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNP
        NP+TG      PP   +++P     N   Q++  Q    P    P++  P  IKL   NYL+WKNQLLN IIA  +E FI+G+ P P +F D A+  +N 
Subjt:  NPTTGPAGFPFPPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNP

Query:  QFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYI
        ++  W++FNR++MSWIY+SLT+  +G+I+G +SA+EIWE L  +Y SSS A+I  +R++LQ +RKD L+  +Y+ + K+I +   A+GEP+S +DHL Y+
Subjt:  QFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYI

Query:  LEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPS-GNP--RYTSVPRPTTPSSFSYPSPFPVP
          GL  EYN FVTSI  R D   L ++ SLLL+Y+ RLE Q++  QL+ +QANLA+L    N +K+  RP+  NP   +T   +  T    S+P   P  
Subjt:  LEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPS-GNP--RYTSVPRPTTPSSFSYPSPFPVP

Query:  NPYPGPGVLGRP
        N +  P +LG+P
Subjt:  NPYPGPGVLGRP

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.3e-5440.81Show/hide
Query:  FPFPPASSSSLPFFPAYNAHPQLLSPQQTTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFIN-GTPAPTKFLDHAQTQMNPQFFVWKKFNRI
        FP  PAS+S+       N  PQ+    Q T P P+L+  L+IKL ++N LL K+QLLN IIA  +E FI+    +P K+LD A  Q+NP+F  W + N++
Subjt:  FPFPPASSSSLPFFPAYNAHPQLLSPQQTTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFIN-GTPAPTKFLDHAQTQMNPQFFVWKKFNRI

Query:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF
        +MSWIYSSLT   +G+I+  S+A +IW  L   YES S A +M++ SQLQ+I+K  + +++YLS++K + D+F  IGEPLSYRD L  ILEGL  EY+ F
Subjt:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF

Query:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNF
        VTSI NR+DRPSL +V SLL  Y+ RL  Q S+DQ         NL+ PQ N                                                
Subjt:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNF

Query:  SPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPTPTPVSAAQSESS
            PR P  N+S PQCQI GK GH AL  Y+R N  YH    P   A F       T +P+SA  + S+
Subjt:  SPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPTPTPVSAAQSESS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.9e-8651.82Show/hide
Query:  TTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWE
        + +P+PTL  PLN+KL+D+N+LLWKNQLLN +IA  +  +++GT   P +FLDH Q Q NP ++ W+++NR+LM WIYSSL+E+K+GE++   + ++IW 
Subjt:  TTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWE

Query:  HLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLE
         L  VY+S +TARIM ++++LQ +RKD  SV+QYL++IK+IADKF A+GEPLSYRDHLA++L+GLGSEYN FVTSI NR D PSL DVRSLLLAY+ARL+
Subjt:  HLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLE

Query:  KQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNFSPRSPRWPSTNSSRPQCQIYGKMGHTAL
        KQ++VDQLN+ QANL NLSL Q+NSKR             P+ + P+ + +  P    +      +LG+P    + P  P  +SS+ QCQI GK+GH+A 
Subjt:  KQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNFSPRSPRWPSTNSSRPQCQIYGKMGHTAL

Query:  VCYNRHNPLYHASSTPTPQAMFTQLSQNPT
        VCY+R N  YH +S   PQA++  +  +PT
Subjt:  VCYNRHNPLYHASSTPTPQAMFTQLSQNPT

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.4e-5242.31Show/hide
Query:  NPTTGPAGFPFPPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNP
        NP+TG      PP   +++P     N   Q++  Q    P    P++  P  IKL   NYL+WKNQLLN IIA  +E FI+G+ P P +F D A+  +N 
Subjt:  NPTTGPAGFPFPPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNP

Query:  QFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYI
        ++  W++FNR++MSWIY+SLT+  +G+I+G +SA+EIWE L  +Y SSS A+I  +R++LQ +RKD L+  +Y+ + K+I +   A+GEP+S +DHL Y+
Subjt:  QFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYI

Query:  LEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPS-GNP--RYTSVPRPTTPSSFSYPSPFPVP
          GL  EYN FVTSI  R D   L ++ SLLL+Y+ RLE Q++  QL+ +QANLA+L    N +K+  RP+  NP   +T   +  T    S+P   P  
Subjt:  LEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPS-GNP--RYTSVPRPTTPSSFSYPSPFPVP

Query:  NPYPGPGVLGRP
        N +  P +LG+P
Subjt:  NPYPGPGVLGRP

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.1e-5440.81Show/hide
Query:  FPFPPASSSSLPFFPAYNAHPQLLSPQQTTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFIN-GTPAPTKFLDHAQTQMNPQFFVWKKFNRI
        FP  PAS+S+       N  PQ+    Q T P P+L+  L+IKL ++N LL K+QLLN IIA  +E FI+    +P K+LD A  Q+NP+F  W + N++
Subjt:  FPFPPASSSSLPFFPAYNAHPQLLSPQQTTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFIN-GTPAPTKFLDHAQTQMNPQFFVWKKFNRI

Query:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF
        +MSWIYSSLT   +G+I+  S+A +IW  L   YES S A +M++ SQLQ+I+K  + +++YLS++K + D+F  IGEPLSYRD L  ILEGL  EY+ F
Subjt:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF

Query:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNF
        VTSI NR+DRPSL +V SLL  Y+ RL  Q S+DQ         NL+ PQ N                                                
Subjt:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNF

Query:  SPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPTPTPVSAAQSESS
            PR P  N+S PQCQI GK GH AL  Y+R N  YH    P   A F       T +P+SA  + S+
Subjt:  SPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPTPTPVSAAQSESS

A0A6J1DQX7 uncharacterized protein LOC1110223151.9e-8651.82Show/hide
Query:  TTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWE
        + +P+PTL  PLN+KL+D+N+LLWKNQLLN +IA  +  +++GT   P +FLDH Q Q NP ++ W+++NR+LM WIYSSL+E+K+GE++   + ++IW 
Subjt:  TTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWE

Query:  HLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLE
         L  VY+S +TARIM ++++LQ +RKD  SV+QYL++IK+IADKF A+GEPLSYRDHLA++L+GLGSEYN FVTSI NR D PSL DVRSLLLAY+ARL+
Subjt:  HLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLE

Query:  KQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNFSPRSPRWPSTNSSRPQCQIYGKMGHTAL
        KQ++VDQLN+ QANL NLSL Q+NSKR             P+ + P+ + +  P    +      +LG+P    + P  P  +SS+ QCQI GK+GH+A 
Subjt:  KQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNFSPRSPRWPSTNSSRPQCQIYGKMGHTAL

Query:  VCYNRHNPLYHASSTPTPQAMFTQLSQNPT
        VCY+R N  YH +S   PQA++  +  +PT
Subjt:  VCYNRHNPLYHASSTPTPQAMFTQLSQNPT

A0A7J0EGI5 Uncharacterized protein1.6e-5346.15Show/hide
Query:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI
        PP +S+ LP     N +PQ+L+    TSP    P++  PL +KL D NY++WK QLLN +IA  +E F++G+   P +FLD  Q Q NP+F  W+++NR+
Subjt:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI

Query:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF
        +MSWIY+S+ E  +G+I+G +SA +IWE L  +Y ++S A +  +R+ LQ I+K+ L+   Y+ + + + +   +IGEP++Y DHL Y L GLG +YNPF
Subjt:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF

Query:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVD
        VTSIQ++  RPS+ +V SLLL+YDARLE+QS+ D
Subjt:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVD

A0A7J0GPN0 UBX domain-containing protein9.2e-4935.1Show/hide
Query:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI
        PP +S+ LP       +PQ+++    TSP    P++  PL +KL D NY++WK QLLN +IA  +E F++G+   P +FLD  Q Q NP+F  W+++NR+
Subjt:  PPASSSSLPFFPAYNAHPQLLSPQQTTSP---YPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRI

Query:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF
        +MSWIY+S+ E  +G+I+G +SA +IWE L  +Y ++S A +  +R+ LQ I+K+ L+   Y+ + + + +   +IGEP++Y DHL Y L GLG +YNPF
Subjt:  LMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPF

Query:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGP-GVLGRPN
        VTSIQ++  RPS+                                             P+     T  P+   PS+ S+P+     N Y  P G    P+
Subjt:  VTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGP-GVLGRPN

Query:  FSPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPT
        +SP     PS+   RP+CQI  K GHTA  CY+ H  L +    P P+     ++ NP+
Subjt:  FSPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNPT

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-1922.76Show/hide
Query:  KLSDSNYLLWKNQLLNHIIAFDMECFING--TPAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTAR
        KL+ +NYL+W  Q+      +++  F++G  T  P      A  ++NP +  WK+ ++++ S +  +++      +   ++A +IWE LR +Y + S   
Subjt:  KLSDSNYLLWKNQLLNHIIAFDMECFING--TPAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTAR

Query:  IMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQL----N
        +  +R+QL++  K + ++  Y+  +    D+   +G+P+ + + +  +LE L  EY P +  I  +   P+L ++   LL +++++   SS   +    N
Subjt:  IMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQL----N

Query:  LVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNFSPRSPRWPSTNSSRP---QCQIYGKMGHTALVCYNRH
         V       +   NN  R +R   N    +  +P   SS ++                            P+ N S+P   +CQI G  GH+A  C    
Subjt:  LVQANLANLSLPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNFSPRSPRWPSTNSSRP---QCQIYGKMGHTALVCYNRH

Query:  NPLYHASSTPTP
        + L   +S   P
Subjt:  NPLYHASSTPTP

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.6e-0823.49Show/hide
Query:  PASSSSLPFF-PAYNAHPQLLSPQQTTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRILMS
        P S    P++ P    HP   S Q+             +   + NY+ WK +  + +       FI+GT P P  F        +P +  W++ N ++M 
Subjt:  PASSSSLPFF-PAYNAHPQLLSPQQTTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGT-PAPTKFLDHAQTQMNPQFFVWKKFNRILMS

Query:  WIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDI
        W+ +S+T+  +  ++   +A+++WE LR V+      +I  +R +L  +R+   SV +Y  ++  +
Subjt:  WIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESSSTARIMAIRSQLQKIRKDSLSVTQYLSQIKDI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.0e-1625.13Show/hide
Query:  PLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGTPAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKI-GEIIGCSSAYEIWEHLRTVYESSS
        P+ + + +SNY  W+   L H ++FD+   I+GT  PT          N     W+K + I+   +Y +LT  +  G  +  S++ +IW  ++  + ++ 
Subjt:  PLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGTPAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKI-GEIIGCSSAYEIWEHLRTVYESSS

Query:  TARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEK
         AR + + S+L+      + V  Y  ++K +AD    +  P++ R+ + Y+L GL  +++  +  I++R   PS  D  ++L   + RL++
Subjt:  TARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.6e-1623.1Show/hide
Query:  LNIKLSDSNYLLWKNQLLNHIIAFDMECFINGTPAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEII--GCSSAYEIWEHLRTVYESSS
        + + L+  NY +W+       ++F +   I+G+  PT   +            WK+ + ++  WIY ++T+  +  II  GC +A ++W  L  ++  + 
Subjt:  LNIKLSDSNYLLWKNQLLNHIIAFDMECFINGTPAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEII--GCSSAYEIWEHLRTVYESSS

Query:  TARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNL
         AR +   ++L+    D LSV +Y  ++K ++D    +  P+S R  + ++L GL  +Y+  +  I++++  PS  + RS+LL  ++RL  +S     + 
Subjt:  TARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNL

Query:  VQANLANL--------------------SLPQNNSKRQSRPSGNP--RYTSVP--RPTTPSSFSY-PSPFPVPNPYPGPGVLGRPNFSPRSPR-WPSTNS
           +L+N+                    ++ +  SK+++R  G+   RY +    R   P ++ Y P   P   P+ GP    +  + P+ P  + S  S
Subjt:  VQANLANL--------------------SLPQNNSKRQSRPSGNP--RYTSVP--RPTTPSSFSY-PSPFPVPNPYPGPGVLGRPNFSPRSPR-WPSTNS

Query:  SRP
         +P
Subjt:  SRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCATCAAGCTCATCTGGTTCAGACTCCTCCCGGTTGTCGGTCATCTCCTCCTCTGTGACAAATCCAATCACCACTCCTATCACTATAGTGACTACACGTCT
TGTTTCGTCTACAATTTCTACCCCTTTTGCCTCCCCACGTTTACCCCCACATTCTCCACCGCCTCCATTAAACCCTAATGCCCCAAATTTTCTTAACAGTCAGAATCCCT
ACCCATTCAACCCGACTACAGGCCCAGCTGGTTTCCCTTTTCCACCAGCATCATCTTCCTCTCTGCCATTCTTTCCCGCTTACAATGCTCACCCGCAGCTCCTTTCACCT
CAGCAAACCACCTCACCATATCCTACCCTAACCCCACCCCTTAATATTAAGCTCTCCGATTCCAACTATCTACTTTGGAAGAATCAACTCTTGAACCACATCATCGCTTT
TGATATGGAGTGTTTTATCAATGGCACCCCTGCTCCTACGAAGTTCTTAGACCATGCTCAAACTCAGATGAATCCCCAATTCTTTGTATGGAAAAAGTTTAACCGGATTC
TAATGAGCTGGATTTACTCTTCCCTTACCGAAGACAAGATTGGCGAAATCATCGGGTGTTCTTCTGCATACGAGATTTGGGAGCATTTGCGTACAGTATATGAATCCTCT
TCCACAGCTCGCATAATGGCCATCCGATCCCAGTTGCAAAAGATCCGCAAGGACAGTCTATCAGTTACCCAATACCTCTCTCAGATCAAAGATATTGCCGATAAATTCTT
CGCAATTGGTGAGCCCCTTTCATATCGCGACCACCTTGCTTACATTCTGGAAGGATTAGGATCAGAGTACAATCCGTTCGTCACTTCCATACAGAATCGCACCGACAGAC
CTTCTTTGGTAGATGTTCGAAGCTTGTTGCTGGCTTATGATGCTCGGTTGGAGAAACAATCGTCGGTTGATCAGTTGAATCTCGTTCAGGCCAATCTCGCCAACCTCTCC
CTTCCCCAAAACAATTCCAAGCGCCAATCTCGTCCCTCAGGTAATCCTCGTTATACCTCTGTTCCACGACCTACTACTCCCTCTTCCTTTTCCTATCCTTCTCCTTTCCC
TGTCCCAAATCCTTATCCTGGTCCTGGTGTGTTGGGTCGCCCAAATTTCTCCCCTCGTTCCCCGCGTTGGCCTTCCACAAATTCGTCAAGACCCCAATGTCAAATCTACG
GAAAAATGGGTCATACTGCCCTTGTATGTTATAACCGCCACAACCCCTTGTACCATGCCTCTTCGACTCCCACCCCACAGGCCATGTTTACTCAATTATCTCAAAATCCC
ACCCCAACCCCTGTATCTGCTGCACAGTCTGAATCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCATCAAGCTCATCTGGTTCAGACTCCTCCCGGTTGTCGGTCATCTCCTCCTCTGTGACAAATCCAATCACCACTCCTATCACTATAGTGACTACACGTCT
TGTTTCGTCTACAATTTCTACCCCTTTTGCCTCCCCACGTTTACCCCCACATTCTCCACCGCCTCCATTAAACCCTAATGCCCCAAATTTTCTTAACAGTCAGAATCCCT
ACCCATTCAACCCGACTACAGGCCCAGCTGGTTTCCCTTTTCCACCAGCATCATCTTCCTCTCTGCCATTCTTTCCCGCTTACAATGCTCACCCGCAGCTCCTTTCACCT
CAGCAAACCACCTCACCATATCCTACCCTAACCCCACCCCTTAATATTAAGCTCTCCGATTCCAACTATCTACTTTGGAAGAATCAACTCTTGAACCACATCATCGCTTT
TGATATGGAGTGTTTTATCAATGGCACCCCTGCTCCTACGAAGTTCTTAGACCATGCTCAAACTCAGATGAATCCCCAATTCTTTGTATGGAAAAAGTTTAACCGGATTC
TAATGAGCTGGATTTACTCTTCCCTTACCGAAGACAAGATTGGCGAAATCATCGGGTGTTCTTCTGCATACGAGATTTGGGAGCATTTGCGTACAGTATATGAATCCTCT
TCCACAGCTCGCATAATGGCCATCCGATCCCAGTTGCAAAAGATCCGCAAGGACAGTCTATCAGTTACCCAATACCTCTCTCAGATCAAAGATATTGCCGATAAATTCTT
CGCAATTGGTGAGCCCCTTTCATATCGCGACCACCTTGCTTACATTCTGGAAGGATTAGGATCAGAGTACAATCCGTTCGTCACTTCCATACAGAATCGCACCGACAGAC
CTTCTTTGGTAGATGTTCGAAGCTTGTTGCTGGCTTATGATGCTCGGTTGGAGAAACAATCGTCGGTTGATCAGTTGAATCTCGTTCAGGCCAATCTCGCCAACCTCTCC
CTTCCCCAAAACAATTCCAAGCGCCAATCTCGTCCCTCAGGTAATCCTCGTTATACCTCTGTTCCACGACCTACTACTCCCTCTTCCTTTTCCTATCCTTCTCCTTTCCC
TGTCCCAAATCCTTATCCTGGTCCTGGTGTGTTGGGTCGCCCAAATTTCTCCCCTCGTTCCCCGCGTTGGCCTTCCACAAATTCGTCAAGACCCCAATGTCAAATCTACG
GAAAAATGGGTCATACTGCCCTTGTATGTTATAACCGCCACAACCCCTTGTACCATGCCTCTTCGACTCCCACCCCACAGGCCATGTTTACTCAATTATCTCAAAATCCC
ACCCCAACCCCTGTATCTGCTGCACAGTCTGAATCTTCTTAG
Protein sequenceShow/hide protein sequence
MASSSSSSGSDSSRLSVISSSVTNPITTPITIVTTRLVSSTISTPFASPRLPPHSPPPPLNPNAPNFLNSQNPYPFNPTTGPAGFPFPPASSSSLPFFPAYNAHPQLLSP
QQTTSPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMECFINGTPAPTKFLDHAQTQMNPQFFVWKKFNRILMSWIYSSLTEDKIGEIIGCSSAYEIWEHLRTVYESS
STARIMAIRSQLQKIRKDSLSVTQYLSQIKDIADKFFAIGEPLSYRDHLAYILEGLGSEYNPFVTSIQNRTDRPSLVDVRSLLLAYDARLEKQSSVDQLNLVQANLANLS
LPQNNSKRQSRPSGNPRYTSVPRPTTPSSFSYPSPFPVPNPYPGPGVLGRPNFSPRSPRWPSTNSSRPQCQIYGKMGHTALVCYNRHNPLYHASSTPTPQAMFTQLSQNP
TPTPVSAAQSESS