; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023868 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023868
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold13:7269829..7275352
RNA-Seq ExpressionSpg023868
SyntenySpg023868
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.5e-5032.31Show/hide
Query:  HNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRL
        H++  ALL KW+W+FL E+  LW+ LII+K Y+ E +  +P   +  S  SPW+ +   I      I   + +G    FW D+W     LS + PRL+ L
Subjt:  HNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRL

Query:  TTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTILN---NLYSVIWKDNYPKK
        +TN    V ++WN     WN+   R L + E   W N+   L P  L D   S   W L ++ +F   S+  DL   S +  N   +LY  +WK ++PKK
Subjt:  TTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTILN---NLYSVIWKDNYPKK

Query:  IKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNR
         K F+W L  G INTADR Q+R+P+++LSP+WC MC+ + E+  HLF+ C ++ + W+       W+   + N    LA        +  K ++      
Subjt:  IKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNR

Query:  VFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL
        +  W +W ERN RIF+     F   +E +L     W      F++Y   S+    + F+
Subjt:  VFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.6e-5132.1Show/hide
Query:  WLQIASFREV--LDNWWNHNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKD
        W Q+ S +E   L   + H++  ALL KW+W+FL E+  LW+ LII+K Y+ E +  +P   +  S  SPW+ +   I      I   + +G    FW D
Subjt:  WLQIASFREV--LDNWWNHNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKD

Query:  SWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTIL
        +W     LS   PRL+ L+TN    V ++WN     WN+   R L + E   W N+   L P  L D   S   W L ++ +F   S+  DL   S +  
Subjt:  SWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTIL

Query:  N---NLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIF
        N   +LY  +WK ++PKK K F+W L  G INTADR Q+R+P+++LSP+WC MC+ + E+  HLF+ C ++ + W+       W+   + N    LA   
Subjt:  N---NLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIF

Query:  VGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL
             +  K ++      +  W +W ERN RIF+     F   +E +L     W      F++Y   S+    + F+
Subjt:  VGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL

RVW19678.1 Structural maintenance of chromosomes protein 3, partial [Vitis vinifera]1.5e-5030.91Show/hide
Query:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL
        N YLSL  R  + + CL+   S  L     + S    +     D  W G G  + ++   W  ++  RE+    +   S++  ALL KW+WRF  E   L
Subjt:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL

Query:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL
        W   +IA  Y +         + R S + PW+ I     + +  +  ++GNG    FW+D W     L   F  LYR+ +  N  V+ V  NS   +WN 
Subjt:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL

Query:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ
        + RR+L + EI     L     S LLSP      +DS +W L +S  FSVKS    L   S+ ++      +W    P K+K   W ++ G +NT D+ Q
Subjt:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ

Query:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV
         R P+ +L P WC++C  N E+  HLF+ C      W+ +    G  W  P S  I D L   F G       +ILW        W +W ERN RIF D 
Subjt:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV

Query:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNF
          T +  ++ + FY+  W  C   F    LS L   W  F
Subjt:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNF

RVW19678.1 Structural maintenance of chromosomes protein 3, partial [Vitis vinifera]2.6e-1540.18Show/hide
Query:  ITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFRE
        +T +MR F++ I    LLD PL+N  FTWS+   +P    +DRFL S +  + F  +    L R TSDH P+ L      WGP PFRFEN WL    F+E
Subjt:  ITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFRE

Query:  VLDNWWNHNSLQ
           +WW   +++
Subjt:  VLDNWWNHNSLQ

RVW19678.1 Structural maintenance of chromosomes protein 3, partial [Vitis vinifera]3.3e-5030.89Show/hide
Query:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL
        N YLSL  R  + + CL+   S  L     + S    +     D  W G G  + ++   W  ++  RE+    +   S++  ALL KW+WRF  E   L
Subjt:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL

Query:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL
        W   +IA  Y +         + R S + PW+ I     + +  +  ++GNG    FW+D W     L   F  LYR+ +  N  V+ V  NS   +WN 
Subjt:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL

Query:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ
        + RR+L + EI     L     S LLSP      +DS +W L +S  FSVKS    L   S+ ++      +W    P K+K   W ++ G +NT D+ Q
Subjt:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ

Query:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV
         R P+ +L P WC++C  N E+  HLF+ C      W+ +    G  W  P S  I D L   F G       +ILW        W +W ERN RIF D 
Subjt:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV

Query:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATW
          T +  ++ + FY+  W  C   F    LS L   W
Subjt:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATW

RVX02903.1 Actin-related protein 7 [Vitis vinifera]4.5e-5230.89Show/hide
Query:  YLSLIDRFLISKDCLNKFGSSHLLRLDRIT----SDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSLW
        YLSL  R  + + CL+    S+ L L +I+    S    +   F     G G       W  ++  +E+    +   SL+  ALL KW+WRF  E   LW
Subjt:  YLSLIDRFLISKDCLNKFGSSHLLRLDRIT----SDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSLW

Query:  RNLIIAKY------YNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVW-NSIE
          +I++ Y      +N+  V  W       S + PW+ I     + +  +  ++GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS  
Subjt:  RNLIIAKY------YNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVW-NSIE

Query:  SAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-DMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRF
         AWNL+ RR+L + EI     L   LS +R    + DS +W L +S LFSVKS  +     S+ IL      +W    P K+K   W ++ G +NT D+ 
Subjt:  SAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-DMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRF

Query:  QRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVS
        Q R P+ SL P WC++C  N E+  HLF+ C      WN + +  G       +  D L   F G       + LW        W +W ERN RIF+D  
Subjt:  QRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVS

Query:  STFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWN
         + ++ ++ +LFY+  W  C   F    L+ L   WN
Subjt:  STFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWN

TrEMBL top hitse value%identityAlignment
A0A438C911 Structural maintenance of chromosomes protein 3 (Fragment)7.1e-5130.91Show/hide
Query:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL
        N YLSL  R  + + CL+   S  L     + S    +     D  W G G  + ++   W  ++  RE+    +   S++  ALL KW+WRF  E   L
Subjt:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL

Query:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL
        W   +IA  Y +         + R S + PW+ I     + +  +  ++GNG    FW+D W     L   F  LYR+ +  N  V+ V  NS   +WN 
Subjt:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL

Query:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ
        + RR+L + EI     L     S LLSP      +DS +W L +S  FSVKS    L   S+ ++      +W    P K+K   W ++ G +NT D+ Q
Subjt:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ

Query:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV
         R P+ +L P WC++C  N E+  HLF+ C      W+ +    G  W  P S  I D L   F G       +ILW        W +W ERN RIF D 
Subjt:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV

Query:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNF
          T +  ++ + FY+  W  C   F    LS L   W  F
Subjt:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNF

A0A438C911 Structural maintenance of chromosomes protein 3 (Fragment)1.3e-1540.18Show/hide
Query:  ITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFRE
        +T +MR F++ I    LLD PL+N  FTWS+   +P    +DRFL S +  + F  +    L R TSDH P+ L      WGP PFRFEN WL    F+E
Subjt:  ITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFRE

Query:  VLDNWWNHNSLQ
           +WW   +++
Subjt:  VLDNWWNHNSLQ

A0A438C911 Structural maintenance of chromosomes protein 3 (Fragment)1.6e-5030.89Show/hide
Query:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL
        N YLSL  R  + + CL+   S  L     + S    +     D  W G G  + ++   W  ++  RE+    +   S++  ALL KW+WRF  E   L
Subjt:  NPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINW-GPGPFRFEN--SWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSL

Query:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL
        W   +IA  Y +         + R S + PW+ I     + +  +  ++GNG    FW+D W     L   F  LYR+ +  N  V+ V  NS   +WN 
Subjt:  WRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNL

Query:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ
        + RR+L + EI     L     S LLSP      +DS +W L +S  FSVKS    L   S+ ++      +W    P K+K   W ++ G +NT D+ Q
Subjt:  SPRRHLNEFEIIEWANL-----SYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQ

Query:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV
         R P+ +L P WC++C  N E+  HLF+ C      W+ +    G  W  P S  I D L   F G       +ILW        W +W ERN RIF D 
Subjt:  RRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDV

Query:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATW
          T +  ++ + FY+  W  C   F    LS L   W
Subjt:  SSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATW

A0A438J1R4 Actin-related protein 72.2e-5230.89Show/hide
Query:  YLSLIDRFLISKDCLNKFGSSHLLRLDRIT----SDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSLW
        YLSL  R  + + CL+    S+ L L +I+    S    +   F     G G       W  ++  +E+    +   SL+  ALL KW+WRF  E   LW
Subjt:  YLSLIDRFLISKDCLNKFGSSHLLRLDRIT----SDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWWNHNSLQ--ALLAKWIWRFLHEEGSLW

Query:  RNLIIAKY------YNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVW-NSIE
          +I++ Y      +N+  V  W       S + PW+ I     + +  +  ++GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS  
Subjt:  RNLIIAKY------YNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVW-NSIE

Query:  SAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-DMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRF
         AWNL+ RR+L + EI     L   LS +R    + DS +W L +S LFSVKS  +     S+ IL      +W    P K+K   W ++ G +NT D+ 
Subjt:  SAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-DMNDSWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRF

Query:  QRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVS
        Q R P+ SL P WC++C  N E+  HLF+ C      WN + +  G       +  D L   F G       + LW        W +W ERN RIF+D  
Subjt:  QRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVS

Query:  STFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWN
         + ++ ++ +LFY+  W  C   F    L+ L   WN
Subjt:  STFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWN

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein7.1e-5132.31Show/hide
Query:  HNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRL
        H++  ALL KW+W+FL E+  LW+ LII+K Y+ E +  +P   +  S  SPW+ +   I      I   + +G    FW D+W     LS + PRL+ L
Subjt:  HNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRL

Query:  TTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTILN---NLYSVIWKDNYPKK
        +TN    V ++WN     WN+   R L + E   W N+   L P  L D   S   W L ++ +F   S+  DL   S +  N   +LY  +WK ++PKK
Subjt:  TTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTILN---NLYSVIWKDNYPKK

Query:  IKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNR
         K F+W L  G INTADR Q+R+P+++LSP+WC MC+ + E+  HLF+ C ++ + W+       W+   + N    LA        +  K ++      
Subjt:  IKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNR

Query:  VFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL
        +  W +W ERN RIF+     F   +E +L     W      F++Y   S+    + F+
Subjt:  VFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein3.2e-5132.1Show/hide
Query:  WLQIASFREV--LDNWWNHNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKD
        W Q+ S +E   L   + H++  ALL KW+W+FL E+  LW+ LII+K Y+ E +  +P   +  S  SPW+ +   I      I   + +G    FW D
Subjt:  WLQIASFREV--LDNWWNHNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKD

Query:  SWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTIL
        +W     LS   PRL+ L+TN    V ++WN     WN+   R L + E   W N+   L P  L D   S   W L ++ +F   S+  DL   S +  
Subjt:  SWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWS-WPLEASKLFSVKSLMVDLLGGSDTIL

Query:  N---NLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIF
        N   +LY  +WK ++PKK K F+W L  G INTADR Q+R+P+++LSP+WC MC+ + E+  HLF+ C ++ + W+       W+   + N    LA   
Subjt:  N---NLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIF

Query:  VGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL
             +  K ++      +  W +W ERN RIF+     F   +E +L     W      F++Y   S+    + F+
Subjt:  VGHPFQGTKRILWLAFNRVFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.9e-1726.5Show/hide
Query:  QALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVR--HWPIPIQRGSFKSPWRFICTTI-DKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLT
        +AL++K  WR L E+ SLW  L++ K Y+  ++R   W IP  +GS+ S WR I   + D V+  +  I G+G    FW D W++G  L  L     R T
Subjt:  QALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVR--HWPIPIQRGSFKSPWRFICTTI-DKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLT

Query:  TNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RDMNDSWSWPLEASKLFSVKS----LMVDLLGGSDTILNNLYSVIWKDNYP
                ++W      W+ +      + +     N    L  + L       D  SW       FSV+S    L VD +   +  + + ++ +WK   P
Subjt:  TNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RDMNDSWSWPLEASKLFSVKS----LMVDLLGGSDTILNNLYSVIWKDNYP

Query:  KKIKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAF
        +++K FLW +   A+ T +   RR  H S S + C +C   +E+  H+   C      W  ++         S ++ ++L          G + I W   
Subjt:  KKIKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAF

Query:  NRVFFWFLWNERNGRIF
          V  W+ W  R G IF
Subjt:  NRVFFWFLWNERNGRIF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCTCCTTCAGCCTCACTCCCATGGAACCATCCTACCCGATCCATTTCCATAGATCGAAAAACCTTCTCTATAGCTTTTGATGAATACTCACGTGGAAGTAAAGC
AAAAATCACTGAAAAAGGTAGAAATTTCTCCAAGTCTCTGTCTCTCACTTGGAAATCCCTCAACTGGTTAGCCTCCACTTTCAACACCCTTGCCAAAGAACCTTGTACGT
ACAAATTCTTCTCAGAATTTTGTGGGGATGAATATATTCTTTGTGTTGAAAAACTCAACAATAAACATGGATACTTTGTGGAGATCAACCAACTGCAGAACTCTGGGAGC
AGAATCAGCATTCTCATCCCTTCCGAAAGCAACAAACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCA
ATCACGATCGTATAAGGAGATCCTCCAACAGAAGCAACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGT
TCTATCAGCGTGATGATTGGCCATCCATTCGTTCATCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTAT
GATCGCAAGACAGCCCTTGAGTTATGCAAATCGACTGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTT
CACTGTTTCTTATGGTGGTTGGATAGAAGTTCGAAACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCGTTGGTGATAGTTGTGGAGGCTACCTAACAACCT
CAAGTCACACCGATAGGATGATCAATCTCATGGAAGCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGC
GAAGAAATTACAGTCCAAATTCGGGGACTTACCGGAGAGACAATCGGACGGGAACAATTTAATGAGCGGAACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAA
ATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAGAGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCAT
TTAATGAAGATTCAATATCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATCGGTGGAATTAAAAAAT
AACATATCTCTTCCTGTGGGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACATCTGGTCTGGGAGGTGATCGTTGGATCCTTGGAGGAGACTTTAATGTTACCCG
ATGGTCATGGGAGAAATCTCATGGTCGTCACATCACTCGGAGTATGCGTATTTTCAACCAATTGATTGCAACTTACAAGCTTCTGGATATCCCATTACAAAATGGTTGTT
TCACCTGGTCCAGTTTTGGTGACAATCCGTATCTCTCCTTAATAGACAGATTTTTGATTTCCAAAGATTGTCTGAATAAATTCGGGTCTTCTCATCTTCTTCGGCTTGAC
AGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTTCGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCCGTGA
GGTGTTGGATAATTGGTGGAATCACAATTCTCTTCAAGCCTTGTTAGCTAAATGGATTTGGAGATTTTTGCACGAAGAAGGATCTCTTTGGCGTAATCTCATTATTGCTA
AATATTACAACTCGGAGGATGTTAGACATTGGCCTATTCCCATTCAAAGGGGATCTTTCAAATCTCCTTGGCGCTTTATTTGTACTACTATCGATAAGGTTACTAGTCGT
ATTCATCGAATTATTGGTAATGGTTGTAGCACATTTTTTTGGAAGGATTCCTGGCTGAATGGAGTGATTCTCTCAAATCTCTTCCCTCGCCTTTATCGGTTAACTACCAA
TCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTATAGAATCAGCATGGAATCTGAGTCCTCGTCGTCACCTTAATGAGTTTGAGATTATTGAATGGGCAAATTTATCCT
ATCTTCTGTCTCCTATTCGACTTCGGGATATGAATGACTCTTGGTCTTGGCCCCTTGAAGCATCCAAATTATTCTCTGTTAAATCCTTGATGGTTGACCTTTTGGGTGGT
TCTGATACTATTTTGAATAATTTATATTCGGTGATATGGAAAGATAATTATCCTAAAAAGATAAAAATCTTTCTATGGGAGCTTAGCCTTGGGGCTATCAATACAGCGGA
TCGTTTTCAACGTAGAATGCCTCATTTTTCACTTTCGCCATCTTGGTGTGTTATGTGCTCATCAAATATGGAGAATTCGGGCCATCTATTTGTCACTTGTTCTTTTGCTA
CCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGTTGGCATTTACCGATGTCGAATAACATTCATGACTTTCTGGCGTCTATTTTTGTTGGTCATCCTTTTCAAGGAACA
AAGAGGATTTTATGGCTGGCTTTTAATAGAGTCTTCTTTTGGTTTCTTTGGAACGAAAGAAATGGAAGAATTTTTAGGGATGTATCCTCAACCTTTGATTCTTTTTTTGA
AAAGGTTCTTTTCTATGCGTTGTATTGGTGTAAATGTCAACACCCCTTTGCTTCTTATAGTCTTTCTTCTTTGATTGCTACTTGGAATAATTTCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTTCTCCTTCAGCCTCACTCCCATGGAACCATCCTACCCGATCCATTTCCATAGATCGAAAAACCTTCTCTATAGCTTTTGATGAATACTCACGTGGAAGTAAAGC
AAAAATCACTGAAAAAGGTAGAAATTTCTCCAAGTCTCTGTCTCTCACTTGGAAATCCCTCAACTGGTTAGCCTCCACTTTCAACACCCTTGCCAAAGAACCTTGTACGT
ACAAATTCTTCTCAGAATTTTGTGGGGATGAATATATTCTTTGTGTTGAAAAACTCAACAATAAACATGGATACTTTGTGGAGATCAACCAACTGCAGAACTCTGGGAGC
AGAATCAGCATTCTCATCCCTTCCGAAAGCAACAAACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCA
ATCACGATCGTATAAGGAGATCCTCCAACAGAAGCAACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGT
TCTATCAGCGTGATGATTGGCCATCCATTCGTTCATCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTAT
GATCGCAAGACAGCCCTTGAGTTATGCAAATCGACTGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTT
CACTGTTTCTTATGGTGGTTGGATAGAAGTTCGAAACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCGTTGGTGATAGTTGTGGAGGCTACCTAACAACCT
CAAGTCACACCGATAGGATGATCAATCTCATGGAAGCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGC
GAAGAAATTACAGTCCAAATTCGGGGACTTACCGGAGAGACAATCGGACGGGAACAATTTAATGAGCGGAACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAA
ATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAGAGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCAT
TTAATGAAGATTCAATATCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATCGGTGGAATTAAAAAAT
AACATATCTCTTCCTGTGGGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACATCTGGTCTGGGAGGTGATCGTTGGATCCTTGGAGGAGACTTTAATGTTACCCG
ATGGTCATGGGAGAAATCTCATGGTCGTCACATCACTCGGAGTATGCGTATTTTCAACCAATTGATTGCAACTTACAAGCTTCTGGATATCCCATTACAAAATGGTTGTT
TCACCTGGTCCAGTTTTGGTGACAATCCGTATCTCTCCTTAATAGACAGATTTTTGATTTCCAAAGATTGTCTGAATAAATTCGGGTCTTCTCATCTTCTTCGGCTTGAC
AGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTTCGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCCGTGA
GGTGTTGGATAATTGGTGGAATCACAATTCTCTTCAAGCCTTGTTAGCTAAATGGATTTGGAGATTTTTGCACGAAGAAGGATCTCTTTGGCGTAATCTCATTATTGCTA
AATATTACAACTCGGAGGATGTTAGACATTGGCCTATTCCCATTCAAAGGGGATCTTTCAAATCTCCTTGGCGCTTTATTTGTACTACTATCGATAAGGTTACTAGTCGT
ATTCATCGAATTATTGGTAATGGTTGTAGCACATTTTTTTGGAAGGATTCCTGGCTGAATGGAGTGATTCTCTCAAATCTCTTCCCTCGCCTTTATCGGTTAACTACCAA
TCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTATAGAATCAGCATGGAATCTGAGTCCTCGTCGTCACCTTAATGAGTTTGAGATTATTGAATGGGCAAATTTATCCT
ATCTTCTGTCTCCTATTCGACTTCGGGATATGAATGACTCTTGGTCTTGGCCCCTTGAAGCATCCAAATTATTCTCTGTTAAATCCTTGATGGTTGACCTTTTGGGTGGT
TCTGATACTATTTTGAATAATTTATATTCGGTGATATGGAAAGATAATTATCCTAAAAAGATAAAAATCTTTCTATGGGAGCTTAGCCTTGGGGCTATCAATACAGCGGA
TCGTTTTCAACGTAGAATGCCTCATTTTTCACTTTCGCCATCTTGGTGTGTTATGTGCTCATCAAATATGGAGAATTCGGGCCATCTATTTGTCACTTGTTCTTTTGCTA
CCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGTTGGCATTTACCGATGTCGAATAACATTCATGACTTTCTGGCGTCTATTTTTGTTGGTCATCCTTTTCAAGGAACA
AAGAGGATTTTATGGCTGGCTTTTAATAGAGTCTTCTTTTGGTTTCTTTGGAACGAAAGAAATGGAAGAATTTTTAGGGATGTATCCTCAACCTTTGATTCTTTTTTTGA
AAAGGTTCTTTTCTATGCGTTGTATTGGTGTAAATGTCAACACCCCTTTGCTTCTTATAGTCTTTCTTCTTTGATTGCTACTTGGAATAATTTCTTGTAA
Protein sequenceShow/hide protein sequence
MTSPSASLPWNHPTRSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFCGDEYILCVEKLNNKHGYFVEINQLQNSGS
RISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVY
DRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFVGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVG
EEITVQIRGLTGETIGREQFNERNQLRKGSYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKN
NISLPVGPTNLKIGQKGSTSGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLD
RITSDHYPLSLTFGDINWGPGPFRFENSWLQIASFREVLDNWWNHNSLQALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSR
IHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGG
SDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRFQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGT
KRILWLAFNRVFFWFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL