; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016033 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016033
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold6:41462658..41468298
RNA-Seq ExpressionSpg016033
SyntenySpg016033
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG98819.1 VIRB2-interacting protein 2 [Prunus dulcis]3.1e-10336.52Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   SG+ +N +K  ++GI++DD  + EM   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC  G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R++AL AKW+WRF +E  +LW  +I + Y    D   W   P  RGS + PWR I +  +        ++G+G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W  + S LF+  S    +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +  +   Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI
        SI +    +G K ++LW +  +     LW ERN RIF D         +++V F+A  W      F      S++
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI

BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]3.1e-10336.52Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   SG+ +N +K  ++GI++DD  + EM   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC  G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R++AL AKW+WRF +E  +LW  +I + Y    D   W   P  RGS + PWR I +  +        ++G+G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W  + S LF+  S    +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +  +   Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI
        SI +    +G K ++LW +  +     LW ERN RIF D         +++V F+A  W      F      S++
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.0e-10235.84Show/hide
Query:  WRVWEVIVGSLEETLMLPDGHGRNLMVVTSLGI---TTLFPLLLVILIGVLG--LFDLKIPGCKLHHSVS-----LNHLQFADDTLLFSSFDSEALNKLF
        WR+W     S     +L +G+ +   V  S G+     L P L  ++  VL   LF  +  G     SV      ++ LQFADDT+ FS    E L  L 
Subjt:  WRVWEVIVGSLEETLMLPDGHGRNLMVVTSLGI---TTLFPLLLVILIGVLG--LFDLKIPGCKLHHSVS-----LNHLQFADDTLLFSSFDSEALNKLF

Query:  EVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKFGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMP
         ++ +F   SGL +NL KS I GI+     L  + + F C+   WP +YLGLPLGGNPK   FW PV+E+I  +L  WK A++S GGR+TLIQ+ LS +P
Subjt:  EVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKFGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMP

Query:  TYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPIP
        +YFLSLFK+P+ +A  ++K+ R+F W G+      H V W+    PK LGGLG G   LRNIALL KW+WRF  E   LW   +I   Y +         
Subjt:  TYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPIP

Query:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNLSPRRHLNEFEIIEWANLSYLL
        + R S + PW+ I     + +  +  ++GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS   AWNL+ RR+L + EI     L   L
Subjt:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNLSPRRHLNEFEIIEWANLSYLL

Query:  SPIR-LRDMNDSWSWPLEASKLFSVKSLMVDLLGGFDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGH
        S +R    + DS +W L +S LF+VKS  + L    + IL      +W    P K+K   W ++ G +NT D+LQ R P+ SL P WC++C  N E+  H
Subjt:  SPIR-LRDMNDSWSWPLEASKLFSVKSLMVDLLGGFDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGH

Query:  LFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFLFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFAS
        LF+ C      WN + +  G       +  D L   F G       + LW          +W ERN RIF D   + ++ ++ +LFY+  W  C   F  
Subjt:  LFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFLFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFAS

Query:  YSLSSLIATWN
          L+ +   W+
Subjt:  YSLSSLIATWN

VVA25489.1 Hypothetical predicted protein, partial [Prunus dulcis]1.5e-10237.1Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   SG+ +N +K  ++GI++DD  + EM   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC  G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R++AL AKW+WRF +E  +LW  +I + Y    D   W   P  RGS + PWR I +  +        ++G+G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W  + S LF+  SL   +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +      Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF
        SI +    +G K +ILW +  +     LW ERN RIF +         +++V F+A  W      F
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF

VVA39726.1 Hypothetical predicted protein, partial [Prunus dulcis]2.0e-10237.1Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   S + +N +K  ++GI++DD  + E+   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC+ G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R+  L AKW+WRF +E  +LW  +I + Y    D   W   P  RGS +SPWR I +  +        ++G G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W L+ S LF+  SL   +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +  +   Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF
        SI +    +G K +ILW +  +     LW ERN RIF D         +++V F+A  W      F
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein9.6e-10335.84Show/hide
Query:  WRVWEVIVGSLEETLMLPDGHGRNLMVVTSLGI---TTLFPLLLVILIGVLG--LFDLKIPGCKLHHSVS-----LNHLQFADDTLLFSSFDSEALNKLF
        WR+W     S     +L +G+ +   V  S G+     L P L  ++  VL   LF  +  G     SV      ++ LQFADDT+ FS    E L  L 
Subjt:  WRVWEVIVGSLEETLMLPDGHGRNLMVVTSLGI---TTLFPLLLVILIGVLG--LFDLKIPGCKLHHSVS-----LNHLQFADDTLLFSSFDSEALNKLF

Query:  EVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKFGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMP
         ++ +F   SGL +NL KS I GI+     L  + + F C+   WP +YLGLPLGGNPK   FW PV+E+I  +L  WK A++S GGR+TLIQ+ LS +P
Subjt:  EVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKFGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMP

Query:  TYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPIP
        +YFLSLFK+P+ +A  ++K+ R+F W G+      H V W+    PK LGGLG G   LRNIALL KW+WRF  E   LW   +I   Y +         
Subjt:  TYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPIP

Query:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNLSPRRHLNEFEIIEWANLSYLL
        + R S + PW+ I     + +  +  ++GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS   AWNL+ RR+L + EI     L   L
Subjt:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNLSPRRHLNEFEIIEWANLSYLL

Query:  SPIR-LRDMNDSWSWPLEASKLFSVKSLMVDLLGGFDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGH
        S +R    + DS +W L +S LF+VKS  + L    + IL      +W    P K+K   W ++ G +NT D+LQ R P+ SL P WC++C  N E+  H
Subjt:  SPIR-LRDMNDSWSWPLEASKLFSVKSLMVDLLGGFDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGH

Query:  LFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFLFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFAS
        LF+ C      WN + +  G       +  D L   F G       + LW          +W ERN RIF D   + ++ ++ +LFY+  W  C   F  
Subjt:  LFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFLFLWNERNGRIFRDVSSTFDSFFEKVLFYALYWCKCQHPFAS

Query:  YSLSSLIATWN
          L+ +   W+
Subjt:  YSLSSLIATWN

A0A4Y1R3V4 VIRB2-interacting protein 21.5e-10336.52Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   SG+ +N +K  ++GI++DD  + EM   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC  G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R++AL AKW+WRF +E  +LW  +I + Y    D   W   P  RGS + PWR I +  +        ++G+G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W  + S LF+  S    +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +  +   Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI
        SI +    +G K ++LW +  +     LW ERN RIF D         +++V F+A  W      F      S++
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI

A0A5E4FED8 Reverse transcriptase domain-containing protein (Fragment)7.4e-10337.1Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   SG+ +N +K  ++GI++DD  + EM   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC  G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R++AL AKW+WRF +E  +LW  +I + Y    D   W   P  RGS + PWR I +  +        ++G+G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W  + S LF+  SL   +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +      Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF
        SI +    +G K +ILW +  +     LW ERN RIF +         +++V F+A  W      F
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF

A0A5E4GJ11 Reverse transcriptase domain-containing protein (Fragment)9.6e-10337.1Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   S + +N +K  ++GI++DD  + E+   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC+ G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R+  L AKW+WRF +E  +LW  +I + Y    D   W   P  RGS +SPWR I +  +        ++G G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W L+ S LF+  SL   +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +  +   Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF
        SI +    +G K +ILW +  +     LW ERN RIF D         +++V F+A  W      F
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPF

A0A5H2Y6K0 VIRB2-interacting protein 21.5e-10336.52Show/hide
Query:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF
        L P L  +++ VL     K       H +S       ++HLQFADDT+ F     E  N L +++ +F   SG+ +N +K  ++GI++DD  + EM   +
Subjt:  LFPLLLVILIGVLGLFDLKIPGCKLHHSVS-------LNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEEMVAKF

Query:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV
        GC  G WP  YLGLPLGGNP+   FW PV+EK++++L  WK A +SKGGRLT+IQA L S+P Y++SLF++P  VA  ++KL+RDF WEG  G    H V
Subjt:  GCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNV

Query:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW
        +W+     K  GGLG+G+ + R++AL AKW+WRF +E  +LW  +I + Y    D   W   P  RGS + PWR I +  +        ++G+G    FW
Subjt:  NWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPI-PIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFW

Query:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF
        +D W  G +L  +FPRL+ L+   N  ++   +S  +  +W+   RR+LNE EI E A L  LL  +RL     D   W  + S LF+  S    +    
Subjt:  KDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RDMNDSWSWPLEASKLFSVKSLMVDLLGGF

Query:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA
        +  +   Y+ IWK   P K+KIF+W+  LG +NT D LQRR P+  +SP WC +C+   ++  HL + C F+ K W  +L+     W +P        L 
Subjt:  DTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG--WHLPMSNNIHDFLA

Query:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI
        SI +    +G K ++LW +  +     LW ERN RIF D         +++V F+A  W      F      S++
Subjt:  SIFVGHPFQGTK-RILWLAFNRVFFLFLWNERNGRIFRDVSST-FDSFFEKVLFYALYWCKCQHPFASYSLSSLI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.7e-3528.4Show/hide
Query:  VIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLA
        ++E++  ++  W+   +S  GRLTL +A LSSMP + +S   LP  +   LD+L R F W  +      H V W K   PK  GGLG+   +  N AL++
Subjt:  VIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLA

Query:  KWIWRFLHEERSLWRNLIIAKYYNSEDVR--HWPIPIQRGSFKSPWRFICTTI-DKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNA
        K  WR L E+ SLW  L++ K Y+  ++R   W IP  +GS+ S WR I   + D V+  +  I G+G    FW D W++G  L  L     R T     
Subjt:  KWIWRFLHEERSLWRNLIIAKYYNSEDVR--HWPIPIQRGSFKSPWRFICTTI-DKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNA

Query:  RVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RDMNDSWSWPLEASKLFSVKS----LMVDLLGGFDTILNNLYSVIWKDNYPKKIKI
           ++W      W+ +      + +     N    L  + L       D  SW       FSV+S    L VD +   +  + + ++ +WK   P+++K 
Subjt:  RVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RDMNDSWSWPLEASKLFSVKS----LMVDLLGGFDTILNNLYSVIWKDNYPKKIKI

Query:  FLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFF
        FLW +   A+ T +   RR  H S S + C +C   +E+  H+   C      W  ++         S ++ ++L          G + I W     V  
Subjt:  FLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFF

Query:  LFLWNERNGRIF
         + W  R G IF
Subjt:  LFLWNERNGRIF

P93295 Uncharacterized mitochondrial protein AtMg003106.0e-0928.08Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRH
        ++P Y +S F+L   + K L   + +F+W   +    +  V W K    K   GGLG  +    N ALLAK  +R +H+  +L   L+ ++Y+    +  
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRH

Query:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL
          +   R S+   WR I    + ++  + R IG+G  T  W D W+
Subjt:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein4.8e-3025.96Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHW
        ++PTY ++ F LP  V K +  ++ DF+W   Q   GMH   WD     K  GG+G  + +  N+ALL K +WR L    SL   +  ++Y++  D  + 
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHW

Query:  PIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSY
        P+   R SF   W+ I  + + +      ++GNG     W+  WL+    S    R+ R+     A V+ +    +           +  E++       
Subjt:  PIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSY

Query:  LLSPIRL--RDMNDSWSWPLEASKLFSVKS---LMVDLLGG-------FDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSW
        L+  +R   R + DS++W   +S  ++VKS   ++  ++          +  LN +Y  IWK     KI+ FLW+    ++  A  L  R  H S   S 
Subjt:  LLSPIRL--RDMNDSWSWPLEASKLFSVKS---LMVDLLGG-------FDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSW

Query:  CVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIF---VGHP--FQGTKRILWLAFNRVFFLFLWNERNGRIFR
        C+ C S  E   HL   C+FA   W +           +++I+  L  +F    G+P   + ++ + WL +       LW  RN  +FR
Subjt:  CVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIF---VGHP--FQGTKRILWLAFNRVFFLFLWNERNGRIFR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.2e-1028.08Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRH
        ++P Y +S F+L   + K L   + +F+W   +    +  V W K    K   GGLG  +    N ALLAK  +R +H+  +L   L+ ++Y+    +  
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRH

Query:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL
          +   R S+   WR I    + ++  + R IG+G  T  W D W+
Subjt:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTCGGACGACGGCGGCGGAGGCTTAAGCAGGGAACGGCGGCGCCGGCGACTGCTGTGGAAGCGGCGGCACAGAATCAGCATTCTCATCCCTTCCGAAAGCAACAA
ACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCAATCACGATCGTATAAGGAGATCCTCCAACAGAAGC
AACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGTTCTATCAGCGTGATGATTGGCCATCCATTCGTTCA
TCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTATGATCGCAAGACAGCCCTTGAGTTATGCAAATCGAC
TGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTTCACTGTTTCATATGGTGGTTGGATAGAAGTTCGAA
ACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCATTGGTGATAGTTGTGGAGGCTACCTAACAACCTCAAGTCACACCGATAGAATGATCAATCTCATGGAA
GCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGCGAAGAAATTACAGTCCAAATTCGGGGACTTACCGG
AGAGACAATCGGACGGGAACAATTTAATGAGCGGCACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAAATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAG
AGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCATTTAATGAAGATTCAATATCGGTTGATTTTCCTAAT
TTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAAGAATGGAAGAGCCATCGGTGGAATTAAAAAATAACATATCTCTTCCTGTGGGCCCTGCGAATTTGAA
AATTGGTCAAAAGGGCTCAACATCTGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTGATACTGAAGCTTATTTATCCAGCCCATCTCCAATCAATTCACCTCATAAGA
TTAATTTGGACCCACCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAACATCATCTTCCTTTAGCCATTATGCCGCCAAAATCAACAAATGCTGGCCAA
TCAGCCATAAATAACAAAGTGAATGCTCCAGCTCCAGATATTTTAACCAACCATCCCCAACCAGAAACACCTCCCATCAATCAGCCAATGTTTGCCCTCCCAGAATATCT
CCGTCATATAGCTCCAATTCTTAGTGAGCATGGGTTGTGTATCATGGCTATCCCTCCATTTCTACCACCTAAAAGGAAGACAGTTACTACTACCGGGAAGAAAACAAAAC
TCCAGAGAGAGCTTGATAACCTAAAAACTACAGTGCATTATGATAAAACTGCTTCTTTGGCCTTAACGGAGGGAGTACAGAATTACATGATCTGGCGGGTCTGGGAGGTG
ATCGTTGGATCCTTGGAGGAGACTTTAATGTTACCTGATGGTCATGGGAGAAATCTCATGGTCGTCACATCACTCGGAATCACTACCCTCTTTCCCTTACTTTTGGTGAT
ATTAATTGGGGTCCTGGGCCTTTTCGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCCGTGAGCCTGAATCATTTACAATTTGCGGACGACACACTTTTATTCTCTT
CTTTTGATTCAGAGGCTTTGAACAAGCTTTTTGAAGTTATCAATATATTTGAATTGGCTTCTGGTCTAAATGTCAACCTTGCCAAGAGTGAAATCTTAGGGATCCATATT
GATGATACAGAGTTGGAAGAAATGGTTGCTAAATTTGGTTGTAAGCGTGGTTTTTGGCCTAGCACATATCTTGGGCTTCCTTTGGGTGGTAACCCGAAAAACTTTGTTTT
CTGGCAACCAGTTATTGAGAAGATTCAACATAAATTACATAGTTGGAAATATGCCTTTATATCCAAAGGAGGAAGGCTTACCCTCATCCAAGCTACTCTTTCGAGTATGC
CGACGTATTTTCTGTCTTTGTTTAAACTTCCAAGTAAGGTTGCTAAATCTCTTGACAAGCTAGTGCGAGATTTTTTCTGGGAAGGTTCTCAAGGGGATGGTGGTATGCAT
AATGTTAATTGGGATAAGACTCAGCTTCCGAAATTACTGGGAGGTCTTGGCATTGGCAATTTTCAGCTTCGAAATATAGCCTTGTTAGCTAAATGGATTTGGAGATTTTT
GCACGAAGAAAGATCTCTTTGGCGTAATCTCATTATTGCTAAATATTACAACTCGGAGGATGTTAGACATTGGCCTATTCCCATTCAAAGGGGATCTTTCAAATCTCCTT
GGCGCTTTATTTGTACTACTATCGATAAGGTTACTAGTCGTATTCATCGAATTATTGGTAATGGTTGTAGCACATTTTTTTGGAAGGATTCCTGGCTGAATGGAGTGATT
CTCTCAAATCTCTTCCCTCGCCTTTATCGGTTAACTACCAATCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTATAGAATCAGCATGGAATCTGAGTCCTCGTCGTCA
CCTTAATGAGTTTGAGATTATTGAATGGGCAAATTTATCCTATCTTCTGTCTCCTATTCGACTTCGGGATATGAATGACTCTTGGTCTTGGCCCCTTGAAGCATCCAAAT
TATTCTCTGTTAAATCCTTGATGGTTGACCTTTTGGGTGGTTTTGATACTATTTTGAATAATTTATATTCGGTGATATGGAAAGATAATTATCCTAAAAAGATAAAAATC
TTTCTATGGGAGCTTAGCCTTGGGGCTATCAATACAGCGGATCGTCTTCAACGTAGAATGCCTCATTTTTCACTTTCGCCATCTTGGTGTGTTATGTGCTCATCAAATAT
GGAGAATTCGGGCCATCTATTTGTCACTTGTTCTTTTGCTACCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGTTGGCATTTACCGATGTCGAATAACATTCATGACT
TTCTGGCGTCTATTTTTGTTGGTCATCCTTTTCAAGGAACAAAGAGGATTTTATGGCTGGCTTTTAATAGAGTCTTCTTTTTGTTTCTTTGGAACGAAAGAAATGGAAGA
ATTTTTAGGGATGTATCCTCAACCTTTGATTCTTTTTTTGAAAAGGTTCTTTTCTATGCGTTGTATTGGTGTAAATGTCAACACCCTTTTGCTTCTTATAGTCTTTCTTC
TTTGATTGCTACTTGGAATAATTTCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTCGGACGACGGCGGCGGAGGCTTAAGCAGGGAACGGCGGCGCCGGCGACTGCTGTGGAAGCGGCGGCACAGAATCAGCATTCTCATCCCTTCCGAAAGCAACAA
ACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCAATCACGATCGTATAAGGAGATCCTCCAACAGAAGC
AACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGTTCTATCAGCGTGATGATTGGCCATCCATTCGTTCA
TCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTATGATCGCAAGACAGCCCTTGAGTTATGCAAATCGAC
TGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTTCACTGTTTCATATGGTGGTTGGATAGAAGTTCGAA
ACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCATTGGTGATAGTTGTGGAGGCTACCTAACAACCTCAAGTCACACCGATAGAATGATCAATCTCATGGAA
GCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGCGAAGAAATTACAGTCCAAATTCGGGGACTTACCGG
AGAGACAATCGGACGGGAACAATTTAATGAGCGGCACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAAATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAG
AGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCATTTAATGAAGATTCAATATCGGTTGATTTTCCTAAT
TTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAAGAATGGAAGAGCCATCGGTGGAATTAAAAAATAACATATCTCTTCCTGTGGGCCCTGCGAATTTGAA
AATTGGTCAAAAGGGCTCAACATCTGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTGATACTGAAGCTTATTTATCCAGCCCATCTCCAATCAATTCACCTCATAAGA
TTAATTTGGACCCACCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAACATCATCTTCCTTTAGCCATTATGCCGCCAAAATCAACAAATGCTGGCCAA
TCAGCCATAAATAACAAAGTGAATGCTCCAGCTCCAGATATTTTAACCAACCATCCCCAACCAGAAACACCTCCCATCAATCAGCCAATGTTTGCCCTCCCAGAATATCT
CCGTCATATAGCTCCAATTCTTAGTGAGCATGGGTTGTGTATCATGGCTATCCCTCCATTTCTACCACCTAAAAGGAAGACAGTTACTACTACCGGGAAGAAAACAAAAC
TCCAGAGAGAGCTTGATAACCTAAAAACTACAGTGCATTATGATAAAACTGCTTCTTTGGCCTTAACGGAGGGAGTACAGAATTACATGATCTGGCGGGTCTGGGAGGTG
ATCGTTGGATCCTTGGAGGAGACTTTAATGTTACCTGATGGTCATGGGAGAAATCTCATGGTCGTCACATCACTCGGAATCACTACCCTCTTTCCCTTACTTTTGGTGAT
ATTAATTGGGGTCCTGGGCCTTTTCGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCCGTGAGCCTGAATCATTTACAATTTGCGGACGACACACTTTTATTCTCTT
CTTTTGATTCAGAGGCTTTGAACAAGCTTTTTGAAGTTATCAATATATTTGAATTGGCTTCTGGTCTAAATGTCAACCTTGCCAAGAGTGAAATCTTAGGGATCCATATT
GATGATACAGAGTTGGAAGAAATGGTTGCTAAATTTGGTTGTAAGCGTGGTTTTTGGCCTAGCACATATCTTGGGCTTCCTTTGGGTGGTAACCCGAAAAACTTTGTTTT
CTGGCAACCAGTTATTGAGAAGATTCAACATAAATTACATAGTTGGAAATATGCCTTTATATCCAAAGGAGGAAGGCTTACCCTCATCCAAGCTACTCTTTCGAGTATGC
CGACGTATTTTCTGTCTTTGTTTAAACTTCCAAGTAAGGTTGCTAAATCTCTTGACAAGCTAGTGCGAGATTTTTTCTGGGAAGGTTCTCAAGGGGATGGTGGTATGCAT
AATGTTAATTGGGATAAGACTCAGCTTCCGAAATTACTGGGAGGTCTTGGCATTGGCAATTTTCAGCTTCGAAATATAGCCTTGTTAGCTAAATGGATTTGGAGATTTTT
GCACGAAGAAAGATCTCTTTGGCGTAATCTCATTATTGCTAAATATTACAACTCGGAGGATGTTAGACATTGGCCTATTCCCATTCAAAGGGGATCTTTCAAATCTCCTT
GGCGCTTTATTTGTACTACTATCGATAAGGTTACTAGTCGTATTCATCGAATTATTGGTAATGGTTGTAGCACATTTTTTTGGAAGGATTCCTGGCTGAATGGAGTGATT
CTCTCAAATCTCTTCCCTCGCCTTTATCGGTTAACTACCAATCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTATAGAATCAGCATGGAATCTGAGTCCTCGTCGTCA
CCTTAATGAGTTTGAGATTATTGAATGGGCAAATTTATCCTATCTTCTGTCTCCTATTCGACTTCGGGATATGAATGACTCTTGGTCTTGGCCCCTTGAAGCATCCAAAT
TATTCTCTGTTAAATCCTTGATGGTTGACCTTTTGGGTGGTTTTGATACTATTTTGAATAATTTATATTCGGTGATATGGAAAGATAATTATCCTAAAAAGATAAAAATC
TTTCTATGGGAGCTTAGCCTTGGGGCTATCAATACAGCGGATCGTCTTCAACGTAGAATGCCTCATTTTTCACTTTCGCCATCTTGGTGTGTTATGTGCTCATCAAATAT
GGAGAATTCGGGCCATCTATTTGTCACTTGTTCTTTTGCTACCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGTTGGCATTTACCGATGTCGAATAACATTCATGACT
TTCTGGCGTCTATTTTTGTTGGTCATCCTTTTCAAGGAACAAAGAGGATTTTATGGCTGGCTTTTAATAGAGTCTTCTTTTTGTTTCTTTGGAACGAAAGAAATGGAAGA
ATTTTTAGGGATGTATCCTCAACCTTTGATTCTTTTTTTGAAAAGGTTCTTTTCTATGCGTTGTATTGGTGTAAATGTCAACACCCTTTTGCTTCTTATAGTCTTTCTTC
TTTGATTGCTACTTGGAATAATTTCTTGTAA
Protein sequenceShow/hide protein sequence
MWSDDGGGGLSRERRRRRLLWKRRHRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDLVWTDIIVVQRFYQRDDWPSIRS
SILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLME
ARLKVRQNSTGFIPSSIALPIALVGEEITVQIRGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPN
LTDSTKSSKGKEPLRMEEPSVELKNNISLPVGPANLKIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPKSTNAGQ
SAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQNYMIWRVWEV
IVGSLEETLMLPDGHGRNLMVVTSLGITTLFPLLLVILIGVLGLFDLKIPGCKLHHSVSLNHLQFADDTLLFSSFDSEALNKLFEVINIFELASGLNVNLAKSEILGIHI
DDTELEEMVAKFGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLHSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMH
NVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEERSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVI
LSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDSWSWPLEASKLFSVKSLMVDLLGGFDTILNNLYSVIWKDNYPKKIKI
FLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASIFVGHPFQGTKRILWLAFNRVFFLFLWNERNGR
IFRDVSSTFDSFFEKVLFYALYWCKCQHPFASYSLSSLIATWNNFL