; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008564 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008564
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:25480901..25484198
RNA-Seq ExpressionLag0008564
SyntenyLag0008564
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]2.2e-7526.24Show/hide
Query:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV
        V  N F++   +   +DR+    PW FD  L ++          E+ FD  ++WVQFHN+P+   N++    LG  IGEV  ++ +E D   GR LRV++
Subjt:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV

Query:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPKNKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRPMGR
         L   +PL RG T+      KV   +KYE++P FC+ CG + H    C   K       QFGSWLRAE+    R       S D G              
Subjt:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPKNKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYRPMGR

Query:  QEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAERDAEGKKS
            S E+  E+N  +                 D   P K                 +    SM     +  + L +D  S N          D      
Subjt:  QEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAERDAEGKKS

Query:  FFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYI--GPSNDFYNMDVEISNPSPNTLRNKFEAVDGQF
        + ++  +       DL  +     +  H    +    G       GPS   G  + +A    T  +  GP        + ++    +T  +  +   G F
Subjt:  FFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYI--GPSNDFYNMDVEISNPSPNTLRNKFEAVDGQF

Query:  LNIRPMDPN--QSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYRRSLLNRPA
        L   P++ +  + ++G+    +Q++ P   + +      +  +       N       S + + +  D  +L++ +     +   +    ++  L     
Subjt:  LNIRPMDPN--QSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYRRSLLNRPA

Query:  ESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII---ENDGKTFRFTGFYGEPKPENRHLSWTLLRRLSDFPHQA
                                 V C G SGG+ALFW++   + I ++SK HI   +   E + + +  TGFYG      RH SW LLR L     + 
Subjt:  ESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII---ENDGKTFRFTGFYGEPKPENRHLSWTLLRRLSDFPHQA

Query:  WVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI-------------------------------
        W++ GDFN I    EK GG     ++ + F   I++C L DLGF+GN FTW N +    CI + LDR +                               
Subjt:  WVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI-------------------------------

Query:  -----------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKEEKR
                               DC   IQ  W    G    + ++ +I+   E+L  W +   G  +  + ++K R+  + +      +   L++    
Subjt:  -----------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKEEKR

Query:  LDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEI-----LQATTGRVT
        +   L   EI WKQRS+  WL  GD+N+++FH KA  RR+KN I+++ DSNG  ++N E  +     Y++ LFK       A DE+     LQ   GR+T
Subjt:  LDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEI-----LQATTGRVT

Query:  DAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEIP
          M   LD  F+  EV+ AL +MHPTKAPGPDG+  LFYQKYW+VVG  VT + L  LN G  P
Subjt:  DAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEIP

KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]9.2e-8227.55Show/hide
Query:  VQGLEFLSRFNDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNE
        V+G++F      ++     LV     + + +++ E PW FD+ L+LL       Q  +I      +WV+ H++P+  RNE + R++GG +GEVL +D + 
Subjt:  VQGLEFLSRFNDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNE

Query:  VDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP--KNKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDS
         +   G ++RVRV +  ++PL R   +         V   YERLPD C+ CG+LGH    C      + +  +  +G WLRA   +  RS   GGR+   
Subjt:  VDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP--KNKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDS

Query:  GRRGRGRFRYRPMGRQEWGSDESEDEENEASTPERTGR-SDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNA
                    M      +  + D     S  +  G+ S A+++     +  P+ +    P        N+      +++P  S   E + I   S + 
Subjt:  GRRGRGRFRYRPMGRQEWGSDESEDEENEASTPERTGR-SDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNA

Query:  GLFPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSP
        G                 KE      L V               +RN  +  +G M    +  S+    ++           GPS++     V       
Subjt:  GLFPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSP

Query:  NTLRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSN---VPNQ
        ++ RN  +   GQ +  R +   Q++  E            GR +         TG    +   +   L S+ G+KR      + + E +K N   +P  
Subjt:  NTLRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSN---VPNQ

Query:  RLTRLY--RRSLLNRPAESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTIIEN-DGKTFRFTGFYGEPKPENRHL
        R + L   RR L+  PA S+E+  LE     EP     T+     +G SGGLALFW  +I+L IVSYS+ HI   I+N DG  +  TG YG P+   R  
Subjt:  RLTRLY--RRSLLNRPAESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTIIEN-DGKTFRFTGFYGEPKPENRHL

Query:  SWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI----------------
         W LL+ L    +  W+V GDFN I   +EK GG+     + + F + ++DC L+DLG+ G  FTW N +  +  +++ LDR +                
Subjt:  SWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI----------------

Query:  --------------------------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTED-
                                              +C+S I++ W +  G  S   ++ RI   + EL RW +   G  +  +A +KR+++ L  + 
Subjt:  --------------------------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTED-

Query:  ----KLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNA
             L+     CL+ ++      L  +E+ WKQRSR  WL+ GD N+++FHSKA+ RR+KN I ++ D +G C++  + ++     Y+ +LF ++  + 
Subjt:  ----KLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNA

Query:  QAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEIPE
          + +IL     RVT  MN  L + + A EVE AL QMHP+KAPGPDG+  LF+QKYW V+G+ +T+  L  LN G +P+
Subjt:  QAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEIPE

OMO70912.1 reverse transcriptase [Corchorus capsularis]1.1e-7125.86Show/hide
Query:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV
        VG+  FL   +    RDR++   PW F R L+LL      DQPE+IVFD +  WV+   +P G   + +   +G  +G V  +D ++     GR++R+RV
Subjt:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV

Query:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP----KNKKSEKPKKQFGSWLRAETYHMPRSPFTGGR------SYDSGRRGR
         L   +PL +G T+T     +     +Y++ PD C++CG L H +  CP    + ++     K++ + ++AET  +  +P  GGR      S  SG    
Subjt:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP----KNKKSEKPKKQFGSWLRAETYHMPRSPFTGGR------SYDSGRRGR

Query:  GRFRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKT
          F        E G     D   +A  P+ +     + +  +   +R  + +      +  R+  E   +  S  P K +E  + +++    ++    + 
Subjt:  GRFRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKT

Query:  AERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNK
         E +  G +S       +   +V D+     E +   H +N+ +          +G    +G +E+ A   +    G S     +   ++      +  +
Subjt:  AERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNK

Query:  FEAVDGQFLNIRPMDPNQSKTGELP--IAEQAVDPNV------------GRKIKKLWKRVNRTGGQSVTKNSTSQP---------------LTSSIGKKR
             G    +       +  G++   +A  A  P+              RK KKL +  ++       +N + QP                TS   + R
Subjt:  FEAVDGQFLNIRPMDPNQSKTGELP--IAEQAVDPNV------------GRKIKKLWKRVNRTGGQSVTKNSTSQP---------------LTSSIGKKR

Query:  DADEPYLEDNEGRKSNVPNQRLTRLYRRSLLNRPAESHENIQLECSRGGEPSNIPSTKKF-----VDCIGLSGGLALFWHSSINLNIVSYSKVHIDTIIE
        + +E      E   ++V  +   R          AE+    + +CS   E   I S   F     V CIG  GGLAL W +   ++++SYS  HID II 
Subjt:  DADEPYLEDNEGRKSNVPNQRLTRLYRRSLLNRPAESHENIQLECSRGGEPSNIPSTKKF-----VDCIGLSGGLALFWHSSINLNIVSYSKVHIDTIIE

Query:  NDGKTFRFTGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRK
        +D  ++ FTGFYG P+  +RH SW LLR L+      W+ AGDFN I   +EK GGS     + ++F S I+DC  ++L   G   +W   Q G+  + +
Subjt:  NDGKTFRFTGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRK

Query:  HLDRC-------------------------------------IDC-----------------ASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNK
         LDRC                                     ID                  AS + Q W      ++  N+  +I     +L+ W R  
Subjt:  HLDRC-------------------------------------IDC-----------------ASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNK

Query:  TGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIER
         G  +  +   KR  E L   K D+      ++ +  LD +  +EE+ W+QRS+  WLK GD+NT++FHS A+ R+Q+  I  I D  G  +     +E+
Subjt:  TGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIER

Query:  HFFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGE
            YY D+F S+    ++I+++L+    R+ + M + LD  FT +E++ A  QM   KAPGPDG+   F+Q  W+ VG+ V S  L  LNEG+
Subjt:  HFFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGE

XP_012847426.1 PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata]1.2e-7626.76Show/hide
Query:  DNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDT-NEVDECCGRFLR
        D +GD  F+   +    R R +EE PWCFD+ LI+L      + P+ +  D   ++V    +P   RN  +A  +G  IG + ++ T N+     G  LR
Subjt:  DNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDT-NEVDECCGRFLR

Query:  VRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPK----NKKSEKPKKQFGSWLRAETYHMPRSPFTG-GRSYDSGRRGRGR
        +R  +  N+PL R   L       V+V ++YERLP+FCY CG++ HI G C K    + +       +G WL+A     P     G   S+ SG      
Subjt:  VRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPK----NKKSEKPKKQFGSWLRAETYHMPRSPFTG-GRSYDSGRRGRGR

Query:  FRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFP--KT
                Q  G   S + +N AS    T     +  L    + +    +NQ   + +                      E+   D    N  + P    
Subjt:  FRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFP--KT

Query:  AERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNK
          RD +   +   ++ SH    V    PN E    N    N T  ++G M   ++                     GP    +N   +I++         
Subjt:  AERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNK

Query:  FEAVDGQFLNIRP-MDPNQSKTGELPIAEQAV-DPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYR
                ++I+P     + +T ++ I++  V D N           V R  G S  K S  + L    GKK+ +    ++  E  +            +
Subjt:  FEAVDGQFLNIRP-MDPNQSKTGELPIAEQAV-DPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYR

Query:  RSLLNRPAESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII--ENDGKTFRFTGFYGEPKPENRHLSWTLLRRL
        + L+ R + S E   L                  +  G SGGLAL W   + +++ ++S  HID  I   N   T+RFTGFYG P    RH SW LLR+L
Subjt:  RSLLNRPAESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII--ENDGKTFRFTGFYGEPKPENRHLSWTLLRRL

Query:  SDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI------------------------
        S+  ++AW+ AGDFN++   +EK G      K+ Q F   + D  L DLGF G  FTW NN+      R+ LDR                          
Subjt:  SDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI------------------------

Query:  --------------------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKLDVRSS
                                        +C   I++ W       +  +  + ++     L RW R   G  + RI + K +I  L +  L   + 
Subjt:  --------------------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKLDVRSS

Query:  VCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEILQATT
          +    + LD +L +EE+ W+QR++  W++ GD+NTK+FH+KA+ RR+KN I  + +S G   +   DIE+    Y+SD+F S       ++E+L A  
Subjt:  VCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEILQATT

Query:  GRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEIP
         RV+D +N  L   +T  EV++AL  M P K+PGPDG P +F+Q++W+VVG  V+   L  LN  E+P
Subjt:  GRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEIP

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]7.3e-7927.34Show/hide
Query:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV
        VG N+F    QS    +RI+   PW FD  L+LL           I F++ + W+Q  + P  + +  VAR +G R+G+V  ++  +  +    F+RVRV
Subjt:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV

Query:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLG----HIDGSCPKNKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYR
         L   +P+ RG  + G    K  V  KYERLP FC+ CG+LG    H  G     KK E+ + Q+G +LRA           GGRS     +  G+    
Subjt:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLG----HIDGSCPKNKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYR

Query:  PMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAERDAE
                +D  E  +  A T  + G             N  + D N    E+ RR +N+    P ++                              AE
Subjt:  PMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAERDAE

Query:  GKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNKFEAVDG
           +  ++  SHA                N H   +T +  G+ ++ +       GQ+  E +  +  ++G S             + N L +      G
Subjt:  GKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNKFEAVDG

Query:  QFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTG-GQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYRRSLLNRP
        Q L+   + P  +    +             K K  W RVNR   G     N+   P    +GK+                                  P
Subjt:  QFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTG-GQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYRRSLLNRP

Query:  AESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHI-DTIIENDGKTFRFTGFYGEPKPENRHLSWTLLRRLSDFPHQAW
         E +E               P+ K+        GGLA  W + + L +++++  H+   + E DG  +  TGFYG P  + +  SW LL+ L  F    W
Subjt:  AESHENIQLECSRGGEPSNIPSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHI-DTIIENDGKTFRFTGFYGEPKPENRHLSWTLLRRLSDFPHQAW

Query:  VVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI--------------------------------
        VV GDFN+    +EK         + + F  A++ C L DLGF G  +TW N +PG+A  +  LDR +                                
Subjt:  VVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI--------------------------------

Query:  -----------------------DCASKIQQGWDKGAGD-NSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKEEK
                               +CA+ IQ+ W  G G+ +    +  +IK+   EL  WG + T      I E +++++ L E +L   S        K
Subjt:  -----------------------DCASKIQQGWDKGAGD-NSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKEEK

Query:  RLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMN
        ++D +L ++EIYW QRSR +WL+ GD+NTK+FH+KA+ RR+KN I+ I +S G   +N E++ +    Y+ +LF++       ++E L A   +VT+ M 
Subjt:  RLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMN

Query:  AHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGE-IPE
          L   FTA EV+ AL QM PTKAPGPDG+ ALFYQK+W++VGD V S  L  LN G  +PE
Subjt:  AHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGE-IPE

TrEMBL top hitse value%identityAlignment
A0A2N9F9E4 Reverse transcriptase domain-containing protein2.7e-8728.19Show/hide
Query:  SRFNDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGR
        S    +VG+NI L+        +R+    PW +D+ LI           E++ F+   +WVQ HN+PI    + VA  +G  IGEVLR  T+E +   GR
Subjt:  SRFNDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGR

Query:  FLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSC------PKNKKSEKPKK--QFGSWLRAETYHMPRSPFTGGRSYDSG
         +R+RV++  ++PL RG  +     G+  V  +YERLP+FCY CG+  H +  C       + +K + P+K  ++G WLRA    + R            
Subjt:  FLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSC------PKNKKSEKPKK--QFGSWLRAETYHMPRSPFTGGRSYDSG

Query:  RRGRGRFRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGL
             R +    GR   G+  +   ++  S P +T                 ++ E+++PP             P    P     TE L  D        
Subjt:  RRGRGRFRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGL

Query:  FPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNT
            +  D   KKS   E +     +  +  P + E                            LG    E   +   +IGP            +P+  T
Subjt:  FPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNT

Query:  LRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPY-LEDNEGRKSNVPNQRLTR
         ++ +      F+ +R   P    T +  ++ Q   P  G      WK+  R  GQ       ++P   ++ +KR ++  + LED + R   + N R   
Subjt:  LRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPY-LEDNEGRKSNVPNQRLTR

Query:  LYRRSLLNRPAESHENIQLECSRGGEPSNIP-----STKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRFTGFYGEPKPENRHLS
           +S++   A S   +    S       I      S+K  V      GGL LFW+   NL I SYS  HID II E     +R TG YG P+   R  +
Subjt:  LYRRSLLNRPAESHENIQLECSRGGEPSNIP-----STKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRFTGFYGEPKPENRHLS

Query:  WTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQ--PGQACIR-------------------KHL
        WTLLR LS      W   GDFN I K +E  G      ++ + F SA++DC L +L F G  +TW NN+  P    +R                   +H+
Subjt:  WTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQ--PGQACIR-------------------KHL

Query:  D------RCI--DCASK-------------------------IQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKL
        D      +CI  DC+ +                         I+Q W+          +  ++K  S++L  W R   G  K +I   K++I     + +
Subjt:  D------RCI--DCASK-------------------------IQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKL

Query:  DVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEI
          R    +    + L+ +L +EE  W+QRSR  WL  GD+NTK+FH KA+ RR++N I ++ D  G  Y + E+I      YY+ LF +S  N  +++E 
Subjt:  DVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEI

Query:  LQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI
        +      VT  MN +L R F   EVE+A+ QM P+KAPGPDG+P +FYQKYW+VVG  VT+  L CLN G +
Subjt:  LQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI

A0A2N9G3I8 Reverse transcriptase domain-containing protein2.8e-8427.54Show/hide
Query:  NVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVR
        +VGDN+ LV  +     +R++   PW +D+ LI         + E+++F+   +WVQ HN+PI    + VA  LG  IGEV+R    + D   GR +RVR
Subjt:  NVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVR

Query:  VKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP---KNKKSEKPK-KQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRY
        VK+   +PL RG  +     G+  V  KYERLP+FCY CG+  H +  C    K    EK K  ++G WLRA    +        R       GR R   
Subjt:  VKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP---KNKKSEKPK-KQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRY

Query:  RPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAERDA
        +PMG                                          E   PP  Q        H P +      MET            G   K  +  A
Subjt:  RPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAERDA

Query:  EGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNKFEAVD
        +   +F  + ++        +   +      P K       + +   + IGP +P+       HK                 + S  + +T         
Subjt:  EGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNKFEAVD

Query:  GQFLNIRPMDPNQSKTGEL--PIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYRRSLLN
                    +S  GE+   +A     PNVG      WK++ R  GQ      T      S+ +KR  DE    + E  +S    Q+  R+     L+
Subjt:  GQFLNIRPMDPNQSKTGEL--PIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYRRSLLN

Query:  RPAESHENIQLECSRGGEPSNI--------------------PSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRFTGFYGEPK
          A + + +     R  +PS I                     ++K  V      GGLAL+W     + I SYS  HID II E     +R TG YG P+
Subjt:  RPAESHENIQLECSRGGEPSNI--------------------PSTKKFVDCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRFTGFYGEPK

Query:  PENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQ--PGQACIR---------------
         + R  +W LLR L       W   GDFN I K  E  G  A   ++ + F SA++DC L DL + G  FTW NN+  P    +R               
Subjt:  PENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQ--PGQACIR---------------

Query:  ----KHLD------RCI--------------------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAES
            +H+D      +C+                                 C   I + W+          +  ++K   ++L  W R   G  K +I   
Subjt:  ----KHLD------RCI--------------------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAES

Query:  KRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFK
        ++RI+      +  R    +    K L+ +L +EE +W+QRSR  WL  GD+NTK+FH +A  RR++N + K+ D  G  +++ E+I      YY+ LF 
Subjt:  KRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFK

Query:  SSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI
        +S  N   + E        VT  MN +L R F A EVE+A+ QM P+KAPGPDG+P +FYQKYW+VVG  VT+  L CLN G +
Subjt:  SSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI

A0A2N9HB41 Uncharacterized protein6.6e-8627.97Show/hide
Query:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV
        + DN+FL         +R+I +SPW FD+ LI +       QP+ + F  + +W++ +N+PI    ++V   +G  IG ++ +D  E     GRFLR+RV
Subjt:  VGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRV

Query:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPKNKKS----EKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYR
        ++   +PL RG  L         V  +YE LP FCY CG +GH    C + ++S         +FGSWLRA       +P  GGR   S RR R   +  
Subjt:  KLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPKNKKS----EKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRFRYR

Query:  PMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRP----PESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAE
            ++W    S+  + E     + G       ++ E+     ++E   P    P +QR   +        M    + E   + ++    N  LF K A+
Subjt:  PMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRP----PESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAE

Query:  RDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNKFE
           +G+              V +  P +   EE        +++E  + Q  + PS    Q++                   MD++       T+ +   
Subjt:  RDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNKFE

Query:  AVDGQ---FLNIRPMDPNQSKTGELPIAEQ-------AVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQR
          D      L +     N SK+  +P+  +           +VG K    WK+  RT            P T    +KR  +E ++  +    +   + +
Subjt:  AVDGQ---FLNIRPMDPNQSKTGELPIAEQ-------AVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQR

Query:  LTRLYRRSLLNRP--AESHENIQLECSR--GGEPSNIPSTK--------KFVDCIG-----LSGGLALFWHSSINLNIVSYSKVHIDT-IIENDGKTFRF
           L  R L N+   AE H  ++LE  +      + +P  K            C G     L GGLALFW +S+++++ SYS  HID  +I++DG  +RF
Subjt:  LTRLYRRSLLNRP--AESHENIQLECSR--GGEPSNIPSTK--------KFVDCIG-----LSGGLALFWHSSINLNIVSYSKVHIDT-IIENDGKTFRF

Query:  TGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDR----
        TGFYG P+   RH SW+LLRRL    +  W++ GDFN I    EK G      ++   F  A+ DC+L DLGF G  FTW NN+     +R  LDR    
Subjt:  TGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDR----

Query:  -------------------------------CI---------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKS
                                       C+                      C   I + W           L  +IK    +L +W +++      
Subjt:  -------------------------------CI---------------------DCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKS

Query:  RIAESKRRIEALTEDKLDVRSSVCLKKEE-----KRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERH
         I   KR+++     +L+ +S+ C K  E     + L  ++ +EEI+W+QRSR  WLK GD N+++FH  A+ RR+ N +  + DSNG    +   +E  
Subjt:  RIAESKRRIEALTEDKLDVRSSVCLKKEE-----KRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERH

Query:  FFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI
           Y+ +LF SS  +   I+E+ Q    RV+  MNA L   F++ EV  AL Q+ P+KAPGPDG+ ALF+Q+YWN+VG  VT   L CLN G +
Subjt:  FFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI

A0A2N9J4C9 Uncharacterized protein1.2e-9028.05Show/hide
Query:  NVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVR
        ++ DN  +   + A  R+R++   PW +D+ L++L      +  +E++F  T++WVQ H +P+   N + A ILG  +G ++ +   E +   G  +R+R
Subjt:  NVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVR

Query:  VKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP---KNKKS-EKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRF-R
        V L   +PLCRG  +      +  +  +YERLP+FCY CG + H D  CP   +NK S +   +QFG WLRA T      P+   R  +    G  R  +
Subjt:  VKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP---KNKKS-EKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRF-R

Query:  YRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETE---------SLKIDTKSFNAG
         +P   Q       +        P +   +   N    +DQ  P   +N  PP SQ+   ++T  Q  +  PA +   +         +++++    N G
Subjt:  YRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETE---------SLKIDTKSFNAG

Query:  LFPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPN
          P  +E    G+  F  E++        D  P     +EN  +R++T  +   +  Q                                    S P P 
Subjt:  LFPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPN

Query:  TLRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTR
         + +   AV     N+ P   +                      KK WK++ R       K +   P+   I  K  +   YLEDNE      P      
Subjt:  TLRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTR

Query:  LYRRSLLNRPAESHENIQLECSRGGEPS-------------------NIPSTKKFV-DCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRF
         +    L  P    E  +L   R  +PS                    +    KFV       GGL LFW   +NL + S+S  HID+II E+   T+R 
Subjt:  LYRRSLLNRPAESHENIQLECSRGGEPS-------------------NIPSTKKFV-DCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRF

Query:  TGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRK--------
        TGFYG P+  NR  SW LLRRLS      W   GDFN + +  EK G      ++ Q F   ++DC   DLGF+G  FTW NN+ G     +        
Subjt:  TGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRK--------

Query:  ------------HLD-------------------------------RCIDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAES
                    HLD                                 + CA  I+  W K        ++  +I+     LK+W +   G  K +I E+
Subjt:  ------------HLD-------------------------------RCIDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAES

Query:  KRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFK
        + +++A     +  R  V +   + +L  +L ++E  W+QRSR +WL+ GDQNT++FHSKA HRR++N + ++ D++G     ++ +   F +YY+ LF 
Subjt:  KRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFK

Query:  SSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI
        ++  N   I+++       VTD MN+ L R FT+ EV  AL QM P KAPGPDGLP +FYQKYW+++G  VT   L CLN G+I
Subjt:  SSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI

A0A2N9J809 Uncharacterized protein1.2e-9028.05Show/hide
Query:  NVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVR
        ++ DN  +   + A  R+R++   PW +D+ L++L      +  +E++F  T++WVQ H +P+   N + A ILG  +G ++ +   E +   G  +R+R
Subjt:  NVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVR

Query:  VKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP---KNKKS-EKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRF-R
        V L   +PLCRG  +      +  +  +YERLP+FCY CG + H D  CP   +NK S +   +QFG WLRA T      P+   R  +    G  R  +
Subjt:  VKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCP---KNKKS-EKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGRF-R

Query:  YRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETE---------SLKIDTKSFNAG
         +P   Q       +        P +   +   N    +DQ  P   +N  PP SQ+   ++T  Q  +  PA +   +         +++++    N G
Subjt:  YRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETE---------SLKIDTKSFNAG

Query:  LFPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPN
          P  +E    G+  F  E++        D  P     +EN  +R++T  +   +  Q                                    S P P 
Subjt:  LFPKTAERDAEGKKSFFKEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPN

Query:  TLRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTR
         + +   AV     N+ P   +                      KK WK++ R       K +   P+   I  K  +   YLEDNE      P      
Subjt:  TLRNKFEAVDGQFLNIRPMDPNQSKTGELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTR

Query:  LYRRSLLNRPAESHENIQLECSRGGEPS-------------------NIPSTKKFV-DCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRF
         +    L  P    E  +L   R  +PS                    +    KFV       GGL LFW   +NL + S+S  HID+II E+   T+R 
Subjt:  LYRRSLLNRPAESHENIQLECSRGGEPS-------------------NIPSTKKFV-DCIGLSGGLALFWHSSINLNIVSYSKVHIDTII-ENDGKTFRF

Query:  TGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRK--------
        TGFYG P+  NR  SW LLRRLS      W   GDFN + +  EK G      ++ Q F   ++DC   DLGF+G  FTW NN+ G     +        
Subjt:  TGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRK--------

Query:  ------------HLD-------------------------------RCIDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAES
                    HLD                                 + CA  I+  W K        ++  +I+     LK+W +   G  K +I E+
Subjt:  ------------HLD-------------------------------RCIDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAES

Query:  KRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFK
        + +++A     +  R  V +   + +L  +L ++E  W+QRSR +WL+ GDQNT++FHSKA HRR++N + ++ D++G     ++ +   F +YY+ LF 
Subjt:  KRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFK

Query:  SSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI
        ++  N   I+++       VTD MN+ L R FT+ EV  AL QM P KAPGPDGLP +FYQKYW+++G  VT   L CLN G+I
Subjt:  SSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.1e-0423.6Show/hide
Query:  ESKRRIEALTEDKLDV-RSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKAN-----------HRRQKNKIQKIVDSNGNCYKNRED
        + + +I+ LT    ++ +      K  +R +   +  E+  K+   +  L+  +++  WF  + N            +R+KN+I  I +  G+   +  +
Subjt:  ESKRRIEALTEDKLDV-RSSVCLKKEEKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKAN-----------HRRQKNKIQKIVDSNGNCYKNRED

Query:  IERHFFHYYSDLFKSSPLNAQAIDEILQA-TTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKY
        I+     YY  L+ +   N + +D  L   T  R+       L+R  T SE+   +  +   K+PGPDG  A FYQ+Y
Subjt:  IERHFFHYYSDLFKSSPLNAQAIDEILQA-TTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKY

P14381 Transposon TX1 uncharacterized 149 kDa protein6.5e-0622.77Show/hide
Query:  KSRIAESKRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQ----------RSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKN
        KS   +    IEAL  + LD+   +   +++      L  +E               RSR   L   D+ +++F++    +  + +I  +   +G   ++
Subjt:  KSRIAESKRRIEALTEDKLDVRSSVCLKKEEKRLDHILLEEEIYWKQ----------RSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKN

Query:  REDIERHFFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGE
         E I      +Y +LF   P++  A +E+       V++     L+   T  E+ +AL  M   K+PG DGL   F+Q +W+ +G        +   +GE
Subjt:  REDIERHFFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGE

Query:  IP
        +P
Subjt:  IP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.8e-1223.22Show/hide
Query:  QAWVVAGDFNSI--TKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI-------DCASKIQQGWDKGAGDNSP-
        Q  ++ GDF+ I  T ++     ++   +  + F + + D  L D+   G  +TW N+Q     IRK LDR I          S I      G  D+SP 
Subjt:  QAWVVAGDFNSI--TKENEKDGGSAYDSKESQNFVSAINDCALKDLGFSGNRFTWLNNQPGQACIRKHLDRCI-------DCASKIQQGWDKGAGDNSP-

Query:  ----TNLLTRIK----------------------------------SVSEELKR-------WGRNKTGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKE
             NL  R K                                  S+ E LK          R   G  + +  E+   +E++    L   S    + E
Subjt:  ----TNLLTRIK----------------------------------SVSEELKR-------WGRNKTGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKE

Query:  ---EKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSP--LNAQAIDEILQATTG
            K+ +      E +++Q+SR  WL+ GD NT++FH      + KN I+ +   +    +N   ++     YY+ L  S    L   ++  I      
Subjt:  ---EKRLDHILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSP--LNAQAIDEILQATTG

Query:  RVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI
        R  D + + L  + +  E+  A+  M   KAPGPD   A F+ + W VV D   +   +    G +
Subjt:  RVTDAMNAHLDRIFTASEVEEALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEI

AT2G01050.1 zinc ion binding;nucleic acid binding1.2e-1029.46Show/hide
Query:  PW-CFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKV
        PW      L++    S FD   + +  TT  WV+  N+P    +  +   +   +G  L++D N ++   GRF RV +++   +PL +G  L  G     
Subjt:  PW-CFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKV

Query:  LVGIKYERLPDFCYICGMLGHIDGSCPKN
           + YE L   C  CG+ GH+  SCP+N
Subjt:  LVGIKYERLPDFCYICGMLGHIDGSCPKN

AT3G31430.1 unknown protein1.3e-1229.75Show/hide
Query:  DRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPL--CRGLTL
        + ++   PW F+  +ILL       +P+  +F    +WVQ   +P    N  V   +G  +G+VL  D N        F RV +      PL   R    
Subjt:  DRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPL--CRGLTL

Query:  TGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSC-PKNKKSEKPKKQFGSWLRAETYH
        T G     L+  +YERL  FC +CGML H  G+C  +N   E+           +TYH
Subjt:  TGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSC-PKNKKSEKPKKQFGSWLRAETYH

AT5G36228.1 nucleic acid binding;zinc ion binding1.3e-1227.69Show/hide
Query:  IEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPT
        +  +PW F+   I L     F  P E        WV    +P+   +E+   I+   +GEV+ +D NE       F+RV+V++ F EPL     +     
Subjt:  IEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLRIDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPT

Query:  GKVLVGIKYERLPDFCYICGMLGHIDGSCP
         + ++G +YE+L   C  C  + H    CP
Subjt:  GKVLVGIKYERLPDFCYICGMLGHIDGSCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCCCGTTTTCTCTTTGGGAGATGCGTTCCGAAGAGTAATGTGCAAGGCCTGGAATTTTTGTCGAGATTTAACGATAACGTTGGTGATAATATCTTCCTTGTTAA
TCCGCAATCCGCTCAAGGGCGTGACAGAATCATCGAGGAAAGTCCTTGGTGTTTTGATAGATGCTTAATCCTCCTTGCAACACCATCAACATTTGACCAACCAGAAGAGA
TTGTGTTCGATACGACAACATATTGGGTGCAATTCCACAATGTTCCTATTGGTCTGAGAAACGAAAAGGTGGCCCGAATCTTAGGTGGTCGGATAGGCGAGGTCCTAAGA
ATTGATACCAACGAAGTTGACGAATGTTGCGGTCGTTTCCTCCGTGTGAGAGTCAAATTACAATTCAATGAACCACTTTGCCGTGGATTAACTTTAACCGGCGGCCCAAC
AGGTAAAGTTCTAGTTGGAATAAAATATGAACGCTTACCCGACTTCTGCTATATTTGTGGAATGCTAGGGCACATCGATGGATCCTGTCCGAAGAACAAAAAATCGGAGA
AACCGAAAAAACAGTTTGGGTCTTGGCTTCGAGCAGAAACATACCATATGCCAAGAAGCCCATTCACCGGCGGCCGTTCTTACGATTCAGGACGAAGAGGACGTGGACGA
TTCAGGTATCGTCCAATGGGGAGGCAAGAATGGGGATCTGATGAATCTGAGGATGAAGAAAATGAAGCATCCACACCGGAAAGAACAGGTCGTTCCGATGCCCAGAACAT
GCTCGCTGAAGAAGACCAAAACCGGCCGGAAAAAGATGAAAACCAGAGGCCACCGGAATCCCAGCGACGGAAATGGAATGAAACCGACCATCAACCACAATCAATGTCGC
CTGCGAAATCTATGGAAACTGAATCACTCAAAATCGATACAAAATCATTTAATGCAGGTTTATTTCCAAAAACGGCCGAACGTGATGCTGAAGGGAAGAAATCGTTCTTC
AAGGAAGCCAAATCCCACGCTCCATTAATGGTTGAAGATTTATTCCCAAATATAGAGGAGAGAGAAGAGAACCCACACAAGAGGAATCTAACGGATCAAATTGAAGGGAG
GATGCATCAACAAGAAATTGGGCCTAGCAACCCTTTGGGCCAAATGGAAGATGAAGCCCACAAACAGAACACTTATTACATTGGGCCCTCCAACGATTTTTACAACATGG
ATGTGGAAATTAGCAACCCCTCGCCTAATACATTACGAAATAAATTTGAGGCAGTGGATGGGCAATTTTTAAATATCCGTCCAATGGATCCCAACCAAAGCAAGACTGGA
GAATTACCCATTGCTGAACAAGCAGTTGATCCAAACGTGGGGAGGAAAATTAAAAAGTTGTGGAAGAGAGTGAACAGAACGGGAGGTCAAAGTGTTACTAAAAATTCCAC
CTCACAACCTCTCACGAGCTCTATTGGAAAGAAAAGAGATGCAGATGAACCATATTTGGAAGACAATGAAGGAAGAAAATCAAATGTACCAAACCAGAGATTGACACGAT
TATATCGGCGGAGCCTGTTAAACAGGCCCGCCGAGAGCCATGAAAATATTCAGTTGGAATGCTCGAGGGGCGGGGAACCCTCGAACATTCCGAGCACTAAAAAATTTGTT
GATTGTATTGGACTTAGTGGTGGTTTGGCTTTATTTTGGCATTCTTCTATTAATCTTAATATTGTTTCTTATTCTAAAGTGCACATTGATACTATTATTGAGAATGATGG
AAAAACGTTCAGGTTTACAGGTTTCTATGGCGAACCTAAGCCAGAAAATCGTCACCTATCTTGGACATTGCTTAGACGTCTCTCTGATTTCCCCCACCAGGCTTGGGTGG
TAGCGGGAGATTTTAACTCTATTACTAAAGAGAATGAAAAAGATGGCGGAAGTGCATATGATAGTAAAGAGAGTCAAAATTTCGTAAGCGCCATAAATGATTGTGCGTTG
AAAGATCTTGGTTTTTCAGGCAACCGTTTCACCTGGCTAAACAATCAACCTGGACAAGCTTGTATCAGGAAGCATTTGGACAGATGCATTGATTGCGCCTCTAAAATTCA
ACAGGGGTGGGATAAAGGAGCAGGGGACAATTCTCCTACGAATTTACTAACCCGTATCAAAAGCGTGTCTGAGGAACTCAAGAGGTGGGGCAGAAACAAAACAGGAAAAT
TTAAGTCTCGAATTGCAGAATCTAAAAGGAGAATAGAGGCGTTGACGGAAGACAAGTTGGATGTGAGGTCCTCGGTATGCCTAAAGAAGGAAGAAAAACGCCTAGATCAT
ATCCTGCTGGAGGAGGAAATATATTGGAAGCAACGCTCTAGGGAGGATTGGCTGAAGTGGGGGGACCAAAACACAAAATGGTTCCACTCTAAAGCTAATCACCGTCGGCA
GAAAAATAAAATTCAAAAGATAGTTGACTCCAATGGAAATTGCTACAAAAACAGGGAAGATATCGAACGTCATTTCTTTCATTACTATTCTGATCTGTTTAAATCTTCTC
CTTTAAATGCGCAGGCAATAGATGAAATTCTCCAAGCTACCACAGGTAGGGTAACAGATGCCATGAATGCACACTTAGATCGGATATTTACAGCGTCAGAAGTGGAGGAA
GCCCTTATGCAGATGCACCCAACAAAAGCTCCCGGTCCAGATGGGCTTCCAGCCCTATTTTATCAAAAATACTGGAATGTAGTGGGAGATCAGGTCACTAGCACATGTCT
TAAATGCCTCAATGAGGGTGAAATTCCAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCTCCCGTTTTCTCTTTGGGAGATGCGTTCCGAAGAGTAATGTGCAAGGCCTGGAATTTTTGTCGAGATTTAACGATAACGTTGGTGATAATATCTTCCTTGTTAA
TCCGCAATCCGCTCAAGGGCGTGACAGAATCATCGAGGAAAGTCCTTGGTGTTTTGATAGATGCTTAATCCTCCTTGCAACACCATCAACATTTGACCAACCAGAAGAGA
TTGTGTTCGATACGACAACATATTGGGTGCAATTCCACAATGTTCCTATTGGTCTGAGAAACGAAAAGGTGGCCCGAATCTTAGGTGGTCGGATAGGCGAGGTCCTAAGA
ATTGATACCAACGAAGTTGACGAATGTTGCGGTCGTTTCCTCCGTGTGAGAGTCAAATTACAATTCAATGAACCACTTTGCCGTGGATTAACTTTAACCGGCGGCCCAAC
AGGTAAAGTTCTAGTTGGAATAAAATATGAACGCTTACCCGACTTCTGCTATATTTGTGGAATGCTAGGGCACATCGATGGATCCTGTCCGAAGAACAAAAAATCGGAGA
AACCGAAAAAACAGTTTGGGTCTTGGCTTCGAGCAGAAACATACCATATGCCAAGAAGCCCATTCACCGGCGGCCGTTCTTACGATTCAGGACGAAGAGGACGTGGACGA
TTCAGGTATCGTCCAATGGGGAGGCAAGAATGGGGATCTGATGAATCTGAGGATGAAGAAAATGAAGCATCCACACCGGAAAGAACAGGTCGTTCCGATGCCCAGAACAT
GCTCGCTGAAGAAGACCAAAACCGGCCGGAAAAAGATGAAAACCAGAGGCCACCGGAATCCCAGCGACGGAAATGGAATGAAACCGACCATCAACCACAATCAATGTCGC
CTGCGAAATCTATGGAAACTGAATCACTCAAAATCGATACAAAATCATTTAATGCAGGTTTATTTCCAAAAACGGCCGAACGTGATGCTGAAGGGAAGAAATCGTTCTTC
AAGGAAGCCAAATCCCACGCTCCATTAATGGTTGAAGATTTATTCCCAAATATAGAGGAGAGAGAAGAGAACCCACACAAGAGGAATCTAACGGATCAAATTGAAGGGAG
GATGCATCAACAAGAAATTGGGCCTAGCAACCCTTTGGGCCAAATGGAAGATGAAGCCCACAAACAGAACACTTATTACATTGGGCCCTCCAACGATTTTTACAACATGG
ATGTGGAAATTAGCAACCCCTCGCCTAATACATTACGAAATAAATTTGAGGCAGTGGATGGGCAATTTTTAAATATCCGTCCAATGGATCCCAACCAAAGCAAGACTGGA
GAATTACCCATTGCTGAACAAGCAGTTGATCCAAACGTGGGGAGGAAAATTAAAAAGTTGTGGAAGAGAGTGAACAGAACGGGAGGTCAAAGTGTTACTAAAAATTCCAC
CTCACAACCTCTCACGAGCTCTATTGGAAAGAAAAGAGATGCAGATGAACCATATTTGGAAGACAATGAAGGAAGAAAATCAAATGTACCAAACCAGAGATTGACACGAT
TATATCGGCGGAGCCTGTTAAACAGGCCCGCCGAGAGCCATGAAAATATTCAGTTGGAATGCTCGAGGGGCGGGGAACCCTCGAACATTCCGAGCACTAAAAAATTTGTT
GATTGTATTGGACTTAGTGGTGGTTTGGCTTTATTTTGGCATTCTTCTATTAATCTTAATATTGTTTCTTATTCTAAAGTGCACATTGATACTATTATTGAGAATGATGG
AAAAACGTTCAGGTTTACAGGTTTCTATGGCGAACCTAAGCCAGAAAATCGTCACCTATCTTGGACATTGCTTAGACGTCTCTCTGATTTCCCCCACCAGGCTTGGGTGG
TAGCGGGAGATTTTAACTCTATTACTAAAGAGAATGAAAAAGATGGCGGAAGTGCATATGATAGTAAAGAGAGTCAAAATTTCGTAAGCGCCATAAATGATTGTGCGTTG
AAAGATCTTGGTTTTTCAGGCAACCGTTTCACCTGGCTAAACAATCAACCTGGACAAGCTTGTATCAGGAAGCATTTGGACAGATGCATTGATTGCGCCTCTAAAATTCA
ACAGGGGTGGGATAAAGGAGCAGGGGACAATTCTCCTACGAATTTACTAACCCGTATCAAAAGCGTGTCTGAGGAACTCAAGAGGTGGGGCAGAAACAAAACAGGAAAAT
TTAAGTCTCGAATTGCAGAATCTAAAAGGAGAATAGAGGCGTTGACGGAAGACAAGTTGGATGTGAGGTCCTCGGTATGCCTAAAGAAGGAAGAAAAACGCCTAGATCAT
ATCCTGCTGGAGGAGGAAATATATTGGAAGCAACGCTCTAGGGAGGATTGGCTGAAGTGGGGGGACCAAAACACAAAATGGTTCCACTCTAAAGCTAATCACCGTCGGCA
GAAAAATAAAATTCAAAAGATAGTTGACTCCAATGGAAATTGCTACAAAAACAGGGAAGATATCGAACGTCATTTCTTTCATTACTATTCTGATCTGTTTAAATCTTCTC
CTTTAAATGCGCAGGCAATAGATGAAATTCTCCAAGCTACCACAGGTAGGGTAACAGATGCCATGAATGCACACTTAGATCGGATATTTACAGCGTCAGAAGTGGAGGAA
GCCCTTATGCAGATGCACCCAACAAAAGCTCCCGGTCCAGATGGGCTTCCAGCCCTATTTTATCAAAAATACTGGAATGTAGTGGGAGATCAGGTCACTAGCACATGTCT
TAAATGCCTCAATGAGGGTGAAATTCCAGAATAG
Protein sequenceShow/hide protein sequence
MLSRFLFGRCVPKSNVQGLEFLSRFNDNVGDNIFLVNPQSAQGRDRIIEESPWCFDRCLILLATPSTFDQPEEIVFDTTTYWVQFHNVPIGLRNEKVARILGGRIGEVLR
IDTNEVDECCGRFLRVRVKLQFNEPLCRGLTLTGGPTGKVLVGIKYERLPDFCYICGMLGHIDGSCPKNKKSEKPKKQFGSWLRAETYHMPRSPFTGGRSYDSGRRGRGR
FRYRPMGRQEWGSDESEDEENEASTPERTGRSDAQNMLAEEDQNRPEKDENQRPPESQRRKWNETDHQPQSMSPAKSMETESLKIDTKSFNAGLFPKTAERDAEGKKSFF
KEAKSHAPLMVEDLFPNIEEREENPHKRNLTDQIEGRMHQQEIGPSNPLGQMEDEAHKQNTYYIGPSNDFYNMDVEISNPSPNTLRNKFEAVDGQFLNIRPMDPNQSKTG
ELPIAEQAVDPNVGRKIKKLWKRVNRTGGQSVTKNSTSQPLTSSIGKKRDADEPYLEDNEGRKSNVPNQRLTRLYRRSLLNRPAESHENIQLECSRGGEPSNIPSTKKFV
DCIGLSGGLALFWHSSINLNIVSYSKVHIDTIIENDGKTFRFTGFYGEPKPENRHLSWTLLRRLSDFPHQAWVVAGDFNSITKENEKDGGSAYDSKESQNFVSAINDCAL
KDLGFSGNRFTWLNNQPGQACIRKHLDRCIDCASKIQQGWDKGAGDNSPTNLLTRIKSVSEELKRWGRNKTGKFKSRIAESKRRIEALTEDKLDVRSSVCLKKEEKRLDH
ILLEEEIYWKQRSREDWLKWGDQNTKWFHSKANHRRQKNKIQKIVDSNGNCYKNREDIERHFFHYYSDLFKSSPLNAQAIDEILQATTGRVTDAMNAHLDRIFTASEVEE
ALMQMHPTKAPGPDGLPALFYQKYWNVVGDQVTSTCLKCLNEGEIPE