; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001372 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001372
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:30335161..30340570
RNA-Seq ExpressionLag0001372
SyntenyLag0001372
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5475845.1 hypothetical protein F2P56_007609 [Juglans regia]1.2e-18730.73Show/hide
Query:  ILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFDRALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWV
        I+  K IN E F+  M K+W  EG V+   VG N FL +FN   +++R+ KG PW+FDR L+ +   +G   ++  +F   +FW+ +H++P+     +  
Subjt:  ILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFDRALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWV

Query:  TALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDPVQPEEDQPF----------GEWMRASPIMGGSNRNNISDSRK
          +G ++G+ +    D+    W   +R++V++ ITK L RG+ +    +  +TW +   + E    F          G+  ++SP     N+  +     
Subjt:  TALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDPVQPEEDQPF----------GEWMRASPIMGGSNRNNISDSRK

Query:  GSTWGRGRGSR-GEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVRPTENLVSQGRE---------SQTASMSGEKGEESAEKSEH
           W R   ++ G+  + R+G  S Q+ +G                 +W  +  +G+EV   + +  +G +          +         E SA+  E 
Subjt:  GSTWGRGRGSR-GEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVRPTENLVSQGRE---------SQTASMSGEKGEESAEKSEH

Query:  TPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNR---KNSPSGPNGPEKEMAKALVMSNMQNESEEVEKKLEQTFESGK
              T  Q PK N T + +E   G                 + T K+   P R   + SP  P GP K  +    + N +  S+   + +  T  +  
Subjt:  TPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNR---KNSPSGPNGPEKEMAKALVMSNMQNESEEVEKKLEQTFESGK

Query:  GIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREPKKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGH
         IK + W+   R        E +         +      +   E + K+ +    + ++K     F+N F +  +G SGGLA  W N  +  + S+S+ H
Subjt:  GIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREPKKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGH

Query:  IDATIKDSD--QNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTW---R
        I   +K +   ++W  TGFYG P  + R+ SW+LL+ ++    + W+  GDFNEI+F  +K GG +R    +E+FR+ +    L D GF GNKFTW   R
Subjt:  IDATIKDSD--QNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTW---R

Query:  KRKSLQHGIRER--------------------------------KSAEITQLEEEDK----------RNPSHNILRA-----------------------
           S    + +R                                   ++T   + DK              HNIL                         
Subjt:  KRKSLQHGIRER--------------------------------KSAEITQLEEEDK----------RNPSHNILRA-----------------------

Query:  --------------EKDLEALLEEEESGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSM
                      ++DL+  + +     I+  G   E+  ++ K        +    +P  T+ D  LE +   +S + +  L +P+S  +IE AL  M
Subjt:  --------------EKDLEALLEEEESGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSM

Query:  NSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAF
        N   +PG DG  A F+Q +W  VG +     L +LN   S+ +IN T I LIPK K P ++S FRPISLCNV+YK+++KA+ NR+K IL  IIS +Q+AF
Subjt:  NSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAF

Query:  VPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPIS
        +P+RLI+DNVI+ FE +H +K K KGKEG +A+KLDMSKAYDR+EW F+R ++ KMGF   WI  +M C+++V Y+                   GDP+S
Subjt:  VPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPIS

Query:  PYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILG
        PY F++CAE                                 DD+LIFC+A   +   +  +L+ YE+ASGQ +N  K+S + SKN+ +     + +I G
Subjt:  PYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILG

Query:  IEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWIS
        ++       YLG+PS  G+ K   F+ + ++V   L ++  K  S  G+E  IKA+VQ +PTYTMS FKLP S    INS+   +WWG+ Q +NK HW+S
Subjt:  IEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWIS

Query:  WRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWL
        W K+  +KD GGMGFRDL+ FNKAMLAK  WR+++ P+SL ++VL+ +YFK ++F +A LG++PS  W+S +  R +  +G  WRVG G +I I Q  WL
Subjt:  WRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWL

Query:  NSPGSEIPILTKNDLKDKRVK--DIIDSQGR-WKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSF
        ++    + +L+   + D+R K  D+IDS  + WK   + + F   + E I+  PI     KD++ W    KG F+V+SAYH+    + + +  S+    F
Subjt:  NSPGSEIPILTKNDLKDKRVK--DIIDSQGR-WKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSF

Query:  DNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHV
           W++ W     P  K+ +WR   D+LPT+  + +K IV N LC  C  + E   HV
Subjt:  DNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHV

XP_018816058.1 uncharacterized protein LOC108987582 [Juglans regia]2.0e-18728.39Show/hide
Query:  MINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFDRALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGN
        M+N + FK+ M KIW  EG ++ K VG N  L +F    +KE++  G PW+FDRAL+ +++  G   +   KF+   FWV  H++P+ C  ++    L +
Subjt:  MINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFDRALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGN

Query:  SLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDPVQPEEDQPF----GEWMRASPIMGGSNRNNISDSRKGSTWGRGRGSR
         +   +  + D+    W + LR +V+++ITKPL RG ++ +     + W   P + E    F    G  + +    G +  N +  + +   W R    +
Subjt:  SLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDPVQPEEDQPF----GEWMRASPIMGGSNRNNISDSRKGSTWGRGRGSR

Query:  G-EFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEA-----------WSPENSMGEEVRPTENLVSQGRESQTASMSG--EKGEESAEKSE--------
        G +   +R+G     +         + A ++  T EA           W  E S  EE +     V Q     T+   G  E  + S  K E        
Subjt:  G-EFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEA-----------WSPENSMGEEVRPTENLVSQGRESQTASMSG--EKGEESAEKSE--------

Query:  --HTPKNQNTSNQPPKDNCTSIKMEAKK-------GKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSP-----SGPNGPEKEMAKALVMSNMQNESEEV
          +   NQ T + P  D  T + + +             + FNQ  +  +  KV  +       R  SP     S P G      K  V   + + S   
Subjt:  --HTPKNQNTSNQPPKDNCTSIKMEAKK-------GKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSP-----SGPNGPEKEMAKALVMSNMQNESEEV

Query:  EKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREPK-------KQRKPYFEAILKKTELGFENCFTVGSQGRSGGL
          K  +  ++    K  K    A      T    + +       R    L++L +++ P        K  K   E + K   L F++C  V S+G SGGL
Subjt:  EKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREPK-------KQRKPYFEAILKKTELGFENCFTVGSQGRSGGL

Query:  ALYWNNSTDLLIRSFSKGHIDATIKDS-DQN-WRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINR
        AL W  + D+ I ++S+ HI A +K+  D+N W  TGFYG P+ + R +SW LL+ +    ++ W+  GDFNEI    EK GG  R  + ME FRE I  
Subjt:  ALYWNNSTDLLIRSFSKGHIDATIKDS-DQN-WRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINR

Query:  SNLFDPGFSGNKFTWRK--------RKSLQHGIRE-----------------------------------------------------------------
         +L      G+KFTW          ++ L  G+                                                                   
Subjt:  SNLFDPGFSGNKFTWRK--------RKSLQHGIRE-----------------------------------------------------------------

Query:  --------------------------------------RKSAEITQLEEEDKRNPSHNILRAEKDLEALLEEEE--------------------------
                                               K   + ++++ ++ N    I + +K+LE  L E++                          
Subjt:  --------------------------------------RKSAEITQLEEEDKRNPSHNILRAEKDLEALLEEEE--------------------------

Query:  ---------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQ
                 S +++     + ++ EIG   T +F DLF++S+P   +I+  LE V   +S   +  L + F++ +++ A+  M    +PG DG  A+F+Q
Subjt:  ---------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQ

Query:  HYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECI
         +WDI+G+E     L+ILN G S++ +N T I LIPKVK PK +  +RPISLCNV+YK+VAK + NRMK +L DIISLNQ+AFVP R IS+N+++ +E +
Subjt:  HYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECI

Query:  HALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE--------
        H + ++ K K G +A+KLD+SKAYDRVEW F+  +M ++GF + WI+ +++CI S  YS                   GDP+SPY+F++C E        
Subjt:  HALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE--------

Query:  -------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQT
                                 DD+LIFC++   + + + ++L  YE ASGQ +N  KSS   SKN+P E    +  I G+  +     YLG+PS  
Subjt:  -------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQT

Query:  GKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRD
        G+ K + F  + ++ W  + NWK    S  GKE L+K+++QAIPTY M  F+LP S+   ++ +C KFWWG  +G++K  W+ W  +   K +GG+GFR 
Subjt:  GKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRD

Query:  LKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKD
           FN A+LAK  W IL  PNSL +Q+L+ +YF  +N L A +GS PSL W+S++ G EL K+G  WR+GNGH + I +  W+  P   +   T  D + 
Subjt:  LKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKD

Query:  KRVKDI-IDSQGRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKIC
        +RV  + I  + +W    ++  FS  ++E+I + PI  G R+D++ W   + G+F+V+SAYHL   +     A  ++V+   ++WK  W+++     ++ 
Subjt:  KRVKDI-IDSQGRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKIC

Query:  VWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWE-------------------------SDYWEWTSDKLDEGDLEKAVLILWSLWDFRNQA
        +WR  ++++PT  N+ ++ +    LC  C+ + E   HV W                           + W+  S+KL + +L+ A  I   LW  RN+ 
Subjt:  VWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWE-------------------------SDYWEWTSDKLDEGDLEKAVLILWSLWDFRNQA

Query:  NQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHD--AWIPPLFQAWKLNVDASWN
                   H   +R     L K   ++ ++ V + +  P  + N    D  +W  P    +K+N DA+ N
Subjt:  NQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHD--AWIPPLFQAWKLNVDASWN

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]2.4e-18534.26Show/hide
Query:  KTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKD-SDQNWRFTGFYGEPDPSNRKVSWQLLERLSD-SPKIPWIVGGDFNEIMFAKE
        K  L F +       G SGGL L W +   L + +F   + D+ + D SD  + FT FYG P   NR  SW LL RL D +P  PW++ GDFNEI+    
Subjt:  KTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKD-SDQNWRFTGFYGEPDPSNRKVSWQLLERLSD-SPKIPWIVGGDFNEIMFAKE

Query:  KEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRKRKSLQHGIRER--------------------------------KSAEITQLEEEDKRNPS--
        K GG++R E  M  FR A++  +L DP F G+ FTW K ++    ++ER                                 +   TQ   +  R  S  
Subjt:  KEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRKRKSLQHGIRER--------------------------------KSAEITQLEEEDKRNPS--

Query:  ----------------------HNILRA----EKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTA
                              H+   A    E  L+ LLE+EE                                   + +  ++G     + ++    
Subjt:  ----------------------HNILRA----EKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTA

Query:  TRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRT
          YF  LF TSH   T++D+ L  ++++++ + +  L +PFSK D++ AL SM   K+PG DGM A+F+Q  WDIVG   +   L +LN G +   +N T
Subjt:  TRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRT

Query:  LIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWI
        +I LIPK+K  K M  FRPISLCNVV K++ K LV R K +L  +IS  Q+AF+P RLI+DN+++ FE IH +K K  G++G+ AMKLDMSKA+DRVEW 
Subjt:  LIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWI

Query:  FIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLI
        FI+ +MD+MGF  EW + IM C+ +  ++                   G P+SPY+FLIC E                                 DD+L+
Subjt:  FIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLI

Query:  FCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVG
        FC A E     IK+ L  Y +ASGQ +N  KS    S N+     ++   +L +  +E    YLG+P+ T ++K+ +F ++KE+VW+ ++ W  K FS+G
Subjt:  FCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVG

Query:  GKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRG
        GKE L+KA+VQ+IPTY MSCFKLPI+L + + +M + FWWG  +   K HW SW+ +C +K  GGMGFR    +N+A+LAK +WR+L  P+SLLS++L+ 
Subjt:  GKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRG

Query:  RYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKDIIDSQGRWKSSKIKENFSALDVEII
        RYF  N+FL AP G +PSLTWQ I+ GREL + G RW+VG G  I    DPW+      +PI       +  V ++I    +W    + E FS++DV+ I
Subjt:  RYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKDIIDSQGRWKSSKIKENFSALDVEII

Query:  LNTPIRGGERKDEIFWSREKKGVFTVKSAY-HLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCR
        L  P+      D + W     G++ V S Y HLA+L ++ L ++S   NS  N WK FWK++   K KI  WR+ HD+LP   +++++ I+T+  C  CR
Subjt:  LNTPIRGGERKDEIFWSREKKGVFTVKSAY-HLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCR

Query:  EKNEEAKHVFWESDY-----------WEWTSD-KLDEGD-------------LEKAVLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMGLNS
        +  E   H  +   Y           ++W +  ++++GD             +E  + ILWS+W  RN+     +++S     SF              S
Subjt:  EKNEEAKHVFWESDY-----------WEWTSD-KLDEGD-------------LEKAVLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMGLNS

Query:  GMYLVQQGEPAPSRLQNQ------MSHDAWIPPLFQAWKLNVDASWN
          +  Q     P    NQ      +    W PP   + K+NVDA+ N
Subjt:  GMYLVQQGEPAPSRLQNQ------MSHDAWIPPLFQAWKLNVDASWN

XP_035544642.1 uncharacterized protein LOC109020982 [Juglans regia]3.9e-19130.67Show/hide
Query:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD
        EE+  Q +N +++  E  ++V +  D +E A    +  +L  I+  K IN E F+  M K+W  EG V+   VG N FL +FN   +++R+ KG PW+FD
Subjt:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD

Query:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDP
        R L+ +   +G   ++  +F   +FW+ +H++P+     +    +G ++G+ +    D+    W   +R++V++ ITK L RG+ +    +  +TW +  
Subjt:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDP

Query:  VQPEEDQPF----------GEWMRASPIMGGSNRNNISDSRKGSTWGRGRGSR-GEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEE
         + E    F          G+  ++SP     N+  +        W R   ++ G+  + R+G  S Q+ +G                 +W  +  +G+E
Subjt:  VQPEEDQPF----------GEWMRASPIMGGSNRNNISDSRKGSTWGRGRGSR-GEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEE

Query:  VRPTENLVSQGRE---------SQTASMSGEKGEESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNR---
        V   + +  +G +          +         E SA+  E       T  Q PK N T + +E   G                 + T K+   P R   
Subjt:  VRPTENLVSQGRE---------SQTASMSGEKGEESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNR---

Query:  KNSPSGPNGPEKEMAKALVMSNMQNESEEVEKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREPKKQRKPYFEAI
        + SP  P GP K  +    + N +  S+   + +  T  +   IK + W+   R        E +         +      +   E + K+ +    + +
Subjt:  KNSPSGPNGPEKEMAKALVMSNMQNESEEVEKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREPKKQRKPYFEAI

Query:  LKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKDSD--QNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFA
        +K     F+N F +  +G SGGLA  W N  +  + S+S+ HI   +K +   ++W  TGFYG P  + R+ SW+LL+ ++    + W+  GDFNEI+F 
Subjt:  LKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKDSD--QNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFA

Query:  KEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTW---RKRKSLQHGIRER--------------------------------KSAEITQLEEEDK-
         +K GG +R    +E+FR+ +    L D GF GNKFTW   R   S    + +R                                   ++T   + DK 
Subjt:  KEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTW---RKRKSLQHGIRER--------------------------------KSAEITQLEEEDK-

Query:  ---------RNPSHNILRA-------------------------------------EKDLEALLEEEESGIINKEGVWVEDEYEIGKTATRYF---QDLF
                     HNIL                                       ++DL+  + +     I+  G   E+  ++ K          +LF
Subjt:  ---------RNPSHNILRA-------------------------------------EKDLEALLEEEESGIINKEGVWVEDEYEIGKTATRYF---QDLF

Query:  STSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKV
        ++S P  T+ D  LE +   +S + +  L +P+S  +IE AL  MN   +PG DG  A F+Q +W  VG +     L +LN   S+ +IN T I LIPK 
Subjt:  STSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKV

Query:  KAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDK
        K P ++S FRPISLCNV+YK+++KA+ NR+K IL  IIS +Q+AF+P+RLI+DNVI+ FE +H +K K KGKEG +A+KLDMSKAYDR+EW F+R ++ K
Subjt:  KAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDK

Query:  MGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKD
        MGF   WI  +M C+++V Y+                   GDP+SPY F++CAE                                 DD+LIFC+A   +
Subjt:  MGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKD

Query:  CATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKA
           +  +L+ YE+ASGQ +N  K+S + SKN+ +     + +I G++       YLG+PS  G+ K   F+ + ++V   L ++  K  S  G+E  IKA
Subjt:  CATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKA

Query:  IVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNF
        +VQ +PTYTMS FKLP S    INS+   +WWG+ Q +NK HW+SW K+  +KD GGMGFRDL+ FNKAMLAK  WR+++ P+SL ++VL+ +YFK ++F
Subjt:  IVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNF

Query:  LAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVK--DIIDSQGR-WKSSKIKENFSALDVEIILNTPI
         +A LG++PS  W+S +  R +  +G  WRVG G +I I Q  WL++    + +L+   + D+R K  D+IDS  + WK   + + F   + E I+  PI
Subjt:  LAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVK--DIIDSQGR-WKSSKIKENFSALDVEIILNTPI

Query:  RGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEA
             KD++ W    KG F+V+SAYH+    + + +  S+    F   W++ W     P  K+ +WR   D+LPT+  + +K IV N LC  C  + E  
Subjt:  RGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEA

Query:  KHV
         HV
Subjt:  KHV

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]1.3e-19129.35Show/hide
Query:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD
        E++  + ++++++  E++++V  E+  L    +  +  +L KI   +  N E F+T M KIWN EG +  K +  N +L +F    +KE++  G PW+FD
Subjt:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD

Query:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKA--------
        R L+ +++ +G   +S+ +F    FW+ +H+LP     ++    +G+ +G  ++ + + E   W   LR++ ++N+TK L RG  ++ GSK         
Subjt:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKA--------

Query:  --------------EETWCED-PVQPEEDQPFGEWMRASPIMGGSNRNNISDSRK-----GSTWGRGRGSRGEFQRNRFGWRSGQKGNGDWREKGNPANS
                      E+  C++          +G+W+RA+ +      N      K     GSTW   +    ++        SG  G    +    P N 
Subjt:  --------------EETWCED-PVQPEEDQPFGEWMRASPIMGGSNRNNISDSRK-----GSTWGRGRGSRGEFQRNRFGWRSGQKGNGDWREKGNPANS

Query:  SNPTTEAWSPENSMGEEVRPTENLVSQGRESQTASMSGEKGEESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKD-
           +TEA        E   P E+   QG  S T + + +KG ++    E     Q   +Q P      +           +    +EA     +  E   
Subjt:  SNPTTEAWSPENSMGEEVRPTENLVSQGRESQTASMSGEKGEESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKD-

Query:  --------KSIPNRKNSP-SGPNGPEKEMAKALVMSNMQNESEE------VEKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNET
                +++ N  NSP  G     K  A    +S++ N  ++       ++ L Q F +    K KK K +A  E                     + 
Subjt:  --------KSIPNRKNSP-SGPNGPEKEMAKALVMSNMQNESEE------VEKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNET

Query:  LSILEEEREP-------KKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKDSDQN--WRFTGFYGEPDPSNRKV
        L +L + ++P        K      + I  K  LG+ENCF+V S+G+SG LAL W +S  + + +++  HI A I     N  W+ TGFYG P+ + R  
Subjt:  LSILEEEREP-------KKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKDSDQN--WRFTGFYGEPDPSNRKV

Query:  SWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRKRKSLQHGIRER---------------------
        SW LL  L  +  +PW+  GDFNEI    EK G   R  + M  FR +++   L+D GF G+KFTW   +      +ER                     
Subjt:  SWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRKRKSLQHGIRER---------------------

Query:  ------------------------------KSAEITQLEEED-----------------------------------KRNPSHNILRAEKDLEALLEEEE
                                      +SA   ++E E+                                   K+      L+ + +L  +L+E  
Subjt:  ------------------------------KSAEITQLEEED-----------------------------------KRNPSHNILRAEKDLEALLEEEE

Query:  SG----------------------------------------------------------IINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSV
         G                                                          I  K G+  +D   I +T   +F DLF++SHP  + ID+ 
Subjt:  SG----------------------------------------------------------IINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSV

Query:  LEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPIS
        L  + K I+ +    LS  F++ +++ A+ SMN   +PG DG  A+FFQ YWD VG   +   L++LNGGK    +N TLIALIPK   P  ++ FRPIS
Subjt:  LEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPIS

Query:  LCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMD
        LCNV+YK++AK L NR+K IL  IIS  QAAFVP RLI+DN+I+ FE +H +K++ KG EG +A+KLDMSKAYDR+EW F+R +M KMGFP +WI  +M 
Subjt:  LCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMD

Query:  CIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEE
        C+ +V YS                   GDPISPY+F++C+E                                 DD+L+FC+A   + + +  +L +YE 
Subjt:  CIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEE

Query:  ASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCF
        ASGQ +N  K+S   S N+ ++    +  I GI  T     YLG+P+  G+ +   FK + +RV   L +WK K  S  GKETLIKA++QAIPTY+M  F
Subjt:  ASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCF

Query:  KLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTW
        KLP  L  ++N     FWWG+ + ++K HW+SW++M  SK  GG G RD + FN+A+LAK  WRI+  P+SL ++V + +YFK + FL    GSN S  W
Subjt:  KLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTW

Query:  QSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDL-KDKRVKDIID-SQGRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSRE
        QS +  R L K+G  WRVG+G  I +  D W   P S  P  +   L  D +V  +ID +  +W    ++    + D E +   P+      D++ W   
Subjt:  QSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDL-KDKRVKDIID-SQGRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSRE

Query:  KKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWE----SDYW
          GVFTVKSAY L   L    +  S+     D  W   W++     +K  +WR   +++PT+  ++ K +V    C  CR   E A H  WE     D W
Subjt:  KKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWE----SDYW

Query:  EWTSDKL---------------------DEGDLEKAVLILWSLWDFRNQ
           + ++                     DE +L +  L  W +W  RN+
Subjt:  EWTSDKL---------------------DEGDLEKAVLILWSLWDFRNQ

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein4.3e-19630.35Show/hide
Query:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD
        EE+      M++S +E+   + +  + +  + ++ Q +IL K+LT +  N E  K  +  +W   G V  + +  N F+  F +  + ERI    PW FD
Subjt:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD

Query:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDL-DEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCED
        + L+ +    G  + +  KF +  FW+ I +LP+    R+    +G ++G  +D D+ D+    W   LR++V++++T PL RG +V++    EET   D
Subjt:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDL-DEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCED

Query:  PVQPE-EDQPFGEWMRASPIMGGSNRNNISDSRKGSTWGRGRGSRGEFQRNRFG-W------RSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVR
        PV  +   +    +     I+G +    I+        GRG  +   F R+++G W      R  Q G     E  + +++++P      P+ ++     
Subjt:  PVQPE-EDQPFGEWMRASPIMGGSNRNNISDSRKGSTWGRGRGSRGEFQRNRFG-W------RSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVR

Query:  PTENLVSQGRESQTASMSGEKG----------------EESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIP
        PT+ +         A  S   G                 E+AE   H P   N  +   K N  +    A  G + T           P         + 
Subjt:  PTENLVSQGRESQTASMSGEKG----------------EESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIP

Query:  NRKNSPSGPN-------------GPEKEMAKALVMSNM--QNESEEVEKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSIL
        +   +P   N             GP  +++ ++   N+   NESE               + +K +    R    G    E     +   ++ + +L  L
Subjt:  NRKNSPSGPN-------------GPEKEMAKALVMSNM--QNESEEVEKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSIL

Query:  EEEREPKKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDA-TIKDSDQNWRFTGFYGEPDPSNRKVSWQLLERLSDSP
         E R   +  +        +  LG   CF V   G  GGLAL WN+S ++ I+S+S  HIDA  I +    WR TGFYG P+ + R  SW LL  L  + 
Subjt:  EEEREPKKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDA-TIKDSDQNWRFTGFYGEPDPSNRKVSWQLLERLSDSP

Query:  KIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTW----------------------------------------------
         +PW+V GDFNEI    EK G   R    M  FRE+++  +L D G+ G  FTW                                              
Subjt:  KIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTW----------------------------------------------

Query:  --------------RKRKSLQH---------------------------------------------------GIRERKSAEITQLEE---EDKRNPSHN
                      +KR   +H                                                    I E K + + QLE    E+  +   N
Subjt:  --------------RKRKSLQH---------------------------------------------------GIRERKSAEITQLEE---EDKRNPSHN

Query:  ILRAEKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKT
        +LR  ++L  L E+EE                                    G+ +  GVW  D   I   A  YF +LF++S+P   SID V++ V   
Subjt:  ILRAEKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKT

Query:  ISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYK
        +S + +  L +P+S  +I  AL  ++ SKAPG DGM A+FFQ YW IVG + S   LD LN G+ +  IN T IALIPKVK P+ M++FRPISLCNV+YK
Subjt:  ISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYK

Query:  MVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEY
        +V+K LVNRMK IL  +IS +Q+AFVP RLI+DNVI+ FE +H LK+ R G    +A KLDMSKAYDRVEW +++ I+ K+GF   W+  IM C+ S  Y
Subjt:  MVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEY

Query:  S-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTIN
        S                   GDP+SPY+FLICAE                                 DD++IFCRA   DC  I+ +L  YE+ASGQ +N
Subjt:  S-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTIN

Query:  TSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLY
          K++   SKN+P  K + +  + G   T     YLG+P   G++K+  F  +KER+WK LQ WK K  S  G+ETLIKA+VQAIPTY MSCFK P  L 
Subjt:  TSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLY

Query:  DDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGR
         +I SM  +FWWG+  G+ K HW+   ++   K  GGMGFRDL +FNKA+LA+  WR+L+ P SL+S+ L+ +YF   +FL  P+ +N S  W+S+   R
Subjt:  DDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGR

Query:  ELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKDIIDSQGR--WKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTV
         + + G RWRVG G  I + +D WL SP S   I     L ++   D +  Q    W S  +++ F   DVEII   P+      D + W     G FTV
Subjt:  ELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKDIIDSQGR--WKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTV

Query:  KSAYHLATL-LNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDYWE--WTS----
        +SAYHL     N  +E+SS  +      W   W  K  PK ++ +WR   D LPT+  +  +GI++   C +C E  E   HV W+ D+ +  W++    
Subjt:  KSAYHLATL-LNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDYWE--WTS----

Query:  -------------------DKLDEGDLEKAVLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIP
                             L E +LE      W LW+ RN+   + +  ++ D +     M  +  ++GL+         +P      ++ S   W P
Subjt:  -------------------DKLDEGDLEKAVLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIP

Query:  PLFQAWKLNV
        P    +K+N+
Subjt:  PLFQAWKLNV

A0A2N9EWI8 Uncharacterized protein1.1e-19629.81Show/hide
Query:  EATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFDRALMVIEDPQGASRISKTKFKYVKFWVHI
        ++ ++ Q ++L K+LT K  N +  K+ + ++W   G +  + +  N F+  F N  + ERI    PW FD+ L+ I    G  + +  KF +V FWV +
Subjt:  EATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFDRALMVIEDPQGASRISKTKFKYVKFWVHI

Query:  HDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDPVQPEEDQPFGEWMRASPIMGGSNRNNISD
         +LP+    R+    +G ++G  ++ D+  +   W   LR++V++ I KPL RG +++          ED      + PF    R   +     R     
Subjt:  HDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDPVQPEEDQPFGEWMRASPIMGGSNRNNISD

Query:  SRKGSTWGRGRGSRGEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVRPTENLVSQGR-ESQTASMSGEKGEESAEKSEHTPKNQN
           G+    GR + G+ +        G+   G W           P   +  P +      + + + V  G   +++AS   ++G  SA K         
Subjt:  SRKGSTWGRGRGSRGEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVRPTENLVSQGR-ESQTASMSGEKGEESAEKSEHTPKNQN

Query:  TSNQPPKDNCTSIKMEAKKG----------KKKTIFNQTSEAADEPKV--------CTEKDKSIPNRKNSPSGPNGPEKEMAKALVMSNMQNESEEVEKK
              + +   I    K+G           ++   NQ +   DE +V        C E  + I +     +      K+  + +++      +E V   
Subjt:  TSNQPPKDNCTSIKMEAKKG----------KKKTIFNQTSEAADEPKV--------CTEKDKSIPNRKNSPSGPNGPEKEMAKALVMSNMQNESEEVEKK

Query:  LEQTFESGKG-----IKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREP----KKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLA
             ++ +       + K WK+RA           N    +G  +   E    +++E        + R P ++    +  LG + CF V   G  GGLA
Subjt:  LEQTFESGKG-----IKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREP----KKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLA

Query:  LYWNNSTDLLIRSFSKGHIDATIKD-SDQNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSN
        L W++S D+ ++S+SK HID+ +   S + WRFTGFYG P  + R+ SW+LL RL      PW+V GDFNEIM + EK+G   R    M  FREA+N   
Subjt:  LYWNNSTDLLIRSFSKGHIDATIKD-SDQNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSN

Query:  LFDPGFSGNKFTWRKRKS--------------------------LQHGI-------------------------RERKSAEITQLEEE------------
        L D GF G +FTW   +                           +QH +                         + R   E   L EE            
Subjt:  LFDPGFSGNKFTWRKRKS--------------------------LQHGI-------------------------RERKSAEITQLEEE------------

Query:  -------------------------------------DKRNPSHNILRAE-------------KDLEALLEEEE--------------------------
                                             DKR     I   +              +L +LL++EE                          
Subjt:  -------------------------------------DKRNPSHNILRAE-------------KDLEALLEEEE--------------------------

Query:  ---------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQ
                  GI + + VW  +E EI      YF ++++T+HP   +ID+V+  V + +S + ++ L +PF++ ++  AL  M+ SKAPG DGM A+FFQ
Subjt:  ---------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQ

Query:  HYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECI
         +W IVG + +N  LD LN G  +K +N T IALIPKVK+P+ M+HFRPISLCNV+YK+++K LVNRMK IL  ++S +Q+AFVP R+ISDN+++ FE I
Subjt:  HYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECI

Query:  HALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE--------
        H LK+KR GK   +A KLDMSKAY+RVEW +++KIM K+GF E+W+A IM+C+ SV YS                   GDP+SPY+FLICAE        
Subjt:  HALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE--------

Query:  -------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQT
                                 DD+LIFCRA E DC  ++++L  YE ASGQ INT K++   S+N+       +  + G   T     YLG+P   
Subjt:  -------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQT

Query:  GKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRD
        G++KK  F ++K+R+W+ LQ WK KF S  GKE LIKA++QAIPTY MSCFKLP  L D+I++M  +FWWG+   + K HW+S +K+C +K  GGMGFRD
Subjt:  GKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRD

Query:  LKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKD
        L+ FN+A+LA+  WR+L+ PNSL+ +V                              +++   G RWRVGNG +I I +D W++SP +   I     L++
Subjt:  LKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKD

Query:  K-RVKDIIDSQG-RWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKI
           V  +I+     W    + E F   D+E+I+  P+      D + W+   KGVFTV+SAYH+    + + +A S+      + W++ W ++  PK K+
Subjt:  K-RVKDIIDSQG-RWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKI

Query:  CVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDY----WEWTSDKLDEG----------------DLEKAVLIL-----WSLWDFRNQ
         +WR   + LPT+  +  +GI  +L C +C E+ E   HV W  ++    W  ++  L  G                DL+  ++ +     W+LW  RN 
Subjt:  CVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDY----WEWTSDKLDEG----------------DLEKAVLIL-----WSLWDFRNQ

Query:  ANQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNV
             +  ++ D    +R  E  +  +       ++Q   P P           W PP    +KLN+
Subjt:  ANQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNV

A0A2N9F2M3 Uncharacterized protein5.3e-19430.42Show/hide
Query:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD
        EE+      M++S +E+ + +    + + ++ ++ Q +IL K+L+TK  N E F   +  +W   G +  + +  N FL  FN   + E+I    PW FD
Subjt:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD

Query:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDP
        + L+ +    G  + +  +F Y  FW+ I +LP+    R+    +G  +G  V+ D+ E    W   LR++V+I+I+KPL+RG ++              
Subjt:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDP

Query:  VQPEEDQPFGEWMRASPIMGGSNR-NNISDSRKGSTWGRGRGSRGEFQRNRFG-W------------RSGQKGNGDWREKGNPANSSNPTT--EAWSPEN
           EE+ PF    R   +     R   I  S      GRG         + FG W            R     + D +++G+  +    TT   A   + 
Subjt:  VQPEEDQPFGEWMRASPIMGGSNR-NNISDSRKGSTWGRGRGSRGEFQRNRFG-W------------RSGQKGNGDWREKGNPANSSNPTT--EAWSPEN

Query:  SMGEEVRPTENLVSQGRESQTASMSGEKGE-ESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSPSG
          G +  PT   V +G  S    +  E  +   A+  EH   N   +         +++   ++G K +  N      D           I     +P  
Subjt:  SMGEEVRPTENLVSQGRESQTASMSGEKGE-ESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSPSG

Query:  PNGPEKEMAKALVMSNMQNES--EEVEKKLEQTF------ESGKGI-----KLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILE----EEREP
         +  +   A   VM+   + S    VE KL            G G+     K   WK+RARS              +   KR    + + E    EE   
Subjt:  PNGPEKEMAKALVMSNMQNES--EEVEKKLEQTF------ESGKGI-----KLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILE----EEREP

Query:  KK----------------QRKPYFEAILK-----------KTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKDSDQNWRFTGFYGE
        KK                +  P    +++           +  LG   CF V   G  GGLAL WN+S  + IR++S  HIDA +   +  WR TGFYG 
Subjt:  KK----------------QRKPYFEAILK-----------KTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATIKDSDQNWRFTGFYGE

Query:  PDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRKRKS--------LQHGI--------
        P+ + R  SW LL  L      PW+V GDFNEI+   E  G N R    M  FREA+   +L D GF G +FTW  R+         L  G+        
Subjt:  PDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRKRKS--------LQHGI--------

Query:  ---------------------------------------------RERKSAEITQ----LEE----------------------------------EDKR
                                                     RE    E+ +    +E+                                  ED +
Subjt:  ---------------------------------------------RERKSAEITQ----LEE----------------------------------EDKR

Query:  N--------PSHN-----ILRAEKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTATRYFQDLFST
        N        PS N     +    ++L  LL++EE                                    G+ +   +W  D+  IG  A  YF  LF++
Subjt:  N--------PSHN-----ILRAEKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTATRYFQDLFST

Query:  SHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKA
        SH    +I  V++ V   ++   +  L +PF   ++  AL  MN SK+PG DGM A+FFQ +W+IVG + +   LD LN G+ +  +N T IALIPKVK+
Subjt:  SHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKA

Query:  PKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMG
        P+ M+ FRPISLCNVVYK+++K LVNRMK IL  +IS +Q+AFVP R+I+DNV+  FE +H LK+ R GK   +A KLDMSKAYDRVEW F+++I  K+G
Subjt:  PKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMG

Query:  FPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCA
        F   W+  +M+C+ SV Y+                   GDP+SPY+FLICAE                                 DD++IFCRA   DC 
Subjt:  FPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCA

Query:  TIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIV
         +   L  YE ASGQ +N +K++   S N+P +    +  + G   T     YLG+P   G++KK  F ++K+RVW+ LQ WK K  S  G+E LIKAI+
Subjt:  TIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIV

Query:  QAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLA
        QAIPTY MSCFK P  L  DI SM  +FWWG+ QG+ K HW+S  K+   K  GGMGFRD+++FN+A+LA+  WR+L  P+SL+ ++ + +YF   NFL 
Subjt:  QAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLA

Query:  APLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKD--IIDSQGRWKSSKIKENFSALDVEIILNTPIRGG
        A +  N S  W+SI   +++   G RWRVG+G  I I +DPWL    +   I   N   +    D  II     W    ++  F   DVE+I   P+   
Subjt:  APLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKD--IIDSQGRWKSSKIKENFSALDVEIILNTPIRGG

Query:  ERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSF-DNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKH
           D   W+  KKGV TVKSAYH       ++EASS+ +N    N W   W  K  PK ++ +WR   + LPT+ N+  +G+     C +C E+ E + H
Subjt:  ERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSF-DNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKH

Query:  VFWESDY----WEWT--------SDKLDEGD-------------LEKAVLILWSLWDFRNQANQNHQSRSIQD
        V W+ ++    W+ +          +L  GD             LE   +  W LW  RN+   N  + ++ D
Subjt:  VFWESDY----WEWT--------SDKLDEGD-------------LEKAVLILWSLWDFRNQANQNHQSRSIQD

A0A2N9HB41 Uncharacterized protein3.4e-20129.52Show/hide
Query:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD
        EE+    + MR+S  E+   + +   +++++  + + ++L ++LT++  N E FK  +  +W  EG +  + +  N FL  FN   + ER+    PW FD
Subjt:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD

Query:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMV--------------
        + L+ +   +   +    KF Y  FW+ I++LP++   ++    +GNS+G  V+ D+ E    W   LR++V+I++TKPL RG ++              
Subjt:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMV--------------

Query:  -----------RIGSKAEETWCEDPVQPEEDQP-----FGEWMRASPIMGG-SNRNNISDSRKGSTW--GRGRGSRGEFQRNRFGWRSGQKGNGDWREKG
                   RIG    E  C +  +   DQ      FG W+RA+P+ GG S+R N    +    W  G  +G  GEF+                 + G
Subjt:  -----------RIGSKAEETWCEDPVQPEEDQP-----FGEWMRASPIMGG-SNRNNISDSRKGSTW--GRGRGSRGEFQRNRFGWRSGQKGNGDWREKG

Query:  NPANSSNPTTEAWSPENSM-------------GEEVRPTEN------LVSQGRESQTASMSGEKGEESAEK-SEHTPKNQNTSN------------QPP-
        NP   S+   E    E  +             G+ +  T N      L +    +Q    + + GE+   K S   P+  +T              QP  
Subjt:  NPANSSNPTTEAWSPENSM-------------GEEVRPTEN------LVSQGRESQTASMSGEKGEESAEK-SEHTPKNQNTSN------------QPP-

Query:  KDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSPSGP--NGPEKEMAKALVMSNMQNESEEVEKKLEQTFESGKGI-----KLKKWK
        +DN   + M+    +  T+ +  S   D  +      ++I N   S + P  N  +K M    +      +  + +K+       G  +     +L   K
Subjt:  KDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSPSGP--NGPEKEMAKALVMSNMQNESEEVEKKLEQTFESGKGI-----KLKKWK

Query:  RRARSEKQGTFTEENTDYSM----------GTEKRRNETLSILEEEREPK-----KQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLI
        R       G   E+    SM          G +    E   +++ E  PK     + R P  +    + +LG   CF V  +   GGLAL+W+ S  + +
Subjt:  RRARSEKQGTFTEENTDYSM----------GTEKRRNETLSILEEEREPK-----KQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLI

Query:  RSFSKGHIDA-TIKDSDQNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKF
        +S+S  HIDA  I+D    WRFTGFYG P+ + R  SW LL RL     +PW++ GDFNEI+  +EK G   R  + M  FREA+   +L D GF G +F
Subjt:  RSFSKGHIDA-TIKDSDQNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKF

Query:  TW--------------------------------------------------------RKRKSLQH-----GIRERKSAEI-------------------
        TW                                                        RK+K+ Q       + E K  E+                   
Subjt:  TW--------------------------------------------------------RKRKSLQH-----GIRERKSAEI-------------------

Query:  ----------------------------TQLEEEDKRN----PSHNILRAEKDLEALLEEEE-----------------------------------SGI
                                     QL+E + ++     +  +    ++L  L+ +EE                                    G+
Subjt:  ----------------------------TQLEEEDKRN----PSHNILRAEKDLEALLEEEE-----------------------------------SGI

Query:  INKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSN
         +  GVW  D   +      YFQ+LF +S P  ++I+ V + V + +S   +  L  PFS  ++  AL  +  SKAPG DGM A+FFQ YW+IVG   ++
Subjt:  INKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSN

Query:  LCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEG
          LD LN G+ +  +N T I LIPK+KAP  MSHFRPISLCNV+YK+++K LVNRMK IL  IIS  Q+AFVP R+I+DN+I+ FE +H LK+K+ G+ G
Subjt:  LCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEG

Query:  LLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE--------------------
         +A KLDMSKAYDRVEW ++R I+ K+GF E W++ IM C+ SV YS                   GDP+SPY+FLICAE                    
Subjt:  LLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYS-------------------GDPISPYIFLICAE--------------------

Query:  -------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVK
                     DD++IFC A   +C  + ++L TYE ASGQ IN SK++   S ++ Q+    +  + G   T     YLG+P   G+ KK  F ++K
Subjt:  -------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVK

Query:  ERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKV
        +RV + LQ WK K  S  G+E LIKA++QA+PTY MSCFKLP+ L  +I++M  KFWWG+   + K HW++ +++  +KDRGGMGFR+L +FN AMLA+ 
Subjt:  ERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKV

Query:  SWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLN-SPGSEIPILTKNDLKDKRVKDIID-SQ
         WR+L+ PNSL+S+VL+ +Y+ + +FL A +  NPS  W+SI   +++   G  WRVG+G +I I +D WLN SP S+I    +    +  V ++I+  Q
Subjt:  SWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLN-SPGSEIPILTKNDLKDKRVKDIID-SQ

Query:  GRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPT
        G W S  I + F   D E I   P+     +D++ W+  + GV++VKSAY L  +  + + + S+   S    W   W  K  PK +  +WR   + LPT
Subjt:  GRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPT

Query:  KMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDY----WEWTSDKLDEG---------------------DLEKAVLILWSLWDFRNQANQNHQSRSIQD
        +  +  K I+++  C +C ++ E   H+ W+ ++    W  +S  +  G                     ++E  + + W LW  RN+   +    +++D
Subjt:  KMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDY----WEWTSDKLDEG---------------------DLEKAVLILWSLWDFRNQANQNHQSRSIQD

Query:  ---HFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNV
             SF+               +  ++ G+      ++    + W PP    +KL++
Subjt:  ---HFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNV

A0A2N9IPS8 Reverse transcriptase domain-containing protein2.6e-20130.45Show/hide
Query:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD
        EE++ + +   ++  +      ++DD +  +       +L K++T +  N E  K  M ++W     +  + +G N F+ +F N++E++R+  G PW FD
Subjt:  EEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWNKEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFD

Query:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDP
          L+ + +  G+   SK +F +  FWVH H +P+    +     +G +LG  +D D+ E+   W  SLRV+++++ TKP+ RG MV  GS          
Subjt:  RALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVKINITKPLKRGLMVRIGSKAEETWCEDP

Query:  VQPEEDQPFGEWMRASPIMGGSNRNNISDSRKGSTWGRGRG-----SRGEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVRPTEN
        V   + + +G W+RA   +   +      + +G      RG     +   F+ N     SG   NG  + +      S  +    +P   +   V     
Subjt:  VQPEEDQPFGEWMRASPIMGGSNRNNISDSRKGSTWGRGRG-----SRGEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPENSMGEEVRPTEN

Query:  LVSQGRESQTASMSGEKGEESAEKSEH--TPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSPSGPNGPEKEMAK
                    ++ +K     ++S H    KNQ   N    D            K+K +     E    P        + P + N   G      +  +
Subjt:  LVSQGRESQTASMSGEKGEESAEKSEH--TPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSPSGPNGPEKEMAK

Query:  ALVMSNMQNESEEVEK------KLEQTFESGKGIK---LKK----WKRRARSEKQGTFTE--------ENTDYSMGTEKR-------------RNETLSI
         +V  N   E+ +  K      K+++   S  G+     KK    WKR AR + + T  +        E+T+ +  T  R                 L +
Subjt:  ALVMSNMQNESEEVEK------KLEQTFESGKGIK---LKK----WKRRARSEKQGTFTE--------ENTDYSMGTEKR-------------RNETLSI

Query:  LEEEREP-----KKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATI--KDSDQNWRFTGFYGEPDPSNRKVSWQLL
        L +E++P      + R         +  + F+  F V  +G  GGLA+ W    D+ + ++S+ HIDA I  K+  + +R TGFYG P+   RK SW LL
Subjt:  LEEEREP-----KKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALYWNNSTDLLIRSFSKGHIDATI--KDSDQNWRFTGFYGEPDPSNRKVSWQLL

Query:  ERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRK-------------------------------------
        + LS     PW+  GDFNEI+   E+ G   R E  + +FREA+    L D G+ GN +TWR+                                     
Subjt:  ERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTWRK-------------------------------------

Query:  -----------------------------------RKSLQH----GIRE----------------------------------RKSAEITQLEEEDKRNP
                                           R+ + H    G+ E                                  RK  ++  L  E     
Subjt:  -----------------------------------RKSLQH----GIRE----------------------------------RKSAEITQLEEEDKRNP

Query:  SHNILRAEKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGV
        S  IL  + DL  LLE+EE                                   SG+ +++GVW  ++ +I + A  YFQ +F++S+P   SI +VL+G+
Subjt:  SHNILRAEKDLEALLEEEE-----------------------------------SGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGV

Query:  VKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNV
           ++   + +L   F+K ++  ALK M  +KAPG DGM AIF+Q YWDIVG E +   L IL+ G  ++ IN T IALIPKVK P+ ++ FRPISLCNV
Subjt:  VKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNV

Query:  VYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIES
        +YK+V+K L NR+K +L  +IS  Q+AFVP RLI+DNV++ FE +H++  KRKGK+G +A+KLDMSKAYDRVEW+F+  IM  MGF +EWI  +M C+ S
Subjt:  VYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIES

Query:  VEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQ
        V YS                   GD +SPY+FLICAE                                 DD+L+FC+A   +C  +  +L+ YE ASGQ
Subjt:  VEYS-------------------GDPISPYIFLICAE---------------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQ

Query:  TINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPI
         +N  K+S   +K++      ++ D   + + +    YLG+PS  G++K + F ++K RVW+ +  WK KF S  G+E LIKA+ Q+IPTY+MSCFKLP 
Subjt:  TINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPI

Query:  SLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIV
        SL +D+N+M + FWWG      K HW+ W K+C SK  GG+GFRDLK FN A+LAK  WR L+  NSL+ +V + +YF   +F++A LG+ PS  W+SI 
Subjt:  SLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIV

Query:  WGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKDII---DSQGRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKG
          R++ + G +W +G+G  + I +DPWL    S   +  ++ L  K    I+   DSQ  W    I   FS  + +II   P+   ++ D +FW+  K G
Subjt:  WGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKDII---DSQGRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKG

Query:  VFTVKSAYHLATLLNNSLEASSTKVNSFDNS-WKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFW----ESDYWEW
        +FTVKSAYHL      +     +     D   WK  W +   PK K+ +WR     LPT   +  + +  N LC  C    E   H  W     +D W  
Subjt:  VFTVKSAYHLATLLNNSLEASSTKVNSFDNS-WKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFW----ESDYWEW

Query:  TSDKLDEGD
        +S KL + D
Subjt:  TSDKLDEGD

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog6.8e-2923.97Show/hide
Query:  KRKSLQHGIRERKSAEITQLEEEDKRNPSHNILRAEKDLEALLEEEESGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGV-VKTISK
        + K +   I + KS    ++ + DK  P  N+ R +K +++L+    S I N       D  EI K    Y++ L+S  +  +  ID  LE   +  +S+
Subjt:  KRKSLQHGIRERKSAEITQLEEEDKRNPSHNILRAEKDLEALLEEEESGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGV-VKTISK

Query:  NQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKV-KAPKEMSHFRPISLCNVVYKMV
         +   L+RP S ++I   ++++   K+PG DG  + F+Q + + +     NL  +I   G          I LIPK  K P    ++RPISL N+  K++
Subjt:  NQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKV-KAPKEMSHFRPISLCNVVYKMV

Query:  AKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCI-------
         K L NR++  ++ II  +Q  F+P      N+      I  + +K K K+ ++ + +D  KA+D ++  F+ + + K+G  E    K+++ I       
Subjt:  AKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCI-------

Query:  -------------ESVEYSGDPISPYIFLICAE----------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKS-S
                      S    G P+SP +F I  E                            DD +++          + +V++ Y   SG  INT KS +
Subjt:  -------------ESVEYSGDPISPYIFLICAE----------------------------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKS-S

Query:  FMASKNSPQEKIIK--LGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKK----VKERVWKALQNWKGKFFSVGGKETLIKAIV--QAIPTYTMSCFKLPI
        F+ + N+  EK +K  +   +  +K + LG+YL       K+ K ++K+    +++ + + +  WK    S  G+  ++K  +  +AI  +     K P+
Subjt:  FMASKNSPQEKIIK--LGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKK----VKERVWKALQNWKGKFFSVGGKETLIKAIV--QAIPTYTMSCFKLPI

Query:  SLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSW
        S + D+  +   F W +     K   I+   +      GG+   DL+++ K+++ K +W
Subjt:  SLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSW

P0C2F6 Putative ribonuclease H protein At1g657501.2e-3826.68Show/hide
Query:  MPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGG
        MP    +  K  F ++ ERV   +  W+ K  S  G+ TL KA++ ++P ++MS   LP S+ + ++ +   F WG    K K H + W K+C  K  GG
Subjt:  MPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGG

Query:  MGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRY----FKDNNFLAAPLGSNPSLTWQSIVWG-RELFKKGYRWRVGNGHHI------WIDQDPWLN
        +G R  K  N+A+++KV WR+L+  NSL + VL+ +Y     +D+ +L  P GS  S TW+SI  G R++   G  W  G+G  I      W+   P L 
Subjt:  MGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRY----FKDNNFLAAPLGSNPSLTWQSIVWG-RELFKKGYRWRVGNGHHI------WIDQDPWLN

Query:  SPGSEIPILTKNDLKDKRVKDIIDSQGRWKSSKI---KENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFD
            E P     D      KD+      W  +KI     N + L++  ++   + G   +D + W   + G F+V+SAY + T+     E     + SF 
Subjt:  SPGSEIPILTKNDLKDKRVKDIIDSQGRWKSSKI---KENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEASSTKVNSFD

Query:  NSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHV----------------------FWESDYWEWTSDKL-DEGDLE
        N     WK++   + K  +W + + ++ T+    ++ +  + +C  C+   E   HV                      F+    +EW  D L D    E
Subjt:  NSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHV----------------------FWESDYWEWTSDKL-DEGDLE

Query:  KA------VLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNVDAS
                 +I+W  W +R   N   ++   +D   FV+  E  +E    +SG  LV   +P   R+        W+ P     K+N D +
Subjt:  KA------VLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNVDAS

P11369 LINE-1 retrotransposable element ORF2 protein7.2e-3124.25Show/hide
Query:  IINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGV-VKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEET
        I N++G    D  EI  T   +++ L+ST    +  +D  L+   V  ++++Q   L+ P S  +IE  + S+ + K+PG DG  A F+Q + + +    
Subjt:  IINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGV-VKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEET

Query:  SNLCLDILNGGKSVKDINRTLIALIPK-VKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKG
          L   I   G          I LIPK  K P ++ +FRPISL N+  K++ K L NR++  ++ II  +Q  F+P      N+      IH + +K K 
Subjt:  SNLCLDILNGGKSVKDINRTLIALIPK-VKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKG

Query:  KEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDC-------------------IESVEYSGDPISPYIFLICAE-----------------
        K  ++ + LD  KA+D+++  F+ K++++ G    ++  I                      ++S    G P+SPY+F I  E                 
Subjt:  KEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDC-------------------IESVEYSGDPISPYIFLICAE-----------------

Query:  -----------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKS-SFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKE
                   DD +++    +     +  ++ ++ E  G  IN++KS +F+ +KN   EK I+      I       L + +  +        FK +K+
Subjt:  -----------DDNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKS-SFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKE

Query:  RVWKALQNWKGKFFSVGGKETLIKAIV--QAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAK
         + + L+ WK    S  G+  ++K  +  +AI  +     K+P   ++++     KF W      NK   I+   +   +  GG+   DLK++ +A++ K
Subjt:  RVWKALQNWKGKFFSVGGKETLIKAIV--QAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAK

Query:  VSW
         +W
Subjt:  VSW

P14381 Transposon TX1 uncharacterized 149 kDa protein3.7e-2724.39Show/hide
Query:  KEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLC
        ++G  +ED   I   A  ++Q+LFS       + + + +G +  +S+ +  RL  P +  ++  AL+ M  +K+PG DG+   FFQ +WD +G +   + 
Subjt:  KEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLC

Query:  LDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLL
         +    G+      R +++L+PK    + + ++RP+SL +  YK+VAKA+  R+K +L ++I  +Q+  VP R I DNV L  + +H   ++R G   L 
Subjt:  LDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLL

Query:  AMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKI------MDCIESVEYS-------------GDPISPYIFLICAED---------------------
         + LD  KA+DRV+  ++   +    F  +++  +       +C+  + +S             G P+S  ++ +  E                      
Subjt:  AMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKI------MDCIESVEYS-------------GDPISPYIFLICAED---------------------

Query:  ------DNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKN-------------SPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSV
              D++I    +  D    ++    Y  AS   IN SKSS +   +             S + KIIK            LG+YL             
Subjt:  ------DNLIFCRAEEKDCATIKKVLRTYEEASGQTINTSKSSFMASKN-------------SPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSV

Query:  FKKVKERVWKALQNWKG--KFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMG
        F +++E V   L  WKG  K  S+ G+  +I  +V +   Y + C          I      F W    GK   HW+S     +    GG G
Subjt:  FKKVKERVWKALQNWKG--KFFSVGGKETLIKAIVQAIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMG

P93295 Uncharacterized mitochondrial protein AtMg003102.9e-3244.68Show/hide
Query:  AIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISK-DRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLA
        A+P Y MSCF+L   L   + S   +FWW   + K K  W++W+K+C SK D GG+GFRDL  FN+A+LAK S+RI+ +P++LLS++LR RYF  ++ + 
Subjt:  AIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISK-DRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLA

Query:  APLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHH--IWIDQ
          +G+ PS  W+SI+ GREL  +G    +G+G H  +W+D+
Subjt:  APLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHH--IWIDQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-1130.61Show/hide
Query:  EGVWVEDEYEIGKTATRYFQDLFSTSHPMMT--SIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNL
        + V VE+  ++ +    Y+  L  +   ++T  S+  + +      +   + RLS   S  +I  A+ +M  +KAPG D   A FF   W +V + T   
Subjt:  EGVWVEDEYEIGKTATRYFQDLFSTSHPMMT--SIDSVLEGVVKTISKNQSRRLSRPFSKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNL

Query:  CLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMV
          +    G  +K  N T I LIPKV    ++S FRP+S C VVYK++
Subjt:  CLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMV

AT3G25270.1 Ribonuclease H-like superfamily protein5.2e-0823.71Show/hide
Query:  KFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDYWE--WTSDKLDEGDLEK------------------------
        K WK+KT PK K  +W+++  +L T  N+ ++ I  +  C  C +++E ++H+F++  Y +  W +  +   +L                          
Subjt:  KFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDYWE--WTSDKLDEGDLEK------------------------

Query:  ---AVLILWSLWDFRNQANQNHQSRSIQDHFSFVRA-----MERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNVDASWN
           A+ ILW LW  RNQ     +S S Q+     R       + N     LN  ++  +  +P  +R + Q     WI       K N D ++N
Subjt:  ---AVLILWSLWDFRNQANQNHQSRSIQDHFSFVRA-----MERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNVDASWN

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.1e-1336.05Show/hide
Query:  LVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKI
        +V R+K ++ ++I   QA+F+P R+ +DN++   E +H+++ ++KG +G + +KLD+ KAYDR+ W ++   +   GFPE W+ +I
Subjt:  LVNRMKYILEDIISLNQAAFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKI

AT4G29090.1 Ribonuclease H-like superfamily protein3.8e-5130.11Show/hide
Query:  AIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAA
        A+PTYTM+CF LP ++   I S+ A FWW   Q     HW +W  +   K  GG+GF+D++ FN A+L K  WR+L RP SL+++V + RYF  ++ L A
Subjt:  AIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAA

Query:  PLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDK--------RVKDIIDSQGR-WKSSKIKENFSALDVEIILN
        PLGS PS  W+SI   +E+ ++G R  VGNG  I I +  WL+S  +   +  +     +        +V D+ID  GR W+   I+  F  ++ ++I  
Subjt:  PLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDK--------RVKDIIDSQGR-WKSSKIKENFSALDVEIILN

Query:  TPIRGGERKDEIFWSREKKGVFTVKSAYHLAT-LLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREK
            G    D   W     G +TVKS Y + T ++N           S +  ++K WK +T PK +  +W+ + +SLP    +  + +     C+ C   
Subjt:  TPIRGGERKDEIFWSREKKGVFTVKSAYHLAT-LLNNSLEASSTKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREK

Query:  NEEAKHVFWESDY----W-----------EWTSD---------KLDEGD--LEKA----VLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMG
         E   H+ ++  +    W           EW             L  G+   EKA      +LW LW  RN+     +  + Q+    +R  E +LE+  
Subjt:  NEEAKHVFWESDY----W-----------EWTSD---------KLDEGD--LEKA----VLILWSLWDFRNQANQNHQSRSIQDHFSFVRAMERNLEKMG

Query:  LNSGMYLVQQGEPAPSRLQ-NQMSHDAWIPPLFQAWKLNVDASWN
        + +      + E   ++ Q N+ S   W PP  Q  K N DA+WN
Subjt:  LNSGMYLVQQGEPAPSRLQ-NQMSHDAWIPPLFQAWKLNVDASWN

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-3344.68Show/hide
Query:  AIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISK-DRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLA
        A+P Y MSCF+L   L   + S   +FWW   + K K  W++W+K+C SK D GG+GFRDL  FN+A+LAK S+RI+ +P++LLS++LR RYF  ++ + 
Subjt:  AIPTYTMSCFKLPISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISK-DRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLA

Query:  APLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHH--IWIDQ
          +G+ PS  W+SI+ GREL  +G    +G+G H  +W+D+
Subjt:  APLGSNPSLTWQSIVWGRELFKKGYRWRVGNGHH--IWIDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGCAACATGCTGCATGAGGGAACCTCCAGCGACATCGAAGCAGCTGTGATGGAAAACAATGAAGAAACAGAAAGGGTCAGTTTAGAGCAAAGTATGGAAGTCGA
AGATGATCCAGAGAAAACGCAAAAGGCAGAAGAAGAAGTAGTCAACCAGATGGAAAATATGAGAATATCACCTTATGAAAGGGAAAAAGTGGTGGCAATTGAGGACGATG
AATTGGAAGAAGCAACGGAAGACCTTCAGGAAGCAATACTATGCAAAATTCTCACAACAAAGATGATAAATCCTGAGACTTTCAAAACATTCATGCCAAAAATATGGAAC
AAGGAGGGAAGAGTTAGAACAAAGACGGTTGGAAGAAATACGTTCTTGTGCAAATTCAACAACAGTCACGAAAAAGAAAGGATAAAGAAAGGAGGCCCTTGGAATTTTGA
CAGAGCTCTAATGGTCATCGAAGATCCTCAAGGAGCCAGCAGAATATCAAAAACAAAATTCAAGTATGTAAAATTTTGGGTTCATATTCATGATCTACCTATGGTGTGTC
ATTGCAGGAAATGGGTGACTGCGCTAGGAAACTCTTTAGGGGAATTTGTGGATTTTGACTTAGATGAAGAAGACAAAAGATGGGACAACTCTCTAAGGGTCCAAGTGAAA
ATCAACATTACCAAACCTCTAAAGAGGGGACTCATGGTCAGGATAGGATCGAAAGCAGAGGAGACATGGTGCGAGGACCCAGTTCAACCAGAGGAGGACCAACCATTTGG
AGAATGGATGAGGGCAAGCCCAATCATGGGGGGTTCAAACAGAAACAATATCTCTGATTCCAGGAAAGGCTCTACATGGGGTAGGGGTAGAGGAAGTCGCGGGGAATTCC
AAAGAAATAGGTTCGGATGGAGAAGCGGACAAAAAGGAAATGGAGACTGGCGTGAGAAAGGAAATCCAGCTAACAGCAGCAATCCGACCACCGAAGCTTGGTCGCCGGAA
AACTCCATGGGAGAAGAGGTGAGACCGACGGAAAATCTCGTTTCCCAAGGCAGAGAGTCTCAAACGGCTAGTATGTCAGGGGAAAAGGGGGAAGAAAGCGCAGAGAAGTC
AGAACATACCCCAAAAAATCAGAACACGTCAAATCAGCCCCCCAAGGATAATTGTACTAGCATTAAAATGGAAGCCAAAAAAGGCAAGAAGAAAACAATTTTTAACCAAA
CTTCTGAGGCGGCAGATGAACCAAAGGTCTGTACAGAAAAGGACAAATCAATCCCAAATAGGAAAAATAGCCCATCTGGGCCAAATGGGCCTGAAAAAGAAATGGCAAAG
GCCTTAGTTATGTCCAATATGCAAAACGAATCAGAGGAAGTGGAAAAGAAACTCGAACAGACCTTCGAATCAGGCAAAGGAATTAAATTGAAGAAATGGAAGAGACGAGC
TAGAAGTGAAAAACAGGGGACCTTCACAGAGGAAAATACTGATTACTCTATGGGAACAGAAAAAAGGAGAAATGAAACTCTGTCTATTCTGGAAGAGGAGAGGGAACCAA
AGAAACAACGCAAACCTTACTTCGAGGCTATTCTAAAGAAAACTGAGTTGGGTTTTGAAAATTGTTTTACGGTTGGGAGCCAAGGAAGAAGTGGCGGCCTTGCGCTCTAC
TGGAATAATTCTACGGATTTGTTGATCAGATCTTTCTCAAAGGGTCATATTGATGCAACAATAAAAGACAGTGACCAGAACTGGAGATTTACCGGATTCTACGGAGAGCC
GGACCCATCAAACAGGAAAGTATCTTGGCAACTCCTTGAGAGGTTGAGCGATTCTCCAAAAATCCCCTGGATCGTTGGAGGGGATTTTAACGAGATCATGTTTGCAAAAG
AAAAGGAAGGAGGGAACATGAGATTAGAAAAAGGAATGGAGAATTTCAGGGAAGCCATCAACAGAAGTAACCTTTTTGACCCGGGGTTCTCTGGCAACAAGTTTACTTGG
AGAAAAAGGAAATCCCTTCAACATGGCATTAGAGAAAGAAAGTCTGCGGAAATCACCCAGTTAGAAGAAGAAGACAAGAGGAACCCATCCCACAACATCTTAAGGGCGGA
AAAAGATCTCGAAGCCCTTTTGGAGGAAGAAGAAAGTGGGATCATAAATAAAGAGGGGGTTTGGGTCGAAGACGAGTATGAAATTGGAAAGACAGCTACGAGATACTTCC
AAGACCTTTTCTCCACTTCCCATCCCATGATGACAAGTATAGACTCTGTGTTGGAAGGGGTTGTCAAAACGATTTCTAAAAACCAAAGCAGGAGGCTTTCAAGGCCTTTT
TCAAAGGCTGACATTGAAGGGGCTTTGAAGAGTATGAATTCCTCAAAAGCTCCTGGAGAGGATGGGATGCATGCCATATTTTTCCAGCACTATTGGGACATTGTGGGTGA
AGAAACATCAAATCTTTGTCTAGATATCCTAAATGGAGGAAAAAGTGTGAAGGATATCAATCGGACTTTGATCGCCCTGATTCCAAAGGTTAAAGCCCCCAAAGAGATGT
CTCATTTCAGGCCAATTAGCCTTTGTAATGTGGTTTACAAGATGGTGGCTAAAGCCCTCGTTAACAGAATGAAATATATCCTAGAAGACATCATTTCCCTGAACCAAGCT
GCTTTTGTTCCCAAAAGGCTCATTTCAGATAACGTCATTTTGGGGTTCGAATGCATTCATGCTCTGAAATCCAAACGAAAAGGAAAGGAAGGGCTCCTAGCTATGAAACT
GGACATGAGCAAAGCTTACGACCGAGTCGAATGGATCTTCATTCGAAAGATCATGGATAAGATGGGATTCCCAGAGGAGTGGATTGCTAAAATCATGGATTGCATCGAAT
CAGTAGAATACTCTGGCGACCCTATCTCCCCTTACATCTTTCTGATCTGTGCTGAAGACGACAACCTAATCTTTTGCAGGGCGGAAGAAAAGGACTGTGCAACCATAAAG
AAGGTTTTGAGGACATACGAGGAGGCTTCGGGGCAGACTATCAATACGTCTAAATCATCCTTCATGGCAAGCAAAAATTCTCCTCAGGAAAAGATTATCAAGCTGGGCGA
TATACTTGGGATCGAAAAAACAGAGGATCTGGGTCTTTATCTGGGAATGCCTTCTCAAACCGGAAAAAACAAGAAGTCAGTGTTTAAGAAGGTCAAGGAAAGGGTATGGA
AAGCCCTTCAGAATTGGAAAGGAAAATTTTTCTCTGTAGGGGGCAAAGAAACTCTGATAAAAGCAATCGTCCAAGCCATCCCAACTTATACGATGAGTTGTTTCAAACTC
CCGATCTCCCTATATGACGATATAAACAGCATGTGTGCAAAATTTTGGTGGGGAGAATTCCAAGGGAAAAACAAAACTCATTGGATTAGTTGGCGTAAGATGTGCATTAG
CAAAGACAGAGGCGGCATGGGTTTTAGGGATTTAAAGATGTTTAATAAGGCTATGCTAGCAAAAGTTAGTTGGAGGATCCTTAAAAGGCCCAACAGTCTTCTTTCCCAAG
TCCTAAGGGGAAGATACTTCAAAGACAACAACTTCCTAGCAGCCCCTCTTGGATCCAACCCATCTCTAACTTGGCAAAGCATTGTTTGGGGTCGTGAGCTTTTCAAAAAG
GGCTACAGATGGAGAGTGGGTAATGGTCATCACATTTGGATTGACCAAGATCCATGGTTGAATTCCCCTGGAAGTGAAATCCCCATCCTAACCAAAAACGATCTCAAAGA
CAAGAGAGTTAAGGACATTATTGATAGCCAAGGGAGATGGAAATCTTCTAAGATTAAAGAAAATTTCTCGGCCCTGGATGTTGAAATAATCCTAAACACTCCTATCCGAG
GAGGGGAAAGAAAGGATGAGATTTTTTGGAGTAGAGAGAAAAAAGGAGTCTTCACTGTAAAAAGTGCATATCACCTAGCTACTCTCCTCAACAACTCTCTAGAAGCATCA
TCTACAAAAGTGAACTCTTTTGATAATAGCTGGAAAAAATTCTGGAAGATAAAAACAGTCCCAAAATCCAAAATTTGCGTGTGGCGTATAATTCACGACTCCCTCCCAAC
CAAAATGAACATCATCAAAAAAGGGATAGTTACTAATCTTTTGTGTTTATTTTGCAGGGAGAAAAATGAAGAGGCAAAGCACGTGTTTTGGGAGTCGGACTACTGGGAGT
GGACTTCAGACAAGTTGGATGAAGGTGACTTGGAAAAGGCGGTCTTAATCCTATGGAGCCTATGGGATTTCAGGAACCAAGCTAATCAAAATCATCAAAGCAGAAGCATC
CAAGATCACTTTTCTTTCGTCAGAGCCATGGAACGAAATTTAGAGAAAATGGGTCTCAATTCTGGAATGTACCTGGTTCAGCAAGGAGAGCCCGCCCCTTCAAGGTTGCA
GAACCAGATGAGTCACGATGCTTGGATCCCTCCGCTCTTTCAAGCGTGGAAACTCAATGTTGACGCCTCTTGGAATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAGCAACATGCTGCATGAGGGAACCTCCAGCGACATCGAAGCAGCTGTGATGGAAAACAATGAAGAAACAGAAAGGGTCAGTTTAGAGCAAAGTATGGAAGTCGA
AGATGATCCAGAGAAAACGCAAAAGGCAGAAGAAGAAGTAGTCAACCAGATGGAAAATATGAGAATATCACCTTATGAAAGGGAAAAAGTGGTGGCAATTGAGGACGATG
AATTGGAAGAAGCAACGGAAGACCTTCAGGAAGCAATACTATGCAAAATTCTCACAACAAAGATGATAAATCCTGAGACTTTCAAAACATTCATGCCAAAAATATGGAAC
AAGGAGGGAAGAGTTAGAACAAAGACGGTTGGAAGAAATACGTTCTTGTGCAAATTCAACAACAGTCACGAAAAAGAAAGGATAAAGAAAGGAGGCCCTTGGAATTTTGA
CAGAGCTCTAATGGTCATCGAAGATCCTCAAGGAGCCAGCAGAATATCAAAAACAAAATTCAAGTATGTAAAATTTTGGGTTCATATTCATGATCTACCTATGGTGTGTC
ATTGCAGGAAATGGGTGACTGCGCTAGGAAACTCTTTAGGGGAATTTGTGGATTTTGACTTAGATGAAGAAGACAAAAGATGGGACAACTCTCTAAGGGTCCAAGTGAAA
ATCAACATTACCAAACCTCTAAAGAGGGGACTCATGGTCAGGATAGGATCGAAAGCAGAGGAGACATGGTGCGAGGACCCAGTTCAACCAGAGGAGGACCAACCATTTGG
AGAATGGATGAGGGCAAGCCCAATCATGGGGGGTTCAAACAGAAACAATATCTCTGATTCCAGGAAAGGCTCTACATGGGGTAGGGGTAGAGGAAGTCGCGGGGAATTCC
AAAGAAATAGGTTCGGATGGAGAAGCGGACAAAAAGGAAATGGAGACTGGCGTGAGAAAGGAAATCCAGCTAACAGCAGCAATCCGACCACCGAAGCTTGGTCGCCGGAA
AACTCCATGGGAGAAGAGGTGAGACCGACGGAAAATCTCGTTTCCCAAGGCAGAGAGTCTCAAACGGCTAGTATGTCAGGGGAAAAGGGGGAAGAAAGCGCAGAGAAGTC
AGAACATACCCCAAAAAATCAGAACACGTCAAATCAGCCCCCCAAGGATAATTGTACTAGCATTAAAATGGAAGCCAAAAAAGGCAAGAAGAAAACAATTTTTAACCAAA
CTTCTGAGGCGGCAGATGAACCAAAGGTCTGTACAGAAAAGGACAAATCAATCCCAAATAGGAAAAATAGCCCATCTGGGCCAAATGGGCCTGAAAAAGAAATGGCAAAG
GCCTTAGTTATGTCCAATATGCAAAACGAATCAGAGGAAGTGGAAAAGAAACTCGAACAGACCTTCGAATCAGGCAAAGGAATTAAATTGAAGAAATGGAAGAGACGAGC
TAGAAGTGAAAAACAGGGGACCTTCACAGAGGAAAATACTGATTACTCTATGGGAACAGAAAAAAGGAGAAATGAAACTCTGTCTATTCTGGAAGAGGAGAGGGAACCAA
AGAAACAACGCAAACCTTACTTCGAGGCTATTCTAAAGAAAACTGAGTTGGGTTTTGAAAATTGTTTTACGGTTGGGAGCCAAGGAAGAAGTGGCGGCCTTGCGCTCTAC
TGGAATAATTCTACGGATTTGTTGATCAGATCTTTCTCAAAGGGTCATATTGATGCAACAATAAAAGACAGTGACCAGAACTGGAGATTTACCGGATTCTACGGAGAGCC
GGACCCATCAAACAGGAAAGTATCTTGGCAACTCCTTGAGAGGTTGAGCGATTCTCCAAAAATCCCCTGGATCGTTGGAGGGGATTTTAACGAGATCATGTTTGCAAAAG
AAAAGGAAGGAGGGAACATGAGATTAGAAAAAGGAATGGAGAATTTCAGGGAAGCCATCAACAGAAGTAACCTTTTTGACCCGGGGTTCTCTGGCAACAAGTTTACTTGG
AGAAAAAGGAAATCCCTTCAACATGGCATTAGAGAAAGAAAGTCTGCGGAAATCACCCAGTTAGAAGAAGAAGACAAGAGGAACCCATCCCACAACATCTTAAGGGCGGA
AAAAGATCTCGAAGCCCTTTTGGAGGAAGAAGAAAGTGGGATCATAAATAAAGAGGGGGTTTGGGTCGAAGACGAGTATGAAATTGGAAAGACAGCTACGAGATACTTCC
AAGACCTTTTCTCCACTTCCCATCCCATGATGACAAGTATAGACTCTGTGTTGGAAGGGGTTGTCAAAACGATTTCTAAAAACCAAAGCAGGAGGCTTTCAAGGCCTTTT
TCAAAGGCTGACATTGAAGGGGCTTTGAAGAGTATGAATTCCTCAAAAGCTCCTGGAGAGGATGGGATGCATGCCATATTTTTCCAGCACTATTGGGACATTGTGGGTGA
AGAAACATCAAATCTTTGTCTAGATATCCTAAATGGAGGAAAAAGTGTGAAGGATATCAATCGGACTTTGATCGCCCTGATTCCAAAGGTTAAAGCCCCCAAAGAGATGT
CTCATTTCAGGCCAATTAGCCTTTGTAATGTGGTTTACAAGATGGTGGCTAAAGCCCTCGTTAACAGAATGAAATATATCCTAGAAGACATCATTTCCCTGAACCAAGCT
GCTTTTGTTCCCAAAAGGCTCATTTCAGATAACGTCATTTTGGGGTTCGAATGCATTCATGCTCTGAAATCCAAACGAAAAGGAAAGGAAGGGCTCCTAGCTATGAAACT
GGACATGAGCAAAGCTTACGACCGAGTCGAATGGATCTTCATTCGAAAGATCATGGATAAGATGGGATTCCCAGAGGAGTGGATTGCTAAAATCATGGATTGCATCGAAT
CAGTAGAATACTCTGGCGACCCTATCTCCCCTTACATCTTTCTGATCTGTGCTGAAGACGACAACCTAATCTTTTGCAGGGCGGAAGAAAAGGACTGTGCAACCATAAAG
AAGGTTTTGAGGACATACGAGGAGGCTTCGGGGCAGACTATCAATACGTCTAAATCATCCTTCATGGCAAGCAAAAATTCTCCTCAGGAAAAGATTATCAAGCTGGGCGA
TATACTTGGGATCGAAAAAACAGAGGATCTGGGTCTTTATCTGGGAATGCCTTCTCAAACCGGAAAAAACAAGAAGTCAGTGTTTAAGAAGGTCAAGGAAAGGGTATGGA
AAGCCCTTCAGAATTGGAAAGGAAAATTTTTCTCTGTAGGGGGCAAAGAAACTCTGATAAAAGCAATCGTCCAAGCCATCCCAACTTATACGATGAGTTGTTTCAAACTC
CCGATCTCCCTATATGACGATATAAACAGCATGTGTGCAAAATTTTGGTGGGGAGAATTCCAAGGGAAAAACAAAACTCATTGGATTAGTTGGCGTAAGATGTGCATTAG
CAAAGACAGAGGCGGCATGGGTTTTAGGGATTTAAAGATGTTTAATAAGGCTATGCTAGCAAAAGTTAGTTGGAGGATCCTTAAAAGGCCCAACAGTCTTCTTTCCCAAG
TCCTAAGGGGAAGATACTTCAAAGACAACAACTTCCTAGCAGCCCCTCTTGGATCCAACCCATCTCTAACTTGGCAAAGCATTGTTTGGGGTCGTGAGCTTTTCAAAAAG
GGCTACAGATGGAGAGTGGGTAATGGTCATCACATTTGGATTGACCAAGATCCATGGTTGAATTCCCCTGGAAGTGAAATCCCCATCCTAACCAAAAACGATCTCAAAGA
CAAGAGAGTTAAGGACATTATTGATAGCCAAGGGAGATGGAAATCTTCTAAGATTAAAGAAAATTTCTCGGCCCTGGATGTTGAAATAATCCTAAACACTCCTATCCGAG
GAGGGGAAAGAAAGGATGAGATTTTTTGGAGTAGAGAGAAAAAAGGAGTCTTCACTGTAAAAAGTGCATATCACCTAGCTACTCTCCTCAACAACTCTCTAGAAGCATCA
TCTACAAAAGTGAACTCTTTTGATAATAGCTGGAAAAAATTCTGGAAGATAAAAACAGTCCCAAAATCCAAAATTTGCGTGTGGCGTATAATTCACGACTCCCTCCCAAC
CAAAATGAACATCATCAAAAAAGGGATAGTTACTAATCTTTTGTGTTTATTTTGCAGGGAGAAAAATGAAGAGGCAAAGCACGTGTTTTGGGAGTCGGACTACTGGGAGT
GGACTTCAGACAAGTTGGATGAAGGTGACTTGGAAAAGGCGGTCTTAATCCTATGGAGCCTATGGGATTTCAGGAACCAAGCTAATCAAAATCATCAAAGCAGAAGCATC
CAAGATCACTTTTCTTTCGTCAGAGCCATGGAACGAAATTTAGAGAAAATGGGTCTCAATTCTGGAATGTACCTGGTTCAGCAAGGAGAGCCCGCCCCTTCAAGGTTGCA
GAACCAGATGAGTCACGATGCTTGGATCCCTCCGCTCTTTCAAGCGTGGAAACTCAATGTTGACGCCTCTTGGAATGATTGA
Protein sequenceShow/hide protein sequence
MGSNMLHEGTSSDIEAAVMENNEETERVSLEQSMEVEDDPEKTQKAEEEVVNQMENMRISPYEREKVVAIEDDELEEATEDLQEAILCKILTTKMINPETFKTFMPKIWN
KEGRVRTKTVGRNTFLCKFNNSHEKERIKKGGPWNFDRALMVIEDPQGASRISKTKFKYVKFWVHIHDLPMVCHCRKWVTALGNSLGEFVDFDLDEEDKRWDNSLRVQVK
INITKPLKRGLMVRIGSKAEETWCEDPVQPEEDQPFGEWMRASPIMGGSNRNNISDSRKGSTWGRGRGSRGEFQRNRFGWRSGQKGNGDWREKGNPANSSNPTTEAWSPE
NSMGEEVRPTENLVSQGRESQTASMSGEKGEESAEKSEHTPKNQNTSNQPPKDNCTSIKMEAKKGKKKTIFNQTSEAADEPKVCTEKDKSIPNRKNSPSGPNGPEKEMAK
ALVMSNMQNESEEVEKKLEQTFESGKGIKLKKWKRRARSEKQGTFTEENTDYSMGTEKRRNETLSILEEEREPKKQRKPYFEAILKKTELGFENCFTVGSQGRSGGLALY
WNNSTDLLIRSFSKGHIDATIKDSDQNWRFTGFYGEPDPSNRKVSWQLLERLSDSPKIPWIVGGDFNEIMFAKEKEGGNMRLEKGMENFREAINRSNLFDPGFSGNKFTW
RKRKSLQHGIRERKSAEITQLEEEDKRNPSHNILRAEKDLEALLEEEESGIINKEGVWVEDEYEIGKTATRYFQDLFSTSHPMMTSIDSVLEGVVKTISKNQSRRLSRPF
SKADIEGALKSMNSSKAPGEDGMHAIFFQHYWDIVGEETSNLCLDILNGGKSVKDINRTLIALIPKVKAPKEMSHFRPISLCNVVYKMVAKALVNRMKYILEDIISLNQA
AFVPKRLISDNVILGFECIHALKSKRKGKEGLLAMKLDMSKAYDRVEWIFIRKIMDKMGFPEEWIAKIMDCIESVEYSGDPISPYIFLICAEDDNLIFCRAEEKDCATIK
KVLRTYEEASGQTINTSKSSFMASKNSPQEKIIKLGDILGIEKTEDLGLYLGMPSQTGKNKKSVFKKVKERVWKALQNWKGKFFSVGGKETLIKAIVQAIPTYTMSCFKL
PISLYDDINSMCAKFWWGEFQGKNKTHWISWRKMCISKDRGGMGFRDLKMFNKAMLAKVSWRILKRPNSLLSQVLRGRYFKDNNFLAAPLGSNPSLTWQSIVWGRELFKK
GYRWRVGNGHHIWIDQDPWLNSPGSEIPILTKNDLKDKRVKDIIDSQGRWKSSKIKENFSALDVEIILNTPIRGGERKDEIFWSREKKGVFTVKSAYHLATLLNNSLEAS
STKVNSFDNSWKKFWKIKTVPKSKICVWRIIHDSLPTKMNIIKKGIVTNLLCLFCREKNEEAKHVFWESDYWEWTSDKLDEGDLEKAVLILWSLWDFRNQANQNHQSRSI
QDHFSFVRAMERNLEKMGLNSGMYLVQQGEPAPSRLQNQMSHDAWIPPLFQAWKLNVDASWND