; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006974 (gene) of Snake gourd v1 genome

Gene IDTan0006974
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG02:77211227..77217209
RNA-Seq ExpressionTan0006974
SyntenyTan0006974
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH80348.1 hypothetical protein [Trifolium medium]9.0e-6627.43Show/hide
Query:  KLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEEPR
        K   +++ + I + DE++   D++     + +  +  P     FK  M + W     + IQ    N +L  F + ++   +  +GPW FD+ LL+     
Subjt:  KLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEEPR

Query:  RNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDFCYK
         N + SE    + +FWV +++LPL       AK LGN +G F E+D  +  N  G  LR++V +D+ KPL RGT  K+    +E+W+   YE+LP+FC+ 
Subjt:  RNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDFCYK

Query:  CGKIGHVFKDCD--------LFAQDSEDELLFSENLREIPYNKSINRGGKEEET--------PRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESK
        CG+IGH  +DC+         +++  E +  F   LR  P  K      KE  +        P     +G+  GT + M+   E Q       S++  +K
Subjt:  CGKIGHVFKDCD--------LFAQDSEDELLFSENLREIPYNKSINRGGKEEET--------PRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESK

Query:  ADINLNL-----PKQNVSED----SPSLTLEENAHQTNIEEKTST------------------PTAKTSTIGKDTPLSPQRLLPTKETFNAANDTEMTDP
         +++ +       +QNV ++    + SL     + QT I E T T                  P     T+GK    S   ++ T  T  A    E    
Subjt:  ADINLNL-----PKQNVSED----SPSLTLEENAHQTNIEEKTST------------------PTAKTSTIGKDTPLSPQRLLPTKETFNAANDTEMTDP

Query:  IGEKVKPLTSDRETKSSGNTILHQYQPIHTKQEIEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDA
            +K   +   +  +   +L +   +   Q +   + + K     N+++  ++  K+  A   NG  +    +GGL L W   L ++I+SFS  HI  
Subjt:  IGEKVKPLTSDRETKSSGNTILHQYQPIHTKQEIEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDA

Query:  SINQD---WSWRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKT
            +    SW  TG YG+P+   + ++W L+  L  QN   WL  GDFN+IL+  EK+GG  R+ +Q     +A+    L+D GF G  FTW  G+E+ 
Subjt:  SINQD---WSWRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKT

Query:  NTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRKLRFEASWTNFGKCSDIVQSCW
          ++ RLDR + N   +++   ++V+HL    SDH  +        P+ +R   +R  RFE SWT   +C ++++S W
Subjt:  NTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRKLRFEASWTNFGKCSDIVQSCW

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]4.9e-7232.37Show/hide
Query:  KQMEKLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALL
        ++ EKL L +++   I  I+    +  +++L  + I +A+++K I    FK+ +  IW  + +V+++  G+N F   F++  D+KRI E GPW+FDK LL
Subjt:  KQMEKLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALL

Query:  LFEEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKL
        +  E   + + ++ +F+Y  FW+ LHNLPL    R+    LG  +G+  E+D+ + G C G  +RI+V IDV  PL RG  V +G   +   +   YE+L
Subjt:  LFEEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKL

Query:  PDFCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQ
        P+FCY CGKIGH+ +DC L    +  E+  S + +  P+ ++++              R R +GTG    + +  +   SS        K     N+ K 
Subjt:  PDFCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQ

Query:  N--VSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPL------SPQRLL--PTKETFNAANDTEMTDP-IGEKVKPLTSDRETKSSGNTILHQYQ
        +  +  D   L L       N  E T T  ++T  + K   L      S +++    T+ +        +T+P IGE V        ++ +G  I +Q +
Subjt:  N--VSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPL------SPQRLL--PTKETFNAANDTEMTDP-IGEKVKPLTSDRETKSSGNTILHQYQ

Query:  PIHTKQE-----IEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASINQDWS--WRFTGFYGHPDP
             +E      E  Q  GK+  +++++ +    RK     + + +    G  GGL L WK+ +++SI SF+ GHIDA I    S  WRFTGFYG P P
Subjt:  PIHTKQE-----IEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASINQDWS--WRFTGFYGHPDP

Query:  QQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDR
          R  SW LL RL   +N PW+V GDFNEIL   EK+GG  R+ + + +F EA++ C L+D G+ G+K+TW   + K   I+ER+DR
Subjt:  QQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDR

XP_022841874.1 uncharacterized protein LOC111365549 [Olea europaea var. sylvestris]2.6e-6527.64Show/hide
Query:  KLKLTENEKAKIIDIRDEDL--KAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLF
        KL L   E+  ++   DE L    +DK L    + + LSSK      FK  M R+W+    +SI     N  +  F   +DK R+  +GPW+F K L+L 
Subjt:  KLKLTENEKAKIIDIRDEDL--KAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLF

Query:  EEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPD
        ++     +  +  F  ANFWV +H+L ++    K    +G  +G+ IEVD D      G  L ++V +D++KPL+RG  + +GS  +  W   +YE+LP+
Subjt:  EEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPD

Query:  FCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQNV
        FCY CG +GH  KD  L+    E     +    +  Y   + R G   +   PI  R     T  +  +      DPSS            N +   +  
Subjt:  FCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQNV

Query:  SEDSP----SLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAAN----DTE--------MTDPIGEKVKPLTSDRETKSSGNTILH
         +  P    +LT  +   QTN E  ++   ++T+T  + TPL    ++      N  +    DT+         TDP+       +++  T + G     
Subjt:  SEDSP----SLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAAN----DTE--------MTDPIGEKVKPLTSDRETKSSGNTILH

Query:  QYQPIHTKQEIEHT--------------QGKGKQMAELNVKTWKRIARKDTHA-HSTNGL-----------------------------------SQHNG
        +++ ++T   I                   +      L + +W     ++    H+ + L                                    Q  G
Subjt:  QYQPIHTKQEIEHT--------------QGKGKQMAELNVKTWKRIARKDTHA-HSTNGL-----------------------------------SQHNG

Query:  TSGGLMLFWKSSLKLSINSFSTGHIDASINQD-WSWRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEA
         SGG+ L WKS++ LSI  +ST HIDA I      W  TG YGHP+  +R ++W LL+RLK  ++  WLV GDFNEIL+  EK GG+ R   Q+ NF+  
Subjt:  TSGGLMLFWKSSLKLSINSFSTGHIDASINQD-WSWRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEA

Query:  INRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRKLRFEASWTNFGKCSDIVQ
           C+L D GF+G  +TW  G+  TN I ERLDRF+AN +         V H S   SDHR +   W +L          +  RFE+ W     CSDIV 
Subjt:  INRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRKLRFEASWTNFGKCSDIVQ

Query:  SCWHKNYT-GRDLLNQKVSHSIRQLQAWNTERLKGSIKGAIDRKAKDLATLENQQIPNQDIILKKNKIEGLFDSLGTWVVREDDM
        + W    +   + + + +S    QLQ WN  +  G ++  +++    L+ ++ +        +    +    + + TW+ RE+ M
Subjt:  SCWHKNYT-GRDLLNQKVSHSIRQLQAWNTERLKGSIKGAIDRKAKDLATLENQQIPNQDIILKKNKIEGLFDSLGTWVVREDDM

XP_022841874.1 uncharacterized protein LOC111365549 [Olea europaea var. sylvestris]2.5e-1523.87Show/hide
Query:  GYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKGRRVNSLISSNGHWNKDILENNFLPGHFSVKSVYRLALNLSQKDEASSSNLLQHKEVWKRFWNL
        G RW VGNG  I I  D WI   G  +          R + +++  N     +   ++ +  +    S YR   N    D  +SS   ++K+ W   W L
Subjt:  GYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKGRRVNSLISSNGHWNKDILENNFLPGHFSVKSVYRLALNLSQKDEASSSNLLQHKEVWKRFWNL

Query:  DSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAIT
            + K   W   Q+ +P++QNL +R +   P C FC    E+  HV+  C     L TD FP L          +  I+    + T +    LS+ + 
Subjt:  DSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAIT

Query:  IMWSIWDARNKALKSGHPPNKEDITKRIELHTLDREFRPQIG-SLDKSSKNQMSHRHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQ
        I W  W  RNK +      + + I        L ++   + G +++ +    +    W  PP    K+N+D     ++ + GLG  + D  G  I   ++
Subjt:  IMWSIWDARNKALKSGHPPNKEDITKRIELHTLDREFRPQIG-SLDKSSKNQMSHRHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQ

Query:  ----LIKTDWTIKILELKAILLAVNL-INHI
            L++ +       ++ + L +NL I+H+
Subjt:  ----LIKTDWTIKILELKAILLAVNL-INHI

XP_022841874.1 uncharacterized protein LOC111365549 [Olea europaea var. sylvestris]3.6e-6227.6Show/hide
Query:  EKLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFE
        + LKLTE E+ +I+ + +E + +++   +   + +  + +      F+T M +IWN EG ++ +    N +L  F+   DK+++    PW FD+ L+  +
Subjt:  EKLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFE

Query:  EPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDF
        E       SE  F    FW+ +HNLP     ++   ++G+ IG  +EV+++ +G   G  LRIK  ++V K L+RG  +K GS  ++ W+   YE+LP F
Subjt:  EPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDF

Query:  CYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSI--NR--GGKEEETPRPIRGRGRGRGTGRAMEARKEYQ----VDPSSGRSR-----EDESK
        C+KCG+  H    C     D+     + + LR    +     NR  GG +E+ P        G     + E   +Y     V  S   S      E    
Subjt:  CYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSI--NR--GGKEEETPRPIRGRGRGRGTGRAMEARKEYQ----VDPSSGRSR-----EDESK

Query:  ADINL-----NLPKQNVSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKD-TPLSPQRLLP-------------------------------TKETF-
         ++NL     NLPK++           +  H+T    +   P +    + +D T L PQ L P                               T+  F 
Subjt:  ADINL-----NLPKQNVSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKD-TPLSPQRLLP-------------------------------TKETF-

Query:  ---NAAND------------------TEMTDPIGEKVKPLTSDR------ETKSSGNTILHQYQ-PIHTKQEIEHTQGKGKQMAELNVKTWK----RIAR
           N++N                   +++T+ I +      S R       T++ G     + Q P  + QE+ H   K KQ   + +   K    R+ R
Subjt:  ---NAAND------------------TEMTDPIGEKVKPLTSDR------ETKSSGNTILHQYQ-PIHTKQEIEHTQGKGKQMAELNVKTWK----RIAR

Query:  KDTHAHSTNGLSQHN-GTSGGLMLFWKSSLKLSINSFSTGHIDASIN---QDWSWRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRR
                N  S ++ G SG L L WK S+K+ + +++T HI A I     +  W+ TGFYGHP+  +R +SW LL  LK   N PWL  GDFNEI ++ 
Subjt:  KDTHAHSTNGLSQHN-GTSGGLMLFWKSSLKLSINSFSTGHIDASIN---QDWSWRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRR

Query:  EKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQR
        EK G   R   Q+  F  +++ C+L D GF GDKFTW   +E     KERLDR   N   I    N  V HL    SDH+ +     +L    S     R
Subjt:  EKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQR

Query:  KLRFEASWTNFGKCSDIVQSCWHKNYTGRDLLN---QKVSHSIRQLQAWNTERLKGSIKGAIDRKAKDLATLE--NQQIPNQDIILKKNKIEGLFDS
          RFE++WT   +C +I++  W  + +G  +L+   Q ++    +L+ W+  + +   K A+  K + L  L+  NQ   +++I      I  + D+
Subjt:  KLRFEASWTNFGKCSDIVQSCWHKNYTGRDLLN---QKVSHSIRQLQAWNTERLKGSIKGAIDRKAKDLATLE--NQQIPNQDIILKKNKIEGLFDS

XP_035545013.1 uncharacterized protein LOC108979776 [Juglans regia]7.4e-6829.01Show/hide
Query:  KLKLTENEKAKI-IDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFE
        KL LTE+E+  + +D+  + L+      +   I + L+ +      FK ++ R+W     + +Q       L  F   +DK+R+  DGPW FD+ L+L +
Subjt:  KLKLTENEKAKI-IDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFE

Query:  EPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDF
        +     +  + +   A+FWV +H+LPL     +  +++G ++G   ++D  D     G  +RI+V ID+ K L+RG  + IGS     W+  +YE+LPDF
Subjt:  EPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDF

Query:  CYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQV---------DPSSGRSREDESKADIN
        C+ C +IGH F+DCD  AQD      F E     PY + +  GG+  +   P   R           A++   V          P       +   AD+ 
Subjt:  CYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQV---------DPSSGRSREDESKADIN

Query:  LNLPKQNVSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAANDTEMTDPIGEKVKPLTSDRETKSSGNTILHQYQPIHTK
          +     S +  S T+  N  +  +E    TP         +  L P             +D  +T+P      P     ETK+               
Subjt:  LNLPKQNVSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAANDTEMTDPIGEKVKPLTSDRETKSSGNTILHQYQPIHTK

Query:  QEIEHTQGKG-KQMAELNVKTWKRIARKDTHA--HSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASINQDWS--WRFTGFYGHPDPQQRHQSW
           +   G G + + EL       I R+D          L      SGGL L W+  L++ + SFS  H+D  +N+D +  WRFTG YG+P    R  +W
Subjt:  QEIEHTQGKG-KQMAELNVKTWKRIARKDTHA--HSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASINQDWS--WRFTGFYGHPDPQQRHQSW

Query:  KLLERLKDQNNS--PWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDH
         L+ +L D ++   PWL+GGDFNE+L   EK+ G+TR+ +Q+  F E +  C L D GF G KFTW  G+E T  I ERLDRFL N +         V H
Subjt:  KLLERLKDQNNS--PWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDH

Query:  LSYHNSDHRGITAAWRELTPTPSRHVPQRKLRFEASWTNFGKCSDIVQSCWHKNYTGRDLLN-------QKVSHSIRQLQAWNTERLKGSIKGAIDRKAK
            +SDH  I   W +      +    R  RFEA W    KC+DI++  W    TG  + N       Q++     +L +WN    K S      +KA+
Subjt:  LSYHNSDHRGITAAWRELTPTPSRHVPQRKLRFEASWTNFGKCSDIVQSCWHKNYTGRDLLN-------QKVSHSIRQLQAWNTERLKGSIKGAIDRKAK

Query:  DLATLENQQIPNQD----IILKKNKIEGLFDSLGTWVVRED
            LE  Q  N++    + L + K     ++L  W+ RE+
Subjt:  DLATLENQQIPNQD----IILKKNKIEGLFDSLGTWVVRED

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]2.8e-1424.7Show/hide
Query:  IPTRVLFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKG-RRVNSLI-SSNGHWNKDILE---------------------------NNFLPG
        I  R L  EG  WRVG+G+ I +  D W  +   +K   +   L    +V  LI  +   WN  ++E                                G
Subjt:  IPTRVLFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKG-RRVNSLI-SSNGHWNKDILE---------------------------NNFLPG

Query:  HFSVKSVYRLALNLSQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDY
         F+VKS Y L L L ++ +  SS  ++   VW   W L      K  +W      IPTR  L  + +     C  CR   E A H +W C  ++ +W+  
Subjt:  HFSVKSVYRLALNLSQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDY

Query:  FPFLTDFLNFCREDRNPIE-CWKALTTHLKNVDLSKAITIMWSIWDARNKALKSGHPPNKEDITKRIEL--HTLDREFRPQIGSLDKSSKNQMSHRHWDP
           +      C   ++ +E C K         +L +     W IW  RN+ + S    +   I + ++L    LD+  +P       SS        W+ 
Subjt:  FPFLTDFLNFCREDRNPIE-CWKALTTHLKNVDLSKAITIMWSIWDARNKALKSGHPPNKEDITKRIEL--HTLDREFRPQIGSLDKSSKNQMSHRHWDP

Query:  PPAGWWKMNSDATWLEEARQGGLGWSVRDSSG
        PP    K N DA   + + + G+G  VR+  G
Subjt:  PPAGWWKMNSDATWLEEARQGGLGWSVRDSSG

TrEMBL top hitse value%identityAlignment
A0A2N9ESV7 Uncharacterized protein1.2e-1827.95Show/hide
Query:  LFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKGRRVNSLISSNGH-WNKDILENNFLPGH---------------------------FSVKS
        L     +W VGNG+ I +  D W+ R    +           +V  LI  + H WN+ +++ NF P                             F+V+S
Subjt:  LFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKGRRVNSLISSNGH-WNKDILENNFLPGH---------------------------FSVKS

Query:  VYRLALNLS-QKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLT
         Y   L+ +     + SSN +  +  WK  W L    + K  +W      +PTR NL  R I  +P CLFC  + E  TH++W C  ++++W      L 
Subjt:  VYRLALNLS-QKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLT

Query:  DFLNFCREDRNPIECWKALTTHLKNVDLSKAITIMWSIWDARNKALKSG---HP
           +    D   +    A +TH ++ +L   IT  WSIW ARNK L  G   HP
Subjt:  DFLNFCREDRNPIECWKALTTHLKNVDLSKAITIMWSIWDARNKALKSG---HP

A0A2N9GJ35 Uncharacterized protein5.8e-2626.43Show/hide
Query:  FRQLFTSTFPNKEHMENLA---------NNIPTRVLFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKG-RRVNSLISSNG-HWNKDILENNF
        +R L    FPN   +E  +         +    + +  +G RWRVGNG+ I I  D W+     ++V+  +  L     V+ LI ++   W+ ++L+  F
Subjt:  FRQLFTSTFPNKEHMENLA---------NNIPTRVLFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKG-RRVNSLISSNG-HWNKDILENNF

Query:  LP---------------------------GHFSVKSVYRLALNLSQKDEASSSNL--LQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREID
        LP                           G FSV+S Y + LN SQ  EA SS+    Q K+ W   W     P+ K  +W   + I+PT+  L  + I 
Subjt:  LP---------------------------GHFSVKSVYRLALNLSQKDEASSSNL--LQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREID

Query:  TNPLCLFCRKKWENATHVIWGCKFSKSLWTD-----------YFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAITIMWSIWDARNKALKSGHPP
            CL+C  + E   H++WGC+F++ +W +             PF T+F++ C ED             L +  L  A T  W++W ARN    +    
Subjt:  TNPLCLFCRKKWENATHVIWGCKFSKSLWTD-----------YFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAITIMWSIWDARNKALKSGHPP

Query:  NKEDITKRIELHTLD-REFRPQIGSLDKSSKNQMSHRHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLA
        N  +I +      LD  E R Q   L  S+ + +    W  P  G +K+N            GLG  +RDS G  +      I  + ++     +A LLA
Subjt:  NKEDITKRIELHTLD-REFRPQIGSLDKSSKNQMSHRHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLA

Query:  V
        +
Subjt:  V

A0A2N9GJ35 Uncharacterized protein7.2e-6926.43Show/hide
Query:  KLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEE
        K+KLT+ E+  I+ I   +   A +  +++ + R L+ +P      KT +   W +  +V +   G       FR+A   + + E  PW FD  LLL   
Subjt:  KLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEE

Query:  PRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDFC
               +   F +A FW+ +  +P      +  + +G  IG FI+VD       +  NLRI+V + ++KPL+RG  V +    + VW+   YE+L  FC
Subjt:  PRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDFC

Query:  YKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRS-----------REDESKADI
        ++CG +GH    C+   Q         E+   +PY + +  G          R   +       + +       P+S  S           + + S +D 
Subjt:  YKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRS-----------REDESKADI

Query:  NLNLPKQNVSED-----SPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAAND-TEMTDPIGEKVKPLTSDRETKSSGNTILHQ
        + +LP  + SED     + +LT         I   T        T+G D        LP+     +AN      + I  K K L S + T+  GN +   
Subjt:  NLNLPKQNVSED-----SPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAAND-TEMTDPIGEKVKPLTSDRETKSSGNTILHQ

Query:  YQPIHTKQEIEHTQGKGKQMA---------------------ELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASIN
          PI       + QG G   A                     +L+VK  +++       +     S+  G SGGL L WK S+++ + +FS  H+D  ++
Subjt:  YQPIHTKQEIEHTQGKGKQMA---------------------ELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASIN

Query:  QDWS--WRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIK
         D +  WR TGFYGHP+P +RH++WKLL  L  +N +PWL  GDFNEIL++ EK G   +   Q+ +F + I  C L+D G+RG  +TW   ++    ++
Subjt:  QDWS--WRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIK

Query:  ERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK--LRFEASWTNFGKCSDIVQSCWH-KNYTGRDL--LNQKVSHSIRQL
        ERLDR LA    +   G     H+    SDH  I   + +  P      P+RK   RFE  W+   +C  ++ + W     TG  +  + +K+    + L
Subjt:  ERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK--LRFEASWTNFGKCSDIVQSCWH-KNYTGRDL--LNQKVSHSIRQL

Query:  QAWNTERLKGSIKGAIDRKAKDLATLENQQIP---NQDIILKKNKIEGLF--DSLGTWVVREDDMGVVAGDYFRQLFTSTFPNKEHMENLANNIPTRVLF
          W      GS+K  +D+K  ++  L +  +    NQ I   K +I  L   D L  W  R   + + +GD   + F      +     +   + +  L+
Subjt:  QAWNTERLKGSIKGAIDRKAKDLATLENQQIP---NQDIILKKNKIEGLF--DSLGTWVVREDDMGVVAGDYFRQLFTSTFPNKEHMENLANNIPTRVLF

Query:  SE
         E
Subjt:  SE

A0A2N9I921 Reverse transcriptase domain-containing protein7.7e-7127.38Show/hide
Query:  INNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAI
        +  + +T  P +W  +   S+++ G NT L +F +  D +R+  + PW +DK ++LF+    +       F     WV LH LP+    R+ A  +G+ I
Subjt:  INNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAI

Query:  GEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDFCYKCGKIGHVFKDCDLFAQ----DSEDELLFSENLR---EIP
        G+ I   S ++        RIKVR+D+ +PL RG  VK+G      WI   YE+LP+FCY+CG + H  KDC   ++     S D+  F   LR   E  
Subjt:  GEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPDFCYKCGKIGHVFKDCDLFAQ----DSEDELLFSENLR---EIP

Query:  YNKS-INRGGKEEETPRPIRGRGRGRGTGRAMEA-RKEYQVDPSSGRSREDESKADINLNLPK-------QNVSEDSPSLTLEENAHQTNIEEKTSTPTA
        + KS +   G+     RP +     +    A ++ RK    + S   + E++    ++  LP        +N+ E   +L    NA   NI      PT 
Subjt:  YNKS-INRGGKEEETPRPIRGRGRGRGTGRAMEA-RKEYQVDPSSGRSREDESKADINLNLPK-------QNVSEDSPSLTLEENAHQTNIEEKTSTPTA

Query:  KTS-----------------TIGKDTPLSPQRLLPTKETFNAAN----------------------DTEMTDPIGEKVKPLTSDRETKSSGNTILHQYQP
        + S                 TI  + P+ P R +P ++  N +                         E+T P    V P+    E +       H+   
Subjt:  KTS-----------------TIGKDTPLSPQRLLPTKETFNAAN----------------------DTEMTDPIGEKVKPLTSDRETKSSGNTILHQYQP

Query:  IHTKQEIEHTQGKGKQMAELNVKTWK-----RIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASIN--QDWSWRFTGFYGHPDPQ
        +   QE+     +    A   ++TW       + R   H  +   + + N   GGL LFWK +L L I+S+S  HID  ++      WRFT FYG P+  
Subjt:  IHTKQEIEHTQGKGKQMAELNVKTWK-----RIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASIN--QDWSWRFTGFYGHPDPQ

Query:  QRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNM
        +R  SW LL  L  Q + PW  GGDFNEI+   EK+G  ++  SQ+ +F EA++ C  +D G+ G  FTW   +    T+ ERLDR +A+ A + +    
Subjt:  QRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNM

Query:  RVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK-LRFEASWTNFGKCSDIVQSCWHKNYTGRDLLN--QKVSHSIRQLQAWNTERLKGSIKGAIDRKAK
        RV HL Y  SDH+ +   W  L+PT +R+ P  K  RFE  W     C++ + + W     G  +     K++H   QL+ W+     GS++  +  K +
Subjt:  RVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK-LRFEASWTNFGKCSDIVQSCWHKNYTGRDLLN--QKVSHSIRQLQAWNTERLKGSIKGAIDRKAK

Query:  DLATLENQQIPNQ------------DIILKK----------------------------------NKIEGLFDSLGTWVVREDDMGVVAGDYFRQLFTST
        +L   E + +  Q             I+L K                                  N I GL DS G W    D +  +   YF+ +F S+
Subjt:  DLATLENQQIPNQ------------DIILKK----------------------------------NKIEGLFDSLGTWVVREDDMGVVAGDYFRQLFTST

Query:  FPNKEHMENLANNIPTRV
         P+   ++ +   IPT +
Subjt:  FPNKEHMENLANNIPTRV

A0A2N9I921 Reverse transcriptase domain-containing protein6.2e-2825.55Show/hide
Query:  LFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLK-GRRVNSLISSNGH-WNKDILENNFLP---------------------------GHFSVK
        +  +G  WR+G+G    I +D W+   G  K++  +       +V+ LI+S    WN  ++E  FLP                           G +SV+
Subjt:  LFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLK-GRRVNSLISSNGH-WNKDILENNFLP---------------------------GHFSVK

Query:  SVYRLALNLSQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLT
        S Y L      +   SSSN    K +W + W+L   P+ K+ +W      +PT+ NL KR++  N  C  C    E+ +H IW C  +  +W        
Subjt:  SVYRLALNLSQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLT

Query:  DFLNFCREDRNPIECWKALTTHLKNVDLS-KAITIMWSIWDARNKALKSGHPPNKEDITKR-IELHTLDREFRPQIGSLDKSSKNQMSHRHWDPPPAGWW
        D+ +  +  R       +     + V++S +  TI WS+W  RNK   + +    ED   R I+L     EF  +  +  ++ K +     W PPP G +
Subjt:  DFLNFCREDRNPIECWKALTTHLKNVDLS-KAITIMWSIWDARNKALKSGHPPNKEDITKR-IELHTLDREFRPQIGSLDKSSKNQMSHRHWDPPPAGWW

Query:  KMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLAV------------------NLINHISKDLGEANSLVVAIEDVASTLG
        K N D    +++ + GLG  +RDSSG  I    Q I    ++ ++E  A   AV                   +I  I++++         I+D+  T  
Subjt:  KMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLAV------------------NLINHISKDLGEANSLVVAIEDVASTLG

Query:  KV---TFAWCPREKNTAAHKIARLPSSPGFWSDLQRSFIAEDDPVVWTHPLPPCIASV
        ++    F    RE N AAH +ARL                  D  VW   +PP I  V
Subjt:  KV---TFAWCPREKNTAAHKIARLPSSPGFWSDLQRSFIAEDDPVVWTHPLPPCIASV

A0A2N9I921 Reverse transcriptase domain-containing protein5.0e-7028.84Show/hide
Query:  KLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEE
        ++KL+E E  + I +R + +  + K  Q++ + + L++KP  +  FK  +  +W+  G V+I+S   N F+  F    D +RI    PW FDK L+    
Subjt:  KLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLFEE

Query:  PRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSM--AEEVWIPATYEKLPD
           + + +E  F +  FW+ + NLP+    R+  + +G  IG  +EVD  + G   G  LRI+V ID+ +PL+RG +++          W+   YE LP 
Subjt:  PRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSM--AEEVWIPATYEKLPD

Query:  FCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQNV
        FCY+CG++GH   +C                         + RGG+  E    + G   G    RA+ AR         G  + DE + + N+   ++  
Subjt:  FCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQNV

Query:  SEDSPSLTLEENA---------HQ-----TNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAANDTEMTDPIGEKVKPLTSDRETKSSGNTILHQY
        +E+ PS  +             H+       +E     P   +   GKD P    R L       + N   + +P  + V  L      K  G  I+   
Subjt:  SEDSPSLTLEENA---------HQ-----TNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAANDTEMTDPIGEKVKPLTSDRETKSSGNTILHQY

Query:  QPIHTKQEIEHTQGKGKQMAELNVKT--WKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASI--NQDWSWRFTGFYGHPDPQQ
        +                    LNV+   W R+           G+ +H G  GGL L W SS+ ++I S+S  HID  +  N    WR TGFYG+P+   
Subjt:  QPIHTKQEIEHTQGKGKQMAELNVKT--WKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASI--NQDWSWRFTGFYGHPDPQQ

Query:  RHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMR
        RH+SW LL  L+  ++ PW++ GDFNEI    EK G + RN +Q+  F EA+  C L D GF G +FTW   +E  + ++ RLDR +A+ A +    +  
Subjt:  RHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMR

Query:  VDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK---LRFEASWTNFGKCSDIVQSCWHKNYTGRDL--LNQKVSHSIRQLQAWNTERLKGSIKGAIDRKA
        ++HL   +SDH G+    R  T  P  HVPQRK    RFE SW     C +++Q  W     G  +  + QK+     +L  W+   ++ + K  ID K 
Subjt:  VDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK---LRFEASWTNFGKCSDIVQSCWHKNYTGRDL--LNQKVSHSIRQLQAWNTERLKGSIKGAIDRKA

Query:  KDLATL---ENQQIPNQDIILKKNKIEGLFDSLG-TWVVREDDMGVVAGDYFRQLF
        K L  L   E +   ++ I L K  + GL +     W  R   + +  GD   + F
Subjt:  KDLATL---ENQQIPNQDIILKKNKIEGLFDSLG-TWVVREDDMGVVAGDYFRQLF

A0A5C7H9Y2 CCHC-type domain-containing protein2.4e-7232.37Show/hide
Query:  KQMEKLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALL
        ++ EKL L +++   I  I+    +  +++L  + I +A+++K I    FK+ +  IW  + +V+++  G+N F   F++  D+KRI E GPW+FDK LL
Subjt:  KQMEKLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALL

Query:  LFEEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKL
        +  E   + + ++ +F+Y  FW+ LHNLPL    R+    LG  +G+  E+D+ + G C G  +RI+V IDV  PL RG  V +G   +   +   YE+L
Subjt:  LFEEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKL

Query:  PDFCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQ
        P+FCY CGKIGH+ +DC L    +  E+  S + +  P+ ++++              R R +GTG    + +  +   SS        K     N+ K 
Subjt:  PDFCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQ

Query:  N--VSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPL------SPQRLL--PTKETFNAANDTEMTDP-IGEKVKPLTSDRETKSSGNTILHQYQ
        +  +  D   L L       N  E T T  ++T  + K   L      S +++    T+ +        +T+P IGE V        ++ +G  I +Q +
Subjt:  N--VSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPL------SPQRLL--PTKETFNAANDTEMTDP-IGEKVKPLTSDRETKSSGNTILHQYQ

Query:  PIHTKQE-----IEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASINQDWS--WRFTGFYGHPDP
             +E      E  Q  GK+  +++++ +    RK     + + +    G  GGL L WK+ +++SI SF+ GHIDA I    S  WRFTGFYG P P
Subjt:  PIHTKQE-----IEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASINQDWS--WRFTGFYGHPDP

Query:  QQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDR
          R  SW LL RL   +N PW+V GDFNEIL   EK+GG  R+ + + +F EA++ C L+D G+ G+K+TW   + K   I+ER+DR
Subjt:  QQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDR

A0A803NQ77 Uncharacterized protein6.1e-7623.59Show/hide
Query:  LKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIIN-NIFKTIMPRIWN--LEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLF
        L L+E E+  +  + + DL   ++  +   + R LSS  I+N   F   M   W+     +V +     + FL     A DK+RI+   PW F   L+L 
Subjt:  LKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIIN-NIFKTIMPRIWN--LEGKVSIQSRGLNTFLCHFRSAKDKKRITEDGPWIFDKALLLF

Query:  EEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPD
          P      ++ +F++A FWV  H LP +   R  AK +G  +GEF+EV  D      G  LR +VR+DV +PL+RG MV +  + +E W+   YE LP 
Subjt:  EEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEKLPD

Query:  FCYKCGKIGHVFKDC----DLFAQDSEDELLFSENL--REIP---YNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADI
        FC+ CGKIGH F  C    +L     + +LL+   +   ++P   Y++      K    P   R   R   T     A     +   S      + K+  
Subjt:  FCYKCGKIGHVFKDC----DLFAQDSEDELLFSENL--REIP---YNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADI

Query:  NLNLPKQNVSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRL-LPTKETFN--AANDTEMTDPIGEKVKPLTSD---RETKSSGNTILHQ
        N  L +++  E+S      E          ++ P A TS I + TP   + L L   ET +  ++  TE +     K K +  D    E      T   Q
Subjt:  NLNLPKQNVSEDSPSLTLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRL-LPTKETFN--AANDTEMTDPIGEKVKPLTSD---RETKSSGNTILHQ

Query:  YQPIHTKQEIEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHN-------------------------------------GTSGGLMLFWKSSLK
          P + +  ++  +   K+  + +  + + ++  D H+ S +GLS  +                                     G  GGLML W +++ 
Subjt:  YQPIHTKQEIEHTQGKGKQMAELNVKTWKRIARKDTHAHSTNGLSQHN-------------------------------------GTSGGLMLFWKSSLK

Query:  LSINSFSTGHIDASINQD--WSWRFTGFYGHPDPQQRHQSWKLLERLKDQN-NSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFR
        +++N++S  H D  ++ D    + FTGFYG P+   R  SW  L  L     N PWLV GDFNE+L+  +K+GG  RN + + NF   I+ C L    F 
Subjt:  LSINSFSTGHIDASINQD--WSWRFTGFYGHPDPQQRHQSWKLLERLKDQN-NSPWLVGGDFNEILNRREKEGGKTRNTSQIRNFEEAINRCQLLDPGFR

Query:  GDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK----LRFEASWTNFGKCSDIVQSCWHKNYT
        GD FTW       N I+ERLDR   N    D      + HL +++SDHR I          P++  P +K     RFE  W     CS I+ + W+ + T
Subjt:  GDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRK----LRFEASWTNFGKCSDIVQSCWHKNYT

Query:  GRDL--LNQKVSHSIRQLQAWNTER---LKGSIKGAIDRKA-----KDLATLENQQIPNQDIILKK----------------------------------
         + L  L Q ++     LQ W+  +   +   I  A    A     K  AT   QQ+ N D IL                                    
Subjt:  GRDL--LNQKVSHSIRQLQAWNTER---LKGSIKGAIDRKA-----KDLATLENQQIPNQDIILKK----------------------------------

Query:  ----NKIEGLFDSLGTWVVREDDMGVVAGDYFRQLFTST-------------------------------------------------------------
            NKI+ L  S G +V  E+++      YF  LF+S                                                              
Subjt:  ----NKIEGLFDSLGTWVVREDDMGVVAGDYFRQLFTST-------------------------------------------------------------

Query:  -FPNKEH-----------------MENLANNIPT------------------------------------------------------------------
          P  +H                 ++ +  +IPT                                                                  
Subjt:  -FPNKEH-----------------MENLANNIPT------------------------------------------------------------------

Query:  ----------------RVLFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKGR---RVNSLISSNGHWNKDILENNF----------LP----
                        R L  +G RW++G+G+++    DPW+         MT     G     V   IS +  W+  IL+ +F          +P    
Subjt:  ----------------RVLFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKGR---RVNSLISSNGHWNKDILENNF----------LP----

Query:  -------------GHFSVKSVYRLALNLSQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHV
                     G +SVKS Y+LA +L  + E SSS+   +++ W R W+L    + K  +W  I   +PT  NL+ R+I ++  C  C+  W  + H 
Subjt:  -------------GHFSVKSVYRLALNLSQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHV

Query:  IWGCKFSKSLWTD-----YFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAITIMWSIWDARNKALKSGHPPNKEDITKRIELHTLDREFRPQIGS
        I+ CK +K++W       Y P + +   +        + +  +     +++L + + +MW+IW  RNK    G  P   D+        LD+  +     
Subjt:  IWGCKFSKSLWTD-----YFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAITIMWSIWDARNKALKSGHPPNKEDITKRIELHTLDREFRPQIGS

Query:  LDKSSKNQMSH--------------RHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSI
           +  +   H              + W  PPAG +K+N DA   +     G G  +RD  G  +
Subjt:  LDKSSKNQMSH--------------RHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein6.1e-0431.58Show/hide
Query:  QRHQSWKLLERLKDQN---NSPWLVGGDFNEILNRREKEGGKTRNTS--QIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLAN
        +R   W  + RL   +   NSPWLV GDFN+I +  E       N S   + + +  +    L+D   RG  +TW    ++ N I  +LDR + N
Subjt:  QRHQSWKLLERLKDQN---NSPWLVGGDFNEILNRREKEGGKTRNTS--QIRNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLAN

AT2G02650.1 Ribonuclease H-like superfamily protein2.7e-1522.19Show/hide
Query:  EVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLTDF----LNFCREDRNPIECWKALT
        EV +  W L   P+ K  +W  +   + T   L  R ID +P+C  C  + E   H+++ C +++S+W      + +      +F       I+  K  T
Subjt:  EVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLTDF----LNFCREDRNPIECWKALT

Query:  THLKNVDLSKAITIMWSIWDARN------KALKSGHPPNK--EDITKRIELHTLDREFRPQIGSLDKSSKNQMSHRHWDPPPAGWWKMNSDATWLEEARQ
        T+  ++D      IMW +W +RN      K     +   K  +D T+ +  +         + + +    ++     W+PPP GW K N D+ + + +  
Subjt:  THLKNVDLSKAITIMWSIWDARN------KALKSGHPPNK--EDITKRIELHTLDREFRPQIGSLDKSSKNQMSHRHWDPPPAGWWKMNSDATWLEEARQ

Query:  GGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLAVNLI----------NHISKDL------GEANSLV-VAIEDVASTLGKVTFA---WCPREKN
           GW++R+ +G  +  G   +++       E    L A+ +I             SK L      GE +SL+   I D+   + K+ +    +  RE+N
Subjt:  GGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLAVNLI----------NHISKDL------GEANSLV-VAIEDVASTLGKVTFA---WCPREKN

Query:  TAAHKIARLPSSPGFWSDLQRSFIAEDDPVVWTHPLPP
        +AA  +A              S +   DP+  ++  PP
Subjt:  TAAHKIARLPSSPGFWSDLQRSFIAEDDPVVWTHPLPP

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-1022.79Show/hide
Query:  LSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSK----AITIMWSIWDARNKALKSGHPP
        ++ R +     C+ C    E   H+++ C F++ +W      +  +      D      +  L   ++   L K       ++W +W +RN+ +  G   
Subjt:  LSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSK----AITIMWSIWDARNKALKSGHPP

Query:  NKEDITKRIELHTLDREFRPQIGSLDKSSKNQMSHR---HWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAIL
        +  ++ +R      +   R ++    K+S  Q+       W  PP  W K N+DATW  E  + G+GW +R+ SG  + +G + +     +   EL+A+ 
Subjt:  NKEDITKRIELHTLDREFRPQIGSLDKSSKNQMSHR---HWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAIL

Query:  LAV------------------NLINHISKDLGEANSLVVAIEDVASTL---GKVTFAWCPREKNTAAHKIAR
         AV                   L+N ++ D     +L  A+ED+   L    +V F + PR  N  A +IAR
Subjt:  LAV------------------NLINHISKDLGEANSLVVAIEDVASTL---GKVTFAWCPREKNTAAHKIAR

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.6e-0532.2Show/hide
Query:  WNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSK
        W+L   P+ K  +W  + N +P    L  R I   P C  CR  +E  TH+++ C F++
Subjt:  WNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSK

AT4G29090.1 Ribonuclease H-like superfamily protein6.3e-2523.17Show/hide
Query:  LFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTK--------DVLKGRRVNSLISSNG-HWNKDILENNF---------------------------L
        +  +G R  VGNG+ III    W+D +     L  +         V    +V+ LI  +G  W KD++E  F                            
Subjt:  LFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTK--------DVLKGRRVNSLISSNG-HWNKDILENNF---------------------------L

Query:  PGHFSVKSVYRLALNL----SQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSK
         G ++VKS Y +   +    S   E S  +L     ++++ W   + P+ +  +W  + N +P    L+ R +     C+ C    E   H+++ C F++
Subjt:  PGHFSVKSVYRLALNL----SQKDEASSSNLLQHKEVWKRFWNLDSIPRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSK

Query:  SLWTDYFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAIT----IMWSIWDARNKALKSGHPPNKEDITKRIELHTLDREFRPQIGSL-DKSSKNQ
          W      +   L     D   +  +        N    KA      ++W +W  RN+ +  G   N +++ +R E    +   R +  S   K   N+
Subjt:  SLWTDYFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAIT----IMWSIWDARNKALKSGHPPNKEDITKRIELHTLDREFRPQIGSL-DKSSKNQ

Query:  MSHRHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLAVNLINHISKD--LGEANSLVV------------
         S   W PPP  W K N+DATW  +  + G+GW +R+  G    +G + +    ++   EL+A+  AV  ++    +  + E++S V+            
Subjt:  MSHRHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLAVNLINHISKD--LGEANSLVV------------

Query:  ---AIEDVASTLGKVT---FAWCPREKNTAAHKIAR
            I+D+   L + T   F + PRE NT A ++AR
Subjt:  ---AIEDVASTLGKVT---FAWCPREKNTAAHKIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTAAATCACTCGCCTCCAATTATTGATGTACTAAAAAAAAAAAAATGTGTGAGAGACAAGCAAATGGAAAAATTGAAACTCACAGAAAACGAAAAAGCAAAAAT
CATCGACATAAGAGACGAAGACCTAAAAGCTGCTGACAAAAATCTTCAAAATGCTTCCATTTGCAGAGCCTTGTCTTCAAAACCAATCATCAACAATATCTTCAAAACTA
TTATGCCAAGAATATGGAACCTAGAAGGGAAGGTGTCAATACAAAGCAGAGGCTTAAACACCTTCCTTTGCCACTTTAGGAGCGCAAAGGACAAAAAAAGGATTACAGAG
GACGGACCGTGGATTTTTGACAAAGCCCTCCTACTGTTCGAAGAACCAAGAAGGAACTGTCGGGGGTCAGAATGGGAGTTCAAATACGCAAATTTCTGGGTCCACTTACA
TAACTTACCGTTGATCTTTTTTTGTAGGAAATGGGCCAAAGTCCTCGGAAATGCGATAGGAGAATTCATAGAAGTCGATTCCGACGATCAAGGAAACTGCGAAGGATTGA
ACCTGAGAATCAAAGTCAGAATTGATGTCAACAAGCCGCTGATCAGGGGCACTATGGTTAAAATTGGCTCCATGGCAGAAGAAGTGTGGATTCCGGCCACCTACGAAAAA
CTACCGGACTTCTGTTATAAATGTGGTAAAATAGGACATGTTTTCAAAGATTGTGATCTTTTTGCCCAGGATTCGGAAGATGAACTTCTTTTCAGCGAGAATCTGAGAGA
AATACCTTATAACAAAAGTATAAATAGAGGAGGGAAGGAAGAAGAAACCCCAAGGCCTATCAGAGGCAGGGGACGAGGTAGGGGAACAGGCAGAGCGATGGAAGCAAGAA
AAGAGTACCAAGTTGATCCCAGTAGCGGGCGGAGCAGAGAGGACGAAAGCAAGGCTGACATAAATTTGAACCTCCCCAAGCAAAATGTTTCAGAAGATTCCCCATCCTTA
ACCCTCGAGGAAAACGCTCACCAAACGAACATCGAAGAAAAGACCTCAACCCCAACTGCAAAGACCAGTACAATAGGAAAAGATACTCCCCTAAGCCCTCAACGACTCCT
ACCCACAAAAGAAACTTTCAACGCTGCAAATGATACTGAAATGACAGACCCCATAGGAGAAAAGGTCAAACCCCTTACCTCAGACAGAGAGACAAAATCCTCAGGAAACA
CTATCCTTCACCAATATCAGCCAATTCATACAAAGCAAGAAATAGAACATACTCAGGGAAAAGGAAAACAAATGGCAGAGCTAAATGTTAAAACCTGGAAAAGGATAGCA
AGGAAAGACACACATGCTCACAGTACAAATGGGTTATCACAGCACAATGGGACTAGCGGCGGTCTTATGCTCTTTTGGAAGAGTTCTCTGAAACTCTCTATAAACTCCTT
CTCCACGGGGCATATCGATGCTTCTATCAATCAGGACTGGTCCTGGAGATTCACTGGCTTCTATGGTCACCCTGACCCTCAACAAAGGCACCAGTCATGGAAGCTTCTTG
AGAGACTAAAGGATCAAAATAACTCCCCTTGGCTCGTAGGAGGAGATTTCAATGAAATTCTCAATAGGAGGGAGAAGGAAGGTGGGAAAACAAGAAATACCTCTCAAATT
AGGAACTTTGAGGAGGCCATTAACAGATGTCAGCTCCTTGATCCAGGCTTCAGAGGGGACAAATTCACCTGGAAAAGAGGAAAAGAAAAAACCAATACAATTAAGGAAAG
GCTTGATAGATTCCTAGCCAACAAGGCCTTGATTGACAAGATTGGTAATATGAGGGTTGATCACCTAAGCTACCACAATTCTGATCATCGGGGCATCACCGCAGCTTGGA
GAGAGCTCACCCCTACCCCTTCCAGACATGTCCCTCAAAGAAAGCTGAGATTTGAAGCTAGCTGGACAAATTTTGGAAAGTGCTCGGACATTGTTCAAAGTTGTTGGCAC
AAGAATTACACAGGAAGAGATCTCCTCAATCAAAAAGTGTCCCATAGTATCCGCCAGTTACAAGCTTGGAATACCGAGAGGCTCAAAGGCTCAATCAAGGGAGCTATAGA
TAGGAAAGCCAAAGATCTAGCCACTTTGGAAAATCAACAAATCCCCAACCAAGATATCATTTTGAAAAAGAACAAAATTGAAGGCCTTTTTGATTCCCTTGGCACTTGGG
TTGTAAGGGAAGACGATATGGGGGTCGTTGCAGGCGATTACTTCAGACAACTTTTCACCTCAACCTTCCCCAACAAGGAACACATGGAGAACTTAGCCAACAACATTCCC
ACCAGAGTCTTATTCTCTGAAGGTTACAGATGGAGAGTGGGAAATGGTAAGCACATTATCATTGACGAAGACCCCTGGATCGATAGGGAAGGTTGTTTCAAAGTTCTAAT
GACAAAGGATGTTCTCAAAGGAAGGCGAGTCAACTCTCTCATTTCTAGTAATGGCCATTGGAACAAGGATATCCTGGAAAACAACTTTCTTCCTGGGCATTTTTCGGTAA
AAAGCGTTTATCGCCTTGCTTTGAACCTTTCCCAAAAGGATGAGGCTTCCTCCTCAAACCTTTTACAGCATAAGGAAGTCTGGAAGAGGTTCTGGAATCTGGATTCTATC
CCGAGGCACAAGACAACCGTCTGGGCCATAATACAAAACATCATTCCTACTCGTCAAAACCTCTCAAAAAGAGAAATCGATACTAACCCTCTGTGTTTATTTTGCAGGAA
AAAATGGGAGAATGCTACACATGTCATTTGGGGTTGTAAGTTTTCTAAGAGCTTGTGGACAGACTACTTCCCTTTTCTAACTGACTTTCTTAATTTTTGCAGAGAAGACA
GGAACCCTATCGAATGTTGGAAAGCTCTCACAACTCACCTCAAGAATGTTGATCTAAGTAAAGCAATAACTATCATGTGGAGTATCTGGGACGCAAGGAATAAAGCATTA
AAGAGTGGCCATCCTCCTAACAAAGAAGACATCACAAAGCGAATTGAACTTCATACCTTGGATCGCGAGTTCCGCCCTCAAATCGGCTCTCTGGACAAATCTTCGAAGAA
CCAAATGAGTCACAGACATTGGGATCCTCCCCCGGCTGGTTGGTGGAAGATGAATTCCGATGCGACCTGGCTTGAAGAAGCACGCCAAGGAGGCTTAGGGTGGTCTGTCC
GTGACTCTTCGGGTTCTTCGATCTGTGTCGGCACTCAATTGATCAAAACAGATTGGACCATCAAAATTCTGGAATTGAAAGCCATTCTTTTGGCAGTCAATCTGATCAAC
CACATATCCAAAGACCTAGGTGAAGCCAATTCCTTGGTTGTTGCTATCGAAGATGTAGCCTCCACCTTGGGCAAAGTGACCTTTGCTTGGTGCCCCCGGGAGAAGAACAC
GGCGGCTCACAAGATCGCTAGGCTTCCTTCCTCCCCTGGTTTTTGGTCGGATCTTCAACGATCCTTTATTGCGGAAGATGATCCAGTAGTTTGGACTCACCCACTTCCCC
CGTGTATCGCCTCTGTCATAAATGAGGCTGATGATTTTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTAAATCACTCGCCTCCAATTATTGATGTACTAAAAAAAAAAAAATGTGTGAGAGACAAGCAAATGGAAAAATTGAAACTCACAGAAAACGAAAAAGCAAAAAT
CATCGACATAAGAGACGAAGACCTAAAAGCTGCTGACAAAAATCTTCAAAATGCTTCCATTTGCAGAGCCTTGTCTTCAAAACCAATCATCAACAATATCTTCAAAACTA
TTATGCCAAGAATATGGAACCTAGAAGGGAAGGTGTCAATACAAAGCAGAGGCTTAAACACCTTCCTTTGCCACTTTAGGAGCGCAAAGGACAAAAAAAGGATTACAGAG
GACGGACCGTGGATTTTTGACAAAGCCCTCCTACTGTTCGAAGAACCAAGAAGGAACTGTCGGGGGTCAGAATGGGAGTTCAAATACGCAAATTTCTGGGTCCACTTACA
TAACTTACCGTTGATCTTTTTTTGTAGGAAATGGGCCAAAGTCCTCGGAAATGCGATAGGAGAATTCATAGAAGTCGATTCCGACGATCAAGGAAACTGCGAAGGATTGA
ACCTGAGAATCAAAGTCAGAATTGATGTCAACAAGCCGCTGATCAGGGGCACTATGGTTAAAATTGGCTCCATGGCAGAAGAAGTGTGGATTCCGGCCACCTACGAAAAA
CTACCGGACTTCTGTTATAAATGTGGTAAAATAGGACATGTTTTCAAAGATTGTGATCTTTTTGCCCAGGATTCGGAAGATGAACTTCTTTTCAGCGAGAATCTGAGAGA
AATACCTTATAACAAAAGTATAAATAGAGGAGGGAAGGAAGAAGAAACCCCAAGGCCTATCAGAGGCAGGGGACGAGGTAGGGGAACAGGCAGAGCGATGGAAGCAAGAA
AAGAGTACCAAGTTGATCCCAGTAGCGGGCGGAGCAGAGAGGACGAAAGCAAGGCTGACATAAATTTGAACCTCCCCAAGCAAAATGTTTCAGAAGATTCCCCATCCTTA
ACCCTCGAGGAAAACGCTCACCAAACGAACATCGAAGAAAAGACCTCAACCCCAACTGCAAAGACCAGTACAATAGGAAAAGATACTCCCCTAAGCCCTCAACGACTCCT
ACCCACAAAAGAAACTTTCAACGCTGCAAATGATACTGAAATGACAGACCCCATAGGAGAAAAGGTCAAACCCCTTACCTCAGACAGAGAGACAAAATCCTCAGGAAACA
CTATCCTTCACCAATATCAGCCAATTCATACAAAGCAAGAAATAGAACATACTCAGGGAAAAGGAAAACAAATGGCAGAGCTAAATGTTAAAACCTGGAAAAGGATAGCA
AGGAAAGACACACATGCTCACAGTACAAATGGGTTATCACAGCACAATGGGACTAGCGGCGGTCTTATGCTCTTTTGGAAGAGTTCTCTGAAACTCTCTATAAACTCCTT
CTCCACGGGGCATATCGATGCTTCTATCAATCAGGACTGGTCCTGGAGATTCACTGGCTTCTATGGTCACCCTGACCCTCAACAAAGGCACCAGTCATGGAAGCTTCTTG
AGAGACTAAAGGATCAAAATAACTCCCCTTGGCTCGTAGGAGGAGATTTCAATGAAATTCTCAATAGGAGGGAGAAGGAAGGTGGGAAAACAAGAAATACCTCTCAAATT
AGGAACTTTGAGGAGGCCATTAACAGATGTCAGCTCCTTGATCCAGGCTTCAGAGGGGACAAATTCACCTGGAAAAGAGGAAAAGAAAAAACCAATACAATTAAGGAAAG
GCTTGATAGATTCCTAGCCAACAAGGCCTTGATTGACAAGATTGGTAATATGAGGGTTGATCACCTAAGCTACCACAATTCTGATCATCGGGGCATCACCGCAGCTTGGA
GAGAGCTCACCCCTACCCCTTCCAGACATGTCCCTCAAAGAAAGCTGAGATTTGAAGCTAGCTGGACAAATTTTGGAAAGTGCTCGGACATTGTTCAAAGTTGTTGGCAC
AAGAATTACACAGGAAGAGATCTCCTCAATCAAAAAGTGTCCCATAGTATCCGCCAGTTACAAGCTTGGAATACCGAGAGGCTCAAAGGCTCAATCAAGGGAGCTATAGA
TAGGAAAGCCAAAGATCTAGCCACTTTGGAAAATCAACAAATCCCCAACCAAGATATCATTTTGAAAAAGAACAAAATTGAAGGCCTTTTTGATTCCCTTGGCACTTGGG
TTGTAAGGGAAGACGATATGGGGGTCGTTGCAGGCGATTACTTCAGACAACTTTTCACCTCAACCTTCCCCAACAAGGAACACATGGAGAACTTAGCCAACAACATTCCC
ACCAGAGTCTTATTCTCTGAAGGTTACAGATGGAGAGTGGGAAATGGTAAGCACATTATCATTGACGAAGACCCCTGGATCGATAGGGAAGGTTGTTTCAAAGTTCTAAT
GACAAAGGATGTTCTCAAAGGAAGGCGAGTCAACTCTCTCATTTCTAGTAATGGCCATTGGAACAAGGATATCCTGGAAAACAACTTTCTTCCTGGGCATTTTTCGGTAA
AAAGCGTTTATCGCCTTGCTTTGAACCTTTCCCAAAAGGATGAGGCTTCCTCCTCAAACCTTTTACAGCATAAGGAAGTCTGGAAGAGGTTCTGGAATCTGGATTCTATC
CCGAGGCACAAGACAACCGTCTGGGCCATAATACAAAACATCATTCCTACTCGTCAAAACCTCTCAAAAAGAGAAATCGATACTAACCCTCTGTGTTTATTTTGCAGGAA
AAAATGGGAGAATGCTACACATGTCATTTGGGGTTGTAAGTTTTCTAAGAGCTTGTGGACAGACTACTTCCCTTTTCTAACTGACTTTCTTAATTTTTGCAGAGAAGACA
GGAACCCTATCGAATGTTGGAAAGCTCTCACAACTCACCTCAAGAATGTTGATCTAAGTAAAGCAATAACTATCATGTGGAGTATCTGGGACGCAAGGAATAAAGCATTA
AAGAGTGGCCATCCTCCTAACAAAGAAGACATCACAAAGCGAATTGAACTTCATACCTTGGATCGCGAGTTCCGCCCTCAAATCGGCTCTCTGGACAAATCTTCGAAGAA
CCAAATGAGTCACAGACATTGGGATCCTCCCCCGGCTGGTTGGTGGAAGATGAATTCCGATGCGACCTGGCTTGAAGAAGCACGCCAAGGAGGCTTAGGGTGGTCTGTCC
GTGACTCTTCGGGTTCTTCGATCTGTGTCGGCACTCAATTGATCAAAACAGATTGGACCATCAAAATTCTGGAATTGAAAGCCATTCTTTTGGCAGTCAATCTGATCAAC
CACATATCCAAAGACCTAGGTGAAGCCAATTCCTTGGTTGTTGCTATCGAAGATGTAGCCTCCACCTTGGGCAAAGTGACCTTTGCTTGGTGCCCCCGGGAGAAGAACAC
GGCGGCTCACAAGATCGCTAGGCTTCCTTCCTCCCCTGGTTTTTGGTCGGATCTTCAACGATCCTTTATTGCGGAAGATGATCCAGTAGTTTGGACTCACCCACTTCCCC
CGTGTATCGCCTCTGTCATAAATGAGGCTGATGATTTTGGCTGA
Protein sequenceShow/hide protein sequence
MSLNHSPPIIDVLKKKKCVRDKQMEKLKLTENEKAKIIDIRDEDLKAADKNLQNASICRALSSKPIINNIFKTIMPRIWNLEGKVSIQSRGLNTFLCHFRSAKDKKRITE
DGPWIFDKALLLFEEPRRNCRGSEWEFKYANFWVHLHNLPLIFFCRKWAKVLGNAIGEFIEVDSDDQGNCEGLNLRIKVRIDVNKPLIRGTMVKIGSMAEEVWIPATYEK
LPDFCYKCGKIGHVFKDCDLFAQDSEDELLFSENLREIPYNKSINRGGKEEETPRPIRGRGRGRGTGRAMEARKEYQVDPSSGRSREDESKADINLNLPKQNVSEDSPSL
TLEENAHQTNIEEKTSTPTAKTSTIGKDTPLSPQRLLPTKETFNAANDTEMTDPIGEKVKPLTSDRETKSSGNTILHQYQPIHTKQEIEHTQGKGKQMAELNVKTWKRIA
RKDTHAHSTNGLSQHNGTSGGLMLFWKSSLKLSINSFSTGHIDASINQDWSWRFTGFYGHPDPQQRHQSWKLLERLKDQNNSPWLVGGDFNEILNRREKEGGKTRNTSQI
RNFEEAINRCQLLDPGFRGDKFTWKRGKEKTNTIKERLDRFLANKALIDKIGNMRVDHLSYHNSDHRGITAAWRELTPTPSRHVPQRKLRFEASWTNFGKCSDIVQSCWH
KNYTGRDLLNQKVSHSIRQLQAWNTERLKGSIKGAIDRKAKDLATLENQQIPNQDIILKKNKIEGLFDSLGTWVVREDDMGVVAGDYFRQLFTSTFPNKEHMENLANNIP
TRVLFSEGYRWRVGNGKHIIIDEDPWIDREGCFKVLMTKDVLKGRRVNSLISSNGHWNKDILENNFLPGHFSVKSVYRLALNLSQKDEASSSNLLQHKEVWKRFWNLDSI
PRHKTTVWAIIQNIIPTRQNLSKREIDTNPLCLFCRKKWENATHVIWGCKFSKSLWTDYFPFLTDFLNFCREDRNPIECWKALTTHLKNVDLSKAITIMWSIWDARNKAL
KSGHPPNKEDITKRIELHTLDREFRPQIGSLDKSSKNQMSHRHWDPPPAGWWKMNSDATWLEEARQGGLGWSVRDSSGSSICVGTQLIKTDWTIKILELKAILLAVNLIN
HISKDLGEANSLVVAIEDVASTLGKVTFAWCPREKNTAAHKIARLPSSPGFWSDLQRSFIAEDDPVVWTHPLPPCIASVINEADDFG