; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005928 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005928
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold11:3141213..3163210
RNA-Seq ExpressionSpg005928
SyntenySpg005928
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH80348.1 hypothetical protein [Trifolium medium]1.1e-10030.21Show/hide
Query:  KLTEEEAEKVKPALSEGSRELC---------VVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRSLVILK
        ++++  A+    A++    E+C         +VGK+ +    N+ AF+  M  AWR+     I+    NL+  +F +K E   V  +GPW+FDR+L+IL 
Subjt:  KLTEEEAEKVKPALSEGSRELC---------VVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRSLVILK

Query:  DAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYERLPNFC
                 ++     SFWV+++++P++ +++G+AK LG  +G  EE+   + +   G  LR+RV + + +PL+RG  + + +GKE W    YERLPNFC
Subjt:  DAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYERLPNFC

Query:  YECVCLGHSRRECGVVGE------SSSEQGGDQYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQVGE
        + C  +GH  R+C  + +      S  E+    +G WLRA     + S  V++E      ++S+  P+      Q S   KE           ++ +V +
Subjt:  YECVCLGHSRRECGVVGE------SSSEQGGDQYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQVGE

Query:  ASTEVAPQSGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVERAGKTGSSRGWKRR-----------------ARDALAD
                S +   KG     A     KQ+     VE  A  +G   ++     G G++ V    K G  R W R+                  + +L D
Subjt:  ASTEVAPQSGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVERAGKTGSSRGWKRR-----------------ARDALAD

Query:  ISNREVPMESVGGKRKGEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNG----RSGGLALLWTT
        +   +  ME++ G  K   G++ ++        GS RA R L +L +   PQ+VFL ET+++ T  + ++ +LG+  C  VD NG    R+GGLAL+W  
Subjt:  ISNREVPMESVGGKRKGEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNG----RSGGLALLWTT

Query:  EVKFSLISYSSNHIDGWVDGGAS--PWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDL
         +  ++ S+S NHI G  +   S   W  T +YG+P+   K +TW L++ L   ++  WL  GDFN +L  HEK GG  +   +    R+AV    L+DL
Subjt:  EVKFSLISYSSNHIDGWVDGGAS--PWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDL

Query:  GYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVG
        G+ G  FTW N R   + V  R+DR MGN  + + FS  +V HL    SDH  +++ L     S     RR+ RFEE+W      +E + +NW  S    
Subjt:  GYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVG

Query:  SSTALALATKRCMNGLSKWGRP----GMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHAS
              L+  + +  L   G       +GN +  +    D+++       S+      +  E+    LL  +E  W+QRSR  WLK GDRNT++FHN AS
Subjt:  SSTALALATKRCMNGLSKWGRP----GMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHAS

Query:  SRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSH
         R   N +K ++DEDGVW    + +  +   YF++L++TS PS+
Subjt:  SRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSH

XP_023911628.1 uncharacterized protein LOC112023240 [Quercus suber]9.9e-9931.1Show/hide
Query:  MEDLV-SKWRSWKLTEEEAEKVKPALSEGSRE------LCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWA
        M+D+V  K    KLT EE E +     EG  E      L +VGK L+ K  N  A +N +R AW  + + +I   G NLF  +F+S+ +  R++  GPW+
Subjt:  MEDLV-SKWRSWKLTEEEAEKVKPALSEGSRE------LCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWA

Query:  FDRSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPV
        FD  L++L+  +   +V ++ F+  S WVQI + P    +  +AK +G ++G VEEV   +  +     +R+RV L + +P+RRG  +   +G + W   
Subjt:  FDRSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPV

Query:  LYERLPNFCYECVCLGHSRRECGVVGESSSEQGGD----QYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEV
         YERLP FC+ C  LGH  R C      S E+ GD    QYG++LRA      G  R    KV G    S        G+A   +P++ G+ +      +
Subjt:  LYERLPNFCYECVCLGHSRRECGVVGESSSEQGGD----QYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEV

Query:  LET--------QVGEASTEVAPQS---GEVGE-----KGPDEGMAN-KGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVE--RAGKTGSSRGWK
          +        + G  S+ +  ++   G V E     K      AN K      VG          + +D++  + + G  S   E      T  ++G  
Subjt:  LET--------QVGEASTEVAPQS---GEVGE-----KGPDEGMAN-KGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVE--RAGKTGSSRGWK

Query:  RRARDALADISNREVPMESVGGKRKGEEGELPIQGGKKA---RGLG---SARAF---------RRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYP
        R     L+   N    ++  G   +    +L ++G  KA    GLG   S  A+         R L K+V++  P + FL ET++    F+ L   L + 
Subjt:  RRARDALADISNREVPMESVGGKRKGEEGELPIQGGKKA---RGLG---SARAF---------RRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYP

Query:  CCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHI-------DGWVDGGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKD
            V      GGLA LW  EV+  +I++++NH+       DG+V      W  T  YG P  + K  +W L+K+L+     PW++ GDFNA L   EK 
Subjt:  CCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHI-------DGWVDGGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKD

Query:  GGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRF
          R     ++  FREA+  C+L DLGY G  +TW N+RPGE     R+DR + N  W D F    V HL    SDH P+LL +   +++   +GR   +F
Subjt:  GGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRF

Query:  EETWLLSQGFKEAVSANWGPSGA-VGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSV----GSRSDLQSAEERLESLLVEEEM
        EE+WLL     + +   WG +G   G   A+    K C   L  WG     +      VA  + Q  L+RL        S++D  S  ++++ LL ++E+
Subjt:  EETWLLSQGFKEAVSANWGPSGA-VGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSV----GSRSDLQSAEERLESLLVEEEM

Query:  YWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGLPT
        YW QRSR  WL+ GDRNT++FH  AS RR +N + G+ +  G W ++ + +  +  +YF+DL+  S  + +++D+ L  V S VT++M   L T
Subjt:  YWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGLPT

XP_030939698.1 uncharacterized protein LOC115964550 [Quercus lobata]6.4e-9829.99Show/hide
Query:  DLVSKWRSWKLTEEEAEKVKPALSEGSR-----ELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRS
        D + + +  +LTEEE E VK  L+   +      L ++G+ L+++  N  A ++++R+ W+     RI   G+ LF  +F+ + +   V+ +GPW+FD +
Subjt:  DLVSKWRSWKLTEEEAEKVKPALSEGSR-----ELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRS

Query:  LVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYER
        L++L+  +   +   V F     WVQ+  +P     +     +G  +G V EV            +R+RV + +++P+RRG  + + EG +      YER
Subjt:  LVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYER

Query:  LPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQVGEA
        L   CY+C  LGH  ++C V G  SS+Q    YG+WL+AG          + + +     ++   P AP  +     P +     +    EV  +     
Subjt:  LPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQVGEA

Query:  STEVAPQSGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSE--GVERAGKTGSSRGWKRRARDALADISNREVPMESVGGKRK
        + E      E+        ++ K +  ++V    +       G      D + G  +   G+     TG            L +     VP+     +  
Subjt:  STEVAPQSGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSE--GVERAGKTGSSRGWKRRARDALADISNREVPMESVGGKRK

Query:  GEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWV-D
         ++     Q  +K+  L +      L  LV+   P+++FL ETK        ++  L Y C F V S  RSGGLALLW  E+   + +++ NHID  + D
Subjt:  GEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWV-D

Query:  GGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWE
          A+ WR T  YG P+ + K  +W L+KHL      PWL  GDFN +L   EK GG PKP   +L FREA+  C L+DLGY G +FTW N R  +D+V E
Subjt:  GGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWE

Query:  RIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRI-QRFEETWLLSQGFKEAVSANWGPSGAVGSSTA-LALATKRCMNGLSKW
        R+DR    + WRD F+  +V HL+ S SDH PIL+T  +  R      ++I  RFEE W      +  +   W      GS  A L    KRC   L  W
Subjt:  RIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRI-QRFEETWLLSQGFKEAVSANWGPSGAVGSSTA-LALATKRCMNGLSKW

Query:  GRPGMGNFRYRIRVATDQVQGALERLGSVGSRSD----LQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQ
         R   G  + +++    + Q  LE L  + +R++    ++S +  + +++ ++E++W+QRSR  WL  GD+NT++FHN AS RR +N + G+ D D  W 
Subjt:  GRPGMGNFRYRIRVATDQVQGALERLGSVGSRSD----LQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQ

Query:  QDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL
           ++I  + E YF++L++T+ P  + ++  L  V   VT  MN  L
Subjt:  QDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL

XP_042958006.1 uncharacterized protein LOC122293492 [Carya illinoinensis]2.8e-9328.03Show/hide
Query:  MEDLVSKWRSWKLTEEEAEKVK-----PALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFD
        MEDL  KW+  KL +EE   ++     P+      +  ++ K  S++ ++ +   N +   WR     +      N+FA+   +   K +V S  PW FD
Subjt:  MEDLVSKWRSWKLTEEEAEKVK-----PALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFD

Query:  RSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLY
          L++LK+    + +  V F   SFWV+ HN+P+    +   + +G  +G VE V   +     G  LR+++ + +++PL RG  +   +GK+ W P  Y
Subjt:  RSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLY

Query:  ERLPNFCYECVCLGHSRRECGVVGESSSEQGGD-QYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQV
        E++P  C+ C C+ H + EC    E  S+ GGD QYG+WLRA   +                N  + VP             +EG +  +E         
Subjt:  ERLPNFCYECVCLGHSRRECGVVGESSSEQGGD-QYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQV

Query:  GEASTEVAPQSGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVERAGKTGSSRGWKRRARDALADISNREVPMESVGGKR
        GE S     + G    +G  EGM N GK ++ V   S     T  GED + +D +G     G E+          + +  + L ++   EV +E +G K 
Subjt:  GEASTEVAPQSGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVERAGKTGSSRGWKRRARDALADISNREVPMESVGGKR

Query:  KGEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWV-
          E      + G++ +     +  + L  + ++ +P +VF+ ETK++  + + +K R+G   CF V++ G+ GGL LLW + +   +++YS  HI  WV 
Subjt:  KGEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWV-

Query:  DGGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVW
        +  AS W     YG P+   +  TW+++K L+   +  W++ GDFN +L + EK GG+ +P  ++  FRE +   +L DLG+ G  FT  N         
Subjt:  DGGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVW

Query:  ERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQ---RFEETWLLSQGFKEAVSANWGPSGAVGSST-ALALATKRCMNGL
        ER+DR + N  W +M++   V  L    SDH+P+L+ L    +     GR+     ++E +W L    +  +   WG +G++  +   L   +KR     
Subjt:  ERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQ---RFEETWLLSQGFKEAVSANWGPSGAVGSST-ALALATKRCMNGL

Query:  SKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEE---RLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGV
        SK  +   G          ++    L+RL +  S  +++   +    L +LL +E+++WKQR++  W K GDRNT++FH  A+ RR RN +  +ED++  
Subjt:  SKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEE---RLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGV

Query:  WQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL
              ++     +YF +++ ++QPS  EI+  L+ +   VT+ MN  L
Subjt:  WQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]6.2e-10129.44Show/hide
Query:  EDLVSKWRSWKLTEEEAEKV----KPALSEGSR-ELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDR
        ED+ ++W+S KLTEEE +++    +  LS  ++ ELC++ K+ + +  N +AFR  M   W T G    +    N++ ++F+   +K +V+   PW+FDR
Subjt:  EDLVSKWRSWKLTEEEAEKV----KPALSEGSR-ELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDR

Query:  SLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEV-GGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLY
         LV +K+ + + S+ +V F    FW+Q+HN+P    +K M   +G  IG V EV    +G  W G  LR++  + V + L RG  ++    K+ W    Y
Subjt:  SLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEV-GGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLY

Query:  ERLPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRA-----GVSFNEGSRRVQEEKVEGGM---NQSVGVPTAPVGDAQVSAPEKEGVRVVREGG
        ERLP FC++C    H +  C   G  +     DQYG+WLRA         N     ++E++  G     +Q  G   +       S P  E   +V E  
Subjt:  ERLPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRA-----GVSFNEGSRRVQEEKVEGGM---NQSVGVPTAPVGDAQVSAPEKEGVRVVREGG

Query:  EVLETQVGEASTEVAPQSGEVGEKGPDEGMANKG-------------KGKQKVGAASVEPRATYVG------EDVMTVDEVGGGGSEGVERAG-------
        E LE  +G   TE      + G    D    +KG             + +      ++ P+  +        + VM   E  G  S   E  G       
Subjt:  EVLETQVGEASTEVAPQSGEVGEKGPDEGMANKG-------------KGKQKVGAASVEPRATYVG------EDVMTVDEVGGGGSEGVERAG-------

Query:  ---------KTGSSRGWKRRAR---DALADISNREVPMESVGGKRKGEEGELPIQGG---KKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFD
                      + WKR+AR     L+D++N      S    ++        +     KK +      + + LH LV+  +P +VFL ETK    R D
Subjt:  ---------KTGSSRGWKRRAR---DALADISNREVPMESVGGKRKGEEGELPIQGG---KKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFD

Query:  VLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVDG--GASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFD
         +K+ LGY  CF V+S G+SG LALLW   VK  +I+Y++ HI   +      + W+ T  YGHP    +  +W L++ L+ +   PWL  GDFN +   
Subjt:  VLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVDG--GASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFD

Query:  HEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGR-
         EK G   +P  +++ FR ++  C+L DLG+ G  FTW N R G+    ER+DR  GN  W  +F+ + V HLD ++SDH+ +L    V    +++ G+ 
Subjt:  HEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGR-

Query:  -RIQRFEETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRS-DLQSAEERLESLLVEE
         R+ RFE  W      +E +   W  S              +C   L  W R    + R  ++  T+ ++   ER  + G  S +++   + +  ++  E
Subjt:  -RIQRFEETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRS-DLQSAEERLESLLVEE

Query:  EMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEM
         +  +QR++  WLK GDRNT++FH  +S R+  N +  ++ + G+  QDP  I   + E+F DL+T+S PS   ID  L  +   +TD+M
Subjt:  EMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEM

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein1.8e-10631.84Show/hide
Query:  KLTEEEAEKV-----KPALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRSLVILKDAQL
        KL+ +EA ++       A SE   +  V+ K+L+ K  N DA ++ +R  W + G   I    DNLF   FR++ +  R+    PW FD+ L+ +     
Subjt:  KLTEEEAEKV-----KPALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRSLVILKDAQL

Query:  VSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRG--FHMQDEE----GKEFWCPVLYERLPN
              V+F   +FWV++ N+P++  T+ + + +G  +G   EV         G  LR+RV + + +PL RG     +DEE       FW    YE LP 
Subjt:  VSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRG--FHMQDEE----GKEFWCPVLYERLPN

Query:  FCYECVCLGHSRREC--GVVGESSSEQGGDQYGEWLRAGVSFNEGSRRVQEEKVEGG-----MNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQ
        FCY C  LGHS  +C  G       E G  +YG WLRA  S      RV       G           V    V     SA EKEG    R+  E  E+ 
Subjt:  FCYECVCLGHSRREC--GVVGESSSEQGGDQYGEWLRAGVSFNEGSRRVQEEKVEGG-----MNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQ

Query:  VGEASTEVAPQSGEVG-EKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVER-----AGKTGSSRGWKRRARDALADISN-----
               VA Q  EVG  +   EG         +V   +       V ED + V ++ G  +E V         KT  +   K +    + D++      
Subjt:  VGEASTEVAPQSGEVG-EKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVER-----AGKTGSSRGWKRRARDALADISN-----

Query:  -REVPMESVGGKRKGEEG---------ELPIQGGKKA-----RGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSG
         R      V  +R  +EG           P  G  +A     RGLG+      LH  V++  PQ+VFL ET++   + + +++RLG   CF V+ +G  G
Subjt:  -REVPMESVGGKRKGEEG---------ELPIQGGKKA-----RGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSG

Query:  GLALLWTTEVKFSLISYSSNHIDGWVDG-GASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVD
        GLALLW   V   + SYS +HID WV+      WRFT  YGHP    +  +W L++ L+G S+ PWL+ GDFN ++   EK G   +   ++  FREA++
Subjt:  GLALLWTTEVKFSLISYSSNHIDGWVDG-GASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVD

Query:  VCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILL-TLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSAN
         C L+DLG+ G  FTW N R   + V ER+DR +    W D+F    ++H+ F+ SDH  ++L + +V      N+ +R   FE  WL  +G +E +S  
Subjt:  VCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILL-TLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSAN

Query:  WGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDL---QSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTR
        W       +   L+   K+C  GL  W +    N R   ++  D+ +   E  G  GS  +    ++    L SLL +EE+YW+QRSR  WL+ GDRNT 
Subjt:  WGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDL---QSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTR

Query:  WFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL
        +FH+ AS R+  N + G+ D   VWQ++   I  +++ YF ++YTT+ P    ID  +  V   V+ +MN  L
Subjt:  WFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL

A0A2N9GLU2 Reverse transcriptase domain-containing protein3.1e-10630.14Show/hide
Query:  MEDLVSKWRSWKLTEEEAEKVKPALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRSLVI
        ME++ S W  + LT +E  +V  +  +G++   +  K  + + +N+D+    ++  WR     +    GDN     F S  E   VM++ PW+FD+ L++
Subjt:  MEDLVSKWRSWKLTEEEAEKVKPALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRSLVI

Query:  LKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYERLPN
        L+      S   V F  C+FWVQIH++P R  T  + + + G +G VE V         G  +R R+ L VD+PL RG  +     +E W    +ERLPN
Subjt:  LKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYERLPN

Query:  FCYECVCLGHSRRECGVVGESSSEQG--GDQYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQVGEAS
        FCY C  + H  R+C +  +S         QYG WLR     N     +      GG   +    T P   A+  A    G++                 
Subjt:  FCYECVCLGHSRRECGVVGESSSEQG--GDQYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQVGEAS

Query:  TEVAPQSGEVGEKGPDEGMANK----------------GKGKQKVGA----------ASVEPRATYVGEDVMTVDEVGGGGSEGVERAGKTGSSRGWKRR
         EV P  G    K  D+G+ +                  +GK    A           +  P  +   E  +    +G    +  +    TGSS  WKR 
Subjt:  TEVAPQSGEVGEKGPDEGMANK----------------GKGKQKVGA----------ASVEPRATYVGEDVMTVDEVGGGGSEGVERAGKTGSSRGWKRR

Query:  ARDALADIS------------NREVPMESVGGKRKGEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCV
         R+ ++  S                PM  +  +R  ++ +  +  G       SA     L  +V++  P ++FL+ETK+   + +VL+ +  +   F V
Subjt:  ARDALADIS------------NREVPMESVGGKRKGEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCV

Query:  DSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVD-GGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELL
         S G+SGGL + W+  V  S+ SYS +HID  +D    + WRFT  YG P  E K   W++++ LR     PWL  GDFN +L   EK G RP+P  ++ 
Subjt:  DSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVD-GGASPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELL

Query:  GFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFK
         FR+ VD C  MDLG++GP +TWCN + G+  V ER+DR +    W   F    + HL    SDH  +   L+   R+     RR  RFEE W +    +
Subjt:  GFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFK

Query:  EAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDR
        + +   W           ++   K     L  W +   G+ R  I V T Q++     + ++ +   ++S  + L SL  +EE  WKQRSR  WL+ GDR
Subjt:  EAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDR

Query:  NTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL
        NTR+FH  A+ R+ RN ++G+ DEDG+WQQ+ D+I   I  Y++ L+T++ P   ++D+ L  VS  V++EMN+ L
Subjt:  NTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL

A0A7N2LIH6 Uncharacterized protein2.4e-10631.74Show/hide
Query:  EDLVSKWRSWKLTEEEAEKVK--PALSEGSREL---CVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDR
        E+L   W+   +TE E E +K     +  ++EL   CVV K+L+ + V ++A +  MR  W+     +I   G++LF V+F    +K +VM   PW++++
Subjt:  EDLVSKWRSWKLTEEEAEKVK--PALSEGSREL---CVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDR

Query:  SLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVG-GAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLY
         L+++++ +      +++ K   FWVQI N+P++  T+     +G KIG V EV    +G +W G  LR+R+R      L RG  +  E G+  W    Y
Subjt:  SLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVG-GAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLY

Query:  ERLPNFCYECVCLGHSRREC--GVVGESSSEQGGDQYGEWLRAGVSFNEG--------------------SRRVQEEKVEGGMNQSVGVPTAPVGDAQVS
        ERLPNFCY+C  L H  ++C     GE+  ++   QYG WLR     + G                    +R   + +V+  + +S  V    V +  + 
Subjt:  ERLPNFCYECVCLGHSRREC--GVVGESSSEQGGDQYGEWLRAGVSFNEG--------------------SRRVQEEKVEGGMNQSVGVPTAPVGDAQVS

Query:  APEKEGVR--VVREGGEVLETQVGEASTEVAPQSGEVG---EKGPDE-GMANKGKGKQKVGAASVEPRATY--VGEDVMTV-DEVGGGGSEGVERAGKTG
          +K+G R  +V  GG V +  V     E+  ++G+     EK  D+  ++ K      V A  +E +  +  V +DV  V  E G    +GV+  G   
Subjt:  APEKEGVR--VVREGGEVLETQVGEASTEVAPQSGEVG---EKGPDE-GMANKGKGKQKVGAASVEPRATY--VGEDVMTV-DEVGGGGSEGVERAGKTG

Query:  -------------------------------SSRGWKRRARDALADISNREVPMESVGGKRKGEEGELPIQGGKKA-------------RGLGSARAFRR
                                        S  WKR AR A     +      SV G +K ++G     GG  A             RGLG++ A R 
Subjt:  -------------------------------SSRGWKRRARDALADISNREVPMESVGGKRKGEEGELPIQGGKKA-------------RGLGSARAFRR

Query:  LHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWT--TEVKFSLISYSSNHIDGWVDGGAS--PWRFTWMYGHPQGELKAR
        L   V++  P +VFL ETK    +    + +LG+     V S+GRSGGLALLW   T+++F   S+S  HID  V G  S  PWR T  YGHP    +  
Subjt:  LHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWT--TEVKFSLISYSSNHIDGWVDGGAS--PWRFTWMYGHPQGELKAR

Query:  TWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRH
        +W L++ L    E PWL+ GDFN ++   EK G + + A ++  FRE +  C L+DLG+ GP FTWCN R G+     R+DR + N AW  MF   +V H
Subjt:  TWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRH

Query:  LDFSRSDHRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALE
        +  S SDH   LL L +   +    G++   FEE W   +  KE V   W P     S+  +    +RC   L +W +   GN    I+   +++Q  LE
Subjt:  LDFSRSDHRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALE

Query:  RLGSV-GSRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEE
         L  +  +  ++Q+ ++ +  L   EE+ WKQRSR  WL++GD+N+++FH  AS RR +N + GL D+ GVW +D +    LI +YF+D+Y+++QP+   
Subjt:  RLGSV-GSRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEE

Query:  IDQALLHVSSSVTDEMNNGL
         D +L  +   VT EMN+ L
Subjt:  IDQALLHVSSSVTDEMNNGL

A0A7N2M6Y1 CCHC-type domain-containing protein2.5e-10831.38Show/hide
Query:  EDLVSKWRSWKLTEEEAEKVKPA--LSEGSRELC---VVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDR
        +++V+   + KLT EE E +  A   S G+ E C   + G++LS +  N+ A +N +RAAW+     RI   G+++   +F SK +   V  SGPW F+ 
Subjt:  EDLVSKWRSWKLTEEEAEKVKPA--LSEGSRELC---VVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDR

Query:  SLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYE
        +L++L   +   +  ++ F    FWVQI  +P     +   + +G KIG V EV            LR+RV +++D+PLRRG  ++++E    W    YE
Subjt:  SLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPVLYE

Query:  RLPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRAGVSFNEGS---RRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQ
        RLP FCY+C  LGH  + C V         G+QYGEWL+AG +  +G    +  Q+   E     S+G+ +        +  E+ G      G       
Subjt:  RLPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRAGVSFNEGS---RRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEKEGVRVVREGGEVLETQ

Query:  VGEASTEVAPQ----SGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEV-------------------GGG-----GSEGVERAGKTGS
        VG  S    P      G  G KG    M        ++      PR    GE V T D+V                   G G     G E V    + G 
Subjt:  VGEASTEVAPQ----SGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEV-------------------GGG-----GSEGVERAGKTGS

Query:  S----RGWKRRARD-ALADISNREVPMESVGGKRKG---------EEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVR
        +    R +KR ARD   A  S      + V  KRK          E  +  +  G   RGLG+ R+ R L +LVQ+++P +VFL+ETK++  + + +K +
Subjt:  S----RGWKRRARD-ALADISNREVPMESVGGKRKG---------EEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVR

Query:  LGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVDGGAS--PWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDG
        +G      V S GRSGGLA+LW  ++   + SYS   ID  V    S   WR T  YG+P+   +  +W  ++ L      PWL  GDFN V+   EK G
Subjt:  LGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVDGGAS--PWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDG

Query:  GRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRFE
        G P+   ++  FRE +  C   DLG+ GP FTWCN + G+  V  R+DR +    W D +   +V HL  S SDH  +LLT +   + +  + +R   FE
Subjt:  GRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRFE

Query:  ETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEERLESLLVEEEMYWKQRS
          W      K+ +   W  S  + S + +A   + C   LSKW +   G    +I+   + +   + R        D+    + +  LL  EE+ W+QRS
Subjt:  ETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRSDLQSAEERLESLLVEEEMYWKQRS

Query:  RDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL
        +  WL  GDRNT++FH+ AS RR +N +  + D+ GVW   PD I  +   YF++LY+T+ P+   I + L  + + V+++MN  L
Subjt:  RDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSVTDEMNNGL

A0A7N2R0C3 Reverse transcriptase domain-containing protein2.4e-11432.86Show/hide
Query:  EDLVSKWRSWKLTEEEAEKV-------KPALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAF
        E+L   W+  K+TEEE E +       + A+  G +  CV  KV+S K + ++A R  +R  W+ + + ++   G+ LF V+F  + +K RVM   PW +
Subjt:  EDLVSKWRSWKLTEEEAEKV-------KPALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAWRTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAF

Query:  DRSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQ-GSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPV
        ++ LV+ K+ +   S  D+  K   FWVQI+N+P++ +TK   K +G  IG   EV   + G +W G  LR+RV + V R L RG  +  E+G+  W   
Subjt:  DRSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQ-GSEWTGPVLRLRVRLRVDRPLRRGFHMQDEEGKEFWCPV

Query:  LYERLPNFCYECVCLGHSRRECGVVGESSSEQGGD----QYGEWLRA------------------GVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQ
         YERLPNFCY C  L H  ++C  + E   ++ G+    QYG WLR                   G   N+ + +  E K    + + V   T  +    
Subjt:  LYERLPNFCYECVCLGHSRRECGVVGESSSEQGGD----QYGEWLRA------------------GVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQ

Query:  VSAPEKEGVRVVREGGEVLETQVGE---ASTEVAPQSG---EVGEKGPDEGMANKGKGKQKVGAAS--VEPRATY---VGEDVMTVDEVGGGG-------
        +    +E    +  GGEV   + GE   A +     SG   EVGE+  + G  N GK   + G A    +P   +   +G + +  D V G G       
Subjt:  VSAPEKEGVRVVREGGEVLETQVGE---ASTEVAPQSG---EVGEKGPDEGMANKGKGKQKVGAAS--VEPRATY---VGEDVMTVDEVGGGG-------

Query:  ---------SEGVERAGKTGSSRG-WKR------------------RARDALADISNREVPMESVGGKRKGEEGELPIQGGKKARGLGSARAFRRLHKLV
                  E    A K G S G WKR                  R RD   D++ RE+       KR+              RG+GSA A R L   V
Subjt:  ---------SEGVERAGKTGSSRG-WKR------------------RARDALADISNREVPMESVGGKRKGEEGELPIQGGKKARGLGSARAFRRLHKLV

Query:  QQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVDG--GASPWRFTWMYGHPQGELKARTWALMKH
        +   P +VFLAETK    R   L+ +LG      V S+GRSGGLA+LW   V  SL S S++HID  V G  GA PWR T  YGHP   ++  +W L++ 
Subjt:  QQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVDG--GASPWRFTWMYGHPQGELKARTWALMKH

Query:  LRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSD
        L      PW++ GDFN +L   EK G   + A ++  FRE +  C L+DLG+ G  FTWCN R GE     R+DR + N  W ++F   +V H   + SD
Subjt:  LRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMFSGYEVRHLDFSRSD

Query:  HRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSV-G
        H   LL+LS+  R      RR   FEE W   +G +E +   W P G       +    K C   L  W R   GN    ++    ++Q  LE L  +  
Subjt:  HRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSV-G

Query:  SRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLH
        S  ++Q  ++ +  +++ EE+ W QRSR  W+K+GDRNTR+FH  A++RR +N+++G+ D +G W+++ + +  +I EYF+++Y+++ P+  E    L  
Subjt:  SRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLH

Query:  VSSSVTDEMNNGL
        V   VT++MN  L
Subjt:  VSSSVTDEMNNGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.1e-1324.38Show/hide
Query:  LMGGDFN--AVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMF-SGYEVRHLDFSRSDHRPILL
        ++ GDF+  A   DH        P   L  F+  +   +L+D+   G  +TW N +    ++  ++DR + N  W   F S   V  L    SDH P ++
Subjt:  LMGGDFN--AVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRDMF-SGYEVRHLDFSRSDHRPILL

Query:  TLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVGSST----ALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRS
         L     ++    ++  R+         F  +++  W     VGS          A K+C   L+   R G GN +++ + A D ++    +L +  S S
Subjt:  TLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVGSST----ALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALERLGSVGSRS

Query:  DLQ---SAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDL
          +    A ++        E +++Q+SR  WL+ GD NTR+FH    + + +N +K L  +D V  ++  ++  +I  Y+  L
Subjt:  DLQ---SAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDL

AT3G42140.1 zinc ion binding;nucleic acid binding1.6e-0624.02Show/hide
Query:  FRSKGEKVRVMSSGPWAFDRSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLR
        F+S+     ++  GPW+F+  + +++    + S  D EFK   FW+QI  +P+R+ T  +   +G ++G+  E    +       VL+ +          
Subjt:  FRSKGEKVRVMSSGPWAFDRSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRVRLRVDRPLR

Query:  RGFHMQDEEGKEFWCPVLYERLPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRAGVSFNEGSRRV---QEEKVEGGMNQSVGVPTAPVGDAQVS
                          YE+L NFC  C  L H   EC   G        D   +   A    NE    V     ++V G MNQ    P     D   +
Subjt:  RGFHMQDEEGKEFWCPVLYERLPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRAGVSFNEGSRRV---QEEKVEGGMNQSVGVPTAPVGDAQVS

Query:  APEK
        AP++
Subjt:  APEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCGGTGGTGGCGTGGTTCGTGGCTTGTTGCAGGCGGTGGTTTGCGGTGGTGTTGGTGTTGTCGTTTCGGTCTAGTTTCTGGTGGAGCGCCGTTTTTGTTGCAGGCGGTGC
TTCGGTGGTCTTGTGGGAGGGGTACGGTGGGGTGGGAGGAGGAAGGACGATGGAGGATTTGGTGAGTAAGTGGAGGTCCTGGAAGTTGACGGAGGAGGAAGCAGAGAAGG
TGAAGCCGGCCTTATCTGAGGGGTCGAGAGAGCTTTGTGTGGTGGGGAAGGTGCTCTCGTCGAAGCGTGTGAATATGGATGCTTTCCGAAATGTGATGAGAGCGGCATGG
AGAACCCATGGGGCGACTCGGATAGAAGCGGCGGGGGATAATTTGTTTGCGGTCCAGTTTCGGTCCAAAGGGGAGAAGGTAAGAGTTATGTCGTCGGGGCCATGGGCGTT
CGACAGATCCCTTGTGATTCTTAAGGATGCACAGTTGGTATCGTCAGTTCTGGATGTGGAGTTTAAAGACTGCTCTTTCTGGGTCCAGATCCACAATGTTCCCATGCGTT
GGCAGACTAAGGGAATGGCGAAGCATTTAGGAGGGAAGATTGGTGTTGTGGAAGAGGTAGGTGGGGCGCAGGGGAGCGAGTGGACGGGACCAGTCTTAAGACTGAGAGTG
CGCCTGAGGGTGGATCGGCCCCTACGAAGAGGTTTTCATATGCAGGATGAGGAGGGCAAGGAATTCTGGTGTCCAGTGTTGTATGAACGGTTGCCGAATTTCTGTTATGA
GTGTGTCTGCTTAGGACACTCGAGGAGGGAGTGTGGGGTGGTGGGAGAAAGTTCTTCGGAGCAGGGAGGGGATCAGTATGGGGAGTGGTTGCGGGCGGGGGTGAGTTTTA
ATGAGGGTAGCCGGAGGGTCCAAGAGGAGAAGGTGGAGGGGGGAATGAATCAGAGCGTCGGGGTCCCGACAGCGCCTGTCGGGGATGCACAAGTGTCGGCCCCTGAGAAG
GAAGGGGTAAGGGTGGTGAGGGAGGGAGGAGAGGTTCTGGAGACCCAGGTTGGGGAAGCTAGTACGGAAGTGGCTCCCCAGTCGGGCGAGGTTGGGGAGAAGGGGCCTGA
TGAGGGGATGGCAAATAAGGGGAAGGGGAAGCAGAAGGTGGGGGCTGCCTCTGTGGAGCCTAGGGCCACCTATGTGGGAGAAGACGTGATGACAGTAGATGAGGTGGGAG
GTGGGGGGTCTGAGGGGGTTGAGAGGGCAGGTAAAACAGGTAGTAGTAGGGGTTGGAAGAGAAGGGCGAGGGATGCACTTGCAGACATCTCAAATAGGGAAGTCCCGATG
GAGTCTGTTGGGGGCAAAAGGAAAGGGGAGGAGGGGGAGTTGCCAATTCAGGGAGGGAAGAAGGCGAGAGGTTTGGGGTCGGCTCGTGCATTCCGTAGGTTGCACAAGTT
GGTGCAACAATACCGACCTCAGATGGTGTTCCTAGCAGAAACGAAAGTGCAGACGACGAGGTTTGATGTGCTTAAAGTTAGGTTGGGGTATCCGTGCTGCTTCTGTGTGG
ATAGTAATGGGAGAAGTGGAGGTCTGGCGTTGCTGTGGACTACGGAGGTTAAGTTCAGTCTTATCTCGTATTCCTCTAATCATATAGACGGATGGGTGGATGGAGGGGCG
AGCCCATGGAGATTCACATGGATGTACGGGCATCCTCAAGGGGAACTAAAGGCTAGAACGTGGGCATTGATGAAGCATCTCCGTGGGAGTAGTGAGACCCCGTGGCTTAT
GGGGGGTGATTTTAATGCAGTGTTGTTTGATCATGAGAAAGATGGGGGAAGGCCTAAGCCGGCGGGGGAGCTGCTTGGTTTCCGGGAGGCGGTGGATGTGTGTGAGCTGA
TGGATCTTGGATACAGTGGGCCTGTATTTACCTGGTGCAATAGGAGACCTGGGGAGGATGTTGTTTGGGAAAGAATTGATAGGTGTATGGGTAATGTGGCATGGCGAGAT
ATGTTCTCTGGTTATGAGGTGAGGCATCTGGATTTTAGTCGGTCGGATCATAGACCTATTCTCCTGACGTTGTCAGTGTGTGCTCGTTCGGTTCATAATGCAGGCCGTAG
AATCCAAAGGTTTGAGGAAACGTGGCTTCTATCACAGGGGTTTAAGGAAGCAGTGTCAGCTAATTGGGGTCCGAGTGGTGCAGTAGGATCGTCGACAGCATTAGCCTTGG
CGACGAAACGATGTATGAACGGGCTTAGCAAATGGGGGAGGCCAGGAATGGGGAATTTTAGGTATCGTATCCGGGTGGCGACGGATCAGGTTCAGGGGGCGTTGGAACGG
CTGGGCAGTGTGGGATCTCGATCTGATTTACAGAGTGCAGAAGAGCGGTTGGAGTCTTTGCTGGTTGAGGAGGAGATGTACTGGAAGCAGAGATCCAGAGATTGTTGGCT
GAAATGGGGGGACAGGAACACTCGATGGTTTCATAATCATGCCTCGAGTAGGAGAATGAGGAATGAGGTGAAGGGTTTGGAGGATGAGGATGGGGTCTGGCAGCAGGATC
CGGACAGAATCTTGGGGCTTATTGAGGAATATTTTGAGGACTTGTATACGACATCGCAGCCTTCGCATGAGGAAATAGACCAGGCCCTGTTACATGTGTCTTCCTCTGTT
ACGGATGAGATGAACAACGGCTTGCCCACGATTGATAATTTGGGTATTCGGGGCTGTGACGTTTTGAATGTTTGTGGCCTCTTTAGGCGAGGTGGGGAGTCTAGTCTGCA
TGTCTTCTGGCATTGCAAGTTCATGAGGGCAGTGTTGATGGAGTCCGAGTTTGGAAGTTTGATACATAAGGTGCAGGCTGGGTCTATGTTTGATCTTCTTAGGGAAGTGA
GGGACAAGGGTGATCAGATTGAGCAGCGGCAGTTACTACCAAATAACCCTCGTGCAGATATCAGGGGAGGTTTTCTGTTCTTCGCACTCTTGTCCCTTATGTTGCGCATG
TCTTGTGATCTGTTGAGGACTGTTAGTGAGGCTGGTGTTGCTAATGTTGAAACGTTAAAGCTTCCTTTGAGGGCCTATGTGTCTAGTGTAGGGGGAAGATGGGTGGAGGG
TCTTATAAATCAGCTCAGGGCTAGTAGGGGCTGCTTTAGGATTGTTGGGAATTGGGAGTGCAACCAAAGTCCCACATTGGCTAGACAAGGGGATGATCATGGGTTTATAA
GGGAGGACAAGTATCTCCATTGGGTGGAAACCTATGGGCTATTGGGAATACATGATGAAGACGATGCTCACAACTCAGACTCAATAACAGCAGTCCATATTATATGGTCA
ATTTGGAATTCGAGGAATAAGGTGAAAAATAGTGGGAAGCCTCCAGTCTTATCTCAGATTTTCAATGAAATTAACGCCAAGCTCGAAGATGTAATCGACAAAGAATATGC
AAACCTCTCGTCGGACAAGTCAGAGAACCTTGAGTCTGGATCTCTCGGAAGAGCTAGTAGGCGGGGCCAACACAACACTCATTCGAAGCAAGAGCAAGAGCAAGAGCCGA
CCCCAGGCTTTTGCCCGGGGGCTCGTCCATTGCTTCTGGCTCTGGTCACGTATTCCTCGGTTTCTCTACAAATTCATTTGTTGGTGTCACGTGAAGGTCATATAAGTGTC
GTCAAAGCGAACATAGTTTCGTGGTTGTATCGTGTGAAGTTTGAGGTGATGTTGGGTATGGCGAGTTATTGTTTGCGTGTTTGTTTGAAGAACATCCCGGAAAGTTGGGT
TTGCGAGTCGTGTACTTGTCTATTGGGGGAGGCTAACTTTTCAGCAATGGTTCGTGAGGACATTGCCCCATTGGCTTGTCAGCCCAACACCAGGCTTGTTGATGAAACAA
CAAATGTTGCTTCTGTTAGACAACAGCCTCATCAAGCATCAACTGAATCTCAAGGTATAAATATGGAGATGGTAGCAAAGATTAGGCCCGAAGGTTACTTTCGTGCACAA
TTGACCTGTTGCTCGCACCTAACGAAAACAATAGAAACTTTGCACAAGAGATTGACCCACACCCAATTAGATATGTTTAGGCAAACCATATTTGGGCCTATAGTGGATAG
TAACATCATATTTAATGGTCAGTTAATCCACCATCTATTGCTTAGGGAGGTTGAGGATCCTAGACAGGATGTCATTAGTTTCAATATATTTGGAAATAGGGTGTCCTTTG
GCAAGCAAGAATTTGACCTAATCACAGGATTTAGACACCATAGAAGGACATTTGATAGAAATAAGTCAGGGGTTAGATTGAGGCGTCTGTACTTTAAAGATAGTGTCAAA
GATACAGTAGCAGATGCCGAAAAAAGGTTCTTAGAAATACAATTTGAGACCGATGAAGATGCGGTGAAGGTAGCTCTCGCATATTTCATTGAGCTAGCAATGTTTGGGCG
GGAGAGGAAGCAGAAGTTCAATTGGTTTCTATTGGGTATTGTGGATGATTGGGAGATATTCTGCAACTATGACTGGAGCAAGGTAATTTTTGAGATGACCATCAGGAGCT
TGAAAGAAGCACTTAGGGCTGCCACCCAAAGAGACGACGTGGTTGGAGAGACTGGTCGAGTGGAAAGATATAATCTTTATGGCTTTCCACATGCTTTTCAGGTATGGGCG
TATGAGACTATATCATCTCTGACGAACCGTGTTGCGAACCAACTGAACCAGGATGCGATCCCACGCTTTTCTCGGTGGTCATGCTCCCATTCTCCTTCGTACACCCAACT
TAGCAACGAGATATTTGGCTTGACGGAGGTAGCAAAGGTGATAGTACAATTGGTGTCGAGCGATGCAGAGCTCGAACACATGCGTCGCATTGTTGGTGCACCAGAGGTCT
CTGTTTTGTCGCCACAATCAGAGGCTCCTATTTTTCTGCCACAACCAGAACTAGTGAATAATGCAAACGTAGATCCTGTCGTGAGTGATAGAGGGTCAGATGAGGCTGGT
TGGGATAGGGGTTCTCCAATAAAGGAGGCAGATATGGTTGGGCTCGATGAAGAATTGACACATGAGAGTCTATCTGAAGGCGTGGGCAAGACCTGTCGGTGTGACTGCAA
GCATTCATACGAGTCACTAGACCGACAGATGAAGGAGATAAAAAACGATTTAAAAGAGATGAAAGTTGACCTAAAGTCAATTAAGAAGTACTTGCGTCGATTATCGAAGG
TGATGAACTCTGATAAAGGAAAGGACCCTGTCATGGAGGAGGGGCATGGTGGGGTTTCAGAAGATGCGATGGTAGAGGATCGTGATATGCATAAGGTCATTGACTCGCTT
TTTATGTTCATTCGGAAGAAACTGCAACAACGGTCGGACTTGTGTCGTTGGAAATTTGTCATTGCAGATATTGTTGTTACCGAGTTTTTGAGACGTCACGACGATATATC
TGAAGAGTTCAAGAAGGTGCAAGATCCTTCATTGATTACGTTCGACTGGAGTACAACCAAGACTTTGATGGACTACGTTATGGGTCGACACTCGGACCATGATGCACATT
GGAGTACAGTTGACGCGATCTACAACCATTCAACCTCGGAGGGAAACCATTGGGTTATGGAGTTGAGCACTTTAGCCATGCCAGTGCTACTGTTCAAGTGCGATGTCATG
AAAAGCCGTCTCTCCCAGTTCACGAATGGGAAATACACAGAGATAGTTTTGTTCCTCAACAGACGAACGGTGGGGACTGTGGTTGGCACTCAAGTTAAGTGTTAG
mRNA sequenceShow/hide mRNA sequence
GCGGTGGTGGCGTGGTTCGTGGCTTGTTGCAGGCGGTGGTTTGCGGTGGTGTTGGTGTTGTCGTTTCGGTCTAGTTTCTGGTGGAGCGCCGTTTTTGTTGCAGGCGGTGC
TTCGGTGGTCTTGTGGGAGGGGTACGGTGGGGTGGGAGGAGGAAGGACGATGGAGGATTTGGTGAGTAAGTGGAGGTCCTGGAAGTTGACGGAGGAGGAAGCAGAGAAGG
TGAAGCCGGCCTTATCTGAGGGGTCGAGAGAGCTTTGTGTGGTGGGGAAGGTGCTCTCGTCGAAGCGTGTGAATATGGATGCTTTCCGAAATGTGATGAGAGCGGCATGG
AGAACCCATGGGGCGACTCGGATAGAAGCGGCGGGGGATAATTTGTTTGCGGTCCAGTTTCGGTCCAAAGGGGAGAAGGTAAGAGTTATGTCGTCGGGGCCATGGGCGTT
CGACAGATCCCTTGTGATTCTTAAGGATGCACAGTTGGTATCGTCAGTTCTGGATGTGGAGTTTAAAGACTGCTCTTTCTGGGTCCAGATCCACAATGTTCCCATGCGTT
GGCAGACTAAGGGAATGGCGAAGCATTTAGGAGGGAAGATTGGTGTTGTGGAAGAGGTAGGTGGGGCGCAGGGGAGCGAGTGGACGGGACCAGTCTTAAGACTGAGAGTG
CGCCTGAGGGTGGATCGGCCCCTACGAAGAGGTTTTCATATGCAGGATGAGGAGGGCAAGGAATTCTGGTGTCCAGTGTTGTATGAACGGTTGCCGAATTTCTGTTATGA
GTGTGTCTGCTTAGGACACTCGAGGAGGGAGTGTGGGGTGGTGGGAGAAAGTTCTTCGGAGCAGGGAGGGGATCAGTATGGGGAGTGGTTGCGGGCGGGGGTGAGTTTTA
ATGAGGGTAGCCGGAGGGTCCAAGAGGAGAAGGTGGAGGGGGGAATGAATCAGAGCGTCGGGGTCCCGACAGCGCCTGTCGGGGATGCACAAGTGTCGGCCCCTGAGAAG
GAAGGGGTAAGGGTGGTGAGGGAGGGAGGAGAGGTTCTGGAGACCCAGGTTGGGGAAGCTAGTACGGAAGTGGCTCCCCAGTCGGGCGAGGTTGGGGAGAAGGGGCCTGA
TGAGGGGATGGCAAATAAGGGGAAGGGGAAGCAGAAGGTGGGGGCTGCCTCTGTGGAGCCTAGGGCCACCTATGTGGGAGAAGACGTGATGACAGTAGATGAGGTGGGAG
GTGGGGGGTCTGAGGGGGTTGAGAGGGCAGGTAAAACAGGTAGTAGTAGGGGTTGGAAGAGAAGGGCGAGGGATGCACTTGCAGACATCTCAAATAGGGAAGTCCCGATG
GAGTCTGTTGGGGGCAAAAGGAAAGGGGAGGAGGGGGAGTTGCCAATTCAGGGAGGGAAGAAGGCGAGAGGTTTGGGGTCGGCTCGTGCATTCCGTAGGTTGCACAAGTT
GGTGCAACAATACCGACCTCAGATGGTGTTCCTAGCAGAAACGAAAGTGCAGACGACGAGGTTTGATGTGCTTAAAGTTAGGTTGGGGTATCCGTGCTGCTTCTGTGTGG
ATAGTAATGGGAGAAGTGGAGGTCTGGCGTTGCTGTGGACTACGGAGGTTAAGTTCAGTCTTATCTCGTATTCCTCTAATCATATAGACGGATGGGTGGATGGAGGGGCG
AGCCCATGGAGATTCACATGGATGTACGGGCATCCTCAAGGGGAACTAAAGGCTAGAACGTGGGCATTGATGAAGCATCTCCGTGGGAGTAGTGAGACCCCGTGGCTTAT
GGGGGGTGATTTTAATGCAGTGTTGTTTGATCATGAGAAAGATGGGGGAAGGCCTAAGCCGGCGGGGGAGCTGCTTGGTTTCCGGGAGGCGGTGGATGTGTGTGAGCTGA
TGGATCTTGGATACAGTGGGCCTGTATTTACCTGGTGCAATAGGAGACCTGGGGAGGATGTTGTTTGGGAAAGAATTGATAGGTGTATGGGTAATGTGGCATGGCGAGAT
ATGTTCTCTGGTTATGAGGTGAGGCATCTGGATTTTAGTCGGTCGGATCATAGACCTATTCTCCTGACGTTGTCAGTGTGTGCTCGTTCGGTTCATAATGCAGGCCGTAG
AATCCAAAGGTTTGAGGAAACGTGGCTTCTATCACAGGGGTTTAAGGAAGCAGTGTCAGCTAATTGGGGTCCGAGTGGTGCAGTAGGATCGTCGACAGCATTAGCCTTGG
CGACGAAACGATGTATGAACGGGCTTAGCAAATGGGGGAGGCCAGGAATGGGGAATTTTAGGTATCGTATCCGGGTGGCGACGGATCAGGTTCAGGGGGCGTTGGAACGG
CTGGGCAGTGTGGGATCTCGATCTGATTTACAGAGTGCAGAAGAGCGGTTGGAGTCTTTGCTGGTTGAGGAGGAGATGTACTGGAAGCAGAGATCCAGAGATTGTTGGCT
GAAATGGGGGGACAGGAACACTCGATGGTTTCATAATCATGCCTCGAGTAGGAGAATGAGGAATGAGGTGAAGGGTTTGGAGGATGAGGATGGGGTCTGGCAGCAGGATC
CGGACAGAATCTTGGGGCTTATTGAGGAATATTTTGAGGACTTGTATACGACATCGCAGCCTTCGCATGAGGAAATAGACCAGGCCCTGTTACATGTGTCTTCCTCTGTT
ACGGATGAGATGAACAACGGCTTGCCCACGATTGATAATTTGGGTATTCGGGGCTGTGACGTTTTGAATGTTTGTGGCCTCTTTAGGCGAGGTGGGGAGTCTAGTCTGCA
TGTCTTCTGGCATTGCAAGTTCATGAGGGCAGTGTTGATGGAGTCCGAGTTTGGAAGTTTGATACATAAGGTGCAGGCTGGGTCTATGTTTGATCTTCTTAGGGAAGTGA
GGGACAAGGGTGATCAGATTGAGCAGCGGCAGTTACTACCAAATAACCCTCGTGCAGATATCAGGGGAGGTTTTCTGTTCTTCGCACTCTTGTCCCTTATGTTGCGCATG
TCTTGTGATCTGTTGAGGACTGTTAGTGAGGCTGGTGTTGCTAATGTTGAAACGTTAAAGCTTCCTTTGAGGGCCTATGTGTCTAGTGTAGGGGGAAGATGGGTGGAGGG
TCTTATAAATCAGCTCAGGGCTAGTAGGGGCTGCTTTAGGATTGTTGGGAATTGGGAGTGCAACCAAAGTCCCACATTGGCTAGACAAGGGGATGATCATGGGTTTATAA
GGGAGGACAAGTATCTCCATTGGGTGGAAACCTATGGGCTATTGGGAATACATGATGAAGACGATGCTCACAACTCAGACTCAATAACAGCAGTCCATATTATATGGTCA
ATTTGGAATTCGAGGAATAAGGTGAAAAATAGTGGGAAGCCTCCAGTCTTATCTCAGATTTTCAATGAAATTAACGCCAAGCTCGAAGATGTAATCGACAAAGAATATGC
AAACCTCTCGTCGGACAAGTCAGAGAACCTTGAGTCTGGATCTCTCGGAAGAGCTAGTAGGCGGGGCCAACACAACACTCATTCGAAGCAAGAGCAAGAGCAAGAGCCGA
CCCCAGGCTTTTGCCCGGGGGCTCGTCCATTGCTTCTGGCTCTGGTCACGTATTCCTCGGTTTCTCTACAAATTCATTTGTTGGTGTCACGTGAAGGTCATATAAGTGTC
GTCAAAGCGAACATAGTTTCGTGGTTGTATCGTGTGAAGTTTGAGGTGATGTTGGGTATGGCGAGTTATTGTTTGCGTGTTTGTTTGAAGAACATCCCGGAAAGTTGGGT
TTGCGAGTCGTGTACTTGTCTATTGGGGGAGGCTAACTTTTCAGCAATGGTTCGTGAGGACATTGCCCCATTGGCTTGTCAGCCCAACACCAGGCTTGTTGATGAAACAA
CAAATGTTGCTTCTGTTAGACAACAGCCTCATCAAGCATCAACTGAATCTCAAGGTATAAATATGGAGATGGTAGCAAAGATTAGGCCCGAAGGTTACTTTCGTGCACAA
TTGACCTGTTGCTCGCACCTAACGAAAACAATAGAAACTTTGCACAAGAGATTGACCCACACCCAATTAGATATGTTTAGGCAAACCATATTTGGGCCTATAGTGGATAG
TAACATCATATTTAATGGTCAGTTAATCCACCATCTATTGCTTAGGGAGGTTGAGGATCCTAGACAGGATGTCATTAGTTTCAATATATTTGGAAATAGGGTGTCCTTTG
GCAAGCAAGAATTTGACCTAATCACAGGATTTAGACACCATAGAAGGACATTTGATAGAAATAAGTCAGGGGTTAGATTGAGGCGTCTGTACTTTAAAGATAGTGTCAAA
GATACAGTAGCAGATGCCGAAAAAAGGTTCTTAGAAATACAATTTGAGACCGATGAAGATGCGGTGAAGGTAGCTCTCGCATATTTCATTGAGCTAGCAATGTTTGGGCG
GGAGAGGAAGCAGAAGTTCAATTGGTTTCTATTGGGTATTGTGGATGATTGGGAGATATTCTGCAACTATGACTGGAGCAAGGTAATTTTTGAGATGACCATCAGGAGCT
TGAAAGAAGCACTTAGGGCTGCCACCCAAAGAGACGACGTGGTTGGAGAGACTGGTCGAGTGGAAAGATATAATCTTTATGGCTTTCCACATGCTTTTCAGGTATGGGCG
TATGAGACTATATCATCTCTGACGAACCGTGTTGCGAACCAACTGAACCAGGATGCGATCCCACGCTTTTCTCGGTGGTCATGCTCCCATTCTCCTTCGTACACCCAACT
TAGCAACGAGATATTTGGCTTGACGGAGGTAGCAAAGGTGATAGTACAATTGGTGTCGAGCGATGCAGAGCTCGAACACATGCGTCGCATTGTTGGTGCACCAGAGGTCT
CTGTTTTGTCGCCACAATCAGAGGCTCCTATTTTTCTGCCACAACCAGAACTAGTGAATAATGCAAACGTAGATCCTGTCGTGAGTGATAGAGGGTCAGATGAGGCTGGT
TGGGATAGGGGTTCTCCAATAAAGGAGGCAGATATGGTTGGGCTCGATGAAGAATTGACACATGAGAGTCTATCTGAAGGCGTGGGCAAGACCTGTCGGTGTGACTGCAA
GCATTCATACGAGTCACTAGACCGACAGATGAAGGAGATAAAAAACGATTTAAAAGAGATGAAAGTTGACCTAAAGTCAATTAAGAAGTACTTGCGTCGATTATCGAAGG
TGATGAACTCTGATAAAGGAAAGGACCCTGTCATGGAGGAGGGGCATGGTGGGGTTTCAGAAGATGCGATGGTAGAGGATCGTGATATGCATAAGGTCATTGACTCGCTT
TTTATGTTCATTCGGAAGAAACTGCAACAACGGTCGGACTTGTGTCGTTGGAAATTTGTCATTGCAGATATTGTTGTTACCGAGTTTTTGAGACGTCACGACGATATATC
TGAAGAGTTCAAGAAGGTGCAAGATCCTTCATTGATTACGTTCGACTGGAGTACAACCAAGACTTTGATGGACTACGTTATGGGTCGACACTCGGACCATGATGCACATT
GGAGTACAGTTGACGCGATCTACAACCATTCAACCTCGGAGGGAAACCATTGGGTTATGGAGTTGAGCACTTTAGCCATGCCAGTGCTACTGTTCAAGTGCGATGTCATG
AAAAGCCGTCTCTCCCAGTTCACGAATGGGAAATACACAGAGATAGTTTTGTTCCTCAACAGACGAACGGTGGGGACTGTGGTTGGCACTCAAGTTAAGTGTTAG
Protein sequenceShow/hide protein sequence
AVVAWFVACCRRWFAVVLVLSFRSSFWWSAVFVAGGASVVLWEGYGGVGGGRTMEDLVSKWRSWKLTEEEAEKVKPALSEGSRELCVVGKVLSSKRVNMDAFRNVMRAAW
RTHGATRIEAAGDNLFAVQFRSKGEKVRVMSSGPWAFDRSLVILKDAQLVSSVLDVEFKDCSFWVQIHNVPMRWQTKGMAKHLGGKIGVVEEVGGAQGSEWTGPVLRLRV
RLRVDRPLRRGFHMQDEEGKEFWCPVLYERLPNFCYECVCLGHSRRECGVVGESSSEQGGDQYGEWLRAGVSFNEGSRRVQEEKVEGGMNQSVGVPTAPVGDAQVSAPEK
EGVRVVREGGEVLETQVGEASTEVAPQSGEVGEKGPDEGMANKGKGKQKVGAASVEPRATYVGEDVMTVDEVGGGGSEGVERAGKTGSSRGWKRRARDALADISNREVPM
ESVGGKRKGEEGELPIQGGKKARGLGSARAFRRLHKLVQQYRPQMVFLAETKVQTTRFDVLKVRLGYPCCFCVDSNGRSGGLALLWTTEVKFSLISYSSNHIDGWVDGGA
SPWRFTWMYGHPQGELKARTWALMKHLRGSSETPWLMGGDFNAVLFDHEKDGGRPKPAGELLGFREAVDVCELMDLGYSGPVFTWCNRRPGEDVVWERIDRCMGNVAWRD
MFSGYEVRHLDFSRSDHRPILLTLSVCARSVHNAGRRIQRFEETWLLSQGFKEAVSANWGPSGAVGSSTALALATKRCMNGLSKWGRPGMGNFRYRIRVATDQVQGALER
LGSVGSRSDLQSAEERLESLLVEEEMYWKQRSRDCWLKWGDRNTRWFHNHASSRRMRNEVKGLEDEDGVWQQDPDRILGLIEEYFEDLYTTSQPSHEEIDQALLHVSSSV
TDEMNNGLPTIDNLGIRGCDVLNVCGLFRRGGESSLHVFWHCKFMRAVLMESEFGSLIHKVQAGSMFDLLREVRDKGDQIEQRQLLPNNPRADIRGGFLFFALLSLMLRM
SCDLLRTVSEAGVANVETLKLPLRAYVSSVGGRWVEGLINQLRASRGCFRIVGNWECNQSPTLARQGDDHGFIREDKYLHWVETYGLLGIHDEDDAHNSDSITAVHIIWS
IWNSRNKVKNSGKPPVLSQIFNEINAKLEDVIDKEYANLSSDKSENLESGSLGRASRRGQHNTHSKQEQEQEPTPGFCPGARPLLLALVTYSSVSLQIHLLVSREGHISV
VKANIVSWLYRVKFEVMLGMASYCLRVCLKNIPESWVCESCTCLLGEANFSAMVREDIAPLACQPNTRLVDETTNVASVRQQPHQASTESQGINMEMVAKIRPEGYFRAQ
LTCCSHLTKTIETLHKRLTHTQLDMFRQTIFGPIVDSNIIFNGQLIHHLLLREVEDPRQDVISFNIFGNRVSFGKQEFDLITGFRHHRRTFDRNKSGVRLRRLYFKDSVK
DTVADAEKRFLEIQFETDEDAVKVALAYFIELAMFGRERKQKFNWFLLGIVDDWEIFCNYDWSKVIFEMTIRSLKEALRAATQRDDVVGETGRVERYNLYGFPHAFQVWA
YETISSLTNRVANQLNQDAIPRFSRWSCSHSPSYTQLSNEIFGLTEVAKVIVQLVSSDAELEHMRRIVGAPEVSVLSPQSEAPIFLPQPELVNNANVDPVVSDRGSDEAG
WDRGSPIKEADMVGLDEELTHESLSEGVGKTCRCDCKHSYESLDRQMKEIKNDLKEMKVDLKSIKKYLRRLSKVMNSDKGKDPVMEEGHGGVSEDAMVEDRDMHKVIDSL
FMFIRKKLQQRSDLCRWKFVIADIVVTEFLRRHDDISEEFKKVQDPSLITFDWSTTKTLMDYVMGRHSDHDAHWSTVDAIYNHSTSEGNHWVMELSTLAMPVLLFKCDVM
KSRLSQFTNGKYTEIVLFLNRRTVGTVVGTQVKC