; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040831 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040831
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:8810675..8816333
RNA-Seq ExpressionLag0040831
SyntenyLag0040831
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]5.8e-10732Show/hide
Query:  MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFD
        MD L   W N+ L+E E     +  ++    +   +  ++GK+ S + V  +        +W + +S  +     N F++ F +  +K R+ S  PW FD
Subjt:  MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFD

Query:  KSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQY
          L V+    GS   S + F    FWV    +PL          LGS +G V EV  +      G  +RV+I+L++ +PL R  R +  +G  +W P++Y
Subjt:  KSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQY

Query:  ERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFLFGDWLRA---VPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASP--
        E++P FCF CGRI H    C      V  D    FG WLRA   +  R  +        G     G           AD    +  SG VL  +  S   
Subjt:  ERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFLFGDWLRA---VPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASP--

Query:  -VPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSKV--GTVPLAPLVATHTVSSGAGSVSAGKG----KAVANENSEI-----TMTDVHDGPV-----
            G    +D     A  D  KE     V  E    +  G +    + A+ T    AG  S   G    +A  NE   +         VH  P+     
Subjt:  -VPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSKV--GTVPLAPLVATHTVSSGAGSVSAGKG----KAVANENSEI-----TMTDVHDGPV-----

Query:  --KKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVS--KRLKEVES---------GSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASA
          + SWK+ AR S      +  S+V    KRS      D+  + +  K L    S         G+P   Q L+ +V+ K P +LFL ETKL++  M   
Subjt:  --KKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVS--KRLKEVES---------GSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASA

Query:  KRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWD---VYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQH
        K  LGF+ C  V  +G+SGG+AL W++     + +FS  HI   ++ +   V  W LTGFYG     KR ++W+LL  L   SD  WLI GDFN +L   
Subjt:  KRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWD---VYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQH

Query:  EKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEG-TIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRI
        EK GGRDK   ++ AF++VID C L DLGF GN FTWCNRR     I ERLDR  S++ WH  YP   V H     SDH PI L L+   G  +   +++
Subjt:  EKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEG-TIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRI

Query:  LRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQAEAQLEDV-------LQEEE
         RF+  W+   + +Q+++D+W        LS   R  Q    C   +  W ++  G+  + ++ A + L      +PLS    +L++        L   E
Subjt:  LRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQAEAQLEDV-------LQEEE

Query:  LYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLF-STSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTE
        + WKQRS+ +WL EGD+N+R+FH +A+ R++ N I  + D  G W Q+     +++ DYF  LF +  E     F   L+ L   +  +M   L  PF+E
Subjt:  LYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLF-STSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTE

Query:  EEILRALKQSHPHKAPGPDGLS
        +E+ RAL + HP KAPGPDG+S
Subjt:  EEILRALKQSHPHKAPGPDGLS

KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]8.3e-10631.63Show/hide
Query:  DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQ--LCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTF
        D L   + ++ L+E E+   SV  +I  L +   +   C++ K+L  K  N +AF++ M  +W   +  +         ++ F  + +K ++M  GPW+F
Subjt:  DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQ--LCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTF

Query:  DKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVP-GEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPL
        DK L++L    G      ++     FWV +  +PL      + R +G  +G V+EV    G  +W G  MRVR+++N+++PL R  R++   G   W   
Subjt:  DKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVP-GEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPL

Query:  QYERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFL-FGDWLRAVPFRHGVANATEEGG----------------------GRPDIQGGGDQVSEVSMP-
         YERLPD CF CG +GHS +EC E  +    D+Q L +G WLRA     G  N+   GG                       + D  G G     V+ P 
Subjt:  QYERLPDFCFRCGRIGHSHRECSEEGEGVGADNQFL-FGDWLRAVPFRHGVANATEEGG----------------------GRPDIQGGGDQVSEVSMP-

Query:  ADRVVDLVDSGVVLEGTT-ASPVPSGTPPSVDPDIGV--ASADKG---KEVADP---GVAPEASSKVGTVPLAPLVATHTV------------------S
         ++  DL   GV + G   +S VP+ TP +++ ++ +  +S D G   KEV++     V      + G + +  L A+H                    S
Subjt:  ADRVVDLVDSGVVLEGTT-ASPVPSGTPPSVDPDIGV--ASADKG---KEVADP---GVAPEASSKVGTVPLAPLVATHTV------------------S

Query:  SGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGS--PRAFQRLAKVVQEK
        S   SV    G+ V +       T+ ++G +      +A +S+     +L S+    H R  +G        +S    E + G+  P    R + +++E+
Subjt:  SGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGS--PRAFQRLAKVVQEK

Query:  RPLVLFLSETKLSSNR---MASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWI-SWDVYHWRLTGFYGFPAADKRDQTWSLLSKL
        R L++    +   SN    + S       EY F   + GRSGGLAL W  ++   ++S+S NHI   I + D   W LTG YG P + +R + W LL  L
Subjt:  RPLVLFLSETKLSSNR---MASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWI-SWDVYHWRLTGFYGFPAADKRDQTWSLLSKL

Query:  RGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDH
          G + PWL+ GDFN +L   EK GG  +  S++  F+ V+  C L DLG+ G  FTW NRR  EG + ERLDR  ++  W +IY N  V+H     SDH
Subjt:  RGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDH

Query:  RPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMG-------NFPQKVQLAIEGLRGA
         P  L L  +   +RR ++R+ RF+  W+ + E   ++  +W        +S  + + ++S RC   +  W ++  G       N  +K+Q       G+
Subjt:  RPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMG-------NFPQKVQLAIEGLRGA

Query:  GSREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLR
         S E   QA  +++  L+ +EL WKQRSR  WL+EGD N+R+FH +AS R+R N I  L D+ G W++   M   L+T+YF  LF+ ++  D   D+ L 
Subjt:  GSREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLR

Query:  DLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS
         ++  V +EMN DLLKP+  EE+  ALKQ HP KAPGPDG+S
Subjt:  DLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS

XP_022841874.1 uncharacterized protein LOC111365549 [Olea europaea var. sylvestris]2.2e-10630.36Show/hide
Query:  DIPLLDESTVQL------CVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLD
        D+ L++E  V +      C++ K+LSSK  N +AF+  M  +W  +R+  I     N+ +  F    +K R+   GPW F K L++     G +    + 
Subjt:  DIPLLDESTVQL------CVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLD

Query:  FSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRE
        FS   FWV I  + +      M   +G  +G V+EV  +      G  + VR+ L++++PL R  ++  G+    W    YERLP+FC+ CG +GH H++
Subjt:  FSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRE

Query:  CSEEGEGVGA--DNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGG---DQVSEVSMPAD----------RVVDLVDSGVVLEGTTASPVPSGTPPSV
               + A   + + +G WLRA        N+++  G   D Q      D  + ++ P+            +V+L +   +LE     P   G P  +
Subjt:  CSEEGEGVGA--DNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGG---DQVSEVSMPAD----------RVVDLVDSGVVLEGTTASPVPSGTPPSV

Query:  DPDIGVASADKGKEVADPGVAPEASSKVGTVPLAP---------LVATHTVSSGAGSVSAGKGKAV-----ANENSEITMTDVHDGPVKKSWKRL-ARSS
           + + ++ + ++    G++  AS     + L P         L A +T  SG           +      + NS   +T        + WKRL    +
Subjt:  DPDIGVASADKGKEVADPGVAPEASSKVGTVPLAP---------LVATHTVSSGAGSVSAGKGKAV-----ANENSEITMTDVHDGPVKKSWKRL-ARSS

Query:  LKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALL
        +  I     +S+       ++  PP    L+S   +     +PR    L+ +++++ P VLFL ETKL++  M   +  L F  C  V + GRSGG+ALL
Subjt:  LKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALL

Query:  WSSSVSFSLLSFSNNHIDGWISWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLD
        W S+V+ S+L +S  HID  I   ++ W LTG YG P   KR +TW+LL +L+  S   WL+ GDFN +L   EK GGRD+P  +L  FQ++   C L D
Subjt:  WSSSVSFSLLSFSNNHIDGWISWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLD

Query:  LGFVGNRFTWCN-RRPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEG
        LGF G  +TW N R     I+ERLDR  ++ +W  ++   +V H     SDHRP+ + L  Q         +  RF+  W+ +     +V + W      
Subjt:  LGFVGNRFTWCN-RRPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEG

Query:  PGLSAPERLAQVSRR---CMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREP-------LSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHR
           +  + L  + R    C   +  W RSK G    ++  A   L     ++P       L  +  +++  +  EEL WKQ+ R  WL++GDQNTR+FH 
Subjt:  PGLSAPERLAQVSRR---CMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREP-------LSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHR

Query:  QASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGL
        +AS R++ N I  L+DDQG W QD     +L+T+YF  LFS++    ++    L +L+  +  +MN+DL++ FTE E+ +ALK+ HP KAPGPD +
Subjt:  QASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGL

XP_030939698.1 uncharacterized protein LOC115964550 [Quercus lobata]2.1e-10429.88Show/hide
Query:  ESTVQLC---VVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVH
        E T++ C   ++G+ L+++P N  A + ++ SVW +    RI   G+ +F  RF   ++   ++  GPW+FD +LLVL         + + F     WV 
Subjt:  ESTVQLC---VVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVH

Query:  ITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEGEGVG
        +  +P +         +G  +G V EV  +         +R+R+ + + +P+RR   +    G  +    +YERL   C++CG++GH  ++CS +G    
Subjt:  ITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEGEGVG

Query:  ADNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTP-PSVDPDIGVASADK---GKEVADPGV
        A+    +GDWL+A             G  R D            M ADR              T +P P+  P PS    + + S D+      + D   
Subjt:  ADNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTP-PSVDPDIGVASADK---GKEVADPGV

Query:  APEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRL
          +  S++        V+     S   +     G   +  +S   +++   G        L  ++L  +++ L +  ++  KR    D      +  ++ 
Subjt:  APEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRL

Query:  KEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWD-VYHWRLTGFY
        +E  S        L+ +V+ K P +LFL ETK S   M   +  L +   F V S  RSGGLALLW   +   + +F+ NHID  I  D   HWRLTGFY
Subjt:  KEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWD-VYHWRLTGFY

Query:  GFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIYERLDRCFSSVAWHD
        G+P   ++ ++W LL  L      PWL  GDFN +L   EK+GG  KPL+ +  F+  +  CGL+DLG+ GN FTW N R +  + ERLDR  +++ W D
Subjt:  GFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIYERLDRCFSSVAWHD

Query:  IYPNCVVNHLDYHQSDHRPIELV--LSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMG-NFP
         +    V HL+   SDH PI +   + P P   ++      RF+E W    + + +++ +W S    P  S   +L +  +RC  ++  W R   G + P
Subjt:  IYPNCVVNHLDYHQSDHRPIELV--LSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMG-NFP

Query:  --QKVQLAIEGL---RGAGSREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQ
          Q+ Q  +E L     A +   +   +A++ +++ ++EL+W+QRSR +WL  GD+NT++FH +AS R+R N I G+ D    W      + ++   YFQ
Subjt:  --QKVQLAIEGL---RGAGSREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQ

Query:  QLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS
        +LFST+ P  Q+ +  L+ +QR V   MN  L +P+T +E+  AL Q HP K+PGPDG+S
Subjt:  QLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]1.5e-10730.53Show/hide
Query:  DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDK
        +D+  +W+++ L+E E     +  +  L   +  +LC++ K+ + +  N +AFR  M  +W+       +    NV++I F  V++K +++   PW+FD+
Subjt:  DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDK

Query:  SLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYE
         L+ +    G    S + F    FW+ +  +P    +  M   +GSV+G V+EV         G  +R++  +N+ + L R  R +K      W   +YE
Subjt:  SLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYE

Query:  RLPDFCFRCGRIGHSHRECSEEGEGVGADNQFL--FGDWLRAV----PFRH-----GVANATEEGGGRPDIQGGGDQVS-----EVSMPADRVVDLVDSG
        RLP FCF+CGR  H    C E     GADN     +G WLRA      F+H     G+      G      Q  GD  S       S P     +LV+S 
Subjt:  RLPDFCFRCGRIGHSHRECSEEGEGVGADNQFL--FGDWLRAV----PFRH-----GVANATEEGGGRPDIQGGGDQVS-----EVSMPADRVVDLVDSG

Query:  VVLE---GTTASPVP---SGTPPSVDPDIGVASADKGKEV----ADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSV----SAGKGKAVANENSEITMT
          LE   G T   +P    GT  + +P     + DKG +       P    E  S+  T     ++  H+++S    V     A    ++  EN  +  T
Subjt:  VVLE---GTTASPVP---SGTPPSVDPDIGVASADKGKEV----ADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSV----SAGKGKAVANENSEITMT

Query:  DV--------HDGPVK-KSWKRLAR---SSLKDISNVLSS-SVVSGHKRS-AQGDPPDEDGLVSKRLKEVESGSP-RAFQRLAKVVQEKRPLVLFLSETK
        +          + PVK K+WKR AR   + L D++N++   S  +  KRS  Q      DG    + K+ ++ +P  + Q L  +V+ K+P ++FL+ETK
Subjt:  DV--------HDGPVK-KSWKRLAR---SSLKDISNVLSS-SVVSGHKRS-AQGDPPDEDGLVSKRLKEVESGSP-RAFQRLAKVVQEKRPLVLFLSETK

Query:  LSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWIS--WDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGD
         ++ R+   K  LG+E CF V+SKG+SG LALLW  SV   +++++  HI   I+   D   W+LTGFYG P + KR ++W LL  L+   + PWL  GD
Subjt:  LSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWIS--WDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGD

Query:  FNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIY--ERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPG
        FN + +Q EK G  ++P  ++  F+N +  C L DLGF G++FTW N R EG  +  ERLDR   +  W  ++ N  V+HLD  QSDH+ + +  +    
Subjt:  FNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIY--ERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPG

Query:  CWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSR------EPLSQAEAQL
          ++   R+ RF+  W K+ E +++++  W  S    G S   R  Q   +C   +  W R+K  +  + ++   E L+    R      E + +    +
Subjt:  CWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSR------EPLSQAEAQL

Query:  EDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVD
          ++  E L  +QR+++ WLK GD+NT++FH+ +S R+R N I  +    G   QD   + Q + ++F  LF++S PS    D  L  LQ+ +  +M   
Subjt:  EDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVD

Query:  LLKPFTEEEILRALKQSHPHKAPGPDG
        L   FTE E+  A+   +P  +PGPDG
Subjt:  LLKPFTEEEILRALKQSHPHKAPGPDG

TrEMBL top hitse value%identityAlignment
A0A2N9GF83 CCHC-type domain-containing protein3.9e-11731.72Show/hide
Query:  MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFD
        +++LV  W    L+E E+  F++  +     ++    C++GK+L SK  N  A +  ML +W V      +  G N+F+ +F + ++ +R+    PW FD
Subjt:  MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFD

Query:  KSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQY
         +LLVL    GS   + + F+ C FWV +  VPL Y T      +G  +G V +V         G  +RVRI +++ +P++R   +  G+   +W   +Y
Subjt:  KSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQY

Query:  ERLPDFCFRCGRIGHSHRECSEE--GEGVGADNQFLFGDWLRAV--PFRHGVANATEEGGGRPD--IQGGGDQVSE-----VSMPAD-------------
        ERLP FCF CG++GH  REC  +  G  +   +   +G WLRA    FRH   +  +   G P   +  GG QV++      S PA              
Subjt:  ERLPDFCFRCGRIGHSHRECSEE--GEGVGADNQFLFGDWLRAV--PFRHGVANATEEGGGRPD--IQGGGDQVSE-----VSMPAD-------------

Query:  --RVVDLV---------------DSGVVLEGTTASPVPS----------GTPPSVDPDIGVASADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSG
          R VD V               D    L+  TA  VP+            P S D   GV   D    +A  G        V T      +  H  +S 
Subjt:  --RVVDLV---------------DSGVVLEGTTASPVPS----------GTPPSVDPDIGVASADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSG

Query:  AGSVSAGKGKAVANENSEITMTDVHDGPV--KKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRP
          S+         +     T+T  H+G V  K  WKRLAR+  K     ++S    G KR     PPD   L+S   + +  G+P   + L  +++EK P
Subjt:  AGSVSAGKGKAVANENSEITMTDVHDGPV--KKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRP

Query:  LVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYH--WRLTGFYGFPAADKRDQTWSLLSKLRGGS
         +LFLSET+L    +   +  L F   FCV   G  GGLALLW++ V   + S+S NHID  +   + H  +R+TGFYG     KR ++W+LL  L   +
Subjt:  LVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYH--WRLTGFYGFPAADKRDQTWSLLSKLRGGS

Query:  DTP-WLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPE---------GTIYERLDRCFSSVAWHDIYPNCVVNHLDY
         +P WL  GDFN +L   E+ G   +P  ++  F+  I  CGL D+GFVG+ FTW  +R           G    RLDR   S +W   +    V+HL  
Subjt:  DTP-WLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPE---------GTIYERLDRCFSSVAWHDIYPNCVVNHLDY

Query:  HQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRR---CMRSMAGWGRSKMGNFPQKVQLAIEGLRG
          SDH P+ + L    G      +++ RF+  W K  +   ++  +W S      + A  ++ QV  +   C  ++  W + + G+    ++     L+ 
Subjt:  HQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRR---CMRSMAGWGRSKMGNFPQKVQLAIEGLRG

Query:  AGSREPLS------QAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQ
          +  PL       + +  L  +L++EE+YW+QRSR  W+KEGD+NT++FH Q S R+  N++ GL D+ G W  D+A V  +  DYF+ +FS+S P+ +
Subjt:  AGSREPLS------QAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQ

Query:  DFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS
            S+  ++R V  EMN  LL  FT +EIL ALKQ +P KAPGPDG+S
Subjt:  DFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS

A0A2N9GJ35 Uncharacterized protein3.4e-11331.38Show/hide
Query:  MGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPR
        M LSE E    S+  D  L  +   Q  ++ K+L++KP + +AF+  + ++WS      I     N+F+  F    +  RI    PWTFDK L+ +V   
Subjt:  MGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPR

Query:  GSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRL---VKGNGSVLWCPLQYERLPDFC
        G   P+ + FS   FW+ +  +P+      +   +G  +G ++EV    +    G  +R+R+ +++AQPL R   L       G + W   +YE LP FC
Subjt:  GSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRL---VKGNGSVLWCPLQYERLPDFC

Query:  FRCGRIGHSHREC-----SEEGEGVGADNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTPP
        +RCGR+GH   EC         EGV  +    +G WLRA+  R      + EG  +PD +G      E +MP DR           E  T +        
Subjt:  FRCGRIGHSHREC-----SEEGEGVGADNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTPP

Query:  SVDPDIGVASADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVV
                          DP  +P  S                     G      G  +     EI   ++H        +R A SS KD          
Subjt:  SVDPDIGVASADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVV

Query:  SGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFS
                  PP     +S   + +  G+P+    L  +V+++ P ++FL ET+L+   +   +  LG + C  V+  G+ GGLALLW SSV  ++ S+S
Subjt:  SGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFS

Query:  NNHIDG-WISWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCN
         +HIDG  +  D   WRLTGFYG+P A  R ++WSLL  LR  SD PW+I GDFN +    EK G  D+  +++AAF+  +  C L D+GF G  FTW N
Subjt:  NNHIDG-WISWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCN

Query:  RRPEGTIYE-RLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWR----RSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPER
         R  G +   RLDR  +  AW  ++P+  +NHL    SDH  + L+L  +    R    +  +R+ RF+++WLK++  +++++ +W     G   +A  +
Subjt:  RRPEGTIYE-RLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWR----RSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPER

Query:  LAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSRE-------PLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRI
        +AQ  ++C   +  W +S +   P+ +   ++ L+    +E        ++  +  L  + ++ E+ W+QRSR VWL EGD+NT++FH  AS R+++N I
Subjt:  LAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSRE-------PLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRI

Query:  GGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS
         GL D Q  WR +   V Q+  DYF  LF++S P  +  D  L +++  V   MN  L++PFT+EEI RAL Q HP K+PGPDG+S
Subjt:  GGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS

A0A2N9H1U1 Reverse transcriptase domain-containing protein8.4e-11231.68Show/hide
Query:  MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFD
        MD L   WEN+ LSE E   F +  D     ES     +  + L+ +P+N +A  R    +W   +  R++  GDNV  I F    +  R+++ GPW++D
Subjt:  MDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFD

Query:  KSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVV-EVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQ
        K  ++          + L F   + WV I  +P  +  A + R +GS +G ++  V  E  + W G  +RV + +N+++PL R  ++  G G  +    Q
Subjt:  KSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVV-EVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQ

Query:  YERLPDFCFRCGRIGHSHRECSE--EGEGVGADNQFLFGDWLRAVP---------------FRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLV
        YE+L +FC+ CG I HS ++CS     +G     +  FG W+RA P               FR   A +T    GR   + G D  +   M +  +   +
Subjt:  YERLPDFCFRCGRIGHSHRECSE--EGEGVGADNQFLFGDWLRAVP---------------FRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLV

Query:  DSGVVLEGTTASPVPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSK--VGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKK
           + +E    S   +G    ++P +  A  +  KE  +    P   S+    T+P +P   T       G V            + +    V  GP +K
Subjt:  DSGVVLEGTTASPVPSGTPPSVDPDIGVASADKGKEVADPGVAPEASSK--VGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKK

Query:  SWKRLARSSLK-DISNVLSSSVV-SGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVD
        SWKR   SS+K    + +S  +  +G + +    PP     ++   + +  G+P   Q L ++V+E+ PLVLF+ ET L   R+   +  L F     V 
Subjt:  SWKRLARSSLK-DISNVLSSSVV-SGHKRSAQGDPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVD

Query:  SKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYH-WRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAA
         + + GGL L W    + ++ S+S +HID  I+  +   WR TGFYG P   +R  +W+LL  L      PWL  GDFN LL   EK+GG  +   ++  
Subjt:  SKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYH-WRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAA

Query:  FQNVIDSCGLLDLGFVGNRFTWCNRRPE-GTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQ
        F++ ID CG +DLG+ G  FTWCN R + GT++ERLDR  +++ W + +P   + HL    SDH PI LV  P      R+  R  RF+E WL     ++
Subjt:  FQNVIDSCGLLDLGFVGNRFTWCNRRPE-GTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQ

Query:  LVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGA--GSREPLSQAEA-----QLEDVLQEEELYWKQRSREVWLKEG
         V  +W +   G   S   R+    R C   +  W R K GN  Q++++    LR A   S   +S + A     +++ +L +EE  W QR+R  WLK G
Subjt:  LVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGA--GSREPLSQAEA-----QLEDVLQEEELYWKQRSREVWLKEG

Query:  DQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAP
        D+NTR+FH+ AS R+R N I  L D+QG        + +L  +YF  LF TS P   DF+  L  +   V  +MN  L +PF  +E+  A+KQ  P KAP
Subjt:  DQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAP

Query:  GPDGLS
        GPDG+S
Subjt:  GPDGLS

A0A7N2LIH6 Uncharacterized protein4.8e-11530.07Show/hide
Query:  HSSAFLVLIGS--CDLGWVVSSLGARM--------------DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLS
        H  +FL ++    CDL    +SLG  +              ++L   W+ + ++EAE     + ++     +   + CVV K+L+ + V  +A ++ M  
Subjt:  HSSAFLVLIGS--CDLGWVVSSLGARM--------------DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLS

Query:  VWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGH
        +W   +  +I   G+++F++ F    +K+++M + PW+++K L+++    G   P  +      FWV I  +PL   T      +G+ +G V+EV     
Subjt:  VWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGH

Query:  SDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSE--EGEGVGADNQFLFGDWLRAVPFRH------------
            G  +RVRI  +    L R  ++    G   W   +YERLP+FC++CGR+ H  ++C E  +GE  G + +  +G WLR  P R             
Subjt:  SDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSE--EGEGVGADNQFLFGDWLRAVPFRH------------

Query:  -----------------------------GVANATEEGGGRPD------IQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTPPSVDPDIGVA
                                     G  + +E+  G+ D      ++ GG     VS    + V+LV      +          T    D   G+ 
Subjt:  -----------------------------GVANATEEGGGRPD------IQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTPPSVDPDIGVA

Query:  SADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHD--------GPVKKSWKRLARSSLKDISNVLSSSVVS
        +A   K++ D     E    V  V             G G  +        NE+S + MT   +        GP    WKRLAR + KD S    S  VS
Subjt:  SADKGKEVADPGVAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHD--------GPVKKSWKRLARSSLKDISNVLSSSVVS

Query:  GHKRSAQG---------DPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSV
        G ++   G          PP    +++   + +  G+  A + L   V++K P+++FL ETK S  +M   +  LGF     V S GRSGGLALLW    
Subjt:  GHKRSAQG---------DPPDEDGLVSKRLKEVESGSPRAFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSV

Query:  SFSLLSFSNNHIDGWI--SWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGF
             S S++HID  +  +     WR TGFYG P   KR  +W LL  L    + PWL+ GDFN +++  EK G +D+  +++ AF+ V+  CGL+DLGF
Subjt:  SFSLLSFSNNHIDGWI--SWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGF

Query:  VGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGL
        VG RFTWCN R  +     RLDR  ++ AW  ++P   V+H+    SDH  + L L+ +    RR  +R   F+E W +  E +++V  +W    E   +
Subjt:  VGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGL

Query:  SAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQE-------EELYWKQRSREVWLKEGDQNTRWFHRQASYRQ
           ERL     RC + +  W ++  GN  + ++     L+   S   L +   +++ + +E       EE+ WKQRSR  WL+ GD+N+++FH  AS R+
Subjt:  SAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQE-------EELYWKQRSREVWLKEGDQNTRWFHRQASYRQ

Query:  RLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS
        + NRIGGLMDD G W +D+    +L+ DYF+ ++S+++P+   FDVSL  +   V  EMN +L K F   E+ +AL+Q HP KAPGPDG+S
Subjt:  RLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS

A0A7N2R0C3 Reverse transcriptase domain-containing protein6.6e-11730.12Show/hide
Query:  DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDK
        ++L V W+ + ++E E  +  +  +         + CV  KV+S K +  +A R+ +  +W  ++S ++   G+ +F++ F    +KRR+M + PW ++K
Subjt:  DDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDK

Query:  SLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYE
         L++     G + P  +      FWV I  +PL   T    +A+G  +G  +EV  E      GT +RVR+ +++ + L R  ++    G   W   +YE
Subjt:  SLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYE

Query:  RLPDFCFRCGRIGHSHRECSEE--GEGVGADNQFLFGDWLRAVPFRHG------------------VANATEEGGGRPDIQGGG-------DQVSEVSMP
        RLP+FC+RCG + H  ++C EE   +  G ++   +G WLR  P R G                        E  GR ++Q G        + +S     
Subjt:  RLPDFCFRCGRIGHSHRECSEE--GEGVGADNQFLFGDWLRAVPFRHG------------------VANATEEGGGRPDIQGGG-------DQVSEVSMP

Query:  ADRVVDLVDSGVVLEGTTASPVPSGTPPSV------------DPDIGVASADKGKEVADPGVAPEASSKVGTVPLA----PLVATHTVSSGAGSVSAGKG
         +R  DL+  G   E TT     +G   S+            + +  V + + GKE    G A   +  +           +   + V  G G      G
Subjt:  ADRVVDLVDSGVVLEGTTASPVPSGTPPSV------------DPDIGVASADKGKEVADPGVAPEASSKVGTVPLA----PLVATHTVSSGAGSVSAGKG

Query:  KAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGD----PPDEDGLVSKRLKEVES------------GSPRAFQRLAKVV
              + E        GP    WKR+ R+      +      VS  +R   GD      D++   SKR K   S            GS  A + L   V
Subjt:  KAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGD----PPDEDGLVSKRLKEVES------------GSPRAFQRLAKVV

Query:  QEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWI--SWDVYHWRLTGFYGFPAADKRDQTWSLLSK
        +   P+++FL+ETK S  R+   +R LG      V S GRSGGLA+LW   V  SL S SN+HID  +  S     WR TGFYG P A  R  +W LL  
Subjt:  QEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWI--SWDVYHWRLTGFYGFPAADKRDQTWSLLSK

Query:  LRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSD
        L    + PW++ GDFN +L   EK G  ++   ++  F+  + +CGLLDLGFVG RFTWCN R  E     RLDR  ++  W +++P   V H     SD
Subjt:  LRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRR-PEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSD

Query:  HRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGN---FPQKVQLAIEGLRGAG--
        H  + L +  +    R+ ++R   F+E W ++   ++++  +W   G  P L+   RL    + C   +  W R   GN     ++ Q  ++ L      
Subjt:  HRPIELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGN---FPQKVQLAIEGLRGAG--

Query:  --SREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSL
          S E + + + ++ +V+  EE+ W QRSR +W+K GD+NTR+FH  A+ R+R N+I G++D +G WR++   V +++ +YF++++S++ P+  +F   L
Subjt:  --SREPLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSL

Query:  RDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS
          + R V  +MN DLL+ F EEE+ +AL Q HP K+PGPDG+S
Subjt:  RDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDGLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.2e-1622.54Show/hide
Query:  SDTPWLIGGDFN--ALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRP
        +D   ++ GDF+  A    H        P+  L  FQN +    L+D+   G  +TW N + +  I  +LDR  ++  W   +P+ +        SDH P
Subjt:  SDTPWLIGGDFN--ALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRP

Query:  IELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSS-GEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQ
          ++L   P    + S++  R+             +  +W      G  + +     + +++C + +    R   GN   K + A++ L    S+   + 
Subjt:  IELVLSPQPGCWRRSSQRILRFDETWLKQAELQQLVRDSWGSS-GEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQ

Query:  AEA--QLEDVLQEE--------ELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLF-STSEPSDQDFDV
        +++  ++E V +++        E +++Q+SR  WL++GD NTR+FH+     Q  N I  L  D     ++   V +++  Y+  L  S S+    D   
Subjt:  AEA--QLEDVLQEE--------ELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLF-STSEPSDQDFDV

Query:  SLRDLQRSVDSEMNVDLLKPF-TEEEILRALKQSHPHKAPGPDGLS
         ++D+     ++     L    +++EI  A+     +KAPGPD  +
Subjt:  SLRDLQRSVDSEMNVDLLKPF-TEEEILRALKQSHPHKAPGPDGLS

AT3G42140.1 zinc ion binding;nucleic acid binding1.1e-0727.15Show/hide
Query:  FHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLR
        F S      I+  GPW+F+  + V+   R +   S  +F R  FW+ I  +PL + TA +  ++G  +G  +E                         L 
Subjt:  FHSVTEKRRIMSLGPWTFDKSLLVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLR

Query:  RVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEG-EGVGADN
        R V ++K          QYE+L +FC  CG + H   EC   G +G  AD+
Subjt:  RVVRLVKGNGSVLWCPLQYERLPDFCFRCGRIGHSHRECSEEG-EGVGADN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTATACCTTTCGACCATCGTTGTCAGATTCGAAGCTAGAGGCGAGCAGACAAGAGATGAGGCTGAGGAAGACAAGATTGAAGATCAGAGAAGAAGACACAACGTT
TCTTGTTATTGTCATGAGTAGAAACATAAATGGCAAGCTTAAGATCAGGGAATTTGAAGAAAGGGTTCACAGCGGCGGCGTGGTCTGCTTGACGGTGAGGGCTTCTTGCC
GGCGAGGTCTCTTTGAGGTGTGGGTCTTCTGCAGCGGTTTTCTTCACGGGGTTGAGGTATTCTCCGACGGTTTCAGAGTCTCGAAGGGGTCGGCGGCTTTCGATCTCCTT
CACGGATTTTGGGTTGTCGTCAGCGGCGGCGGAGTTTCATGGGTTCGACGGTTGATCTCCGGCGTTGGGCGGTGTCGTCTCCGGTTCCATTCGGGGTCGAGGTTGACTTC
GGCTTTGTGTGCTTTGCGATTTTGGTCTTGGGGCTGGTCTCAGTCTGTAGCTGTCGGTTTCTTGCGTGTTTGTTCTGTTTTAGGTGGTCGCGGGTTTAGTTCTGTTTGTG
TTTTATTTTGCTTGCAAGTTTCGGTTGTATTTTGGTGTTTGGTTGCTGGTTTGGGTCTCCACAGCTCTGCGTTTCTGGTCCTAATTGGTTCCTGCGATTTGGGCTGGGTT
GTGAGTTCCCTCGGTGCACGTATGGATGATCTTGTGGTTCAATGGGAAAATATGGGGTTATCAGAGGCAGAGTCGACGACGTTCTCTGTGCCAGCAGATATTCCTCTCTT
GGATGAATCAACAGTTCAACTGTGTGTAGTTGGTAAGGTTCTTTCTTCCAAACCGGTGAATCCTGATGCCTTTCGTCGAGTAATGTTATCGGTTTGGAGTGTTCATCGTT
CTACCCGGATTGAACCTTGGGGGGATAATGTCTTCGTAATTCGGTTTCATTCCGTCACTGAAAAGCGGAGAATTATGAGTTTGGGCCCTTGGACCTTCGATAAGTCTTTA
CTGGTTCTGGTGTCTCCTCGAGGGTCGGATGACCCATCTCTTCTGGACTTTTCTCGTTGTGAGTTTTGGGTTCATATCACGAAAGTTCCGTTGAACTATCATACAGCGGC
TATGGCTCGTGCTCTTGGTAGTGTGGTGGGTCACGTGGTTGAGGTGCCAGGGGAAGGCCACAGTGACTGGCTTGGCACAGTAATGAGAGTTCGTATTGTTCTTAACATGG
CTCAACCCCTTCGTCGTGTTGTCCGGCTTGTAAAAGGGAATGGGTCCGTTCTTTGGTGTCCGTTGCAGTATGAACGATTGCCGGATTTCTGTTTTAGGTGTGGGCGTATT
GGCCATTCACATAGGGAGTGCTCGGAAGAGGGGGAAGGCGTGGGTGCTGACAATCAGTTTCTGTTTGGTGACTGGTTGCGGGCTGTTCCATTTCGGCATGGGGTTGCTAA
TGCGACAGAAGAGGGTGGTGGGCGTCCGGATATTCAGGGGGGCGGGGATCAGGTGTCTGAGGTGTCTATGCCAGCTGACCGGGTGGTAGATCTGGTTGATTCAGGAGTTG
TTCTTGAGGGGACTACGGCTTCCCCGGTTCCTTCGGGTACTCCCCCGTCAGTCGATCCTGATATTGGGGTGGCTTCTGCGGACAAGGGTAAGGAGGTGGCCGATCCGGGT
GTTGCTCCAGAAGCTAGCTCTAAGGTGGGTACGGTGCCTTTGGCTCCGTTAGTGGCAACACATACAGTCTCTTCTGGGGCAGGGTCGGTTTCTGCAGGTAAAGGCAAGGC
TGTGGCTAATGAAAATTCTGAGATTACTATGACTGATGTGCATGATGGTCCGGTGAAGAAGAGTTGGAAGCGGCTGGCCAGAAGCTCTTTGAAGGACATTTCCAATGTCT
TATCTTCCTCAGTTGTTAGTGGGCACAAGCGATCAGCCCAGGGGGACCCGCCTGATGAGGATGGGTTAGTCTCCAAGCGACTGAAGGAGGTGGAGTCTGGGTCTCCCCGG
GCTTTCCAGCGCTTGGCCAAGGTGGTTCAAGAGAAAAGACCCCTGGTGCTCTTCCTGTCTGAAACAAAGCTGTCGTCAAACAGGATGGCATCAGCGAAGCGAGTTCTGGG
TTTCGAGTACTGTTTTTGTGTTGATAGCAAAGGTAGGAGTGGTGGTTTGGCTCTGTTGTGGAGTTCGTCTGTCTCCTTCAGCCTCTTGTCATTTTCGAATAACCACATTG
ATGGGTGGATCTCGTGGGACGTTTATCATTGGCGACTCACGGGTTTCTATGGTTTCCCTGCCGCCGATAAGCGGGATCAAACGTGGTCCCTTCTCTCTAAGTTAAGGGGG
GGTTCTGATACTCCTTGGCTTATAGGAGGGGACTTTAATGCCCTGTTGTACCAGCATGAGAAGGAAGGTGGCAGAGATAAACCCCTCTCAGAGCTAGCGGCCTTTCAGAA
TGTGATTGACTCATGTGGGCTTCTTGATTTGGGATTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCGGAAGGAACGATCTATGAGCGCTTGGATAGGTGTTTTA
GCTCAGTTGCTTGGCATGATATCTACCCCAACTGTGTAGTTAACCATCTTGATTATCACCAGTCCGATCACCGACCGATTGAGCTGGTTCTCTCTCCGCAGCCTGGTTGT
TGGAGACGCTCGAGCCAGCGAATCTTACGGTTTGATGAGACTTGGCTGAAGCAAGCAGAGCTGCAGCAGCTGGTCAGGGACTCATGGGGGTCGAGTGGGGAGGGTCCTGG
TTTGTCAGCTCCCGAAAGGTTGGCTCAAGTTTCCAGAAGGTGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAAAATGGGGAACTTCCCTCAGAAGGTACAGCTGGCCA
TTGAGGGATTGAGAGGGGCTGGGTCCCGTGAGCCACTTTCCCAGGCAGAGGCCCAGTTGGAAGATGTGTTACAGGAGGAGGAACTTTACTGGAAGCAAAGATCCAGAGAG
GTGTGGTTGAAGGAAGGGGATCAGAATACTCGGTGGTTTCATCGTCAAGCCTCGTATAGGCAAAGGCTCAATCGTATTGGGGGCCTCATGGACGATCAGGGGGAATGGCG
CCAGGACAGAGCTATGGTTCTTCAGTTGGTGACTGATTATTTCCAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCTCTCCGGGACCTTCAGC
GATCTGTGGATAGTGAAATGAATGTGGATCTGTTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGG
TTATCTGATAGCGAGCATAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTTATACCTTTCGACCATCGTTGTCAGATTCGAAGCTAGAGGCGAGCAGACAAGAGATGAGGCTGAGGAAGACAAGATTGAAGATCAGAGAAGAAGACACAACGTT
TCTTGTTATTGTCATGAGTAGAAACATAAATGGCAAGCTTAAGATCAGGGAATTTGAAGAAAGGGTTCACAGCGGCGGCGTGGTCTGCTTGACGGTGAGGGCTTCTTGCC
GGCGAGGTCTCTTTGAGGTGTGGGTCTTCTGCAGCGGTTTTCTTCACGGGGTTGAGGTATTCTCCGACGGTTTCAGAGTCTCGAAGGGGTCGGCGGCTTTCGATCTCCTT
CACGGATTTTGGGTTGTCGTCAGCGGCGGCGGAGTTTCATGGGTTCGACGGTTGATCTCCGGCGTTGGGCGGTGTCGTCTCCGGTTCCATTCGGGGTCGAGGTTGACTTC
GGCTTTGTGTGCTTTGCGATTTTGGTCTTGGGGCTGGTCTCAGTCTGTAGCTGTCGGTTTCTTGCGTGTTTGTTCTGTTTTAGGTGGTCGCGGGTTTAGTTCTGTTTGTG
TTTTATTTTGCTTGCAAGTTTCGGTTGTATTTTGGTGTTTGGTTGCTGGTTTGGGTCTCCACAGCTCTGCGTTTCTGGTCCTAATTGGTTCCTGCGATTTGGGCTGGGTT
GTGAGTTCCCTCGGTGCACGTATGGATGATCTTGTGGTTCAATGGGAAAATATGGGGTTATCAGAGGCAGAGTCGACGACGTTCTCTGTGCCAGCAGATATTCCTCTCTT
GGATGAATCAACAGTTCAACTGTGTGTAGTTGGTAAGGTTCTTTCTTCCAAACCGGTGAATCCTGATGCCTTTCGTCGAGTAATGTTATCGGTTTGGAGTGTTCATCGTT
CTACCCGGATTGAACCTTGGGGGGATAATGTCTTCGTAATTCGGTTTCATTCCGTCACTGAAAAGCGGAGAATTATGAGTTTGGGCCCTTGGACCTTCGATAAGTCTTTA
CTGGTTCTGGTGTCTCCTCGAGGGTCGGATGACCCATCTCTTCTGGACTTTTCTCGTTGTGAGTTTTGGGTTCATATCACGAAAGTTCCGTTGAACTATCATACAGCGGC
TATGGCTCGTGCTCTTGGTAGTGTGGTGGGTCACGTGGTTGAGGTGCCAGGGGAAGGCCACAGTGACTGGCTTGGCACAGTAATGAGAGTTCGTATTGTTCTTAACATGG
CTCAACCCCTTCGTCGTGTTGTCCGGCTTGTAAAAGGGAATGGGTCCGTTCTTTGGTGTCCGTTGCAGTATGAACGATTGCCGGATTTCTGTTTTAGGTGTGGGCGTATT
GGCCATTCACATAGGGAGTGCTCGGAAGAGGGGGAAGGCGTGGGTGCTGACAATCAGTTTCTGTTTGGTGACTGGTTGCGGGCTGTTCCATTTCGGCATGGGGTTGCTAA
TGCGACAGAAGAGGGTGGTGGGCGTCCGGATATTCAGGGGGGCGGGGATCAGGTGTCTGAGGTGTCTATGCCAGCTGACCGGGTGGTAGATCTGGTTGATTCAGGAGTTG
TTCTTGAGGGGACTACGGCTTCCCCGGTTCCTTCGGGTACTCCCCCGTCAGTCGATCCTGATATTGGGGTGGCTTCTGCGGACAAGGGTAAGGAGGTGGCCGATCCGGGT
GTTGCTCCAGAAGCTAGCTCTAAGGTGGGTACGGTGCCTTTGGCTCCGTTAGTGGCAACACATACAGTCTCTTCTGGGGCAGGGTCGGTTTCTGCAGGTAAAGGCAAGGC
TGTGGCTAATGAAAATTCTGAGATTACTATGACTGATGTGCATGATGGTCCGGTGAAGAAGAGTTGGAAGCGGCTGGCCAGAAGCTCTTTGAAGGACATTTCCAATGTCT
TATCTTCCTCAGTTGTTAGTGGGCACAAGCGATCAGCCCAGGGGGACCCGCCTGATGAGGATGGGTTAGTCTCCAAGCGACTGAAGGAGGTGGAGTCTGGGTCTCCCCGG
GCTTTCCAGCGCTTGGCCAAGGTGGTTCAAGAGAAAAGACCCCTGGTGCTCTTCCTGTCTGAAACAAAGCTGTCGTCAAACAGGATGGCATCAGCGAAGCGAGTTCTGGG
TTTCGAGTACTGTTTTTGTGTTGATAGCAAAGGTAGGAGTGGTGGTTTGGCTCTGTTGTGGAGTTCGTCTGTCTCCTTCAGCCTCTTGTCATTTTCGAATAACCACATTG
ATGGGTGGATCTCGTGGGACGTTTATCATTGGCGACTCACGGGTTTCTATGGTTTCCCTGCCGCCGATAAGCGGGATCAAACGTGGTCCCTTCTCTCTAAGTTAAGGGGG
GGTTCTGATACTCCTTGGCTTATAGGAGGGGACTTTAATGCCCTGTTGTACCAGCATGAGAAGGAAGGTGGCAGAGATAAACCCCTCTCAGAGCTAGCGGCCTTTCAGAA
TGTGATTGACTCATGTGGGCTTCTTGATTTGGGATTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCGGAAGGAACGATCTATGAGCGCTTGGATAGGTGTTTTA
GCTCAGTTGCTTGGCATGATATCTACCCCAACTGTGTAGTTAACCATCTTGATTATCACCAGTCCGATCACCGACCGATTGAGCTGGTTCTCTCTCCGCAGCCTGGTTGT
TGGAGACGCTCGAGCCAGCGAATCTTACGGTTTGATGAGACTTGGCTGAAGCAAGCAGAGCTGCAGCAGCTGGTCAGGGACTCATGGGGGTCGAGTGGGGAGGGTCCTGG
TTTGTCAGCTCCCGAAAGGTTGGCTCAAGTTTCCAGAAGGTGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAAAATGGGGAACTTCCCTCAGAAGGTACAGCTGGCCA
TTGAGGGATTGAGAGGGGCTGGGTCCCGTGAGCCACTTTCCCAGGCAGAGGCCCAGTTGGAAGATGTGTTACAGGAGGAGGAACTTTACTGGAAGCAAAGATCCAGAGAG
GTGTGGTTGAAGGAAGGGGATCAGAATACTCGGTGGTTTCATCGTCAAGCCTCGTATAGGCAAAGGCTCAATCGTATTGGGGGCCTCATGGACGATCAGGGGGAATGGCG
CCAGGACAGAGCTATGGTTCTTCAGTTGGTGACTGATTATTTCCAGCAGCTTTTCTCGACATCAGAGCCGAGTGATCAGGATTTCGATGTATCTCTCCGGGACCTTCAGC
GATCTGTGGATAGTGAAATGAATGTGGATCTGTTGAAACCTTTTACTGAGGAGGAGATTCTTCGGGCTTTGAAGCAGTCTCATCCTCATAAGGCCCCGGGTCCAGATGGG
TTATCTGATAGCGAGCATAGCTAG
Protein sequenceShow/hide protein sequence
MFYTFRPSLSDSKLEASRQEMRLRKTRLKIREEDTTFLVIVMSRNINGKLKIREFEERVHSGGVVCLTVRASCRRGLFEVWVFCSGFLHGVEVFSDGFRVSKGSAAFDLL
HGFWVVVSGGGVSWVRRLISGVGRCRLRFHSGSRLTSALCALRFWSWGWSQSVAVGFLRVCSVLGGRGFSSVCVLFCLQVSVVFWCLVAGLGLHSSAFLVLIGSCDLGWV
VSSLGARMDDLVVQWENMGLSEAESTTFSVPADIPLLDESTVQLCVVGKVLSSKPVNPDAFRRVMLSVWSVHRSTRIEPWGDNVFVIRFHSVTEKRRIMSLGPWTFDKSL
LVLVSPRGSDDPSLLDFSRCEFWVHITKVPLNYHTAAMARALGSVVGHVVEVPGEGHSDWLGTVMRVRIVLNMAQPLRRVVRLVKGNGSVLWCPLQYERLPDFCFRCGRI
GHSHRECSEEGEGVGADNQFLFGDWLRAVPFRHGVANATEEGGGRPDIQGGGDQVSEVSMPADRVVDLVDSGVVLEGTTASPVPSGTPPSVDPDIGVASADKGKEVADPG
VAPEASSKVGTVPLAPLVATHTVSSGAGSVSAGKGKAVANENSEITMTDVHDGPVKKSWKRLARSSLKDISNVLSSSVVSGHKRSAQGDPPDEDGLVSKRLKEVESGSPR
AFQRLAKVVQEKRPLVLFLSETKLSSNRMASAKRVLGFEYCFCVDSKGRSGGLALLWSSSVSFSLLSFSNNHIDGWISWDVYHWRLTGFYGFPAADKRDQTWSLLSKLRG
GSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCGLLDLGFVGNRFTWCNRRPEGTIYERLDRCFSSVAWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGC
WRRSSQRILRFDETWLKQAELQQLVRDSWGSSGEGPGLSAPERLAQVSRRCMRSMAGWGRSKMGNFPQKVQLAIEGLRGAGSREPLSQAEAQLEDVLQEEELYWKQRSRE
VWLKEGDQNTRWFHRQASYRQRLNRIGGLMDDQGEWRQDRAMVLQLVTDYFQQLFSTSEPSDQDFDVSLRDLQRSVDSEMNVDLLKPFTEEEILRALKQSHPHKAPGPDG
LSDSEHS