; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035328 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035328
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr3:18988768..18993446
RNA-Seq ExpressionLag0035328
SyntenyLag0035328
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.2e-23544.81Show/hide
Query:  LTELARKHINEERGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPAIPLLAETVRALNYCKSNKG---------GKLRCCVPLLYIWVHSHFK
        L ++ + +I+E++GL LLALCIYG V++P+ +GYVD  V+KLFF +E+GV+P IP+LAET R+LNYC+ NKG         GKL CCVPLLYIW+HSH K
Subjt:  LTELARKHINEERGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPAIPLLAETVRALNYCKSNKG---------GKLRCCVPLLYIWVHSHFK

Query:  FPEDFGCPKINFSSTWNRTRSTIAEFGTAPWNSSFPRKEAWVSFFSNSKLEHVIWKAHWMPLKALIYRCGSFHTVPLVGPWGCVHYAPLLALRQSWFKQF
        FP +F CP+++FSS WN  R+TI+EFG A W+ ++PRKEAW+SFF+    E+VIWKA WMPLKA+IYRCG FH+VPL+GPWG V+Y PLL LRQ W KQF
Subjt:  FPEDFGCPKINFSSTWNRTRSTIAEFGTAPWNSSFPRKEAWVSFFSNSKLEHVIWKAHWMPLKALIYRCGSFHTVPLVGPWGCVHYAPLLALRQSWFKQF

Query:  IPTTHGLHKLEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIP-TRDNVGESSSRAVDKPSQLATERNELVGRNQTLKLEN
        IP TH L                           +I    H EGVT  Y  W+  R K  I  +R+ V      + ++P+Q   +  EL  +N+ L+ EN
Subjt:  IPTTHGLHKLEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIP-TRDNVGESSSRAVDKPSQLATERNELVGRNQTLKLEN

Query:  EKLQQEVKRLIDQL-------------------VGTSAPSISKGKRTIVTRERTIRGE------------------------------------------
        EKL++E  + +D                     + T   ++ K  R +    R+++ E                                          
Subjt:  EKLQQEVKRLIDQL-------------------VGTSAPSISKGKRTIVTRERTIRGE------------------------------------------

Query:  -------------ILLIKGDYSTMRDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRR--ARIMEEKG-EQ
                      + +K DY     D Q+++ +V +T+  ++++++RA GF+EWA      T                IR     R  ++IMEEK  + 
Subjt:  -------------ILLIKGDYSTMRDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRR--ARIMEEKG-EQ

Query:  EKTKRDIEEIREKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFKEEVKNELATPSNE
        +K +++I  + E+V  I+  L   KGK   DTT    PI +    + +PP F     T    ++P+    N ++                   L   S +
Subjt:  EKTKRDIEEIREKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFKEEVKNELATPSNE

Query:  KLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTW
        KL+VLEERLRA+E T V+GN+DATQLCLVP ++IP KFK+PEF KYDG+TCP+SH++MYCRKMA H++NDKLL+HCFQDSL  PA+RWY+QLD++H+  W
Subjt:  KLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTW

Query:  KNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGR
        K+LAD+FLKQYK NID+APDRLDLQRMEKKS+E FK+YAQRWRD AA+VQPPLTDKE+++MF+NTLR+PFYE M+G+ASTNFSDI+ IGERIEYGIKHGR
Subjt:  KNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGR

Query:  IAK-TAESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNAS-AQYSPFYGQNTRPQMNQNFQSRRQQQPVTPGQQNNR
        +A+ T E    KK   SKKKEGEV  IG  +    +    + +Y  N       +P  +NN S   Y+ +   +T  +  +   S    +P   G Q ++
Subjt:  IAK-TAESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNAS-AQYSPFYGQNTRPQMNQNFQSRRQQQPVTPGQQNNR

Query:  APYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLP---
              +FDPIPMTYTELLPQLIQN QLA  P+ P+QPPYP+WYD NARCDYHAG VGHSTENC ALK  VQ LI AGWL+FKK  E  NV+ NPLP   
Subjt:  APYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLP---

Query:  NHQVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQE
        N +VN +D   +  K  +V EI  P E +F  L   GYVS EY   ++  E
Subjt:  NHQVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQE

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]1.7e-22161.3Show/hide
Query:  MPQYTTYNPLYDIPVGQY--PFPSFKEEVKNELA-----------------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDV
        MPQYTTYNPLYD+PVGQY  PF    +++   +                               PSNEK EVL+ERLRA+E T VFGN+DA+QLC V  +
Subjt:  MPQYTTYNPLYDIPVGQY--PFPSFKEEVKNELA-----------------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDV

Query:  VIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKST
        VIPPK K+PEFEKY+G++CPK+H+ MYCRKMAA+V NDKLLIHCFQDSL GPA+RWYMQLDSSHV +WKNLADSFLKQYKHNID+APDRLDLQRMEKKST
Subjt:  VIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKST

Query:  EDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDK
        + FK+YAQRWRDTAAQVQPPL DKELSAMFINTL+ PFY+ M+GSASTNFSDIMTIGERIEYG++HGRI  T  E  + KK++ SKKKEGEVQM+G    
Subjt:  EDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDK

Query:  S-HRQIHQPITQYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLP
        S  +Q ++   QY   Y PT YGY    VNNA++ Y P+  QN RP           +QNFQ R QQ     T GQQNNR   KQ QFDPIPMTYTELLP
Subjt:  S-HRQIHQPITQYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLP

Query:  QLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTE
        QL QN+QLAP PV P+QPPYPRWYD NARCDYHAGA+ HSTENCT LKYRVQ LIKAGW NFKKE G +V    L NH   Q+NAI +CQ I  K+ V +
Subjt:  QLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTE

Query:  IKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKP
        I TP  +LFEILL +GY+S+EY       + +D++LTC FH GAKGHSLEQC+ F  ++QEL+DSK L  A +H +     VVE++L  E  + S K KP
Subjt:  IKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKP

Query:  LTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGTGGVTRSGKSYTPDQL
        LTI+YREK    +   K   I VP PF+YKSSKAVPW YE KVT+  +  +PSLPVDNI+G GG+TR+G+ YTPD L
Subjt:  LTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGTGGVTRSGKSYTPDQL

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]5.9e-23058.94Show/hide
Query:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFK------
        + K EQEKT++DIEE+REK+DAI+ ALEKGK IA+T+ P      PQ    F PSF       R   E  M QYTTYNPLYDIP GQ+P P  +      
Subjt:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFK------

Query:  -----EEVKNELA--------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRK
             +E +  +A                     PS+EKLEVLEERLRAVEGT VFGN+DA+QLCL   +VIPPKFK+PEFEKY+G++CPK+H++MYCRK
Subjt:  -----EEVKNELA--------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRK

Query:  MAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMF
        MAA++ NDKLLIHCFQDSL GP + WYM LDS HV +WKNLADSFLKQYKHNID+  DRLDLQ MEKK+ E FK+Y QRWRDTAAQ QPP TDKELS+MF
Subjt:  MAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMF

Query:  INTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTAESSSTKKSNT-SKKKEGEVQMIGKVDKSHRQ-IHQPITQYPSNYLPT-YGYYPHQVN
        INTL+ PFY+ M+GSAST+FSDI+TIGERIEYG+ HGRI  T   SST K  T SKKKEGEVQM+G      RQ  +     Y   Y P+ YGY    VN
Subjt:  INTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTAESSSTKKSNT-SKKKEGEVQMIGKVDKSHRQ-IHQPITQYPSNYLPT-YGYYPHQVN

Query:  NASAQYSPFYGQNTRPQMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGH
         A+ QY+    QN RP  +Q FQ R  QQ    T GQQNNR+  KQ+QFDPIPMTYTELLPQL QN+QLAP PV P+QPPYP WYD N RCDYHAGA+GH
Subjt:  NASAQYSPFYGQNTRPQMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGH

Query:  STENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCL
        STENCTALKYRVQ LIKAG L FKKE  P+V NNPLPNH   Q+NA+ +CQ I  ++ V+EI TP + LFEIL  +GY+S+E+   D+  E +D+NLTC 
Subjt:  STENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCL

Query:  FHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSY
        +HAGA+GH LEQC  F +++QEL+D K L V Q+ HQ+  I VVE +   E+ + ++K KPLT+ YREK    ++  + ITIQVP PF+Y SSKAVPW Y
Subjt:  FHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSY

Query:  EYKVTI--NSETPSLPVDNISGTGGVTRSGKSYTPDQL
        E KVT+   +++ SLPVDNI+  GG+TR+G+ YTP+ L
Subjt:  EYKVTI--NSETPSLPVDNISGTGGVTRSGKSYTPDQL

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]6.4e-22466.34Show/hide
Query:  PSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSH
        PSNEK EVLEERLRA+EGT VFGN+DA+QLCLV  +VIPPKFK+PEFEKYDG++CPK+H++MYCRKMAA+V NDKLLIHCFQDSL GPA+RWYMQLDSS+
Subjt:  PSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSH

Query:  VVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGI
        V +WKNLADSFLKQYKHNID+APDRLDLQRMEKKSTE FK+YAQRWRDTAAQVQPPLTDKELSAMFINTL+ PFY+ M+G+ASTNFSDIMTIGERIEYG+
Subjt:  VVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGI

Query:  KHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPIT---QYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQ
        +HGRI  T  E  + KK++ SKKKEGEVQM+G    S +Q  QP +   +Y   Y PT YGY    VNNA++ YSP+  QN RP           +QNFQ
Subjt:  KHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPIT---QYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQ

Query:  SRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNF
         R QQ     T  QQ NR   KQ QFDPIPMTYTELLPQL QN+QLAP PV P+QPPYPRWYD NARCDYHAGA+GHSTENCTALKYRVQ LIKAGWLNF
Subjt:  SRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNF

Query:  KKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQEL
        KKE GP+V  NPLPNH   Q+NAI +CQ+I  K+ V +I+TP  +LFEILL +GYVS+EY   +L  + +D++LTC FHAGAKGHSLEQC+ F  ++QEL
Subjt:  KKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQEL

Query:  VDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREKTTTHD--PKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGT
        +DSK L VA + HQ   I +VE++   E  + + K K LTI+Y EK    +   K ITI VP PF+YKSSKAVPW Y+ KVT+  +  +P LP+DNI+G 
Subjt:  VDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREKTTTHD--PKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGT

Query:  GGVTRSGKSYTPDQL
        GG+TR+G+ YTPD L
Subjt:  GGVTRSGKSYTPDQL

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]4.6e-21441.66Show/hide
Query:  FHTVPLVGPWGCVHYAPLLALRQSWFKQFIPTTHGLHKLEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIPT-RDNVGES
        FH+VPL+GPWG V+Y PLL LRQ W KQFIP TH L   +F+Y  ED   KK + + +W++VR+I    H EGVT  Y  W+  R K  I T R+ V   
Subjt:  FHTVPLVGPWGCVHYAPLLALRQSWFKQFIPTTHGLHKLEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIPT-RDNVGES

Query:  SSRAVDKPSQLATERNELVGRNQTLKLENEKLQQEVKRLID-----------------------------------------------------------
           + ++P Q   +  +L  +N+ L+ ENEKLQ+E  + ID                                                           
Subjt:  SSRAVDKPSQLATERNELVGRNQTLKLENEKLQQEVKRLID-----------------------------------------------------------

Query:  -------------QLVGTSAPSISKGKRTIVTRE----------------------------------------------------RTIRGEILLIKGDY
                     +LV     SI K +  IV  E                                                    + ++ + + +K DY
Subjt:  -------------QLVGTSAPSISKGKRTIVTRE----------------------------------------------------RTIRGEILLIKGDY

Query:  STMRDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRRARIMEEKGEQEKTKRDIEEIREKVDAIIAALEKG
           R D Q ++ +V +T+  ++I++RRA GF+EWA DLR N   +  ++++L  FL MI R+LG                                  KG
Subjt:  STMRDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRRARIMEEKGEQEKTKRDIEEIREKVDAIIAALEKG

Query:  KMIADTTAPDTPIGNPQAGLPFPPSFA-SHVRTTAEASMPQYTTYNPLYDIP-----VGQYPFPSFKEEVKNELATPSNEKLEVLEERLRAVEGTSVFGN
        K++ +T     P+ +    + +PP F   H+  T   +   Y   NPL+D+P     + Q    +  +++     TP+ +KL+VLEERLRA+EGT V+GN
Subjt:  KMIADTTAPDTPIGNPQAGLPFPPSFA-SHVRTTAEASMPQYTTYNPLYDIP-----VGQYPFPSFKEEVKNELATPSNEKLEVLEERLRAVEGTSVFGN

Query:  MDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPD
        +DATQLCLVP ++IP KFK+P F+KYDG++CP+SH++MYCRKMAAH+ NDKLLIHCFQDSL GPA RWY+QLD++H+  WK+LAD+FLKQYKHNID+APD
Subjt:  MDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPD

Query:  RLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKT-AESSSTKKSNTSKKK
        RLDLQRMEKKS+E FK+YAQRWRD AA+VQPPLTDKE++ MF+NTLR+PFY+ M+G+A+TNFSDI+ IGERIEYGIKHGR+ +T AE    KK  T KKK
Subjt:  RLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKT-AESSSTKKSNTSKKK

Query:  EGEVQMIG-------KVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNASAQYSPFYGQNTRPQM-NQNFQSRRQQQPVTPGQQNNRAPYKQAQFDPIPM
        EGEV  IG       K     R+  Q    Y SN      + P+  NN    Y P +  +  P++ N NF      +P   G Q ++   +  +FDPIPM
Subjt:  EGEVQMIG-------KVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNASAQYSPFYGQNTRPQM-NQNFQSRRQQQPVTPGQQNNRAPYKQAQFDPIPM

Query:  TYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLPNHQ---VNAIDQCQDI
        TYTELLPQL+ N QLAP P+ P+QPPYP+WYDPNARCDYHAG VGHSTENC ALK +VQ LI AGWL+FKK  E P+V+NNPLPNH+   VNAID C   
Subjt:  TYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLPNHQ---VNAIDQCQDI

Query:  IKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEI-GVVEELLPKEN
          K +V EI+ P E LFE L   GYVS+EY   ++  E +D+   C+FH G  GH             +++ + +    +   +DN+I  ++ E+  KEN
Subjt:  IKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEI-GVVEELLPKEN

Query:  LNPSFKLKPLTIYYRE---KTTTHDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSETPSLPVDNISGTGGVTRSGKSYTPDQL
            F  +PLT++Y+E   K+++ +PK + ++VP+PFK+K  KAVPW Y+ +V      PS  VDNI+G  G+TRSG+ Y PD L
Subjt:  LNPSFKLKPLTIYYRE---KTTTHDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSETPSLPVDNISGTGGVTRSGKSYTPDQL

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase2.2e-20637.97Show/hide
Query:  LTELARKHINEERGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPAIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGCPK
        L  LAR  +  ++GL L+ALCIYG V++PR+KGYV+ +V+K+F GIE+GV+P IP++AET R+LN+C+    GK   C P+L+IW+ SH ++P +FG P+
Subjt:  LTELARKHINEERGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPAIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGCPK

Query:  INFSSTWNRTRSTIAEFGTAPWNSSFPRKEAWVSFFSNSKLEHVIWKAHWMPLKALIYRCGSFHTVPLVGPWGCVHYAPLLALRQSWFKQFIPTTHGLHK
        I F+S WN+TR+TI EF +A W+   P   A+                                        G + Y+PLL LRQSW KQF P   GL  
Subjt:  INFSSTWNRTRSTIAEFGTAPWNSSFPRKEAWVSFFSNSKLEHVIWKAHWMPLKALIYRCGSFHTVPLVGPWGCVHYAPLLALRQSWFKQFIPTTHGLHK

Query:  LEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIPTRDNVGESSSRAVD--KPSQLATERNELVGRNQTLKLENEKLQQEVK
         EFSY    +N+K  + + +W+ V+++    H EG T +Y  WR  R+ I   T + + ES    ++  K      E   L   N  L  ENEKL++EVK
Subjt:  LEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIPTRDNVGESSSRAVD--KPSQLATERNELVGRNQTLKLENEKLQQEVK

Query:  RLIDQLVGT--------------------------------------------------SAPSISKGKRTIVT--------RERTIRGEILLIKGDYSTM
        + + Q V T                                                  S   I +  R ++T        + + +  + ++ K +Y T+
Subjt:  RLIDQLVGT--------------------------------------------------SAPSISKGKRTIVT--------RERTIRGEILLIKGDYSTM

Query:  RDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRRARIMEEKGEQEKTKRDIEEIREKVDAIIAALEKGK--
        R      +G++ R    ++ MAR A  FS  AR LR+   P     +EL  FLG+I + LG R              +D+E +++++  I+  L  G+  
Subjt:  RDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRRARIMEEKGEQEKTKRDIEEIREKVDAIIAALEKGK--

Query:  -MIADTTAPDTPIGNPQAGLP-FPPSFASH-----------VRTTAEASMPQYTTYNPLY-DIPVGQYPFPSFKEEVKNELATPSNEKLEVLEERLRAVE
         ++  ++  +  +      +P +PP F                T+  A  P  TT    + + P+        K+  + +    S  +LE LEERLR +E
Subjt:  -MIADTTAPDTPIGNPQAGLP-FPPSFASH-----------VRTTAEASMPQYTTYNPLY-DIPVGQYPFPSFKEEVKNELATPSNEKLEVLEERLRAVE

Query:  GTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKH
                                    +FEKY+GT+CPKSH+VMYCRKM+A+ H+DKLLIHCFQDSL  PA+RWYMQLD S V  WK+LADSFLKQYK+
Subjt:  GTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKH

Query:  NIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTAESSSTKKS
        NID+APDRLDLQRMEKK+ E FK+YAQRWR+ AAQVQPPLTDKEL+AMFINTLR+P+Y+ MVGSASTNFSD++TIGERIE+G+K+GRI+  A  S T++ 
Subjt:  NIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTAESSSTKKS

Query:  NTSKKKEGEVQMIGKVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNASAQYSPFYGQNTRPQMNQNFQSRRQQQPVTPGQQNNRAPYKQAQFDPIPMTY
         T KKKEGEV  +    +   ++  PI    +N+ P+Y          +   SPF GQ+T                    Q+N R  +KQ +FDPIPM+Y
Subjt:  NTSKKKEGEVQMIGKVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNASAQYSPFYGQNTRPQMNQNFQSRRQQQPVTPGQQNNRAPYKQAQFDPIPMTY

Query:  TELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLPNHQ---VNAIDQCQDIIK
        TELLP LI++HQ+A  P  PLQPPYP+WYDPNA+C+YHAGAVGHSTENC  LK +VQ L+KAGWL FKK  E P+V+ NPLPNH+   +NA+D       
Subjt:  TELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLPNHQ---VNAIDQCQDIIK

Query:  KADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLN-
        K  V+++ T  + LF+IL   GY+S  + + D  +    +N  CL H     HS+E C  F   +Q+L+DSK L++ Q   Q  E+ ++      E  + 
Subjt:  KADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLN-

Query:  -PSFKLKPLTIYYREKTT----THDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSETPSLPVDNISGTGGVTRSGKSYTPDQL
          +F  KPL I+Y EK +       PK++T+++P                                    GG+TRSG+ YTPD L
Subjt:  -PSFKLKPLTIYYREKTT----THDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSETPSLPVDNISGTGGVTRSGKSYTPDQL

A0A5A7T1W2 Retrotrans_gag domain-containing protein6.0e-23644.81Show/hide
Query:  LTELARKHINEERGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPAIPLLAETVRALNYCKSNKG---------GKLRCCVPLLYIWVHSHFK
        L ++ + +I+E++GL LLALCIYG V++P+ +GYVD  V+KLFF +E+GV+P IP+LAET R+LNYC+ NKG         GKL CCVPLLYIW+HSH K
Subjt:  LTELARKHINEERGLVLLALCIYGMVLYPRVKGYVDGDVLKLFFGIEQGVDPAIPLLAETVRALNYCKSNKG---------GKLRCCVPLLYIWVHSHFK

Query:  FPEDFGCPKINFSSTWNRTRSTIAEFGTAPWNSSFPRKEAWVSFFSNSKLEHVIWKAHWMPLKALIYRCGSFHTVPLVGPWGCVHYAPLLALRQSWFKQF
        FP +F CP+++FSS WN  R+TI+EFG A W+ ++PRKEAW+SFF+    E+VIWKA WMPLKA+IYRCG FH+VPL+GPWG V+Y PLL LRQ W KQF
Subjt:  FPEDFGCPKINFSSTWNRTRSTIAEFGTAPWNSSFPRKEAWVSFFSNSKLEHVIWKAHWMPLKALIYRCGSFHTVPLVGPWGCVHYAPLLALRQSWFKQF

Query:  IPTTHGLHKLEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIP-TRDNVGESSSRAVDKPSQLATERNELVGRNQTLKLEN
        IP TH L                           +I    H EGVT  Y  W+  R K  I  +R+ V      + ++P+Q   +  EL  +N+ L+ EN
Subjt:  IPTTHGLHKLEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIP-TRDNVGESSSRAVDKPSQLATERNELVGRNQTLKLEN

Query:  EKLQQEVKRLIDQL-------------------VGTSAPSISKGKRTIVTRERTIRGE------------------------------------------
        EKL++E  + +D                     + T   ++ K  R +    R+++ E                                          
Subjt:  EKLQQEVKRLIDQL-------------------VGTSAPSISKGKRTIVTRERTIRGE------------------------------------------

Query:  -------------ILLIKGDYSTMRDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRR--ARIMEEKG-EQ
                      + +K DY     D Q+++ +V +T+  ++++++RA GF+EWA      T                IR     R  ++IMEEK  + 
Subjt:  -------------ILLIKGDYSTMRDDMQIILGKVSRTMNTIKIMARRARGFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRR--ARIMEEKG-EQ

Query:  EKTKRDIEEIREKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFKEEVKNELATPSNE
        +K +++I  + E+V  I+  L   KGK   DTT    PI +    + +PP F     T    ++P+    N ++                   L   S +
Subjt:  EKTKRDIEEIREKVDAIIAALE--KGKMIADTTAPDTPIGNPQAGLPFPPSFASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFKEEVKNELATPSNE

Query:  KLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTW
        KL+VLEERLRA+E T V+GN+DATQLCLVP ++IP KFK+PEF KYDG+TCP+SH++MYCRKMA H++NDKLL+HCFQDSL  PA+RWY+QLD++H+  W
Subjt:  KLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTW

Query:  KNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGR
        K+LAD+FLKQYK NID+APDRLDLQRMEKKS+E FK+YAQRWRD AA+VQPPLTDKE+++MF+NTLR+PFYE M+G+ASTNFSDI+ IGERIEYGIKHGR
Subjt:  KNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGR

Query:  IAK-TAESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNAS-AQYSPFYGQNTRPQMNQNFQSRRQQQPVTPGQQNNR
        +A+ T E    KK   SKKKEGEV  IG  +    +    + +Y  N       +P  +NN S   Y+ +   +T  +  +   S    +P   G Q ++
Subjt:  IAK-TAESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNAS-AQYSPFYGQNTRPQMNQNFQSRRQQQPVTPGQQNNR

Query:  APYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLP---
              +FDPIPMTYTELLPQLIQN QLA  P+ P+QPPYP+WYD NARCDYHAG VGHSTENC ALK  VQ LI AGWL+FKK  E  NV+ NPLP   
Subjt:  APYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPNVDNNPLP---

Query:  NHQVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQE
        N +VN +D   +  K  +V EI  P E +F  L   GYVS EY   ++  E
Subjt:  NHQVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQE

A0A6J1D099 Ribonuclease H6.4e-22261.3Show/hide
Query:  MPQYTTYNPLYDIPVGQY--PFPSFKEEVKNELA-----------------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDV
        MPQYTTYNPLYD+PVGQY  PF    +++   +                               PSNEK EVL+ERLRA+E T VFGN+DA+QLC V  +
Subjt:  MPQYTTYNPLYDIPVGQY--PFPSFKEEVKNELA-----------------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDV

Query:  VIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKST
        VIPPK K+PEFEKY+G++CPK+H+ MYCRKMAA+V NDKLLIHCFQDSL GPA+RWYMQLDSSHV +WKNLADSFLKQYKHNID+APDRLDLQRMEKKST
Subjt:  VIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKST

Query:  EDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDK
        + FK+YAQRWRDTAAQVQPPL DKELSAMFINTL+ PFY+ M+GSASTNFSDIMTIGERIEYG++HGRI  T  E  + KK++ SKKKEGEVQM+G    
Subjt:  EDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDK

Query:  S-HRQIHQPITQYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLP
        S  +Q ++   QY   Y PT YGY    VNNA++ Y P+  QN RP           +QNFQ R QQ     T GQQNNR   KQ QFDPIPMTYTELLP
Subjt:  S-HRQIHQPITQYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLP

Query:  QLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTE
        QL QN+QLAP PV P+QPPYPRWYD NARCDYHAGA+ HSTENCT LKYRVQ LIKAGW NFKKE G +V    L NH   Q+NAI +CQ I  K+ V +
Subjt:  QLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTE

Query:  IKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKP
        I TP  +LFEILL +GY+S+EY       + +D++LTC FH GAKGHSLEQC+ F  ++QEL+DSK L  A +H +     VVE++L  E  + S K KP
Subjt:  IKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKP

Query:  LTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGTGGVTRSGKSYTPDQL
        LTI+YREK    +   K   I VP PF+YKSSKAVPW YE KVT+  +  +PSLPVDNI+G GG+TR+G+ YTPD L
Subjt:  LTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGTGGVTRSGKSYTPDQL

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222311.7e-23059.08Show/hide
Query:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFK------
        + K EQEKT++DIEE+REK+DAI+ ALEKGK IA+T+ P      PQ    F PSF       R   E  M QYTTYNPLYDIP GQ+P P  +      
Subjt:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAEASMPQYTTYNPLYDIPVGQYPFPSFK------

Query:  -----EEVKNELA--------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRK
             +E +  +A                     PS+EKLEVLEERLRAVEGT VFGN+DA+QLCL   +VIPPKFK+PEFEKYDG++CPK+H++MYCRK
Subjt:  -----EEVKNELA--------------------TPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRK

Query:  MAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMF
        MAA++ NDKLLIHCFQDSL GP + WYM LDS HV +WKNLADSFLKQYKHNID+  DRLDLQ MEKK+ E FK+Y QRWRDTAAQ QPP TDKELS+MF
Subjt:  MAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMF

Query:  INTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTAESSSTKKSNT-SKKKEGEVQMIGKVDKSHRQ-IHQPITQYPSNYLPT-YGYYPHQVN
        INTL+ PFY+ M+GSAST+FSDI+TIGERIEYG+ HGRI  T   SST K  T SKKKEGEVQM+G      RQ  +     Y   Y P+ YGY    VN
Subjt:  INTLRSPFYEHMVGSASTNFSDIMTIGERIEYGIKHGRIAKTAESSSTKKSNT-SKKKEGEVQMIGKVDKSHRQ-IHQPITQYPSNYLPT-YGYYPHQVN

Query:  NASAQYSPFYGQNTRPQMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGH
         A+ QY+    QN RP  +Q FQ R  QQ    T GQQNNR+  KQ+QFDPIPMTYTELLPQL QN+QLAP PV P+QPPYP WYD N RCDYHAGA+GH
Subjt:  NASAQYSPFYGQNTRPQMNQNFQSRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGH

Query:  STENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCL
        STENCTALKYRVQ LIKAG L FKKE  P+V NNPLPNH   Q+NA+ +CQ I  ++ V+EI TP + LFEIL  +GY+S+E+   D+  E +D+NLTC 
Subjt:  STENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCL

Query:  FHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSY
        +HAGA+GH LEQC  F +++QEL+D K L V Q+ HQ+  I VVE +   E+ + ++K KPLT+ YREK    ++  + ITIQVP PF+Y SSKAVPW Y
Subjt:  FHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREK--TTTHDPKSITIQVPTPFKYKSSKAVPWSY

Query:  EYKVTI--NSETPSLPVDNISGTGGVTRSGKSYTPDQL
        E KVT+   +++ SLPVDNI+  GG+TR+G+ YTP+ L
Subjt:  EYKVTI--NSETPSLPVDNISGTGGVTRSGKSYTPDQL

A0A6J1E2J7 Ribonuclease H3.1e-22466.34Show/hide
Query:  PSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSH
        PSNEK EVLEERLRA+EGT VFGN+DA+QLCLV  +VIPPKFK+PEFEKYDG++CPK+H++MYCRKMAA+V NDKLLIHCFQDSL GPA+RWYMQLDSS+
Subjt:  PSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIHCFQDSLKGPAARWYMQLDSSH

Query:  VVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGI
        V +WKNLADSFLKQYKHNID+APDRLDLQRMEKKSTE FK+YAQRWRDTAAQVQPPLTDKELSAMFINTL+ PFY+ M+G+ASTNFSDIMTIGERIEYG+
Subjt:  VVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDIMTIGERIEYGI

Query:  KHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPIT---QYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQ
        +HGRI  T  E  + KK++ SKKKEGEVQM+G    S +Q  QP +   +Y   Y PT YGY    VNNA++ YSP+  QN RP           +QNFQ
Subjt:  KHGRIAKTA-ESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPIT---QYPSNYLPT-YGYYPHQVNNASAQYSPFYGQNTRP---------QMNQNFQ

Query:  SRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNF
         R QQ     T  QQ NR   KQ QFDPIPMTYTELLPQL QN+QLAP PV P+QPPYPRWYD NARCDYHAGA+GHSTENCTALKYRVQ LIKAGWLNF
Subjt:  SRRQQQPV--TPGQQNNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNF

Query:  KKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQEL
        KKE GP+V  NPLPNH   Q+NAI +CQ+I  K+ V +I+TP  +LFEILL +GYVS+EY   +L  + +D++LTC FHAGAKGHSLEQC+ F  ++QEL
Subjt:  KKEEGPNVDNNPLPNH---QVNAIDQCQDIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQEL

Query:  VDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREKTTTHD--PKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGT
        +DSK L VA + HQ   I +VE++   E  + + K K LTI+Y EK    +   K ITI VP PF+YKSSKAVPW Y+ KVT+  +  +P LP+DNI+G 
Subjt:  VDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKPLTIYYREKTTTHD--PKSITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPSLPVDNISGT

Query:  GGVTRSGKSYTPDQL
        GG+TR+G+ YTPD L
Subjt:  GGVTRSGKSYTPDQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCCAACGTACCCATAGAGTCGATACGGTCATCTTCCTCTGAGTATGATGAGTTGAGTACTGTGCTACAATGGGCAGAGCAGACACAACTAAAGCATGGGGATAA
TCTGCCTTTTAAAAGCCCCGTGTCGTCATCCTTCTCAAACCTCATGTATATTCCAGTTAATCGGTTCACCGTCCAGGCTGTGTTAGAATTCTGGGACCCGGCCTACAGAT
TATTAACAGAACTCGCTCGCAAACACATAAATGAAGAAAGGGGCCTGGTGCTGTTAGCTTTGTGCATCTACGGGATGGTCTTGTATCCCAGAGTTAAAGGATATGTGGAC
GGTGATGTTTTGAAGTTATTCTTCGGCATTGAGCAAGGGGTGGATCCAGCCATACCTCTTCTAGCGGAAACGGTTCGGGCGCTCAATTACTGTAAGTCCAACAAAGGCGG
GAAGTTGAGATGCTGTGTGCCCTTACTCTATATTTGGGTGCATAGTCATTTTAAGTTCCCAGAAGATTTTGGCTGTCCAAAAATCAACTTCAGCAGCACGTGGAATCGAA
CAAGGAGTACAATTGCAGAATTTGGCACGGCCCCATGGAACTCAAGTTTTCCGAGGAAAGAAGCATGGGTGTCTTTCTTTTCGAATTCCAAATTAGAGCATGTGATTTGG
AAGGCACATTGGATGCCCCTAAAAGCCTTAATCTACAGATGCGGGAGCTTCCACACAGTTCCTCTAGTAGGCCCTTGGGGATGCGTCCATTATGCTCCGTTGTTGGCCTT
GCGACAAAGTTGGTTTAAACAATTCATACCAACAACTCATGGTCTGCATAAATTAGAATTTTCCTACAGTTGCGAGGATTCTAACGAGAAGAAGCGACAAGTGCTGACTT
CCTGGAGAACAGTTAGAAGGATAAATGGCAATAGTCACTCTGAGGGAGTTACTCCAGAATATCTGCAATGGCGCATTAGGAGGAGTAAGATCCCAATTCCGACTCGTGAC
AATGTGGGAGAATCATCTAGTAGAGCAGTAGATAAGCCTAGCCAGCTAGCGACAGAGCGGAATGAGTTAGTGGGAAGAAATCAAACATTGAAACTAGAAAATGAAAAATT
ACAGCAAGAGGTCAAACGTTTGATAGACCAGCTCGTTGGAACGTCAGCTCCAAGCATTTCAAAGGGCAAGCGAACAATTGTCACTAGAGAGAGGACAATTAGAGGAGAAA
TACTCCTCATTAAGGGAGATTATTCCACTATGAGAGATGACATGCAGATAATTCTTGGGAAAGTAAGTCGAACCATGAACACTATCAAGATCATGGCTAGGAGAGCCCGA
GGATTTTCAGAATGGGCAAGGGATCTGCGAGAGAATACATCACCTATGGCCTCTAATGCGGAGGAGTTGTTTGAGTTTTTAGGGATGATTCGTAGAGACCTTGGGCGTAG
AGCAAGGATTATGGAAGAAAAGGGTGAACAAGAGAAGACTAAGCGGGACATCGAGGAAATCAGGGAAAAGGTTGATGCAATCATTGCCGCTTTGGAAAAGGGCAAAATGA
TTGCAGATACAACTGCACCAGACACTCCGATTGGAAACCCTCAAGCTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCGTACGACAGCAGAAGCGTCCATGCCA
CAATATACTACCTATAACCCCTTATATGACATACCTGTTGGGCAATACCCTTTTCCATCATTTAAAGAAGAAGTAAAGAACGAGTTAGCCACACCCTCTAATGAAAAACT
TGAAGTTCTAGAGGAAAGGTTAAGGGCGGTTGAGGGCACTAGTGTTTTTGGAAACATGGATGCCACTCAATTATGTTTAGTTCCAGACGTTGTGATCCCACCAAAATTCA
AACTTCCTGAATTTGAAAAATATGACGGGACAACATGCCCTAAAAGTCACATTGTTATGTACTGTAGGAAGATGGCAGCGCACGTTCATAATGATAAGCTGCTCATACAT
TGCTTTCAGGATAGCTTAAAGGGTCCAGCTGCCCGTTGGTATATGCAGTTGGATAGTTCACATGTTGTCACATGGAAGAACTTGGCTGATTCCTTCCTCAAACAGTACAA
GCACAATATTGATTTGGCTCCCGATCGACTGGACTTACAGAGGATGGAAAAGAAGAGTACTGAAGACTTTAAAGATTATGCTCAAAGGTGGCGGGATACGGCAGCTCAAG
TCCAACCCCCTTTAACGGATAAGGAGCTTTCTGCCATGTTCATCAATACCTTAAGGTCCCCATTCTATGAACATATGGTTGGAAGTGCGTCAACCAATTTCTCTGATATC
ATGACAATTGGAGAAAGAATTGAGTACGGGATCAAACATGGTAGGATAGCCAAAACTGCTGAATCCTCAAGCACGAAGAAGAGCAATACTTCAAAAAAGAAGGAGGGAGA
AGTACAGATGATAGGGAAGGTGGATAAGTCCCATCGACAAATCCACCAACCCATCACACAATATCCTTCGAATTATTTACCTACGTACGGATATTATCCACACCAGGTGA
ATAATGCATCAGCGCAGTACTCCCCATTTTATGGCCAAAATACCAGACCCCAAATGAATCAAAACTTTCAGTCTCGAAGGCAACAACAACCAGTCACTCCAGGTCAGCAA
AATAATCGAGCACCTTACAAGCAAGCTCAATTTGATCCGATTCCGATGACATATACTGAGCTCTTGCCTCAGTTAATTCAGAACCATCAATTGGCACCCAAGCCTGTTGC
TCCACTGCAACCCCCTTATCCAAGATGGTATGATCCCAATGCAAGGTGTGATTACCATGCAGGTGCTGTTGGGCATTCTACAGAAAATTGCACTGCTTTGAAGTATAGGG
TGCAAGAGCTCATTAAGGCCGGATGGTTAAATTTCAAAAAGGAGGAGGGGCCCAATGTTGATAATAATCCTTTACCAAACCACCAGGTGAATGCAATAGATCAGTGTCAG
GATATTATCAAGAAGGCAGATGTTACAGAGATCAAGACTCCCAAGGAGAAGCTTTTTGAGATTCTCCTCGTTAACGGATATGTATCAATTGAGTATGCACACAAGGACCT
TGTTCAAGAAGAATTTGATGATAATCTGACTTGCCTATTTCATGCTGGGGCAAAGGGACATTCTTTAGAACAATGTCATCGTTTTCATAAAAGGATCCAAGAACTGGTGG
ATTCAAAGTTTCTTGTGGTTGCCCAAGCCCATCACCAGGATAATGAAATAGGCGTCGTGGAAGAATTGTTGCCTAAAGAAAATTTGAATCCATCTTTCAAACTAAAGCCA
CTCACGATCTATTACCGTGAGAAAACTACTACTCATGATCCAAAGTCGATCACCATTCAGGTGCCGACTCCTTTCAAGTACAAGAGTTCTAAGGCAGTACCATGGAGTTA
TGAGTACAAAGTAACTATTAACTCAGAAACGCCCTCACTTCCAGTTGACAACATTTCCGGAACGGGAGGCGTAACACGGAGTGGGAAATCCTACACTCCAGATCAGTTAT
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCAACCAAATA
TGGCCGAAATTTTTCAAAGGCAAACACTACAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGCCAACGTACCCATAGAGTCGATACGGTCATCTTCCTCTGAGTATGATGAGTTGAGTACTGTGCTACAATGGGCAGAGCAGACACAACTAAAGCATGGGGATAA
TCTGCCTTTTAAAAGCCCCGTGTCGTCATCCTTCTCAAACCTCATGTATATTCCAGTTAATCGGTTCACCGTCCAGGCTGTGTTAGAATTCTGGGACCCGGCCTACAGAT
TATTAACAGAACTCGCTCGCAAACACATAAATGAAGAAAGGGGCCTGGTGCTGTTAGCTTTGTGCATCTACGGGATGGTCTTGTATCCCAGAGTTAAAGGATATGTGGAC
GGTGATGTTTTGAAGTTATTCTTCGGCATTGAGCAAGGGGTGGATCCAGCCATACCTCTTCTAGCGGAAACGGTTCGGGCGCTCAATTACTGTAAGTCCAACAAAGGCGG
GAAGTTGAGATGCTGTGTGCCCTTACTCTATATTTGGGTGCATAGTCATTTTAAGTTCCCAGAAGATTTTGGCTGTCCAAAAATCAACTTCAGCAGCACGTGGAATCGAA
CAAGGAGTACAATTGCAGAATTTGGCACGGCCCCATGGAACTCAAGTTTTCCGAGGAAAGAAGCATGGGTGTCTTTCTTTTCGAATTCCAAATTAGAGCATGTGATTTGG
AAGGCACATTGGATGCCCCTAAAAGCCTTAATCTACAGATGCGGGAGCTTCCACACAGTTCCTCTAGTAGGCCCTTGGGGATGCGTCCATTATGCTCCGTTGTTGGCCTT
GCGACAAAGTTGGTTTAAACAATTCATACCAACAACTCATGGTCTGCATAAATTAGAATTTTCCTACAGTTGCGAGGATTCTAACGAGAAGAAGCGACAAGTGCTGACTT
CCTGGAGAACAGTTAGAAGGATAAATGGCAATAGTCACTCTGAGGGAGTTACTCCAGAATATCTGCAATGGCGCATTAGGAGGAGTAAGATCCCAATTCCGACTCGTGAC
AATGTGGGAGAATCATCTAGTAGAGCAGTAGATAAGCCTAGCCAGCTAGCGACAGAGCGGAATGAGTTAGTGGGAAGAAATCAAACATTGAAACTAGAAAATGAAAAATT
ACAGCAAGAGGTCAAACGTTTGATAGACCAGCTCGTTGGAACGTCAGCTCCAAGCATTTCAAAGGGCAAGCGAACAATTGTCACTAGAGAGAGGACAATTAGAGGAGAAA
TACTCCTCATTAAGGGAGATTATTCCACTATGAGAGATGACATGCAGATAATTCTTGGGAAAGTAAGTCGAACCATGAACACTATCAAGATCATGGCTAGGAGAGCCCGA
GGATTTTCAGAATGGGCAAGGGATCTGCGAGAGAATACATCACCTATGGCCTCTAATGCGGAGGAGTTGTTTGAGTTTTTAGGGATGATTCGTAGAGACCTTGGGCGTAG
AGCAAGGATTATGGAAGAAAAGGGTGAACAAGAGAAGACTAAGCGGGACATCGAGGAAATCAGGGAAAAGGTTGATGCAATCATTGCCGCTTTGGAAAAGGGCAAAATGA
TTGCAGATACAACTGCACCAGACACTCCGATTGGAAACCCTCAAGCTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCGTACGACAGCAGAAGCGTCCATGCCA
CAATATACTACCTATAACCCCTTATATGACATACCTGTTGGGCAATACCCTTTTCCATCATTTAAAGAAGAAGTAAAGAACGAGTTAGCCACACCCTCTAATGAAAAACT
TGAAGTTCTAGAGGAAAGGTTAAGGGCGGTTGAGGGCACTAGTGTTTTTGGAAACATGGATGCCACTCAATTATGTTTAGTTCCAGACGTTGTGATCCCACCAAAATTCA
AACTTCCTGAATTTGAAAAATATGACGGGACAACATGCCCTAAAAGTCACATTGTTATGTACTGTAGGAAGATGGCAGCGCACGTTCATAATGATAAGCTGCTCATACAT
TGCTTTCAGGATAGCTTAAAGGGTCCAGCTGCCCGTTGGTATATGCAGTTGGATAGTTCACATGTTGTCACATGGAAGAACTTGGCTGATTCCTTCCTCAAACAGTACAA
GCACAATATTGATTTGGCTCCCGATCGACTGGACTTACAGAGGATGGAAAAGAAGAGTACTGAAGACTTTAAAGATTATGCTCAAAGGTGGCGGGATACGGCAGCTCAAG
TCCAACCCCCTTTAACGGATAAGGAGCTTTCTGCCATGTTCATCAATACCTTAAGGTCCCCATTCTATGAACATATGGTTGGAAGTGCGTCAACCAATTTCTCTGATATC
ATGACAATTGGAGAAAGAATTGAGTACGGGATCAAACATGGTAGGATAGCCAAAACTGCTGAATCCTCAAGCACGAAGAAGAGCAATACTTCAAAAAAGAAGGAGGGAGA
AGTACAGATGATAGGGAAGGTGGATAAGTCCCATCGACAAATCCACCAACCCATCACACAATATCCTTCGAATTATTTACCTACGTACGGATATTATCCACACCAGGTGA
ATAATGCATCAGCGCAGTACTCCCCATTTTATGGCCAAAATACCAGACCCCAAATGAATCAAAACTTTCAGTCTCGAAGGCAACAACAACCAGTCACTCCAGGTCAGCAA
AATAATCGAGCACCTTACAAGCAAGCTCAATTTGATCCGATTCCGATGACATATACTGAGCTCTTGCCTCAGTTAATTCAGAACCATCAATTGGCACCCAAGCCTGTTGC
TCCACTGCAACCCCCTTATCCAAGATGGTATGATCCCAATGCAAGGTGTGATTACCATGCAGGTGCTGTTGGGCATTCTACAGAAAATTGCACTGCTTTGAAGTATAGGG
TGCAAGAGCTCATTAAGGCCGGATGGTTAAATTTCAAAAAGGAGGAGGGGCCCAATGTTGATAATAATCCTTTACCAAACCACCAGGTGAATGCAATAGATCAGTGTCAG
GATATTATCAAGAAGGCAGATGTTACAGAGATCAAGACTCCCAAGGAGAAGCTTTTTGAGATTCTCCTCGTTAACGGATATGTATCAATTGAGTATGCACACAAGGACCT
TGTTCAAGAAGAATTTGATGATAATCTGACTTGCCTATTTCATGCTGGGGCAAAGGGACATTCTTTAGAACAATGTCATCGTTTTCATAAAAGGATCCAAGAACTGGTGG
ATTCAAAGTTTCTTGTGGTTGCCCAAGCCCATCACCAGGATAATGAAATAGGCGTCGTGGAAGAATTGTTGCCTAAAGAAAATTTGAATCCATCTTTCAAACTAAAGCCA
CTCACGATCTATTACCGTGAGAAAACTACTACTCATGATCCAAAGTCGATCACCATTCAGGTGCCGACTCCTTTCAAGTACAAGAGTTCTAAGGCAGTACCATGGAGTTA
TGAGTACAAAGTAACTATTAACTCAGAAACGCCCTCACTTCCAGTTGACAACATTTCCGGAACGGGAGGCGTAACACGGAGTGGGAAATCCTACACTCCAGATCAGTTAT
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCAACCAAATA
TGGCCGAAATTTTTCAAAGGCAAACACTACAGCTAA
Protein sequenceShow/hide protein sequence
MNANVPIESIRSSSSEYDELSTVLQWAEQTQLKHGDNLPFKSPVSSSFSNLMYIPVNRFTVQAVLEFWDPAYRLLTELARKHINEERGLVLLALCIYGMVLYPRVKGYVD
GDVLKLFFGIEQGVDPAIPLLAETVRALNYCKSNKGGKLRCCVPLLYIWVHSHFKFPEDFGCPKINFSSTWNRTRSTIAEFGTAPWNSSFPRKEAWVSFFSNSKLEHVIW
KAHWMPLKALIYRCGSFHTVPLVGPWGCVHYAPLLALRQSWFKQFIPTTHGLHKLEFSYSCEDSNEKKRQVLTSWRTVRRINGNSHSEGVTPEYLQWRIRRSKIPIPTRD
NVGESSSRAVDKPSQLATERNELVGRNQTLKLENEKLQQEVKRLIDQLVGTSAPSISKGKRTIVTRERTIRGEILLIKGDYSTMRDDMQIILGKVSRTMNTIKIMARRAR
GFSEWARDLRENTSPMASNAEELFEFLGMIRRDLGRRARIMEEKGEQEKTKRDIEEIREKVDAIIAALEKGKMIADTTAPDTPIGNPQAGLPFPPSFASHVRTTAEASMP
QYTTYNPLYDIPVGQYPFPSFKEEVKNELATPSNEKLEVLEERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHVHNDKLLIH
CFQDSLKGPAARWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGSASTNFSDI
MTIGERIEYGIKHGRIAKTAESSSTKKSNTSKKKEGEVQMIGKVDKSHRQIHQPITQYPSNYLPTYGYYPHQVNNASAQYSPFYGQNTRPQMNQNFQSRRQQQPVTPGQQ
NNRAPYKQAQFDPIPMTYTELLPQLIQNHQLAPKPVAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPNVDNNPLPNHQVNAIDQCQ
DIIKKADVTEIKTPKEKLFEILLVNGYVSIEYAHKDLVQEEFDDNLTCLFHAGAKGHSLEQCHRFHKRIQELVDSKFLVVAQAHHQDNEIGVVEELLPKENLNPSFKLKP
LTIYYREKTTTHDPKSITIQVPTPFKYKSSKAVPWSYEYKVTINSETPSLPVDNISGTGGVTRSGKSYTPDQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNQI
WPKFFKGKHYS