; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g26660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g26660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr4:19556056..19563118
RNA-Seq ExpressionMoc04g26660
SyntenyMoc04g26660
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]7.1e-9626.16Show/hide
Query:  ENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNKSLFVLAS
        + ++   +E+ VI      P      VQLC+VG L +    +  A QS +KQVWNV     I  +G N Y+  F    +++R+L   PW F+K +  L  
Subjt:  ENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNKSLFVLAS

Query:  PSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTR-----PYFRLR-VKI-DI-SKSLRRGLKLCGSDGKDMWCPIRF
         +  + P ++  +   FW++IH++P+   + + A  +G KVG     D+ D   W       P+ R   VKI DI SKS   G    GS  K ++ P   
Subjt:  PSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTR-----PYFRLR-VKI-DI-SKSLRRGLKLCGSDGKDMWCPIRF

Query:  EKLPDFCYECGIIGH--SGRECNNRNTLCQNQYGEWLRASLLKKNVGLRGAETGFRGGRGGQSNKGRGGWGDWRRFEEESEENKFP---ADVVRREVSDN
         +  +       + H  SG   +  +              +L    GL   +T                  D      +   +  P   +  V  E S N
Subjt:  EKLPDFCYECGIIGH--SGRECNNRNTLCQNQYGEWLRASLLKKNVGLRGAETGFRGGRGGQSNKGRGGWGDWRRFEEESEENKFP---ADVVRREVSDN

Query:  A-TVDVPARGDSGESGLEEG-SKKVTPV------ERAEDTEISLMRQSFPKETTIHEG--FKETEENETVIITDSLTMDLDLNAQSNKSGQDEEFDMSNQ
        A    +P  G  G SG+    S  V         E  E  +    R     +T  + G    E + N    I  ++  ++D          D+ FD+   
Subjt:  A-TVDVPARGDSGESGLEEG-SKKVTPV------ERAEDTEISLMRQSFPKETTIHEG--FKETEENETVIITDSLTMDLDLNAQSNKSGQDEEFDMSNQ

Query:  GSGLVDIMRSKSNGTQKSWKQAAR-------------ILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISKPHIL
            V    S+ +G ++SW +                 +  + W    LG     R               + K L  D                +P ++
Subjt:  GSGLVDIMRSKSNGTQKSWKQAAR-------------ILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISKPHIL

Query:  FLSETKARRDKVNSVKQWLGFSNCISVDCKG----RSGGLALMWQDTLTFNLISFSQNHIDGGVEFEG-NI-WRLMGFYGYSEEENRHLSWDLMRWLKGS
        FL ETKA+   +  +K  LGF    +VDC G    R+GGLAL W +TL   L SFS NHID  V   G NI WRL G +G+ EE+++H +WDL+R L  +
Subjt:  FLSETKARRDKVNSVKQWLGFSNCISVDCKG----RSGGLALMWQDTLTFNLISFSQNHIDGGVEFEG-NI-WRLMGFYGYSEEENRHLSWDLMRWLKGS

Query:  GSTPWLIGGDFNGILAHSEKIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPL
           PWL  GDFN I+  SEK GG  K D+ +  F  A + C   DMGF G   TW N R +   + ERLDR+     W   +P  +V H+ + +SDH  L
Subjt:  GSTPWLIGGDFNGILAHSEKIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPL

Query:  QLSVVQDVFKGRLMTKRLYRFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRI
        ++S            +RL+RFEEAW  D +C+ ++   W        + +   KL + A+                             NL F+ + T  
Subjt:  QLSVVQDVFKGRLMTKRLYRFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRI

Query:  QTAEYNLEKLLNEEEIYWRKKRNEINGLFDTQGHWETDQSVVAGGKDSVLYMEELVVGKIVAHRGGSLAHWEWGEVPLDT---KVSFLVDSDSRQWKTNL
          A +    ++    +        +   F     WE +            ++  +   +I++ R            PLD+    V+ L++ ++R WK +L
Subjt:  QTAEYNLEKLLNEEEIYWRKKRNEINGLFDTQGHWETDQSVVAGGKDSVLYMEELVVGKIVAHRGGSLAHWEWGEVPLDT---KVSFLVDSDSRQWKTNL

Query:  IRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILR
        I   F   E + ILS+PL      D+ IW FEK+  Y+VKS Y +        S STSS +   + W  +W+L +  KI++FFWRLC + LPT  NL  R
Subjt:  IRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILR

Query:  GVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSW--SEFEE-------WVVFL-WGIWSRRNVKVFPNGRKM
        GV + N C  C    ED +H F  C           ++ L   + PL    +L   +  + W  S+ E+       W+  + W IW+ RN  +F      
Subjt:  GVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSW--SEFEE-------WVVFL-WGIWSRRNVKVFPNGRKM

Query:  VGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNC-------WKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVG
                + +VS          C+ +   ++ +     S   K SL +  C       WK P     K+N DA+  S  + + LG V+R+  G +  V 
Subjt:  VGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNC-------WKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVG

Query:  TKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNML-NREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEH
           ++ +     AEA  +  G+ L +   +   ++ETD L       +  D+ L +T    C++ S   +  H   +   R  N VAH +A+  L  L  
Subjt:  TKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNML-NREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEH

Query:  AVWIEEWPAEI
        AVW+  +P  +
Subjt:  AVWIEEWPAEI

KAF7825238.1 ribonuclease H [Senna tora]4.6e-10325.11Show/hide
Query:  ENEVIVIDK----EKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNKSLFVLASPSAT
        E EV++I++    ++P  N  N  LC+ G +      +  A Q  + QVW       I+ I  N Y+  F    ++ +VL   PW F+K +  L   S  
Subjt:  ENEVIVIDK----EKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNKSLFVLASPSAT

Query:  DQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRFEKLPDFCYECGI
        +QP  +      FW++++++PF   +   A MLG KVG     D+ ++  W +  FR R  ID++K L++G  + G + K +                  
Subjt:  DQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRFEKLPDFCYECGI

Query:  IGHSGRECNNRNTLCQNQ------YGEWLRASLLK-KNVGLRGAETGFRGGRGGQSNKGRGGWGDWRRFEEESEENKFPADVVRREVSDNATVDVPARGD
             +ECN+      ++      YG WLRAS  K KN      +       GG+S K          F+  +++++  + +  +        +      
Subjt:  IGHSGRECNNRNTLCQNQ------YGEWLRASLLK-KNVGLRGAETGFRGGRGGQSNKGRGGWGDWRRFEEESEENKFPADVVRREVSDNATVDVPARGD

Query:  SGESGLEEGSKKVTPVERAEDTEISLMRQSFPKETTIH--EGFKETEEN----ETVIITDSLTMD----LDLNAQSNKSGQDEE-FDMSNQGSGLVDIMR
            G+ E       +  ++ +   ++  SF  + T+   E  +++EE     + VII   L  +    +  N  SN    D++   + NQ +G      
Subjt:  SGESGLEEGSKKVTPVERAEDTEISLMRQSFPKETTIH--EGFKETEEN----ETVIITDSLTMD----LDLNAQSNKSGQDEE-FDMSNQGSGLVDIMR

Query:  SKSNGTQKSWKQAAR---------ILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISK---PHILFLSETKARRD
             + K W +  R           P L  G+++L      R+  P  E+  +I+S  +    +D  S    +      +++   P I+FL ET+  + 
Subjt:  SKSNGTQKSWKQAAR---------ILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISK---PHILFLSETKARRD

Query:  KVNSVKQWLGFSNCISVDCKG----RSGGLALMWQDTLTFNLISFSQNHIDGGVEFEGN--IWRLMGFYGYSEEENRHLSWDLMRWLKGSGSTPWLIGGD
        ++  +K+ L F   I+V C+G    R+GGLAL+W+  +T +L SFS NHID  ++ +     WR  GFYG+ +E  + LSW L++ L  + + PWL  GD
Subjt:  KVNSVKQWLGFSNCISVDCKG----RSGGLALMWQDTLTFNLISFSQNHIDGGVEFEGN--IWRLMGFYGYSEEENRHLSWDLMRWLKGSGSTPWLIGGD

Query:  FNGILAHSEKIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPLQLSVVQDVFK
        FN I+  SEK GG PK+ + + +F  A+D C L D+GF G   TW N R     + ERLD+   +  W SL+P   V+H     S H  + ++   +   
Subjt:  FNGILAHSEKIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPLQLSVVQDVFK

Query:  GRLMTKRLYRFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRIQTAEYNLEKL
             KRL+RFEE+W  +  C  +V  GW       +   +  K+   A+   +   +  G+ +R+++  E+ +          + R  IQ A+  L++L
Subjt:  GRLMTKRLYRFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRIQTAEYNLEKL

Query:  LNEEEIYWRKK-------------------------RNEINGLFDTQGHWETDQSVV-------------------------------------------
        L  EEI WR++                         RN I  + D Q    +D + +                                           
Subjt:  LNEEEIYWRKK-------------------------RNEINGLFDTQGHWETDQSVV-------------------------------------------

Query:  ----------------AGGKDSVL-YMEELVVGKI-----VAHRGGSLAHWE---------WGEVPLDT-----------------KVSFLVDSDSRQWK
                        A G D+ L Y    V   I     V H G   A W          W +  + +                  VS L+  D R+W 
Subjt:  ----------------AGGKDSVL-YMEELVVGKI-----VAHRGGSLAHWE---------WGEVPLDT-----------------KVSFLVDSDSRQWK

Query:  TNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANL
           I ++F   E K I SIPL      DRL+W  EK+G Y+V+S Y   H  S      +SS       W  +W L+LP K+KVF WRLC   + +  N+
Subjt:  TNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANL

Query:  ILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLP-----QDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMVG
          RGV    CC  C  K E   HLF  C         S     P        R      L +D  +I+ W       +  W IW RRN  VF   +K++ 
Subjt:  ILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLP-----QDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMVG

Query:  DLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTKYLEFVSS
          D                   L     ++ D N       +  +P +  W  P     KINTDA+  S +   GLG+V R+  G ++ V          
Subjt:  DLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTKYLEFVSS

Query:  VDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEHAVWIEEWPAE
           AEA  +   + +A  + +     E+D L V N  ++ ++D+S    II    +            TKR+GN VAH +A R L   E   W   +P  
Subjt:  VDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEHAVWIEEWPAE

Query:  ISAALAKECL
        + +    + L
Subjt:  ISAALAKECL

XP_022143319.1 uncharacterized protein LOC111013220 [Momordica charantia]1.5e-9889.69Show/hide
Query:  MFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVD
        MFT DEVKTILSIPLGI LAADRLIWNFEKNGI TVKS YKLAHMQSPDTSASTS  ECLA WWKDVWQLNLPSKIKVFFWR CLDRLPTGANLILRGVD
Subjt:  MFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVD

Query:  VPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMVGDLDGW
        VP+C AFCGKKGEDALHLFWTCKV  NQRQVSKFSHLPQDVRPLSLLHLLRDC+ ILSWS+FEE VVFLWGIWS+RNVKVF NGR+MV DLDGW
Subjt:  VPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMVGDLDGW

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.5e-9860.13Show/hide
Query:  MDDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFN
        MD++    E+   T++ENE + ID+ KPI+  DNV+LCVV KL +SKR S EA++SVMK VW VH++TR EP+GMN YV LF+SL EK+RVL+SGPWTFN
Subjt:  MDDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFN

Query:  KSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF
        KSL VL SP+AT+QPL MNFN CAFWIQIHNIPFEC+S EMA +LG+K+G+VE ++ +  +GW  P+ R+RVKID+SK LRRG+KL  SDGKD+WCP+R+
Subjt:  KSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF

Query:  EKLPDFCYECGIIGHSGRECNNRN----TLCQNQYGEWLRASLLKKNVGLRGAETGFRG---GRGGQSNKGRGGWGDWRRFEEESEENKFPADVVRREVS
        EKLPDFCYECG IGHSGREC  R+    T    QYG+WLRA+LLKK+V     E  +RG   GRG Q N GRGG GDWRR +E   +   P    RR V 
Subjt:  EKLPDFCYECGIIGHSGRECNNRN----TLCQNQYGEWLRASLLKKNVGLRGAETGFRG---GRGGQSNKGRGGWGDWRRFEEESEENKFPADVVRREVS

Query:  D
        +
Subjt:  D

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.2e-10829.24Show/hide
Query:  LKLCGSDGKDMWCPIRFEKLPDFCYECGIIGHSGRECNNR----NTLCQNQYGEWLRASLLKKNVGLRGAETGFRGG---RGGQSNKGRGGWGDWRRFEE
        +K+  +DGKD+WCP+R+EKLPDFCYECG +GHSGRE   R          QYG WLRASLLKKN+  +  E   RGG   RG Q   GRGG G+W R  E
Subjt:  LKLCGSDGKDMWCPIRFEKLPDFCYECGIIGHSGRECNNR----NTLCQNQYGEWLRASLLKKNVGLRGAETGFRGG---RGGQSNKGRGGWGDWRRFEE

Query:  ESEENKFPADVVRREVSDNATVDVPARGDSGESGLEEGSKKVTPVERAEDTEISLMRQSFPK------ETT--IHEGFKETE---------ENETVIITD
           E   P     R   D A+  +    D+G        K   P+  A     SL+            ETT  I +G K  +          N+T    +
Subjt:  ESEENKFPADVVRREVSDNATVDVPARGDSGESGLEEGSKKVTPVERAEDTEISLMRQSFPK------ETT--IHEGFKETE---------ENETVIITD

Query:  SLTMDLDLNAQSNKSGQDEEFDMSNQG-SGLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGF
         L +          +  D  F  + +     +  M       +  W+Q +R           L    G R              TN+    +D      +
Subjt:  SLTMDLDLNAQSNKSGQDEEFDMSNQG-SGLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGF

Query:  SNGPAMEISKPHILFLSETKARRDKVNSVKQWLGFSNCIS--VDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVEFEG----NIWRLMGFYGYSEEEN
               +      F     + R  V+ ++Q  G   CI   V        L    ++ +   L     +   G   F      N W ++G         
Subjt:  SNGPAMEISKPHILFLSETKARRDKVNSVKQWLGFSNCIS--VDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVEFEG----NIWRLMGFYGYSEEEN

Query:  RHLSWDLMRWLKGSGS----TPWLIGGDFNGILAHSEK--IGGRPKDDKLINDF-SLAVD---DCDLIDMGFLGEAV--------------TWCNRRPNG
        + +S   +  L  S      +  ++    + +++ S+   I  R   D  I  F SL +D     D ++  FL EAV              T+CN     
Subjt:  RHLSWDLMRWLKGSGS----TPWLIGGDFNGILAHSEK--IGGRPKDDKLINDF-SLAVD---DCDLIDMGFLGEAV--------------TWCNRRPNG

Query:  GTVFERLDRI----LCNPGWASLYPNF--------DVKHLDYR---------TSDHRPLQLSVVQDVFKGRLMTKRL------------------YRFEE
          + + + R+    L       +   F         V  + YR           +++  Q S++Q++    ++  +L                  Y  + 
Subjt:  GTVFERLDRI----LCNPGWASLYPNF--------DVKHLDYR---------TSDHRPLQLSVVQDVFKGRLMTKRL------------------YRFEE

Query:  AW--LEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTC---ANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHN---LPFVSDRTRIQTAEYNLEKLLNEEE
         W  L+  + +L  + G    +  + +Q++     +C      +++   +    +     + +K++ +   N   LP        +  E   + LL ++ 
Subjt:  AW--LEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTC---ANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHN---LPFVSDRTRIQTAEYNLEKLLNEEE

Query:  IYWRKKRNEINGLFD--TQGHWETD----QSVVAGGKD----SVLYMEELVVGKIVAHRGGSLAHWEWGE----------------VPLDTKVSFLVDSD
          WR   N  N +     +G +  D    ++ ++G       S+L+  +L+   +    G   + + +G+                +PL ++VS LVD +
Subjt:  IYWRKKRNEINGLFD--TQGHWETD----QSVVAGGKD----SVLYMEELVVGKIVAHRGGSLAHWEWGE----------------VPLDTKVSFLVDSD

Query:  SRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSA-STSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRL
           W+ +++RD FT DE K ILSIP+G     DRLIWN+EK G+Y+V+SGYK+A + +P   A S+SS E +  WW   W++++P+KIKVF WRLCLDRL
Subjt:  SRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSA-STSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRL

Query:  PTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMV
        PTG NL  RGV++ NCC FCG+ GED++HLFW CK        SKF         LS   +LR+  + LS ++FEE  V +WG+W++RN + F +  K V
Subjt:  PTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMV

Query:  ----GDLDGWTSSYVSTYRAANS-PST-CLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTK
             +L  W + Y   +R A S P T  + +  E+L                    W+ P EG +KINTDASF + +Q AGLGI+I N RGQVMA  TK
Subjt:  ----GDLDGWTSSYVSTYRAANS-PST-CLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTK

Query:  YLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEHAVW
        YLE + SVD AEA   +EG++LA + G+HP                  EDLSETG I+  AK+F  + LH  F F KR+GN  AH+LARRALL  E ++W
Subjt:  YLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEHAVW

Query:  IEEWPAEISAALAKECLDLLV
        +E+WP E+ + L  ECL+ L+
Subjt:  IEEWPAEISAALAKECLDLLV

TrEMBL top hitse value%identityAlignment
A0A2N9I239 RNase H domain-containing protein3.1e-9721.18Show/hide
Query:  MDDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFN
        M+++ +  + L+LT++E + + +     +         +  K L+ +  + EA+   +K +W      +   +G N  + +F    +  RV+ + PW+F+
Subjt:  MDDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFN

Query:  KSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF
        K L +L           ++F+ C FW+QIH++P   ++  +   LGS +G+VE V+  D+      + R+RV +D+S+ L RG K+     +D W  +++
Subjt:  KSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF

Query:  EKLPDFCYECGIIGHSGREC----NNRNTLC--QNQYGEWLRASLLKKNVGLRGAETGFRG---------GRGGQSNKGRGGWGDWRRFEEESEENKFPA
        E+LP FCY CG+I H  R+C     +R TL     QYG W+R  +++     R     FRG          RGG +  G+ G           +   FP 
Subjt:  EKLPDFCYECGIIGHSGREC----NNRNTLC--QNQYGEWLRASLLKKNVGLRGAETGFRG---------GRGGQSNKGRGGWGDWRRFEEESEENKFPA

Query:  DVVRREVSDNATVDVPARGDSGESGLEEGSKKVTPVERAEDTEISLMRQSFPKETTIHEGFKETEENETV-IITDSLTMDLDLNAQSNKSGQDEEFDMSN
            + V+     +   R    + GL+ G   V P+   +++ +  +R+                 N+ V  + ++   D+ L       G  E+F +S+
Subjt:  DVVRREVSDNATVDVPARGDSGESGLEEGSKKVTPVERAEDTEISLMRQSFPKETTIHEGFKETEENETV-IITDSLTMDLDLNAQSNKSGQDEEFDMSN

Query:  QGS----GLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGP---SC----GKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISKPHILF
        + S    G    +  + NG++ +   +      +  G  +  P   SC     +  G+PA   + TI+   QK                      P  LF
Subjt:  QGS----GLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGP---SC----GKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISKPHILF

Query:  LSETKARRDKVNSVKQWLGFSNCISVDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVEFE-GNIWRLMGFYGYSEEENRHLSWDLMRWLKGSGSTPWL
        +SETK    K+  ++ + GF   + V  +G+SGGL L W+  ++  + S+SQ+HID  VE +    WR+ GFYG      + ++WD++R L      PWL
Subjt:  LSETKARRDKVNSVKQWLGFSNCISVDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVEFE-GNIWRLMGFYGYSEEENRHLSWDLMRWLKGSGSTPWL

Query:  IGGDFNGILAHSEKIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPLQLSVVQ
         GGDFN +L   EK G   + +  +  F   VDDC  +D+GF G   TW N+R     V ERLDR   N  W  L+P+  V HL    SDHRPL +  + 
Subjt:  IGGDFNGILAHSEKIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPLQLSVVQ

Query:  DVFKGRLMTKRLYRFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRIQTAEYN
         V       ++ +RFEE W  D  CE  V   W+   H +S   +  K+ T    + +W     G+ +  ++   +QLQ     +P   +   ++     
Subjt:  DVFKGRLMTKRLYRFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRIQTAEYN

Query:  LEKLLNEEEIYW-------------------------RKKRNEINGLFDTQGHWETDQS-------------------------------VVAGGKDSVL
        L  L  +EE  W                         R++RN I  L D  G W + ++                               V+ G  + +L
Subjt:  LEKLLNEEEIYW-------------------------RKKRNEINGLFDTQGHWETDQS-------------------------------VVAGGKDSVL

Query:  YME--------------------------ELVVGKIVAH---------------RGGSLAH---------------WE----------------------
          E                            V G++++                +GG L +               WE                      
Subjt:  YME--------------------------ELVVGKIVAH---------------RGGSLAH---------------WE----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------WG--------------------------
                                                                                WG                          
Subjt:  ------------------------------------------------------------------------WG--------------------------

Query:  ---------------------------------------------------------------------------------------------------E
                                                                                                           E
Subjt:  ---------------------------------------------------------------------------------------------------E

Query:  VPLDTKVSFLVDSDSRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSK
        +P D +VS L+   +  W   L+  +F   E   ILS  L      D L+W  EKNG Y+V+S Y++      D ++  S+ E   L+WK +W + +P K
Subjt:  VPLDTKVSFLVDSDSRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSK

Query:  IKVFFWRLCLDRLPTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSR
        I+ F WR+C+D LPT   L  R +    CC+FC  + ED +H  WTC ++       K +      R  SLL ++    +  S +   E+   LW +W R
Subjt:  IKVFFWRLCLDRLPTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSR

Query:  RNVKVFPNGRKMVGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQ
        RN  ++ +    + D+       +  Y A N                      +    +     W+ PT   FK N DA+    +    +G+VIR+  G 
Subjt:  RNVKVFPNGRKMVGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQ

Query:  VMAVGTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRAL-
         +A   K    + +VD+AEA  V E ++LA D G+   ++E DSL +FN L ++ E  +  G II            V F   +R GN VAH+LAR A+ 
Subjt:  VMAVGTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRAL-

Query:  LQLEHAVWIEEWPAEISAALAKE
        L  +  VW+E+ P  ++  +  E
Subjt:  LQLEHAVWIEEWPAEISAALAKE

A0A6J1CNZ5 uncharacterized protein LOC1110132207.4e-9989.69Show/hide
Query:  MFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVD
        MFT DEVKTILSIPLGI LAADRLIWNFEKNGI TVKS YKLAHMQSPDTSASTS  ECLA WWKDVWQLNLPSKIKVFFWR CLDRLPTGANLILRGVD
Subjt:  MFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVD

Query:  VPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMVGDLDGW
        VP+C AFCGKKGEDALHLFWTCKV  NQRQVSKFSHLPQDVRPLSLLHLLRDC+ ILSWS+FEE VVFLWGIWS+RNVKVF NGR+MV DLDGW
Subjt:  VPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMVGDLDGW

A0A6J1D765 uncharacterized protein LOC1110179027.4e-9960.13Show/hide
Query:  MDDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFN
        MD++    E+   T++ENE + ID+ KPI+  DNV+LCVV KL +SKR S EA++SVMK VW VH++TR EP+GMN YV LF+SL EK+RVL+SGPWTFN
Subjt:  MDDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFN

Query:  KSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF
        KSL VL SP+AT+QPL MNFN CAFWIQIHNIPFEC+S EMA +LG+K+G+VE ++ +  +GW  P+ R+RVKID+SK LRRG+KL  SDGKD+WCP+R+
Subjt:  KSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF

Query:  EKLPDFCYECGIIGHSGRECNNRN----TLCQNQYGEWLRASLLKKNVGLRGAETGFRG---GRGGQSNKGRGGWGDWRRFEEESEENKFPADVVRREVS
        EKLPDFCYECG IGHSGREC  R+    T    QYG+WLRA+LLKK+V     E  +RG   GRG Q N GRGG GDWRR +E   +   P    RR V 
Subjt:  EKLPDFCYECGIIGHSGRECNNRN----TLCQNQYGEWLRASLLKKNVGLRGAETGFRG---GRGGQSNKGRGGWGDWRRFEEESEENKFPADVVRREVS

Query:  D
        +
Subjt:  D

A0A6J1DAR4 uncharacterized protein LOC1110189546.1e-10929.24Show/hide
Query:  LKLCGSDGKDMWCPIRFEKLPDFCYECGIIGHSGRECNNR----NTLCQNQYGEWLRASLLKKNVGLRGAETGFRGG---RGGQSNKGRGGWGDWRRFEE
        +K+  +DGKD+WCP+R+EKLPDFCYECG +GHSGRE   R          QYG WLRASLLKKN+  +  E   RGG   RG Q   GRGG G+W R  E
Subjt:  LKLCGSDGKDMWCPIRFEKLPDFCYECGIIGHSGRECNNR----NTLCQNQYGEWLRASLLKKNVGLRGAETGFRGG---RGGQSNKGRGGWGDWRRFEE

Query:  ESEENKFPADVVRREVSDNATVDVPARGDSGESGLEEGSKKVTPVERAEDTEISLMRQSFPK------ETT--IHEGFKETE---------ENETVIITD
           E   P     R   D A+  +    D+G        K   P+  A     SL+            ETT  I +G K  +          N+T    +
Subjt:  ESEENKFPADVVRREVSDNATVDVPARGDSGESGLEEGSKKVTPVERAEDTEISLMRQSFPK------ETT--IHEGFKETE---------ENETVIITD

Query:  SLTMDLDLNAQSNKSGQDEEFDMSNQG-SGLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGF
         L +          +  D  F  + +     +  M       +  W+Q +R           L    G R              TN+    +D      +
Subjt:  SLTMDLDLNAQSNKSGQDEEFDMSNQG-SGLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGF

Query:  SNGPAMEISKPHILFLSETKARRDKVNSVKQWLGFSNCIS--VDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVEFEG----NIWRLMGFYGYSEEEN
               +      F     + R  V+ ++Q  G   CI   V        L    ++ +   L     +   G   F      N W ++G         
Subjt:  SNGPAMEISKPHILFLSETKARRDKVNSVKQWLGFSNCIS--VDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVEFEG----NIWRLMGFYGYSEEEN

Query:  RHLSWDLMRWLKGSGS----TPWLIGGDFNGILAHSEK--IGGRPKDDKLINDF-SLAVD---DCDLIDMGFLGEAV--------------TWCNRRPNG
        + +S   +  L  S      +  ++    + +++ S+   I  R   D  I  F SL +D     D ++  FL EAV              T+CN     
Subjt:  RHLSWDLMRWLKGSGS----TPWLIGGDFNGILAHSEK--IGGRPKDDKLINDF-SLAVD---DCDLIDMGFLGEAV--------------TWCNRRPNG

Query:  GTVFERLDRI----LCNPGWASLYPNF--------DVKHLDYR---------TSDHRPLQLSVVQDVFKGRLMTKRL------------------YRFEE
          + + + R+    L       +   F         V  + YR           +++  Q S++Q++    ++  +L                  Y  + 
Subjt:  GTVFERLDRI----LCNPGWASLYPNF--------DVKHLDYR---------TSDHRPLQLSVVQDVFKGRLMTKRL------------------YRFEE

Query:  AW--LEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTC---ANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHN---LPFVSDRTRIQTAEYNLEKLLNEEE
         W  L+  + +L  + G    +  + +Q++     +C      +++   +    +     + +K++ +   N   LP        +  E   + LL ++ 
Subjt:  AW--LEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTC---ANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHN---LPFVSDRTRIQTAEYNLEKLLNEEE

Query:  IYWRKKRNEINGLFD--TQGHWETD----QSVVAGGKD----SVLYMEELVVGKIVAHRGGSLAHWEWGE----------------VPLDTKVSFLVDSD
          WR   N  N +     +G +  D    ++ ++G       S+L+  +L+   +    G   + + +G+                +PL ++VS LVD +
Subjt:  IYWRKKRNEINGLFD--TQGHWETD----QSVVAGGKD----SVLYMEELVVGKIVAHRGGSLAHWEWGE----------------VPLDTKVSFLVDSD

Query:  SRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSA-STSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRL
           W+ +++RD FT DE K ILSIP+G     DRLIWN+EK G+Y+V+SGYK+A + +P   A S+SS E +  WW   W++++P+KIKVF WRLCLDRL
Subjt:  SRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSA-STSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRL

Query:  PTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMV
        PTG NL  RGV++ NCC FCG+ GED++HLFW CK        SKF         LS   +LR+  + LS ++FEE  V +WG+W++RN + F +  K V
Subjt:  PTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKMV

Query:  ----GDLDGWTSSYVSTYRAANS-PST-CLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTK
             +L  W + Y   +R A S P T  + +  E+L                    W+ P EG +KINTDASF + +Q AGLGI+I N RGQVMA  TK
Subjt:  ----GDLDGWTSSYVSTYRAANS-PST-CLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTK

Query:  YLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEHAVW
        YLE + SVD AEA   +EG++LA + G+HP                  EDLSETG I+  AK+F  + LH  F F KR+GN  AH+LARRALL  E ++W
Subjt:  YLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEHAVW

Query:  IEEWPAEISAALAKECLDLLV
        +E+WP E+ + L  ECL+ L+
Subjt:  IEEWPAEISAALAKECLDLLV

A0A803QI56 Uncharacterized protein9.6e-11526.14Show/hide
Query:  NLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNKSLFVLASP
        N+ +  EE + ++   +   + E + + C+VG+ L++K    +A+Q+ M  +W       ++ +  N ++  F    +  RV+   PWTF++   +    
Subjt:  NLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNKSLFVLASP

Query:  SATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRFEKLPDFCYE
           + P  +  N   FW+QIHN+    M + +   LG+ VG     D  +  G  R Y R+RVKI+I   L+R  K+    G   +   ++E L  FC+ 
Subjt:  SATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRFEKLPDFCYE

Query:  CGIIGHSGREC----NNRNTLCQNQYGEWLRASLLKKN--VGLRGAETGFRGGRGGQSNKGRGGWGD----------WRRFEEESEENKFPADVVRREVS
         GI+GHS R C    +      Q  Y   L+A+  +KN  +G R   +G     GG +       G+                +S  N  P +     + 
Subjt:  CGIIGHSGREC----NNRNTLCQNQYGEWLRASLLKKN--VGLRGAETGFRGGRGGQSNKGRGGWGD----------WRRFEEESEENKFPADVVRREVS

Query:  DNATVDVPARGDSGESG-----LEEGSKKVTPVERAEDT--EISLMRQSFPKETTIHEGFKETEENETVIITDSLTMDLDLNAQSNKSGQDEEFDMSNQG
         ++   +  RG+S         L+EG   +      E    + +L      KET+I     +T+   T I+ + L +D +L+  ++    +EEF      
Subjt:  DNATVDVPARGDSGESG-----LEEGSKKVTPVERAEDT--EISLMRQSFPKETTIHEGFKETEENETVIITDSLTMDLDLNAQSNKSGQDEEFDMSNQG

Query:  SGLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISKPHILFLSETKARRDKVNS
         G   IM                   VL W    LG                     NQ+ ++              +    P+ +FL ETK  + ++ S
Subjt:  SGLVDIMRSKSNGTQKSWKQAARILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISKPHILFLSETKARRDKVNS

Query:  VKQWLGFSNCISVDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVE-FEGNIWRLMGFYGYSEEENRHLSWDLMRWLKGSGSTPWLIGGDFNGILAHSE
        + + L F     V+ +G SGGLAL+W++    +++ +S NHID  VE  E   W+L GFYG  E  NRH SW L+R L    + PW + GD N I+ H +
Subjt:  VKQWLGFSNCISVDCKGRSGGLALMWQDTLTFNLISFSQNHIDGGVE-FEGNIWRLMGFYGYSEEENRHLSWDLMRWLKGSGSTPWLIGGDFNGILAHSE

Query:  KIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPLQLSVVQDVFKGRLMTKRLY
        K GGRP   +LI+ F  A++DC+LIDM  +G   TW   R +   +  RLDR L N  W+ ++    + +L+  +SDH P+ L   +        T R +
Subjt:  KIGGRPKDDKLINDFSLAVDDCDLIDMGFLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPLQLSVVQDVFKGRLMTKRLY

Query:  RFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRIQTAEYNLEKLLNEEEIYWR
        +FE AWL++  C  +V + W SE  D    +L  KL  CA+ +  WG+   GN+K R+   + +++  + N    +   R +  +  L  +L++ E +W+
Subjt:  RFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHTCANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRIQTAEYNLEKLLNEEEIYWR

Query:  KKRNEINGLFDTQGHWETDQSVVAGGKDSVLYMEELVVGK----IVAHRGGSLAHWEWGEVPLDTKVSFLVDSDSRQWKTNLIRDMFTEDEVKTILSIPL
        ++  ++        H  + +  V  G++  +  E  +  +    + +H      H   GE     KV  L+  +   W  +++ D+F E + K IL IPL
Subjt:  KKRNEINGLFDTQGHWETDQSVVAGGKDSVLYMEELVVGK----IVAHRGGSLAHWEWGEVPLDTKVSFLVDSDSRQWKTNLIRDMFTEDEVKTILSIPL

Query:  GISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVDVPNCCAFCGKKGEDA
         IS   D+L W++E +GIY+VKSGY L          +      L  +W   W+  +P K+K   WR   + LPT   L ++ VDV   C  C  + E  
Subjt:  GISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVDVPNCCAFCGKKGEDA

Query:  LHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSW-------SEFEEWV---VFLWGIWSRRNVKVFPNGRKMVGDLDGWTSSYVSTYRAA
        +H   TC  +   +QV        D   + +     + ++ L W       +E E+ +   V  W IWS RN  V+         +    + Y+  ++ A
Subjt:  LHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSW-------SEFEEWV---VFLWGIWSRRNVKVFPNGRKMVGDLDGWTSSYVSTYRAA

Query:  NSPSTCLQHENEVLTDNNRFLSHRHKDSLPD-SNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTKYLEFVSSVDEAEAFGVLEGIRL
                        N+R  +        D +  W  P+    K+N DA+        G G+V R+  G ++   TK      S + AE+ G+ E +  
Subjt:  NSPSTCLQHENEVLTDNNRFLSHRHKDSLPD-SNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTKYLEFVSSVDEAEAFGVLEGIRL

Query:  ALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRAL
                  LETD L V   +  E + +S  G II   K+ + E   +   F KR  N+VAH  AR ++
Subjt:  ALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRAL

SwissProt top hitse value%identityAlignment
O04244 Uncharacterized protein At4g020009.3e-0624.06Show/hide
Query:  VLNSGPWTFNKSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSD
        VL   PW +N + FV       +    +  ++   W+Q+  IP   + +E AL +  ++G++ T+D  D       Y R+R++  I+  LR   ++    
Subjt:  VLNSGPWTFNKSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSD

Query:  GKDMWCPIRFEKLPDFCYECGIIGHSGRECNNR
        G+      ++E+L   C  C  + H    C  R
Subjt:  GKDMWCPIRFEKLPDFCYECGIIGHSGRECNNR

P0C2F6 Putative ribonuclease H protein At1g657501.2e-1623.4Show/hide
Query:  KLLNEEEIYWR---KKRNEINGLFDT-----QGHW-ETDQSVVAGGKDSVLYMEELVVGKIVAHR--------GGSLAHWEWGEVP--LDTKVSFLVDSD
        +LL E+   W    +K+  +  + D+     +G W  T +S+  G +D V +    + G     R        G  L   + GE P   DT V+  +   
Subjt:  KLLNEEEIYWR---KKRNEINGLFDT-----QGHW-ETDQSVVAGGKDSVLYMEELVVGKIVAHR--------GGSLAHWEWGEVP--LDTKVSFLVDSD

Query:  SRQWKTNLIRDMFTEDEVKTILSIPLG-ISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRL
         R W    I    T +    + ++ L  ++ A DRL W F ++G ++V+S Y++  +         S   CL       W++ +P ++K F W +    +
Subjt:  SRQWKTNLIRDMFTEDEVKTILSIPLG-ISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRL

Query:  PTGANLILRGVDVPNCCAFCGKKGEDALHLFWTC--------KVINNQRQVSKFSHLPQDVRPLSLLHLLRD-------CKDILSWSEFEEWVVFLWGIW
         T      R +   N C  C    E  LH+   C        +V+  +RQ   FS         SL   L D       C+DI  WS    + V +W  W
Subjt:  PTGANLILRGVDVPNCCAFCGKKGEDALHLFWTC--------KVINNQRQVSKFSHLPQDVRPLSLLHLLRD-------CKDILSWSEFEEWVVFLWGIW

Query:  SRRNVKVFPNGRKMVGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFR
          R   +F    K    +       V  YRA         H   VL    +    R          W +P  G  K+NTD +       A  G V+R+  
Subjt:  SRRNVKVFPNGRKMVGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFR

Query:  GQVMAVGTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRA
        G     G        S  +AE +GV  G+  A +  +   +LE DS  +   L     D      ++     F+ +   V      R+ N +A  LA  A
Subjt:  GQVMAVGTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRA

Arabidopsis top hitse value%identityAlignment
AT2G13450.1 unknown protein4.0e-1224.43Show/hide
Query:  DDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMN-TYV-CLFRSLGEKTRVLNSGPWTF
        D+L    ++L L   E+  + I  E   + E   +L ++ + L+ +  +  A+ S + + W +  T R+    +N T+V  +F+S  +   VL   PW +
Subjt:  DDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMN-TYV-CLFRSLGEKTRVLNSGPWTF

Query:  NKSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIR
        N + FV A     +    +  ++   W+Q+  IP   + +E AL +  ++GE+ T+D  D       Y R+R++  I+  LR  L++    G+      +
Subjt:  NKSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIR

Query:  FEKLPDFCYECGIIGHSGREC
        +E+L   C  C  + H    C
Subjt:  FEKLPDFCYECGIIGHSGREC

AT2G17920.1 nucleic acid binding;zinc ion binding2.5e-1427.43Show/hide
Query:  DDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNV--HDTTRIEPIGMNTYV-CLFRSLGEKTRVLNSGPWT
        D+L    +NL L  +E   + I  E  IM     +L ++ + L+ +  + +A+ + + + W +  H   RI     +TYV  LF+S  +   V    PW 
Subjt:  DDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNV--HDTTRIEPIGMNTYV-CLFRSLGEKTRVLNSGPWT

Query:  FNKSLFVLASPSATDQPLAMNF-NVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCP
        FN   + +AS      P A+NF      W+Q+  IPF  +S+E AL +  ++G + ++D  D       Y R+RV++ I+ SLR   ++    G+     
Subjt:  FNKSLFVLASPSATDQPLAMNF-NVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCP

Query:  IRFEKLPDFCYECGIIGHSGRECNNR
         ++E+L   C  C    H+   C  R
Subjt:  IRFEKLPDFCYECGIIGHSGRECNNR

AT2G41590.1 unknown protein4.4e-1123.77Show/hide
Query:  DDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNK
        D+L    +NL L  +E+  + I  E  +M E   +L ++ + L+ +  +  ++   + + W + +      +       LF++  +   V    PW FN 
Subjt:  DDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWNVHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNK

Query:  SLFVLASPSATDQPLAMNF-NVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF
        + FV A  +  +   A NF      W+QI  IP   +S+E  + +   +GEV  +D  D       Y R+RV+  I+  LR   ++    G+      ++
Subjt:  SLFVLASPSATDQPLAMNF-NVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVKIDISKSLRRGLKLCGSDGKDMWCPIRF

Query:  EKLPDFCYECGIIGHSGRECNNR
        E+L   C  C    H+   C  R
Subjt:  EKLPDFCYECGIIGHSGRECNNR

AT3G09510.1 Ribonuclease H-like superfamily protein4.2e-2226.08Show/hide
Query:  WKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGY-KLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTG
        W  + I     + +   I  I L  S   D++IWN+   G YTV+SGY  L H  S +  A       + L  + +W L +  K+K F WR     L T 
Subjt:  WKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGY-KLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTG

Query:  ANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFS----HLPQDVRPLSLLHLLRDCKDILSWSEFEEW--VVFLWGIWSRRNVKVFPNGR
          L  RG+ +   C  C ++ E   H  +TC       ++S  S     L  +    ++ ++L   +D  + S+F +   V  +W IW  RN  VF   R
Subjt:  ANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFS----HLPQDVRPLSLLHLLRDCKDILSWSEFEEW--VVFLWGIWSRRNVKVFPNGR

Query:  KMVGDLDGWTSSYVSTYRAANSPS-TCLQHENEVLTDNNRFLSHRHKDS-----LPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAV
        +                    SPS T L  + E     N   SH+   S       +   W+ P     K N DA F      A  G +IRN  G  ++ 
Subjt:  KMVGDLDGWTSSYVSTYRAANSPS-TCLQHENEVLTDNNRFLSHRHKDS-----LPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAV

Query:  GTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNR---EDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLAR
        G+  L   S+  EAE   +L  ++     G     +E D   + N++N         +    I  +A  F      + F F +R GN +AH+LA+
Subjt:  GTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQLETDSLRVFNMLNR---EDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLAR

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-3025.82Show/hide
Query:  QSVVAGGKDSVLYMEELVVGK--IVAHRGGSLAHWEWGEVPLDTKVSFLVDSDSRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTV
        ++VV  G+D +++  + +  K    A R   +   E+  V    KVS L+D   R+W+ ++I  +F E E K I  +  G     D   W++  +G YTV
Subjt:  QSVVAGGKDSVLYMEELVVGK--IVAHRGGSLAHWEWGEVPLDTKVSFLVDSDSRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTV

Query:  KSGY-KLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFS
        KSGY  L  + +  +S    S   L   ++ +W+     KI+ F W+   + LP    L  R +   + C  C    E   HL + C        +S   
Subjt:  KSGY-KLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIKVFFWRLCLDRLPTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFS

Query:  HLP-----QDVRPLSLLHLLRDCKDILSWSEFEEWVVF-LWGIWSRRNVKVFPNGRKMVGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHR
         +P      D   ++L  +         W +  + V + LW +W  RN  VF        ++       +  +R      +C                  
Subjt:  HLP-----QDVRPLSLLHLLRDCKDILSWSEFEEWVVF-LWGIWSRRNVKVFPNGRKMVGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHR

Query:  HKDSLPDSNC--WKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQ-----LETDSLRV
         K  +  S+C  W+ P     K NTDA+++  N+  G+G V+RN +G+V  +G + L  + SV EAE    LE +R A+ S +  FQ      E+DS  +
Subjt:  HKDSLPDSNC--WKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTKYLEFVSSVDEAEAFGVLEGIRLALDSGIHPFQ-----LETDSLRV

Query:  FNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEH
          +LN  DE        I   +  + +   V F F  R+GN +A  +AR +L  L +
Subjt:  FNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTACGGCGTAACCGGAGCCGGAGTTGGAGGTTCCGCCGTAGATCTGAGTCTGAAATTTTTGACGCTTGGAATGACGGACGGATTTGTTGTATACTGGCGGCAGTT
CCTGGATTGGCTTCTGCAGGTTAGGTCAGTCATGGATGACCTCGTTGCCTGTTGCGAAAACCTCAATCTAACTTCTGAAGAGAATGAGGTGATAGTGATTGACAAGGAAA
AACCAATAATGAATGAGGACAATGTTCAGCTTTGTGTGGTGGGGAAACTCCTATCAAGCAAGCGTACAAGCACGGAGGCATTACAATCGGTAATGAAGCAAGTTTGGAAT
GTTCATGATACCACAAGGATAGAACCTATCGGGATGAATACGTATGTCTGTCTATTTCGATCTCTAGGAGAGAAAACTCGAGTCCTAAACTCCGGACCCTGGACTTTCAA
CAAATCCCTTTTTGTCCTAGCTTCTCCTTCCGCCACAGATCAGCCTCTTGCAATGAATTTCAATGTTTGTGCTTTCTGGATTCAGATCCATAACATACCTTTTGAATGTA
TGTCTCAGGAGATGGCTCTTATGCTTGGGAGTAAGGTAGGTGAGGTTGAAACAGTGGACAGTGAAGACCAGGAGGGTTGGACAAGACCTTACTTTCGGCTGCGAGTGAAG
ATTGATATCTCGAAGTCGTTGCGGAGAGGTTTGAAATTATGTGGCAGCGATGGAAAGGATATGTGGTGCCCAATTCGGTTTGAGAAACTACCAGATTTCTGCTATGAATG
TGGAATAATTGGTCATTCAGGAAGGGAATGCAACAATCGTAACACTCTTTGTCAGAACCAGTATGGGGAGTGGTTACGTGCATCCTTGCTAAAGAAGAATGTGGGCTTGC
GTGGGGCAGAAACTGGGTTTAGGGGCGGCCGTGGAGGGCAAAGTAACAAGGGTCGTGGAGGATGGGGGGACTGGCGACGGTTCGAAGAGGAGAGCGAGGAGAATAAATTT
CCGGCAGATGTTGTCCGGAGGGAGGTGTCGGACAATGCGACGGTGGACGTTCCGGCAAGGGGCGATTCGGGGGAATCCGGTCTGGAAGAGGGGTCGAAAAAGGTAACTCC
TGTCGAGAGGGCGGAAGATACGGAAATCTCATTAATGAGGCAATCATTCCCAAAAGAGACCACTATTCATGAGGGATTTAAGGAAACGGAAGAAAATGAGACCGTCATTA
TTACAGATAGTCTGACCATGGATCTGGATTTAAATGCCCAAAGCAATAAATCTGGGCAGGATGAGGAATTTGATATGTCTAATCAAGGCTCTGGGTTGGTGGACATTATG
AGGAGTAAGTCAAATGGAACCCAAAAGAGTTGGAAACAGGCTGCTAGAATTCTACCGGTTTTAGACTGGGGTTTGGAGTCGCTAGGCCCATCTTGTGGGAAACGGAAAGG
GAGCCCAGCGGAAGAAAGTCAGGGCACTATTGTTTCTACTAATCAGAAACTCTTGAAAATGGATAACGGAAGCAAAATGGGCTTTAGTAATGGGCCGGCGATGGAGATCT
CGAAACCTCATATTTTGTTTCTTTCTGAAACAAAAGCTAGAAGAGACAAAGTTAACAGTGTTAAGCAATGGCTGGGTTTTTCAAACTGTATTTCGGTGGATTGTAAAGGG
AGAAGTGGTGGTTTAGCTTTGATGTGGCAGGATACATTGACTTTCAACTTGATCTCGTTCTCGCAAAATCATATTGATGGGGGAGTTGAATTTGAAGGGAATATTTGGCG
TTTAATGGGATTTTATGGGTATTCGGAGGAAGAGAATCGTCATCTTTCGTGGGACTTAATGCGTTGGTTGAAAGGAAGCGGATCGACGCCATGGCTCATTGGTGGCGACT
TTAATGGAATTTTAGCCCATTCTGAGAAAATTGGGGGGAGACCGAAAGATGACAAGTTGATCAATGATTTTAGTTTGGCGGTGGATGATTGTGATCTAATTGATATGGGA
TTTTTGGGTGAAGCTGTGACTTGGTGTAATAGAAGACCAAATGGTGGAACAGTTTTTGAGCGGCTAGACCGGATTTTGTGCAACCCGGGCTGGGCTTCGTTATATCCAAA
TTTTGACGTGAAACATCTAGACTATCGAACATCCGACCATAGACCTCTGCAACTAAGTGTAGTACAAGATGTCTTTAAGGGTAGATTGATGACAAAAAGGCTTTATCGCT
TTGAAGAAGCATGGCTGGAAGATCATCAATGTGAATTGATGGTAGTTAATGGGTGGAGCAGTGAGGTCCATGACAGCAGCTCTCAATCTTTAAAGGCTAAGCTTCATACA
TGTGCTAACCATGTTCTGAAATGGGGTCGGAATAAGGCGGGTAATTACAAGAGAAGATTACAAGAAGCAGAAAAACAGTTACAATTTGCAATTCATAATCTCCCCTTTGT
ATCTGACCGAACCAGAATTCAGACGGCAGAATATAATTTGGAGAAGCTTTTGAATGAAGAGGAAATTTATTGGAGGAAGAAGAGAAATGAAATAAATGGTTTGTTTGACA
CTCAGGGTCATTGGGAGACTGACCAGTCTGTGGTGGCAGGTGGGAAGGACTCCGTCCTTTATATGGAGGAGCTTGTTGTGGGGAAGATCGTTGCTCACAGAGGGGGGTCG
TTGGCGCATTGGGAATGGGGAGAAGTCCCTCTTGATACAAAGGTTAGTTTCCTGGTGGATAGTGATTCGAGGCAGTGGAAGACAAATTTGATTCGGGATATGTTCACTGA
AGATGAAGTAAAGACCATTCTTTCAATTCCTCTTGGTATCAGCCTTGCAGCTGATAGGCTTATCTGGAATTTTGAGAAGAATGGGATCTATACAGTTAAGAGTGGTTACA
AATTGGCTCATATGCAATCTCCGGATACTAGTGCCTCTACTTCTTCGTGTGAATGTTTAGCTTTATGGTGGAAAGATGTTTGGCAGCTCAATCTTCCTAGTAAGATTAAG
GTTTTCTTCTGGCGTCTTTGTTTGGATAGATTACCAACGGGTGCTAATCTTATTCTTCGAGGAGTAGATGTTCCTAATTGTTGTGCTTTTTGTGGAAAGAAAGGGGAAGA
TGCTTTGCACTTGTTCTGGACCTGCAAAGTCATTAACAACCAGAGGCAAGTGTCGAAATTTAGCCATCTTCCTCAAGATGTTAGGCCCCTTTCTCTGTTGCATCTGCTTA
GAGATTGCAAGGATATCCTGAGTTGGTCTGAATTCGAGGAATGGGTTGTTTTCCTTTGGGGAATTTGGTCTAGACGAAATGTGAAAGTGTTTCCGAATGGAAGAAAGATG
GTGGGGGATCTGGATGGTTGGACAAGTTCTTATGTCTCTACCTACAGAGCTGCAAATTCACCTTCTACATGTTTACAACATGAAAATGAAGTTTTAACTGATAACAATAG
ATTCCTTTCACACAGGCATAAGGATTCGCTACCAGATTCAAATTGTTGGAAGGCTCCAACTGAGGGAGCTTTTAAGATCAACACAGATGCATCATTTTCGTCTATTAATC
AAACAGCGGGTCTGGGTATTGTAATCAGGAATTTTCGTGGTCAGGTCATGGCAGTAGGAACGAAATACCTGGAGTTTGTTTCATCGGTTGACGAAGCTGAGGCCTTTGGC
GTTCTGGAAGGGATTCGTTTGGCTCTGGATTCTGGGATTCATCCGTTTCAGCTAGAAACCGATTCTTTGCGAGTATTTAATATGCTCAATCGAGAGGATGAAGACCTCTC
GGAAACAGGAACAATCATTTGTTATGCAAAGTCTTTTATTCCAGAAGGTTTGCATGTTCCTTTCGAGTTTACAAAGAGAGATGGGAACGTTGTTGCTCACCTTTTGGCAC
GAAGAGCGTTGCTTCAACTGGAGCATGCTGTCTGGATTGAAGAGTGGCCGGCGGAGATTTCGGCTGCGTTAGCGAAGGAGTGCCTAGACCTTTTGGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTACGGCGTAACCGGAGCCGGAGTTGGAGGTTCCGCCGTAGATCTGAGTCTGAAATTTTTGACGCTTGGAATGACGGACGGATTTGTTGTATACTGGCGGCAGTT
CCTGGATTGGCTTCTGCAGGTTAGGTCAGTCATGGATGACCTCGTTGCCTGTTGCGAAAACCTCAATCTAACTTCTGAAGAGAATGAGGTGATAGTGATTGACAAGGAAA
AACCAATAATGAATGAGGACAATGTTCAGCTTTGTGTGGTGGGGAAACTCCTATCAAGCAAGCGTACAAGCACGGAGGCATTACAATCGGTAATGAAGCAAGTTTGGAAT
GTTCATGATACCACAAGGATAGAACCTATCGGGATGAATACGTATGTCTGTCTATTTCGATCTCTAGGAGAGAAAACTCGAGTCCTAAACTCCGGACCCTGGACTTTCAA
CAAATCCCTTTTTGTCCTAGCTTCTCCTTCCGCCACAGATCAGCCTCTTGCAATGAATTTCAATGTTTGTGCTTTCTGGATTCAGATCCATAACATACCTTTTGAATGTA
TGTCTCAGGAGATGGCTCTTATGCTTGGGAGTAAGGTAGGTGAGGTTGAAACAGTGGACAGTGAAGACCAGGAGGGTTGGACAAGACCTTACTTTCGGCTGCGAGTGAAG
ATTGATATCTCGAAGTCGTTGCGGAGAGGTTTGAAATTATGTGGCAGCGATGGAAAGGATATGTGGTGCCCAATTCGGTTTGAGAAACTACCAGATTTCTGCTATGAATG
TGGAATAATTGGTCATTCAGGAAGGGAATGCAACAATCGTAACACTCTTTGTCAGAACCAGTATGGGGAGTGGTTACGTGCATCCTTGCTAAAGAAGAATGTGGGCTTGC
GTGGGGCAGAAACTGGGTTTAGGGGCGGCCGTGGAGGGCAAAGTAACAAGGGTCGTGGAGGATGGGGGGACTGGCGACGGTTCGAAGAGGAGAGCGAGGAGAATAAATTT
CCGGCAGATGTTGTCCGGAGGGAGGTGTCGGACAATGCGACGGTGGACGTTCCGGCAAGGGGCGATTCGGGGGAATCCGGTCTGGAAGAGGGGTCGAAAAAGGTAACTCC
TGTCGAGAGGGCGGAAGATACGGAAATCTCATTAATGAGGCAATCATTCCCAAAAGAGACCACTATTCATGAGGGATTTAAGGAAACGGAAGAAAATGAGACCGTCATTA
TTACAGATAGTCTGACCATGGATCTGGATTTAAATGCCCAAAGCAATAAATCTGGGCAGGATGAGGAATTTGATATGTCTAATCAAGGCTCTGGGTTGGTGGACATTATG
AGGAGTAAGTCAAATGGAACCCAAAAGAGTTGGAAACAGGCTGCTAGAATTCTACCGGTTTTAGACTGGGGTTTGGAGTCGCTAGGCCCATCTTGTGGGAAACGGAAAGG
GAGCCCAGCGGAAGAAAGTCAGGGCACTATTGTTTCTACTAATCAGAAACTCTTGAAAATGGATAACGGAAGCAAAATGGGCTTTAGTAATGGGCCGGCGATGGAGATCT
CGAAACCTCATATTTTGTTTCTTTCTGAAACAAAAGCTAGAAGAGACAAAGTTAACAGTGTTAAGCAATGGCTGGGTTTTTCAAACTGTATTTCGGTGGATTGTAAAGGG
AGAAGTGGTGGTTTAGCTTTGATGTGGCAGGATACATTGACTTTCAACTTGATCTCGTTCTCGCAAAATCATATTGATGGGGGAGTTGAATTTGAAGGGAATATTTGGCG
TTTAATGGGATTTTATGGGTATTCGGAGGAAGAGAATCGTCATCTTTCGTGGGACTTAATGCGTTGGTTGAAAGGAAGCGGATCGACGCCATGGCTCATTGGTGGCGACT
TTAATGGAATTTTAGCCCATTCTGAGAAAATTGGGGGGAGACCGAAAGATGACAAGTTGATCAATGATTTTAGTTTGGCGGTGGATGATTGTGATCTAATTGATATGGGA
TTTTTGGGTGAAGCTGTGACTTGGTGTAATAGAAGACCAAATGGTGGAACAGTTTTTGAGCGGCTAGACCGGATTTTGTGCAACCCGGGCTGGGCTTCGTTATATCCAAA
TTTTGACGTGAAACATCTAGACTATCGAACATCCGACCATAGACCTCTGCAACTAAGTGTAGTACAAGATGTCTTTAAGGGTAGATTGATGACAAAAAGGCTTTATCGCT
TTGAAGAAGCATGGCTGGAAGATCATCAATGTGAATTGATGGTAGTTAATGGGTGGAGCAGTGAGGTCCATGACAGCAGCTCTCAATCTTTAAAGGCTAAGCTTCATACA
TGTGCTAACCATGTTCTGAAATGGGGTCGGAATAAGGCGGGTAATTACAAGAGAAGATTACAAGAAGCAGAAAAACAGTTACAATTTGCAATTCATAATCTCCCCTTTGT
ATCTGACCGAACCAGAATTCAGACGGCAGAATATAATTTGGAGAAGCTTTTGAATGAAGAGGAAATTTATTGGAGGAAGAAGAGAAATGAAATAAATGGTTTGTTTGACA
CTCAGGGTCATTGGGAGACTGACCAGTCTGTGGTGGCAGGTGGGAAGGACTCCGTCCTTTATATGGAGGAGCTTGTTGTGGGGAAGATCGTTGCTCACAGAGGGGGGTCG
TTGGCGCATTGGGAATGGGGAGAAGTCCCTCTTGATACAAAGGTTAGTTTCCTGGTGGATAGTGATTCGAGGCAGTGGAAGACAAATTTGATTCGGGATATGTTCACTGA
AGATGAAGTAAAGACCATTCTTTCAATTCCTCTTGGTATCAGCCTTGCAGCTGATAGGCTTATCTGGAATTTTGAGAAGAATGGGATCTATACAGTTAAGAGTGGTTACA
AATTGGCTCATATGCAATCTCCGGATACTAGTGCCTCTACTTCTTCGTGTGAATGTTTAGCTTTATGGTGGAAAGATGTTTGGCAGCTCAATCTTCCTAGTAAGATTAAG
GTTTTCTTCTGGCGTCTTTGTTTGGATAGATTACCAACGGGTGCTAATCTTATTCTTCGAGGAGTAGATGTTCCTAATTGTTGTGCTTTTTGTGGAAAGAAAGGGGAAGA
TGCTTTGCACTTGTTCTGGACCTGCAAAGTCATTAACAACCAGAGGCAAGTGTCGAAATTTAGCCATCTTCCTCAAGATGTTAGGCCCCTTTCTCTGTTGCATCTGCTTA
GAGATTGCAAGGATATCCTGAGTTGGTCTGAATTCGAGGAATGGGTTGTTTTCCTTTGGGGAATTTGGTCTAGACGAAATGTGAAAGTGTTTCCGAATGGAAGAAAGATG
GTGGGGGATCTGGATGGTTGGACAAGTTCTTATGTCTCTACCTACAGAGCTGCAAATTCACCTTCTACATGTTTACAACATGAAAATGAAGTTTTAACTGATAACAATAG
ATTCCTTTCACACAGGCATAAGGATTCGCTACCAGATTCAAATTGTTGGAAGGCTCCAACTGAGGGAGCTTTTAAGATCAACACAGATGCATCATTTTCGTCTATTAATC
AAACAGCGGGTCTGGGTATTGTAATCAGGAATTTTCGTGGTCAGGTCATGGCAGTAGGAACGAAATACCTGGAGTTTGTTTCATCGGTTGACGAAGCTGAGGCCTTTGGC
GTTCTGGAAGGGATTCGTTTGGCTCTGGATTCTGGGATTCATCCGTTTCAGCTAGAAACCGATTCTTTGCGAGTATTTAATATGCTCAATCGAGAGGATGAAGACCTCTC
GGAAACAGGAACAATCATTTGTTATGCAAAGTCTTTTATTCCAGAAGGTTTGCATGTTCCTTTCGAGTTTACAAAGAGAGATGGGAACGTTGTTGCTCACCTTTTGGCAC
GAAGAGCGTTGCTTCAACTGGAGCATGCTGTCTGGATTGAAGAGTGGCCGGCGGAGATTTCGGCTGCGTTAGCGAAGGAGTGCCTAGACCTTTTGGTGTAA
Protein sequenceShow/hide protein sequence
MVYGVTGAGVGGSAVDLSLKFLTLGMTDGFVVYWRQFLDWLLQVRSVMDDLVACCENLNLTSEENEVIVIDKEKPIMNEDNVQLCVVGKLLSSKRTSTEALQSVMKQVWN
VHDTTRIEPIGMNTYVCLFRSLGEKTRVLNSGPWTFNKSLFVLASPSATDQPLAMNFNVCAFWIQIHNIPFECMSQEMALMLGSKVGEVETVDSEDQEGWTRPYFRLRVK
IDISKSLRRGLKLCGSDGKDMWCPIRFEKLPDFCYECGIIGHSGRECNNRNTLCQNQYGEWLRASLLKKNVGLRGAETGFRGGRGGQSNKGRGGWGDWRRFEEESEENKF
PADVVRREVSDNATVDVPARGDSGESGLEEGSKKVTPVERAEDTEISLMRQSFPKETTIHEGFKETEENETVIITDSLTMDLDLNAQSNKSGQDEEFDMSNQGSGLVDIM
RSKSNGTQKSWKQAARILPVLDWGLESLGPSCGKRKGSPAEESQGTIVSTNQKLLKMDNGSKMGFSNGPAMEISKPHILFLSETKARRDKVNSVKQWLGFSNCISVDCKG
RSGGLALMWQDTLTFNLISFSQNHIDGGVEFEGNIWRLMGFYGYSEEENRHLSWDLMRWLKGSGSTPWLIGGDFNGILAHSEKIGGRPKDDKLINDFSLAVDDCDLIDMG
FLGEAVTWCNRRPNGGTVFERLDRILCNPGWASLYPNFDVKHLDYRTSDHRPLQLSVVQDVFKGRLMTKRLYRFEEAWLEDHQCELMVVNGWSSEVHDSSSQSLKAKLHT
CANHVLKWGRNKAGNYKRRLQEAEKQLQFAIHNLPFVSDRTRIQTAEYNLEKLLNEEEIYWRKKRNEINGLFDTQGHWETDQSVVAGGKDSVLYMEELVVGKIVAHRGGS
LAHWEWGEVPLDTKVSFLVDSDSRQWKTNLIRDMFTEDEVKTILSIPLGISLAADRLIWNFEKNGIYTVKSGYKLAHMQSPDTSASTSSCECLALWWKDVWQLNLPSKIK
VFFWRLCLDRLPTGANLILRGVDVPNCCAFCGKKGEDALHLFWTCKVINNQRQVSKFSHLPQDVRPLSLLHLLRDCKDILSWSEFEEWVVFLWGIWSRRNVKVFPNGRKM
VGDLDGWTSSYVSTYRAANSPSTCLQHENEVLTDNNRFLSHRHKDSLPDSNCWKAPTEGAFKINTDASFSSINQTAGLGIVIRNFRGQVMAVGTKYLEFVSSVDEAEAFG
VLEGIRLALDSGIHPFQLETDSLRVFNMLNREDEDLSETGTIICYAKSFIPEGLHVPFEFTKRDGNVVAHLLARRALLQLEHAVWIEEWPAEISAALAKECLDLLV