; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011004 (gene) of Snake gourd v1 genome

Gene IDTan0011004
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG05:23593733..23599467
RNA-Seq ExpressionTan0011004
SyntenyTan0011004
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]1.2e-4527.04Show/hide
Query:  TEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNC
        TE+E   I VE +++ E       +L+ K  +  P ++  F+Q I + W+    I +  + ENLFL  F    D   VL N PW FD+ +L+     G  
Subjt:  TEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNC

Query:  RFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGR
        +  DL+   VNFW+ +++LP   +S  MA+  GN +G F++VD  D     G  L LK  +++  PLKRG K+K     + +W    YE+LP+FC++CG+
Subjt:  RFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGR

Query:  TGHLDRICNEVD------WSTSSKKQ--FGSGLR-------------HPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQ-TGSKQ
         GH  + C +V+      +S  ++K   +G  LR               SS S  +N   S    +G +   K    D E++       GI  Q   +K 
Subjt:  TGHLDRICNEVD------WSTSSKKQ--FGSGLR-------------HPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQ-TGSKQ

Query:  KTSLDTNNLEKNCRR---SESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKD----RNGKLKGLLEVSR----KLVFDIEGEEDVS
          +LD  ++ ++      S S    + T+ QG  + +K +   N S + V  + + ++ +   +++       +++ L  + R    ++VF +E    V 
Subjt:  KTSLDTNNLEKNCRR---SESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKD----RNGKLKGLLEVSR----KLVFDIEGEEDVS

Query:  -------KKVLNSSSAVEPRGV--EKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETR----IIGL-AEE----FLQTNGLNFQIG------SSIMDM
               K   +S  A++ +GV  E+   LA      + I   S     +  +  D+ET     + G+ AEE     ++T G   +IG      S + D+
Subjt:  -------KKVLNSSSAVEPRGV--EKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETR----IIGL-AEE----FLQTNGLNFQIG------SSIMDM

Query:  WFD-----------------------------------FETIHLGTYSSDHRPILGVTGE----RAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRH
         F+                                      +HL  Y SDH  +L +T E    R   +R+  + L RFE +W +   C  ++   W + 
Subjt:  WFD-----------------------------------FETIHLGTYSSDHRPILGVTGE----RAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRH

Query:  PHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTPDTEETWFQK----RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHI
        P  +  +   RL     EL         G +   I + EK IQ+ + +   D  ET  Q+       L+ LL+E+E  WRQRSR  WLK GD+NTK+FH 
Subjt:  PHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTPDTEETWFQK----RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHI

Query:  KATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTS
        KA+QR+K N+I+ L+  +  W   E+++ +    YF+ LF+S  P++
Subjt:  KATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTS

GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]1.3e-1529.9Show/hide
Query:  DCILPSVTDDTNR-----MLLSDFTHAE--NILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNRRDSIWMALWNANTPSKIKIC
        D ++  + DD  R     ++   FT+ E   I+  P +  +  D+IIW     GV++V+ AY L    +D      S++  DS+W  +W A  P+K+K  
Subjt:  DCILPSVTDDTNR-----MLLSDFTHAE--NILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNRRDSIWMALWNANTPSKIKIC

Query:  CWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKD---FNVFLIILWKIW
          R+  NILPT+ NL +KG+ +   CP C    E + H+   C + ++       S          D N      W+LE  N  D     +F  ILWK W
Subjt:  CWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKD---FNVFLIILWKIW

Query:  TWRN
          RN
Subjt:  TWRN

KAE8800683.1 retrotransposon unclassified [Hordeum vulgare]4.0e-5723.18Show/hide
Query:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF
        M+  L L+EEE + + +     D+  + +    + K LS K  H D     + K+W     I   ++GEN F+ +F      RR +E+ PW FD  +++ 
Subjt:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF

Query:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGL---------KVKIGTMAEEL--
        +E     R  + EF  +  W+ ++NLP    + + A+  GN +G F + D     +  G  L +K+R+ +  PL RG          K K   M +E   
Subjt:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGL---------KVKIGTMAEEL--

Query:  --WCPVTYEKLPDFCYSCGRTGHLDRICNEVDWSTSSKKQFG------SGLRHPSSNSG-----------QQNWARSGVHG----------------RGS
          WC   YE LPDFCY+CG  GH ++ CN        K+QFG       G R  + ++G           Q+N+  S  +G                R S
Subjt:  --WCPVTYEKLPDFCYSCGRTGHLDRICNEVDWSTSSKKQFG------SGLRHPSSNSG-----------QQNWARSGVHG----------------RGS

Query:  RDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSES-EAGDRATERQG--SEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNG
         D  +  E   E  S + ++ G  +Q+G  +K  L  N  EK+    E   AG +     G   EE   L      +    +++ +    +G+  K  + 
Subjt:  RDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSES-EAGDRATERQG--SEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNG

Query:  KLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETRIIGLAEEFLQTNGLNFQIGSSIMDMW
           G     R  +  + G     + +L             G+K  Q  T+G     +  G  ++  +  ++   I     E L     N    +   + +
Subjt:  KLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETRIIGLAEEFLQTNGLNFQIGSSIMDMW

Query:  FDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEK
         +FE I+     SDHRPI+  T      +     G  RFE  W     C E +   W+       G +   +      +++W++  + G L+G + K   
Subjt:  FDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEK

Query:  EIQSLEQYLTPDTEETWFQKRR---ELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFS
        E++   +  +P +E    ++ R    L  L ++  I  +QRS + WL+ G+RNT++F      RKKQN+++ L+  + S + +  E+  +  SYFQ LF+
Subjt:  EIQSLEQYLTPDTEETWFQKRR---ELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFS

Query:  SDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNR-RDSIWMALWNANT
        ++                                   + ++ + G +G                            S +A    N+  +  W  +W    
Subjt:  SDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNR-RDSIWMALWNANT

Query:  PSKIKICCWRILHNILPTKTNLIQKGLDIQPW-CPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLII
        P  +++  WRI H  L   TN+ ++G  +Q   C FC    E   H+   CK+ + +W         +  +   D +A   + W L++  R      L  
Subjt:  PSKIKICCWRILHNILPTKTNLIQKGLDIQPW-CPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLII

Query:  LWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEF--ISSP---------VNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLV--RAG-HHFIRTN
         W  W++RN   + +      E+ R TR  V E+  I +P           PP +G    N D S++      G G   R  DG LV  RAG   +I   
Subjt:  LWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEF--ISSP---------VNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLV--RAG-HHFIRTN

Query:  WSILILELRGIIEGLKAIPNKTIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMW-SQIAFKHIPRSANQTTHKLA
        +   ++ +   +    A     +  V E+DS   ++ ++    D +  +  I + K    MW S+       RSAN   H+LA
Subjt:  WSILILELRGIIEGLKAIPNKTIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMW-SQIAFKHIPRSANQTTHKLA

PWA36168.1 hypothetical protein CTI12_AA602590 [Artemisia annua]1.3e-4721.91Show/hide
Query:  LFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINV
        +FL   G+  DLRRVLE+ PW F++ +++      + +  + +   V FW+ + N+P   +     +    ++G   +VD  D     GS     +R+  
Subjt:  LFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINV

Query:  TIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRIC----NEVDWSTSSKKQFGSGLR-----------------HPSSNSGQQN------
        T       KV++          + YE+LP+FCY CG  GH ++ C     E++  T     F   LR                 HP+ N+ QQ       
Subjt:  TIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRIC----NEVDWSTSSKKQFGSGLR-----------------HPSSNSGQQN------

Query:  -----WARSGVHGRGSRDTKK---------------TVEDDIEEDSSMDSSSGINLQTG----------SKQKTSLDTNNLEKNC-RRSESEAGDRATER
             +  S ++   ++D  +               T++   +++      SGI L  G          +K +  L+  NL K   RR+  E   + T  
Subjt:  -----WARSGVHGRGSRDTKK---------------TVEDDIEEDSSMDSSSGINLQTG----------SKQKTSLDTNNLEKNC-RRSESEAGDRATER

Query:  QGSEEF---QKLINAINFSVE------------SVSKERSLEVTEGHSRKDRNGKLKG----------LLEVSRKLVFDIEGEEDVS-----------KK
          S       + ++ ++++V+            S+ KE S  +      +    ++ G          L+  S +   D   +ED +           ++
Subjt:  QGSEEF---QKLINAINFSVE------------SVSKERSLEVTEGHSRKDRNGKLKG----------LLEVSRKLVFDIEGEEDVS-----------KK

Query:  VLNSSSAVEPRGVEKGEKLAQVPTKGLSIL---------HSSTGPKIMGFKDN----DMETRIIGLAEEFLQTNGL-----------NFQIGSSIMDMWF
             +    R +   ++ A V     + +           S   ++  F++     ++E R   +  +   +NG             F   S   D++ 
Subjt:  VLNSSSAVEPRGVEKGEKLAQVPTKGLSIL---------HSSTGPKIMGFKDN----DMETRIIGLAEEFLQTNGL-----------NFQIGSSIMDMWF

Query:  DFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNW-------GTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGA
        D    +L   +SDH PI+     R     +    + RFE  W       G   D W        +H    I      ++ C   L  WN+ R  G ++ +
Subjt:  DFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNW-------GTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGA

Query:  ISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQH
        I  K++ +Q+L+      T       R ++  LL  +E+ W+QRSR++WL+ GD+NT++FH +A+ R+++N I  L+  +  W+ +  E+ +  +SYF  
Subjt:  ISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQH

Query:  LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAE--NILKLPRTG----------------TMGC--------DEIIWKCHPRGVFTVKSAYQLR
        LFSS  P     E V   I   +T++  + L    T +E  ++L     G                 +GC        D + W  +P G F+ KSAY L 
Subjt:  LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAE--NILKLPRTG----------------TMGC--------DEIIWKCHPRGVFTVKSAYQLR

Query:  LRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDF
        L   +D    +  +N     W  +W A  PSK+K+  WR  +N +PT  NL  +GL+    C  C +  E   H+L+ C V + +WN         FYD 
Subjt:  LRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDF

Query:  REDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFI----------SSPVNPPRQGTWY--------LNTDAS
        +       + Q +LE     ++  F++ILW +WT RN     +    +  +  I +  ++++           +S V+    G W         +N DA+
Subjt:  REDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFI----------SSPVNPPRQGTWY--------LNTDAS

Query:  WSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVV-ESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWSQIAF
        W  +    GLG++ R + G ++ +G        S L  E + +   +  +  K    V+ E++SL  ++ +  + V   + +   +EI +    ++   +
Subjt:  WSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVV-ESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWSQIAF

Query:  KHIPRSANQTTHKLAQRA
          + R  N+  H +A  A
Subjt:  KHIPRSANQTTHKLAQRA

TXG53380.1 hypothetical protein EZV62_022549 [Acer yangbiense]2.4e-4625.59Show/hide
Query:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF
        + E L   E +   ++++ +   E  + +S  L+ K L  K ++ + F+  I ++W +T  + ++K+ +NLF+  F        V    PW+FD  I++ 
Subjt:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF

Query:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPD
        ++  G      +EF+ V+FW+ IH +P    + ++ +I   ++G+  ++   D++  WG  L ++VRI+++ PLKR LKV++    + +   + YE+L +
Subjt:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPD

Query:  FCYSCGRTGHLDRIC-------NEVDWSTS-----SKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTG------
         C++ G+ GH+ R C         +D  T+      K     GL+   S +G++     G+  RG  +      D + + SS  S SG+ ++ G      
Subjt:  FCYSCGRTGHLDRIC-------NEVDWSTS-----SKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTG------

Query:  ---------SKQKT------SLDTNNLEKN--CRR-----SESEAGDRATERQ---------------------------GSEEFQKLINAINFSVESVS
                   QK        +  + LE+N  C        ESE  DR  E+                            G     K I   +   +S +
Subjt:  ---------SKQKT------SLDTNNLEKN--CRR-----SESEAGDRATERQ---------------------------GSEEFQKLINAINFSVESVS

Query:  KERSL----EVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKK-------VLNSSSAVEPRGVEKGEK-----------LAQVPTKGLS--ILHSS
        +++SL    +  +    K  + K +    V RK++  ++ E  V KK       +L+  +A     + +GE            + QV ++G S     SS
Subjt:  KERSL----EVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKK-------VLNSSSAVEPRGVEKGEK-----------LAQVPTKGLS--ILHSS

Query:  TGPKIMGF--------KDNDMETRIIGLAEEFLQT----NGLNFQI-GSSIMDMWFDFETIHLGTYSSDHRP-ILGVTGERAHFQRQHDRGLLRFEPNWG
        T   I           + +++     G   E L++     G + Q+ G +I     D    HLG  +SDHRP IL   G   + ++  D G  + EP W 
Subjt:  TGPKIMGF--------KDNDMETRIIGLAEEFLQT----NGLNFQI-GSSIMDMWFDFETIHLGTYSSDHRP-ILGVTGERAHFQRQHDRGLLRFEPNWG

Query:  TYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQS-LEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVD
        T   C E+V + W      + +  ++ +L++C  +L+ W++ +  G L   IS K +E+++ L +   P   +   +   ELDN+L  +EIYWRQRSR +
Subjt:  TYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQS-LEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVD

Query:  WLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSD
        WL   DRN+++FH KA+ RKK+N I  L+       +DEK I +   +YF  LFSS  P+++
Subjt:  WLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSD

VFQ81500.1 unnamed protein product [Cuscuta campestris]3.4e-4821.92Show/hide
Query:  VERLNLTEEEGRAIVVEDDDVDESAR--LLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILL
        +  ++L  E+   +V  D+  D S +       L+ + L+ +PV+    +  +  +W+    + + +VG  L++  FG   ++ RV+E  PW F+   LL
Subjt:  VERLNLTEEEGRAIVVEDDDVDESAR--LLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILL

Query:  FDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLP
         +    +    ++    +  W+ ++ L     S ++AQ  GN +G F + D  +    W S L ++V+++V  PLK+G ++K     E       YE+LP
Subjt:  FDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLP

Query:  DFCYSCGRTGHLDRICNEV--DWSTSSKKQFGSGLR---HPSSNSGQQNWARSGVHGRG--SRDTKKTVEDDIEEDSSMDS--SSGINLQTGSKQKTSLD
         FC+ CGR GH +R C E    W    +++FG  LR     +SN+    W R  +  RG  ++  +K  + D    +   +  SS +  Q G    T L 
Subjt:  DFCYSCGRTGHLDRICNEV--DWSTSSKKQFGSGLR---HPSSNSGQQNWARSGVHGRG--SRDTKKTVEDDIEEDSSMDS--SSGINLQTGSKQKTSLD

Query:  TNNLEKNCRRSESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNGKLK---------GLLEVSRKLVFDIEGEEDVSKKVLNSS
            ++N   +          R       K  +     + +  K+R   V E +     +G L           LL +SR    D+E E+    +  ++ 
Subjt:  TNNLEKNCRRSESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNGKLK---------GLLEVSRKLVFDIEGEEDVSKKVLNSS

Query:  SAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETR------------IIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPI
            P    + E    +      +  +ST P I G   N +  +            +I    E ++  GL             DF             P+
Subjt:  SAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETR------------IIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPI

Query:  LGVTGERAHFQRQHDRGLLRFEPNWGTYP-------DCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTP
         G    + +  R       RFE  W + P       DCWE+          +   +++ +LN C   L  W     + F   +     + IQ   +    
Subjt:  LGVTGERAHFQRQHDRGLLRFEPNWGTYP-------DCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTP

Query:  DTEETWFQK-RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKI------------------------------------RTLQTLN
              FQ+ ++ L  L    E +WRQ+++  WL  GDRNTK+FH +A++R++ N+I                                      +Q+L 
Subjt:  DTEETWFQK-RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKI------------------------------------RTLQTLN

Query:  ESWISD-----------------------------------------------------------------EKEIGE---------FATSYFQH------
         S +S                                                                  E +IG          FAT    H      
Subjt:  ESWISD-----------------------------------------------------------------EKEIGE---------FATSYFQH------

Query:  ---------------------LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQD
                               ++++P S  +  V+D   P+  D      + +    + I  +P TG+   D +IW     G++TVKSAY  +    D
Subjt:  ---------------------LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQD

Query:  SQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWN
        +    +  +R  ++W  LW      K++   WR  +NILP   NL+ K + +Q  CP C    ET  HI   C   R +W   FL  Y      F  DW 
Subjt:  SQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWN

Query:  AGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVR
               +L   N  D  +   +LW IW+ RN      ++W       + R  +  +   PVN  +     +N DAS        GLG+I R+++GR V 
Subjt:  AGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVR

Query:  AGHHFIRTNWSILILELRGIIEGLKAIPNK-TIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWS-QIAFKHIPRSANQTTHKLAQRASRLQ
        A     R ++     E   + E L  I +K    ++VE+D  E +  +N + +D++  +  I + K +  + +  + F   PRS N+  H LA+ +    
Subjt:  AGHHFIRTNWSILILELRGIIEGLKAIPNK-TIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWS-QIAFKHIPRSANQTTHKLAQRASRLQ

Query:  TNESWLDGP
            W D P
Subjt:  TNESWLDGP

TrEMBL top hitse value%identityAlignment
A0A2N9FNT0 RNase H domain-containing protein1.7e-5026.91Show/hide
Query:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF
        M +  +L+++EG      D D+  +++     L  K L+S+ +++D   +    +WKT     +  +G N     F +  DL RVL NEPW +DK +++F
Subjt:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF

Query:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTI--PLKRGLKVKIGTMAEELWCPVTYEKL
           +G+    D  F   +FW+ +HNLP   ++ + A+  G  +G+ +KV  ++ E   G   C++VRI + I  PL RG  VK     +  W    YE+L
Subjt:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTI--PLKRGLKVKIGTMAEELWCPVTYEKL

Query:  PDFCYSCGRTGHLDRICN----EVDWSTSSKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNL
        P+FCY CG   H ++ C+    +   S   + QFG+ LR  S  +  +       +   SRD     +    +D               + + +++  +L
Subjt:  PDFCYSCGRTGHLDRICN----EVDWSTSSKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNL

Query:  EKNCRRSESEAGDRATERQGSEEFQK-----LINAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGV
         +N R +     +   + +   E ++     + + I+ S +           E H R + +  L   L     L +   G+     +++ SS     R  
Subjt:  EKNCRRSESEAGDRATERQGSEEFQK-----LINAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGV

Query:  EKGEK---LAQVPTKGLSILHSSTGPKIMGFKDNDMETRIIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRG
         + +     A +   G   L     P           T  + L + F+ TN    +  S+ +D        H+   +SDH+PI  +  +     R   + 
Subjt:  EKGEK---LAQVPTKGLSILHSSTGPKIMGFKDNDMETRIIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRG

Query:  LLRFEPNWGTYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTPDT-----EETWFQKRRELDNLLQ
        L RFE  W  +PDC  +V   W  +   + I  ++ ++  C  EL RW+RT+      G I+K  KE   L +    D+      +T    R+E+++LL 
Subjt:  LLRFEPNWGTYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTPDT-----EETWFQKRRELDNLLQ

Query:  EDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDF
        ++E  W+QRSR  WLK GDRNTK+FH +A+ R+++N I TL   +   +++ + IG   T Y+Q LF++ +P  D +E V D I P VT + N+ L+S F
Subjt:  EDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDF

Query:  THAENI
        T AE I
Subjt:  THAENI

A0A2N9FNT0 RNase H domain-containing protein1.3e-2125Show/hide
Query:  AENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTN-RRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCP
        AE ILK+P +  +  D+I W  +  G ++V+S Y+L L+ +   +A +S     D +W  +W A  P+KIK   WR  H+ LPT + L Q+ +   P C 
Subjt:  AENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTN-RRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCP

Query:  FCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFRED---------WNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRI
         C  Q E S H LW C     +W     S  + F  FR           WN   +      D N      +     +IWT     + +      EE    
Subjt:  FCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFRED---------WNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRI

Query:  TRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILEL----RGIIEGLKAIPNKTIPLVVESDSLEAIQ
             T +       P    + +N D +   + + GG+G + R+  G  +      +    ++ ++E     R II   +      + + VE D+   I+
Subjt:  TRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILEL----RGIIEGLKAIPNKTIPLVVESDSLEAIQ

Query:  QINGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNESWLD
         +N +   +T     I + K +   + + +  H  RS N   H LA+RAS   +   WL+
Subjt:  QINGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNESWLD

A0A2N9FNT0 RNase H domain-containing protein1.6e-4821.92Show/hide
Query:  VERLNLTEEEGRAIVVEDDDVDESAR--LLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILL
        +  ++L  E+   +V  D+  D S +       L+ + L+ +PV+    +  +  +W+    + + +VG  L++  FG   ++ RV+E  PW F+   LL
Subjt:  VERLNLTEEEGRAIVVEDDDVDESAR--LLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILL

Query:  FDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLP
         +    +    ++    +  W+ ++ L     S ++AQ  GN +G F + D  +    W S L ++V+++V  PLK+G ++K     E       YE+LP
Subjt:  FDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLP

Query:  DFCYSCGRTGHLDRICNEV--DWSTSSKKQFGSGLR---HPSSNSGQQNWARSGVHGRG--SRDTKKTVEDDIEEDSSMDS--SSGINLQTGSKQKTSLD
         FC+ CGR GH +R C E    W    +++FG  LR     +SN+    W R  +  RG  ++  +K  + D    +   +  SS +  Q G    T L 
Subjt:  DFCYSCGRTGHLDRICNEV--DWSTSSKKQFGSGLR---HPSSNSGQQNWARSGVHGRG--SRDTKKTVEDDIEEDSSMDS--SSGINLQTGSKQKTSLD

Query:  TNNLEKNCRRSESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNGKLK---------GLLEVSRKLVFDIEGEEDVSKKVLNSS
            ++N   +          R       K  +     + +  K+R   V E +     +G L           LL +SR    D+E E+    +  ++ 
Subjt:  TNNLEKNCRRSESEAGDRATERQGSEEFQKLINAINFSVESVSKERSLEVTEGHSRKDRNGKLK---------GLLEVSRKLVFDIEGEEDVSKKVLNSS

Query:  SAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETR------------IIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPI
            P    + E    +      +  +ST P I G   N +  +            +I    E ++  GL             DF             P+
Subjt:  SAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETR------------IIGLAEEFLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPI

Query:  LGVTGERAHFQRQHDRGLLRFEPNWGTYP-------DCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTP
         G    + +  R       RFE  W + P       DCWE+          +   +++ +LN C   L  W     + F   +     + IQ   +    
Subjt:  LGVTGERAHFQRQHDRGLLRFEPNWGTYP-------DCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQSLEQYLTP

Query:  DTEETWFQK-RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKI------------------------------------RTLQTLN
              FQ+ ++ L  L    E +WRQ+++  WL  GDRNTK+FH +A++R++ N+I                                      +Q+L 
Subjt:  DTEETWFQK-RRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKI------------------------------------RTLQTLN

Query:  ESWISD-----------------------------------------------------------------EKEIGE---------FATSYFQH------
         S +S                                                                  E +IG          FAT    H      
Subjt:  ESWISD-----------------------------------------------------------------EKEIGE---------FATSYFQH------

Query:  ---------------------LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQD
                               ++++P S  +  V+D   P+  D      + +    + I  +P TG+   D +IW     G++TVKSAY  +    D
Subjt:  ---------------------LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQD

Query:  SQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWN
        +    +  +R  ++W  LW      K++   WR  +NILP   NL+ K + +Q  CP C    ET  HI   C   R +W   FL  Y      F  DW 
Subjt:  SQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWN

Query:  AGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVR
               +L   N  D  +   +LW IW+ RN      ++W       + R  +  +   PVN  +     +N DAS        GLG+I R+++GR V 
Subjt:  AGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVR

Query:  AGHHFIRTNWSILILELRGIIEGLKAIPNK-TIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWS-QIAFKHIPRSANQTTHKLAQRASRLQ
        A     R ++     E   + E L  I +K    ++VE+D  E +  +N + +D++  +  I + K +  + +  + F   PRS N+  H LA+ +    
Subjt:  AGHHFIRTNWSILILELRGIIEGLKAIPNK-TIPLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWS-QIAFKHIPRSANQTTHKLAQRASRLQ

Query:  TNESWLDGP
            W D P
Subjt:  TNESWLDGP

A0A2N9IXK4 RNase H domain-containing protein4.4e-4624.9Show/hide
Query:  LICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQS
        L  + L+ +PV++D   +    +W+T     +  +G+N+ +  F +  DL RV+ + PW +DK+++LF           + F   + W+ IH LPP    
Subjt:  LICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQS

Query:  LKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRICN----EVDWSTSSKKQF
           A   G  +G   +    + E  WG  + ++V I+V  PL RG K+ IG   +E+     YEKLP+FCY CG   H D+ C+      D   +SK+Q+
Subjt:  LKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRICN----EVDWSTSSKKQF

Query:  GSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSESEAGDRATERQ---GSEEFQKLI---
        G+ LR P               G   R    +V+  +       +SS  N  TG          +  K+C + E++ G           S  F  LI   
Subjt:  GSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSESEAGDRATERQ---GSEEFQKLI---

Query:  ---NAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKG------EKLAQVPTKGLS---------
           NAI    +  S +   E+   + RK  + +      ++       E          + S  V PR  + G      ++ A V  K  S         
Subjt:  ---NAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKG------EKLAQVPTKGLS---------

Query:  -----------------------------ILHSST---------------------GPKIMGFKDNDMETRIIGLAEEFLQTNG-----LNFQIGSSIM-
                                     +LHS +                     GP     +  D    I     E L  NG      N ++GS  + 
Subjt:  -----------------------------ILHSST---------------------GPKIMGFKDNDMETRIIGLAEEFLQTNG-----LNFQIGSSIM-

Query:  ---------DMWFDF----ETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRHPHNT-IGNMQVRLNACLMELKRWN
                   W       +  HL   SSDH PI          + + +R + RFE  W ++P C E + + WQ   H T +  +  +L  C   L++W+
Subjt:  ---------DMWFDF----ETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRHPHNT-IGNMQVRLNACLMELKRWN

Query:  RTRLEGFLKGAISKKEKEIQSLE-QYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEK
        R    G +   + KK + ++  E + +           +RE++ LL  +E  WRQRSR  WL+WGD+NT +FH  ATQR+++N I  +Q  + +  + E+
Subjt:  RTRLEGFLKGAISKKEKEIQSLE-QYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEK

Query:  EIGEFATSYFQHLFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAE
        +I      +F  LFSS  PT    + V   +   VTD+ N  L+ +FT  E
Subjt:  EIGEFATSYFQHLFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAE

A0A2N9IXK4 RNase H domain-containing protein3.4e-1427.53Show/hide
Query:  LPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQP
        +P +     D++IW   P G+F+V+SAY   L   Q    +S++ N    +W  +W+   P KI+   WR   + LPTK NL ++ + + P C  C +  
Subjt:  LPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQP

Query:  ETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRN----LAIRDK
        E   H++W C V+  LWN+  P ++ + +D    +        ++   + +   +F I  W +W  RN    + +RD+
Subjt:  ETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRN----LAIRDK

A0A2U1KHJ0 CCHC-type domain-containing protein6.2e-4821.91Show/hide
Query:  LFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINV
        +FL   G+  DLRRVLE+ PW F++ +++      + +  + +   V FW+ + N+P   +     +    ++G   +VD  D     GS     +R+  
Subjt:  LFLCSFGNTGDLRRVLENEPWYFDKAILLFDEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINV

Query:  TIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRIC----NEVDWSTSSKKQFGSGLR-----------------HPSSNSGQQN------
        T       KV++          + YE+LP+FCY CG  GH ++ C     E++  T     F   LR                 HP+ N+ QQ       
Subjt:  TIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLDRIC----NEVDWSTSSKKQFGSGLR-----------------HPSSNSGQQN------

Query:  -----WARSGVHGRGSRDTKK---------------TVEDDIEEDSSMDSSSGINLQTG----------SKQKTSLDTNNLEKNC-RRSESEAGDRATER
             +  S ++   ++D  +               T++   +++      SGI L  G          +K +  L+  NL K   RR+  E   + T  
Subjt:  -----WARSGVHGRGSRDTKK---------------TVEDDIEEDSSMDSSSGINLQTG----------SKQKTSLDTNNLEKNC-RRSESEAGDRATER

Query:  QGSEEF---QKLINAINFSVE------------SVSKERSLEVTEGHSRKDRNGKLKG----------LLEVSRKLVFDIEGEEDVS-----------KK
          S       + ++ ++++V+            S+ KE S  +      +    ++ G          L+  S +   D   +ED +           ++
Subjt:  QGSEEF---QKLINAINFSVE------------SVSKERSLEVTEGHSRKDRNGKLKG----------LLEVSRKLVFDIEGEEDVS-----------KK

Query:  VLNSSSAVEPRGVEKGEKLAQVPTKGLSIL---------HSSTGPKIMGFKDN----DMETRIIGLAEEFLQTNGL-----------NFQIGSSIMDMWF
             +    R +   ++ A V     + +           S   ++  F++     ++E R   +  +   +NG             F   S   D++ 
Subjt:  VLNSSSAVEPRGVEKGEKLAQVPTKGLSIL---------HSSTGPKIMGFKDN----DMETRIIGLAEEFLQTNGL-----------NFQIGSSIMDMWF

Query:  DFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNW-------GTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGA
        D    +L   +SDH PI+     R     +    + RFE  W       G   D W        +H    I      ++ C   L  WN+ R  G ++ +
Subjt:  DFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNW-------GTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGA

Query:  ISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQH
        I  K++ +Q+L+      T       R ++  LL  +E+ W+QRSR++WL+ GD+NT++FH +A+ R+++N I  L+  +  W+ +  E+ +  +SYF  
Subjt:  ISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQH

Query:  LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAE--NILKLPRTG----------------TMGC--------DEIIWKCHPRGVFTVKSAYQLR
        LFSS  P     E V   I   +T++  + L    T +E  ++L     G                 +GC        D + W  +P G F+ KSAY L 
Subjt:  LFSSDDPTSDMIEGVTDCILPSVTDDTNRMLLSDFTHAE--NILKLPRTG----------------TMGC--------DEIIWKCHPRGVFTVKSAYQLR

Query:  LRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDF
        L   +D    +  +N     W  +W A  PSK+K+  WR  +N +PT  NL  +GL+    C  C +  E   H+L+ C V + +WN         FYD 
Subjt:  LRI-QDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDF

Query:  REDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFI----------SSPVNPPRQGTWY--------LNTDAS
        +       + Q +LE     ++  F++ILW +WT RN     +    +  +  I +  ++++           +S V+    G W         +N DA+
Subjt:  REDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFI----------SSPVNPPRQGTWY--------LNTDAS

Query:  WSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVV-ESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWSQIAF
        W  +    GLG++ R + G ++ +G        S L  E + +   +  +  K    V+ E++SL  ++ +  + V   + +   +EI +    ++   +
Subjt:  WSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVV-ESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWSQIAF

Query:  KHIPRSANQTTHKLAQRA
          + R  N+  H +A  A
Subjt:  KHIPRSANQTTHKLAQRA

A0A5C7H8M7 Uncharacterized protein1.2e-4625.59Show/hide
Query:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF
        + E L   E +   ++++ +   E  + +S  L+ K L  K ++ + F+  I ++W +T  + ++K+ +NLF+  F        V    PW+FD  I++ 
Subjt:  MVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLF

Query:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPD
        ++  G      +EF+ V+FW+ IH +P    + ++ +I   ++G+  ++   D++  WG  L ++VRI+++ PLKR LKV++    + +   + YE+L +
Subjt:  DEPRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPD

Query:  FCYSCGRTGHLDRIC-------NEVDWSTS-----SKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTG------
         C++ G+ GH+ R C         +D  T+      K     GL+   S +G++     G+  RG  +      D + + SS  S SG+ ++ G      
Subjt:  FCYSCGRTGHLDRIC-------NEVDWSTS-----SKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTG------

Query:  ---------SKQKT------SLDTNNLEKN--CRR-----SESEAGDRATERQ---------------------------GSEEFQKLINAINFSVESVS
                   QK        +  + LE+N  C        ESE  DR  E+                            G     K I   +   +S +
Subjt:  ---------SKQKT------SLDTNNLEKN--CRR-----SESEAGDRATERQ---------------------------GSEEFQKLINAINFSVESVS

Query:  KERSL----EVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKK-------VLNSSSAVEPRGVEKGEK-----------LAQVPTKGLS--ILHSS
        +++SL    +  +    K  + K +    V RK++  ++ E  V KK       +L+  +A     + +GE            + QV ++G S     SS
Subjt:  KERSL----EVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKK-------VLNSSSAVEPRGVEKGEK-----------LAQVPTKGLS--ILHSS

Query:  TGPKIMGF--------KDNDMETRIIGLAEEFLQT----NGLNFQI-GSSIMDMWFDFETIHLGTYSSDHRP-ILGVTGERAHFQRQHDRGLLRFEPNWG
        T   I           + +++     G   E L++     G + Q+ G +I     D    HLG  +SDHRP IL   G   + ++  D G  + EP W 
Subjt:  TGPKIMGF--------KDNDMETRIIGLAEEFLQT----NGLNFQI-GSSIMDMWFDFETIHLGTYSSDHRP-ILGVTGERAHFQRQHDRGLLRFEPNWG

Query:  TYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQS-LEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVD
        T   C E+V + W      + +  ++ +L++C  +L+ W++ +  G L   IS K +E+++ L +   P   +   +   ELDN+L  +EIYWRQRSR +
Subjt:  TYPDCWEIVANVW-QRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFLKGAISKKEKEIQS-LEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVD

Query:  WLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSD
        WL   DRN+++FH KA+ RKK+N I  L+       +DEK I +   +YF  LFSS  P+++
Subjt:  WLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDPTSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.8e-0726.7Show/hide
Query:  EGFLKGAISKKEKE----IQSLEQYLTPDTEETWFQ----KRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWIS
        +GF  G I  K KE    ++S++  L  +  ++ F+     R++ +      E ++RQ+SR+ WL+ GD NT++FH      + +N I+ L+  ++  + 
Subjt:  EGFLKGAISKKEKE----IQSLEQYLTPDTEETWFQ----KRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWIS

Query:  DEKEIGEFATSYFQHLFSSDDP--TSDMIEGVTDCILPSVTDDT--NRM--LLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSA
        +  ++ E   +Y+ HL  SD    T D ++ + D I P   +DT  +R+  L SD      +  +PR    G D    +      F VK +
Subjt:  DEKEIGEFATSYFQHLFSSDDP--TSDMIEGVTDCILPSVTDDT--NRM--LLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSA

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.7e-0820.99Show/hide
Query:  CPFCMKQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWNAGTYFQWMLE---DNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIR----
        C  C    ET  H+L+ C   R++W    +P+Y     ++ +   A  Y+   LE       K  N+   +LW++W  RN  +   + ++  E++R    
Subjt:  CPFCMKQPETSCHILWGCKVTRVLWN-HFLPSYTNLFYDFREDWNAGTYFQWMLE---DNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIR----

Query:  -----ITRCHVTEFISSP----------VNPPRQGTWY-LNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTI
              TR  +    S P            PP Q  W   NTDA+W  +    G+GWI R   G ++  G   +    ++L  EL  +   +  +     
Subjt:  -----ITRCHVTEFISSP----------VNPPRQGTWY-LNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTI

Query:  PLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRA
          ++     +A+  +  S   +      + +I+ +   + ++ F+  PR  N+   ++A+ +
Subjt:  PLVVESDSLEAIQQINGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRA

AT3G09510.1 Ribonuclease H-like superfamily protein7.3e-1724.11Show/hide
Query:  DEIIWKCHPRGVFTVKSAYQLRLRIQDSQ-EASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWG
        D+IIW  +  G +TV+S Y L      +   A N  +    +   +WN     K+K   WR L   L T   L  +G+ I P CP C ++ E+  H L+ 
Subjt:  DEIIWKCHPRGVFTVKSAYQLRLRIQDSQ-EASNSTNRRDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWG

Query:  CKVTRVLW---NHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLII--LWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFIS---------
        C    + W   +  L     +  DF E+ +    F   ++D    DF+  L +  +W+IW  RN  + +K   +  + +   +    ++++         
Subjt:  CKVTRVLW---NHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLII--LWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEFIS---------

Query:  SPV-----------NPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLV-VESDSLEAIQQI
        SP            NPP       N DA +   +     GWI R   G  +  G   +    + L  E + ++  L+    +    V +E D    I  I
Subjt:  SPV-----------NPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLV-VESDSLEAIQQI

Query:  NGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNES-------WLD
        NG S  ++  +  + +I   A+ ++ I F  I R  N+  H LA+      T  S       WLD
Subjt:  NGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNES-------WLD

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0535.71Show/hide
Query:  LWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGC
        +W+     KIK+  W+ L+N LP    L+ + + I+P+C  C +  ET  HIL+ C
Subjt:  LWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGC

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-2022.7Show/hide
Query:  DEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNR--RDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILW
        D   W     G +TVKS Y +  +I + + +    +    + I+  +W + T  KI+   W+ L N LP    L  + L  +  C  C    ET  H+L+
Subjt:  DEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNR--RDSIWMALWNANTPSKIKICCWRILHNILPTKTNLIQKGLDIQPWCPFCMKQPETSCHILW

Query:  GCKVTRVLWNHFLPSYTNLFYDFREDWNAGTY--FQWMLEDNN-----RKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEF--------
         C   R+ W     + +++      +W    Y    W+    N      K   +   +LW++W  RN  +   + +N +E++R     + E+        
Subjt:  GCKVTRVLWNHFLPSYTNLFYDFREDWNAGTY--FQWMLEDNN-----RKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRITRCHVTEF--------

Query:  --ISSPVNPPRQGTW--------YLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVV-ESDSLEAIQQ
              VN    G W          NTDA+W+ D +  G+GW+ R   G +   G   +    S+L  EL  +   + ++       V+ ESDS   I+ 
Subjt:  --ISSPVNPPRQGTW--------YLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVV-ESDSLEAIQQ

Query:  INGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRA
        +N   + +      I +++ + S ++++ F  IPR  N    ++A+ +
Subjt:  INGSSVDYTETSEFINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCCAAGCATCTCAGACTAATGGTCGAACGTCTAAATCTTACAGAGGAGGAAGGCAGAGCTATCGTGGTTGAGGACGATGACGTCGATGAGAGCGCCCGCCTCCT
GTCGATATCCCTGATCTGCAAAAGTCTATCCTCGAAACCAGTTCATATTGATGTTTTCCGGCAAAAAATTCCAAAAATTTGGAAAACCACTGCCCCGATTGGTATCGATA
AAGTTGGGGAAAATCTATTCCTCTGTAGCTTTGGGAATACAGGAGATTTACGAAGAGTCTTGGAAAACGAACCATGGTACTTTGATAAGGCGATTCTCCTATTTGATGAA
CCAAGAGGCAACTGTCGGTTCATAGATCTGGAATTCAGGTTCGTTAATTTTTGGATCCACATACATAATCTGCCTCCTGCTGAACAATCGCTGAAAATGGCACAAATCTT
TGGAAATCGCCTCGGTGTGTTCCAGAAGGTGGACCTGAACGATACAGAAACATGTTGGGGAAGCTCCTTGTGCCTAAAGGTTCGGATCAACGTCACCATTCCTCTAAAAC
GGGGATTAAAGGTCAAAATAGGAACGATGGCAGAGGAGTTATGGTGCCCGGTCACCTACGAGAAACTTCCTGATTTTTGCTACAGCTGCGGGCGAACCGGGCACCTGGAC
AGGATATGCAATGAGGTTGATTGGTCAACTTCGAGTAAAAAGCAATTTGGTTCTGGACTAAGACATCCATCCTCAAATTCTGGACAACAGAATTGGGCACGATCGGGAGT
TCATGGCAGAGGATCGAGAGACACAAAGAAGACAGTGGAGGACGATATCGAAGAAGACTCGTCGATGGATTCGAGTTCAGGGATAAATTTACAGACCGGAAGCAAACAGA
AGACATCCTTGGACACAAATAATTTGGAAAAAAATTGTCGAAGATCGGAGTCGGAGGCCGGAGACAGAGCGACGGAGAGGCAAGGAAGCGAGGAGTTTCAAAAATTAATT
AATGCCATTAATTTCTCCGTTGAATCGGTGTCAAAAGAGAGGTCGTTAGAAGTAACTGAAGGACATTCAAGGAAGGATCGAAATGGAAAGCTAAAAGGTCTACTGGAGGT
TTCAAGAAAGCTGGTTTTCGACATTGAAGGAGAAGAAGACGTTTCGAAAAAGGTTCTTAATTCTTCTTCTGCAGTTGAGCCACGTGGGGTTGAGAAAGGAGAAAAATTAG
CACAGGTACCAACAAAAGGGCTCTCAATTTTGCACTCCTCAACCGGGCCTAAAATAATGGGCTTTAAGGACAACGACATGGAGACTCGGATAATTGGGCTAGCCGAAGAA
TTTTTGCAGACTAATGGGCTAAATTTTCAAATAGGCTCCTCGATAATGGATATGTGGTTTGATTTCGAAACCATTCATCTAGGGACCTACTCTTCTGATCATCGACCAAT
CTTGGGGGTTACAGGTGAACGAGCGCATTTTCAAAGGCAACATGATAGGGGGCTTCTAAGATTCGAACCAAATTGGGGTACATATCCAGACTGTTGGGAGATTGTAGCAA
ATGTGTGGCAACGACATCCTCATAATACCATTGGGAACATGCAGGTTAGACTAAATGCTTGTTTGATGGAATTAAAAAGATGGAACCGAACTCGTTTGGAGGGATTTTTA
AAGGGGGCCATTTCAAAAAAAGAAAAAGAAATCCAATCCTTAGAACAATACTTGACCCCAGACACAGAGGAAACTTGGTTTCAAAAGAGAAGGGAGTTAGACAACCTACT
TCAAGAGGATGAGATATATTGGAGACAAAGGTCAAGGGTGGATTGGTTAAAATGGGGGGATCGAAACACGAAATGGTTTCACATCAAGGCCACACAGCGGAAAAAACAAA
ACAAGATCCGGACACTTCAAACTCTGAACGAATCGTGGATTTCTGATGAAAAAGAAATAGGAGAGTTTGCAACCTCGTACTTCCAACATCTTTTTTCTTCTGATGACCCG
ACTTCTGATATGATTGAAGGCGTAACAGATTGTATCTTACCATCGGTCACAGATGATACTAATAGAATGCTTCTATCGGATTTCACACATGCAGAAAATATTCTTAAGTT
ACCCCGGACTGGGACGATGGGTTGTGACGAGATTATATGGAAATGCCACCCTCGGGGTGTCTTCACAGTTAAAAGTGCCTACCAACTAAGACTTCGTATCCAAGATTCAC
AGGAGGCTTCTAATTCGACCAACAGGAGAGACTCTATTTGGATGGCATTATGGAACGCTAATACTCCATCCAAGATCAAGATTTGTTGTTGGAGAATTCTTCACAATATC
CTCCCCACAAAGACAAATCTGATCCAAAAGGGCCTCGACATTCAACCATGGTGCCCTTTCTGCATGAAACAACCGGAGACGAGCTGCCATATCCTATGGGGATGCAAGGT
AACAAGGGTGCTTTGGAACCATTTTCTACCTTCTTACACGAATTTGTTTTATGATTTCAGGGAAGATTGGAATGCGGGAACTTATTTTCAGTGGATGTTGGAAGACAACA
ATCGAAAAGACTTTAACGTCTTTCTGATCATTTTATGGAAGATCTGGACCTGGAGGAATTTAGCGATTAGGGATAAACAAATTTGGAACCAGGAAGAACTCATCAGAATC
ACACGGTGTCATGTCACGGAGTTCATTTCCTCTCCAGTTAATCCGCCAAGACAAGGAACTTGGTATCTAAACACAGATGCTTCATGGAGTACAGACCGCGATTGTGGCGG
TTTAGGTTGGATATTTCGAGAGTGGGATGGTCGGCTGGTTCGTGCTGGACATCATTTCATCCGCACAAACTGGTCGATCCTGATTCTGGAACTCAGGGGCATTATCGAGG
GTTTGAAGGCAATCCCAAACAAAACTATCCCACTCGTAGTGGAATCAGACTCTTTAGAAGCCATTCAACAAATTAATGGTTCGTCAGTTGACTACACTGAGACCAGTGAG
TTTATTAATGAAATCAAAACGATGGCAAGCATGTGGTCACAGATAGCTTTTAAACATATTCCTAGATCGGCAAACCAGACGACCCACAAACTAGCACAAAGGGCCTCACG
GCTACAAACAAATGAATCTTGGTTGGATGGTCCCTCTTCGAACCTTGATACTTTTCTATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCCAAGCATCTCAGACTAATGGTCGAACGTCTAAATCTTACAGAGGAGGAAGGCAGAGCTATCGTGGTTGAGGACGATGACGTCGATGAGAGCGCCCGCCTCCT
GTCGATATCCCTGATCTGCAAAAGTCTATCCTCGAAACCAGTTCATATTGATGTTTTCCGGCAAAAAATTCCAAAAATTTGGAAAACCACTGCCCCGATTGGTATCGATA
AAGTTGGGGAAAATCTATTCCTCTGTAGCTTTGGGAATACAGGAGATTTACGAAGAGTCTTGGAAAACGAACCATGGTACTTTGATAAGGCGATTCTCCTATTTGATGAA
CCAAGAGGCAACTGTCGGTTCATAGATCTGGAATTCAGGTTCGTTAATTTTTGGATCCACATACATAATCTGCCTCCTGCTGAACAATCGCTGAAAATGGCACAAATCTT
TGGAAATCGCCTCGGTGTGTTCCAGAAGGTGGACCTGAACGATACAGAAACATGTTGGGGAAGCTCCTTGTGCCTAAAGGTTCGGATCAACGTCACCATTCCTCTAAAAC
GGGGATTAAAGGTCAAAATAGGAACGATGGCAGAGGAGTTATGGTGCCCGGTCACCTACGAGAAACTTCCTGATTTTTGCTACAGCTGCGGGCGAACCGGGCACCTGGAC
AGGATATGCAATGAGGTTGATTGGTCAACTTCGAGTAAAAAGCAATTTGGTTCTGGACTAAGACATCCATCCTCAAATTCTGGACAACAGAATTGGGCACGATCGGGAGT
TCATGGCAGAGGATCGAGAGACACAAAGAAGACAGTGGAGGACGATATCGAAGAAGACTCGTCGATGGATTCGAGTTCAGGGATAAATTTACAGACCGGAAGCAAACAGA
AGACATCCTTGGACACAAATAATTTGGAAAAAAATTGTCGAAGATCGGAGTCGGAGGCCGGAGACAGAGCGACGGAGAGGCAAGGAAGCGAGGAGTTTCAAAAATTAATT
AATGCCATTAATTTCTCCGTTGAATCGGTGTCAAAAGAGAGGTCGTTAGAAGTAACTGAAGGACATTCAAGGAAGGATCGAAATGGAAAGCTAAAAGGTCTACTGGAGGT
TTCAAGAAAGCTGGTTTTCGACATTGAAGGAGAAGAAGACGTTTCGAAAAAGGTTCTTAATTCTTCTTCTGCAGTTGAGCCACGTGGGGTTGAGAAAGGAGAAAAATTAG
CACAGGTACCAACAAAAGGGCTCTCAATTTTGCACTCCTCAACCGGGCCTAAAATAATGGGCTTTAAGGACAACGACATGGAGACTCGGATAATTGGGCTAGCCGAAGAA
TTTTTGCAGACTAATGGGCTAAATTTTCAAATAGGCTCCTCGATAATGGATATGTGGTTTGATTTCGAAACCATTCATCTAGGGACCTACTCTTCTGATCATCGACCAAT
CTTGGGGGTTACAGGTGAACGAGCGCATTTTCAAAGGCAACATGATAGGGGGCTTCTAAGATTCGAACCAAATTGGGGTACATATCCAGACTGTTGGGAGATTGTAGCAA
ATGTGTGGCAACGACATCCTCATAATACCATTGGGAACATGCAGGTTAGACTAAATGCTTGTTTGATGGAATTAAAAAGATGGAACCGAACTCGTTTGGAGGGATTTTTA
AAGGGGGCCATTTCAAAAAAAGAAAAAGAAATCCAATCCTTAGAACAATACTTGACCCCAGACACAGAGGAAACTTGGTTTCAAAAGAGAAGGGAGTTAGACAACCTACT
TCAAGAGGATGAGATATATTGGAGACAAAGGTCAAGGGTGGATTGGTTAAAATGGGGGGATCGAAACACGAAATGGTTTCACATCAAGGCCACACAGCGGAAAAAACAAA
ACAAGATCCGGACACTTCAAACTCTGAACGAATCGTGGATTTCTGATGAAAAAGAAATAGGAGAGTTTGCAACCTCGTACTTCCAACATCTTTTTTCTTCTGATGACCCG
ACTTCTGATATGATTGAAGGCGTAACAGATTGTATCTTACCATCGGTCACAGATGATACTAATAGAATGCTTCTATCGGATTTCACACATGCAGAAAATATTCTTAAGTT
ACCCCGGACTGGGACGATGGGTTGTGACGAGATTATATGGAAATGCCACCCTCGGGGTGTCTTCACAGTTAAAAGTGCCTACCAACTAAGACTTCGTATCCAAGATTCAC
AGGAGGCTTCTAATTCGACCAACAGGAGAGACTCTATTTGGATGGCATTATGGAACGCTAATACTCCATCCAAGATCAAGATTTGTTGTTGGAGAATTCTTCACAATATC
CTCCCCACAAAGACAAATCTGATCCAAAAGGGCCTCGACATTCAACCATGGTGCCCTTTCTGCATGAAACAACCGGAGACGAGCTGCCATATCCTATGGGGATGCAAGGT
AACAAGGGTGCTTTGGAACCATTTTCTACCTTCTTACACGAATTTGTTTTATGATTTCAGGGAAGATTGGAATGCGGGAACTTATTTTCAGTGGATGTTGGAAGACAACA
ATCGAAAAGACTTTAACGTCTTTCTGATCATTTTATGGAAGATCTGGACCTGGAGGAATTTAGCGATTAGGGATAAACAAATTTGGAACCAGGAAGAACTCATCAGAATC
ACACGGTGTCATGTCACGGAGTTCATTTCCTCTCCAGTTAATCCGCCAAGACAAGGAACTTGGTATCTAAACACAGATGCTTCATGGAGTACAGACCGCGATTGTGGCGG
TTTAGGTTGGATATTTCGAGAGTGGGATGGTCGGCTGGTTCGTGCTGGACATCATTTCATCCGCACAAACTGGTCGATCCTGATTCTGGAACTCAGGGGCATTATCGAGG
GTTTGAAGGCAATCCCAAACAAAACTATCCCACTCGTAGTGGAATCAGACTCTTTAGAAGCCATTCAACAAATTAATGGTTCGTCAGTTGACTACACTGAGACCAGTGAG
TTTATTAATGAAATCAAAACGATGGCAAGCATGTGGTCACAGATAGCTTTTAAACATATTCCTAGATCGGCAAACCAGACGACCCACAAACTAGCACAAAGGGCCTCACG
GCTACAAACAAATGAATCTTGGTTGGATGGTCCCTCTTCGAACCTTGATACTTTTCTATCTTAA
Protein sequenceShow/hide protein sequence
MEAKHLRLMVERLNLTEEEGRAIVVEDDDVDESARLLSISLICKSLSSKPVHIDVFRQKIPKIWKTTAPIGIDKVGENLFLCSFGNTGDLRRVLENEPWYFDKAILLFDE
PRGNCRFIDLEFRFVNFWIHIHNLPPAEQSLKMAQIFGNRLGVFQKVDLNDTETCWGSSLCLKVRINVTIPLKRGLKVKIGTMAEELWCPVTYEKLPDFCYSCGRTGHLD
RICNEVDWSTSSKKQFGSGLRHPSSNSGQQNWARSGVHGRGSRDTKKTVEDDIEEDSSMDSSSGINLQTGSKQKTSLDTNNLEKNCRRSESEAGDRATERQGSEEFQKLI
NAINFSVESVSKERSLEVTEGHSRKDRNGKLKGLLEVSRKLVFDIEGEEDVSKKVLNSSSAVEPRGVEKGEKLAQVPTKGLSILHSSTGPKIMGFKDNDMETRIIGLAEE
FLQTNGLNFQIGSSIMDMWFDFETIHLGTYSSDHRPILGVTGERAHFQRQHDRGLLRFEPNWGTYPDCWEIVANVWQRHPHNTIGNMQVRLNACLMELKRWNRTRLEGFL
KGAISKKEKEIQSLEQYLTPDTEETWFQKRRELDNLLQEDEIYWRQRSRVDWLKWGDRNTKWFHIKATQRKKQNKIRTLQTLNESWISDEKEIGEFATSYFQHLFSSDDP
TSDMIEGVTDCILPSVTDDTNRMLLSDFTHAENILKLPRTGTMGCDEIIWKCHPRGVFTVKSAYQLRLRIQDSQEASNSTNRRDSIWMALWNANTPSKIKICCWRILHNI
LPTKTNLIQKGLDIQPWCPFCMKQPETSCHILWGCKVTRVLWNHFLPSYTNLFYDFREDWNAGTYFQWMLEDNNRKDFNVFLIILWKIWTWRNLAIRDKQIWNQEELIRI
TRCHVTEFISSPVNPPRQGTWYLNTDASWSTDRDCGGLGWIFREWDGRLVRAGHHFIRTNWSILILELRGIIEGLKAIPNKTIPLVVESDSLEAIQQINGSSVDYTETSE
FINEIKTMASMWSQIAFKHIPRSANQTTHKLAQRASRLQTNESWLDGPSSNLDTFLS