; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016126 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016126
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold9:41394267..41400184
RNA-Seq ExpressionSpg016126
SyntenySpg016126
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]1.5e-4624.33Show/hide
Query:  ELEKLKIT-TAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL
        E EK  I+ T +    + +E E++ +  E  +  +  ++ T    N   FK  + + W  +  + ++ +  N +L  F    + D + R GPW++DR LL
Subjt:  ELEKLKIT-TAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL

Query:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP
        ++    G  + S      VNFWV ++DLP        A+KLGN +G F E D  +   T   LR++  +D+ +PLKRG  ++   K    WV   YERLP
Subjt:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP

Query:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ
         FC+ CG IGH  +ECE + +  E N          YG ++RA+P+  +   P++                +    Q +  +  ++  D++      +  
Subjt:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ

Query:  SGKSNEENVGVLRPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSE---MITR
               N+ V     S G  +      VGK+      S+  +  R K+ K V     +  A E+GK R+L    + + +A   ++  +       M TR
Subjt:  SGKSNEENVGVLRPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSE---MITR

Query:  KE------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ
         +            ++SG   D KG   + +                                      +G+  E  K   +  +  LE    + +++  
Subjt:  KE------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ

Query:  DSLMFRGEGKNVRTWKRRARVENNSVDSESLKGATIANVGRKKPL---------------------------KRKRRSK---KFEEAWIRVSDSKKIVES
        + L F G      TW      ++N    +    ++   + R  P+                           +R+RR K   +FEE+W   +  + ++++
Subjt:  DSLMFRGEGKNVRTWKRRARVENNSVDSESLKGATIANVGRKKPL---------------------------KRKRRSK---KFEEAWIRVSDSKKIVES

Query:  SWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWF
         W + P  SF+D   +L R + N L    D   GSI   I R    I   +  D+S  ++      E  LE LL EEE+ W+ RSR  WL+ GD+NTK+F
Subjt:  SWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWF

Query:  HHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSN
        H KAS+R+K NEIK L +  G+W   E+++  +   YFK+LF+S++P+ V ++   +  ++ L          D +  + E  G   +++   +  L   
Subjt:  HHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSN

Query:  SKMESPSTVTSHDTFWRKFWKI
         K   P  + +   F++K+W I
Subjt:  SKMESPSTVTSHDTFWRKFWKI

GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]2.8e-1630.58Show/hide
Query:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV
        + S    D     N  I+ + F+  +A  I++TP+      D+IIW+ E+ G+++V+ AYHL     +  +  PS+ +  D+ W+K WK     K K  +
Subjt:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV

Query:  WRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI
         R+  + LPTR N+ KKGI ++  CP   +  E+A H+   C +          L   LFA   G   P   D   WL++ L   ++L  ++  TILW  
Subjt:  WRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI

Query:  WEYRNK
        W  RN+
Subjt:  WEYRNK

GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]2.2e-4526.24Show/hide
Query:  WKRRARVENNSV-DSESLKGATIANVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRK
        WKRR  +   +V D  S     I    R    +   R  +FE  W   S  ++IVE  W +    S      KL  C   L +W  D +    +  ID  
Subjt:  WKRRARVENNSV-DSESLKGATIANVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRK

Query:  RDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSS
        +  +  + G     +  S  +A+ +  +LL  +ESYW+ R++E WL+ GD+NTK+FH KA+ R+++N I  L + +G W D    +  + ++YFK++F+S
Subjt:  RDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSS

Query:  S----------------------------------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLF
        +                                                          N  +V   F+  D ++IL+ P+    + D   W  +K+G +
Subjt:  S----------------------------------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLF

Query:  TVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIP
        +V SAY L       + +   T   +  FW+K W I A PK +  +WR +  SLPTR  +  + +P   +CP      ES KH+L +C  ++++W     
Subjt:  TVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIP

Query:  LTNGLFALNRGSWIP--RDYWAWLMDNLA---EEELEIAITILWSIWEYRNKVTHTESKPIYQEI---SRIISSKIDFPKVVSRTYLPKSSGKNQPTVKN
              A + G + P    +  WLM  LA     +      + WSIWE RN V    ++P           +  + D   V  R  +P       PT  +
Subjt:  LTNGLFALNRGSWIP--RDYWAWLMDNLA---EEELEIAITILWSIWEYRNKVTHTESKPIYQEI---SRIISSKIDFPKVVSRTYLPKSSGKNQPTVKN

Query:  LASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLL
        L +   WK P    +KLNVD + N      G+G ++R+  GS +     RM   +S    E   V E L  I          +  +++VESD   VVN+L
Subjt:  LASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLL

Query:  NEED-EDFTEISFLIQEISRLKKNFK-EISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSP
        N  + E+F+ +  ++ +   L      EI F +  R  N  A  LA+   SFP+ +    S P
Subjt:  NEED-EDFTEISFLIQEISRLKKNFK-EISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSP

KAE8800683.1 retrotransposon unclassified [Hordeum vulgare]2.3e-5023.47Show/hide
Query:  RILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKW
        +IL+ KL +P+     + ++W     +  K MG N ++  F     K R    GPW +D  L+V+E+     R+    F  +  WV +++LP+  M  + 
Subjt:  RILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKW

Query:  AEKLGNSLGEFVEADLDEGGSTENT-LRIQVKIDVSEPLKRGLMV------------RIGSKAEET-WVKVTYERLPEFCYGCGIIGHVQQECEKISNEG
        AE +GN +G+FVEAD    GS     LRI++++ + +PL RG  +             +G + + + W +  YE LP+FCY CG++GH +++C     +G
Subjt:  AEKLGNSLGEFVEADLDEGGSTENT-LRIQVKIDVSEPLKRGLMV------------RIGSKAEET-WVKVTYERLPEFCYGCGIIGHVQQECEKISNEG

Query:  EENLYGDFM-------RATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGR--SSPE
        E+  +G ++       RA    GG  K            GRE G +  Y +  +R N +     +    R+D    S +       PEN+  G   +SP 
Subjt:  EENLYGDFM-------RATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGR--SSPE

Query:  KETVGKSQ-GQPTASEVSEQSRMKTGKAVMETNCQAVEIGK-IRDLGQKYVPDKKATKGIVLK--DQTSEMITRKEWASGTEKDKKGPRAK--NSQSGLE
        K T G+ Q G P    + E  + K G    E     V  GK +R+ G         TK   L    Q  +   R    + ++ + K P  K    +  +E
Subjt:  KETVGKSQ-GQPTASEVSEQSRMKTGKAVMETNCQAVEIGK-IRDLGQKYVPDKKATKGIVLK--DQTSEMITRKEWASGTEKDKKGPRAK--NSQSGLE

Query:  KEINKALEREKHLEEKE---ISEMDTNQDS-------LMFRGEGKNVRTWKRRARVENNSVDSESLKGATIAN-----------VGRKKPLKRKRRSK--
               +RE  L +K     +E +  +D+       +M RGE  N    +R  R   N+   E+     + N           +   K   R+R  K  
Subjt:  KEINKALEREKHLEEKE---ISEMDTNQDS-------LMFRGEGKNVRTWKRRARVENNSVDSESLKGATIAN-----------VGRKKPLKRKRRSK--

Query:  ---KFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEES
           +FE  W+      + ++ +W+E         +  + R    + KW+K  + G +   + + R E+ R M    S   +         L  L  ++  
Subjt:  ---KFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEES

Query:  YWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWN
          K RS   WL+ G+RNT++F    + RKK+N +K L    G  + +  E+ N   SYF++LF++             + E+      GG G   + + +
Subjt:  YWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWN

Query:  REKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYL-CPFSRSSEESAKHILWDCKLSK
          K                              +  W++ WK+      ++  WRI H+SL    N+ ++G  +    C F   ++E   H+   CK+ K
Subjt:  REKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYL-CPFSRSSEESAKHILWDCKLSK

Query:  NLW--------KIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEIS-RIISSKIDFPKVVSRTYLPKSS
         +W        +I +     + A+        DY  W +D    + L + +T  W  W YRNKV   E      E++ R  SS +++ ++ +        
Subjt:  NLW--------KIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEIS-RIISSKIDFPKVVSRTYLPKSS

Query:  GKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVES
            P  K +++   WKPP +  +K N+D S+    +  G G   R S G  +     R          E       +I++ +   L   L  +  + E+
Subjt:  GKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVES

Query:  DASSVVNLLNEEDEDFTEISFLIQEIS-RLKKNFKEISFLYCPRDQNVAADLLARVAISFP
        D+  ++  L+    D +  + +I++   +LK  F +     C R  N  A  LA V   +P
Subjt:  DASSVVNLLNEEDEDFTEISFLIQEIS-RLKKNFKEISFLYCPRDQNVAADLLARVAISFP

KAE8813692.1 hypothetical protein D1007_09196 [Hordeum vulgare]8.9e-4722.72Show/hide
Query:  KLKITTAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIED
        K+K++  E+ K+VA +                 ++L+ +L +P+  +  + R+W     +  K +G N ++  FN    K R    GPW +++ L+V++ 
Subjt:  KLKITTAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIED

Query:  TQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLD-EGGSTENTLRIQVKIDVSEPL-----------------KRGLMVRIGSK
             R+    F     WV I +LP+  M  + AE++GN +G FVEAD+  +G +    LR+++++ + +P+                 KR + +    +
Subjt:  TQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLD-EGGSTENTLRIQVKIDVSEPL-----------------KRGLMVRIGSK

Query:  AEE-TWVKVTYERLPEFCYGCGIIGHVQQECEKISNEGEENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWR
         E+  W +  YE LP+FCY CG++GH Q+ C     +GE+  +G  +RA        ++K    + G++ GR RG  GA  F   R  +K     +    
Subjt:  AEE-TWVKVTYERLPEFCYGCGIIGHVQQECEKISNEGEENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWR

Query:  RRDQSGKSNEENVGVLRPENSTGGRSSPEKETVG-KSQGQP----TASEVSEQSRMKTGKAVMETNCQAVEI-GKIRDLGQKYVPDKKATKGIVLKDQTS
        R+D S  +  +     + E  T   S P+    G +  G+P       + +E   +K+       +   VE+     + G + V D +   G    ++  
Subjt:  RRDQSGKSNEENVGVLRPENSTGGRSSPEKETVG-KSQGQP----TASEVSEQSRMKTGKAVMETNCQAVEI-GKIRDLGQKYVPDKKATKGIVLKDQTS

Query:  EMIT---RKEWASGTEKDKKGP--------------------------RAKNSQSGL-EKEINKALEREKHLEEKEISEMDTNQDSLMFRG------EGK
        +M T    ++    T+  KKGP                            K +Q G  E+E +K    E     +  + MD  +++L+  G       G 
Subjt:  EMIT---RKEWASGTEKDKKGP--------------------------RAKNSQSGL-EKEINKALEREKHLEEKEISEMDTNQDSLMFRG------EGK

Query:  NVRTWKRRARVENNSVDSESLKGATIANV-------GRKKPLKRKRRSK-----------------------KFEEAWIRVSDSKKIVESSWKE--IPGR
           TW+   R E +S   E L  AT ANV       G     +R R S                        +FE  W++     ++++ +W+E  + G 
Subjt:  NVRTWKRRARVENNSVDSESLKGATIANV-------GRKKPLKRKRRSK-----------------------KFEEAWIRVSDSKKIVESSWKE--IPGR

Query:  SFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERK
           +    + +  + + +W+K+ I G +   + + + E+ R M    S A +   G       +L  ++ +  K RS   WL+ G+RNT++F   AS R+
Subjt:  SFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERK

Query:  KRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPST
        K+N IKGL+   G+ + + D + N   SYF                                            +GLFT   +  LA LL   +      
Subjt:  KRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPST

Query:  VTSHDTFWRKFWKIKALPK-AKICVWRIIHDSLPTRVNILKKGIPI-NYLCPFSRSSEESAKHILWDCKLSKNLWK-IFIPLTNGLFALNRGSWIPRDYW
                    ++ A P+  ++  WR+ H+SL  R N+ K+GIP+ +  C F   +EE   H+   CK  K  W+ + +    GL           D+ 
Subjt:  VTSHDTFWRKFWKIKALPK-AKICVWRIIHDSLPTRVNILKKGIPI-NYLCPFSRSSEESAKHILWDCKLSKNLWK-IFIPLTNGLFALNRGSWIPRDYW

Query:  AWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQF
         W    L+E +    +T  W  W  RNK+   E     +E++R   +      V+    + ++  K++        H  WKPPPD ++K+N D S+    
Subjt:  AWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQF

Query:  RKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEIS-RLKKNFKE
             G + RD +G  +     R        A E  A+   +        +   L  + ++ E+D+S ++  ++    D +  + +I+++  +LK  F +
Subjt:  RKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEIS-RLKKNFKE

Query:  ISFLYCPRDQNVAADLLA
             C R+ N  A  LA
Subjt:  ISFLYCPRDQNVAADLLA

PWA36168.1 hypothetical protein CTI12_AA602590 [Artemisia annua]8.6e-5022.44Show/hide
Query:  YLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSE
        +L    +  +  R+   GPW+++R L+V++  +   + + T    V FWV + ++P+         ++   +G+ +E  +D+   T+ +  I+VK     
Subjt:  YLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSE

Query:  PLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC---------------------------------------------------------
               VR         V + YERLP FCY CG++GH ++EC                                                         
Subjt:  PLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC---------------------------------------------------------

Query:  EKISNEGE--ENLYGDFMRATPIVGGTPK-----------QKTQENKRGNFWGRERGRRGAYQFQNNRQNQK--YNDEKEETWRRRDQSGKSNEENVGVL
        + I NE +  E+   D      + GGT             QK  +N  G+    + GR+     + + + Q+   N    + W+RR +   + +   G  
Subjt:  EKISNEGE--ENLYGDFMRATPIVGGTPK-----------QKTQENKRGNFWGRERGRRGAYQFQNNRQNQK--YNDEKEETWRRRDQSGKSNEENVGVL

Query:  RPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKA-------VMETNCQAVEIGKIRDLGQKY---VPD--KKATKGIVLKDQTSEMITR-KEWA
           NS    + P   ++     Q   +  + Q      K        +MET     E+   R +  +Y   V D  ++A   +V +D      T    W 
Subjt:  RPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKA-------VMETNCQAVEIGKIRDLGQKY---VPD--KKATKGIVLKDQTSEMITR-KEWA

Query:  SGTEKDKKGPRAKNSQSGLEK---------EINKALEREKHLEEKEISEMDTNQDSLMF----------------RGEGKNVRTWKRRARVENNSVDSES
           EK +     ++ ++  E+         EI  A E+E        +EM   +++  F                 G   N    KR  R   NS   + 
Subjt:  SGTEKDKKGPRAKNSQSGLEK---------EINKALEREKHLEEKEISEMDTNQDSLMF----------------RGEGKNVRTWKRRARVENNSVDSES

Query:  LKGATIANVGR------------KKPLKRKRRSKKFEEAWIRVSDSKKIVESSWK-EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEI
           A+  N+ R               +K+K R  +FE  W+R      +V   W   +      D    ++ C + L  WNK R  G ++ +I  K+  +
Subjt:  LKGATIANVGR------------KKPLKRKRRSKKFEEAWIRVSDSKKIVESSWK-EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEI

Query:  LRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNP-
          ++     +  A      ++++ LL  EE  WK RSR +WL+ GD+NT++FH +AS R++RN I  L    G W+++ +E+  + SSYF DLFSSS+P 
Subjt:  LRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNP-

Query:  ---TIVRE---------------SFSAIDAEVILNTPVGG------------------------EGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMES
           ++VR+                 ++ +   +LNT   G                        +   D + W+    G F+ KSAY LA+      + +
Subjt:  ---TIVRE---------------SFSAIDAEVILNTPVGG------------------------EGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMES

Query:  PSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYW
         +   S   FWR  WK +   K K+ +WR  ++ +PT  N+  +G+     C     + E+  H+L+ C ++K++W        G F   +G+   +D+ 
Subjt:  PSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYW

Query:  AWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKI-DFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQ
          +++     E E  + ILW +W  RN+  H +       +  I  S + D+ K   R  +  +SG     V N+ +  +W  P    +K+N DA+W  +
Subjt:  AWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKI-DFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQ

Query:  FRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKE
          K G+G++ R+  G  +  G +      S    EAKAV   +  ++     H    + SL +     + V LL        +I+ L  EI      F  
Subjt:  FRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKE

Query:  ISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCIKSLLNEVGVLD
         ++ +  R+ N  A  +A +A+S  S    LD S  V      W     + IKS     G+++
Subjt:  ISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCIKSLLNEVGVLD

TrEMBL top hitse value%identityAlignment
A0A2N9GD63 Reverse transcriptase domain-containing protein3.3e-4726.05Show/hide
Query:  EDLQGAIFCRILTPKLINPE-VFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHD
        ED Q  +  R +T + +N E V +TF P +W       ++ +G N  +  F +  + +R+++G PW+YD+ L+  +     + ++     YV+FWV I++
Subjt:  EDLQGAIFCRILTPKLINPE-VFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHD

Query:  LPMVCMRRKWAEKLGNSLGEFVE-ADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQECEK-ISNEG--
        LP+  M+R++A  LG+++GE  + A+ ++    E  +RI+VK+D+S+PL RG   R+ S   ETW+   YERLP FCY CG + H +++CE  + ++G  
Subjt:  LPMVCMRRKWAEKLGNSLGEFVE-ADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQECEK-ISNEG--

Query:  --EENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGRSSPEKETVGKS
          EE  YG ++RA P+    P ++ +    GN    +         ++ R+ Q    + + +W    +   S +EN+G  +  N+    +  E +  G++
Subjt:  --EENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGRSSPEKETVGKS

Query:  QGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEMITRKEWA-SGTEKDKKGPRAKNSQSGLEK-----------
        +G      V       TG   +        IG   ++    V  KK  +G+   +     +   +W  +G        R   S + L+K           
Subjt:  QGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEMITRKEWA-SGTEKDKKGPRAKNSQSGLEK-----------

Query:  --EINKALEREKHLEEKEISEMD-------TNQDSLMFRGEGKNVRTWKR--------RARVENNSVDSESLKGATIANVGR------------------
          + N+ L  ++ L E   S+ +        N+  L+  G      TW+         + R++        +   T+  +                    
Subjt:  --EINKALEREKHLEEKEISEMD-------TNQDSLMFRGEGKNVRTWKR--------RARVENNSVDSESLKGATIANVGR------------------

Query:  KKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIP--GRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRMEGDDSTANLASIGLAEKEL
            + KRR +KFEE W    + +KI+   W +    G        K+ +C  +L +W K           D+       + G+    N  +I   ++E+
Subjt:  KKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIP--GRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRMEGDDSTANLASIGLAEKEL

Query:  ENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEV
          LL  EE +W+ RSR  WL+ GDRNTK+FH  A++RKK N I+G  N + VW D E ++ +IA  YF+++F++S PT + E+ +A+D  V
Subjt:  ENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEV

A0A2U1KHJ0 CCHC-type domain-containing protein4.2e-5022.44Show/hide
Query:  YLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSE
        +L    +  +  R+   GPW+++R L+V++  +   + + T    V FWV + ++P+         ++   +G+ +E  +D+   T+ +  I+VK     
Subjt:  YLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSE

Query:  PLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC---------------------------------------------------------
               VR         V + YERLP FCY CG++GH ++EC                                                         
Subjt:  PLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC---------------------------------------------------------

Query:  EKISNEGE--ENLYGDFMRATPIVGGTPK-----------QKTQENKRGNFWGRERGRRGAYQFQNNRQNQK--YNDEKEETWRRRDQSGKSNEENVGVL
        + I NE +  E+   D      + GGT             QK  +N  G+    + GR+     + + + Q+   N    + W+RR +   + +   G  
Subjt:  EKISNEGE--ENLYGDFMRATPIVGGTPK-----------QKTQENKRGNFWGRERGRRGAYQFQNNRQNQK--YNDEKEETWRRRDQSGKSNEENVGVL

Query:  RPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKA-------VMETNCQAVEIGKIRDLGQKY---VPD--KKATKGIVLKDQTSEMITR-KEWA
           NS    + P   ++     Q   +  + Q      K        +MET     E+   R +  +Y   V D  ++A   +V +D      T    W 
Subjt:  RPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKA-------VMETNCQAVEIGKIRDLGQKY---VPD--KKATKGIVLKDQTSEMITR-KEWA

Query:  SGTEKDKKGPRAKNSQSGLEK---------EINKALEREKHLEEKEISEMDTNQDSLMF----------------RGEGKNVRTWKRRARVENNSVDSES
           EK +     ++ ++  E+         EI  A E+E        +EM   +++  F                 G   N    KR  R   NS   + 
Subjt:  SGTEKDKKGPRAKNSQSGLEK---------EINKALEREKHLEEKEISEMDTNQDSLMF----------------RGEGKNVRTWKRRARVENNSVDSES

Query:  LKGATIANVGR------------KKPLKRKRRSKKFEEAWIRVSDSKKIVESSWK-EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEI
           A+  N+ R               +K+K R  +FE  W+R      +V   W   +      D    ++ C + L  WNK R  G ++ +I  K+  +
Subjt:  LKGATIANVGR------------KKPLKRKRRSKKFEEAWIRVSDSKKIVESSWK-EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEI

Query:  LRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNP-
          ++     +  A      ++++ LL  EE  WK RSR +WL+ GD+NT++FH +AS R++RN I  L    G W+++ +E+  + SSYF DLFSSS+P 
Subjt:  LRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNP-

Query:  ---TIVRE---------------SFSAIDAEVILNTPVGG------------------------EGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMES
           ++VR+                 ++ +   +LNT   G                        +   D + W+    G F+ KSAY LA+      + +
Subjt:  ---TIVRE---------------SFSAIDAEVILNTPVGG------------------------EGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMES

Query:  PSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYW
         +   S   FWR  WK +   K K+ +WR  ++ +PT  N+  +G+     C     + E+  H+L+ C ++K++W        G F   +G+   +D+ 
Subjt:  PSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYW

Query:  AWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKI-DFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQ
          +++     E E  + ILW +W  RN+  H +       +  I  S + D+ K   R  +  +SG     V N+ +  +W  P    +K+N DA+W  +
Subjt:  AWLMDNLAEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKI-DFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQ

Query:  FRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKE
          K G+G++ R+  G  +  G +      S    EAKAV   +  ++     H    + SL +     + V LL        +I+ L  EI      F  
Subjt:  FRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKE

Query:  ISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCIKSLLNEVGVLD
         ++ +  R+ N  A  +A +A+S  S    LD S  V      W     + IKS     G+++
Subjt:  ISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCIKSLLNEVGVLD

A0A2Z6NZV1 Uncharacterized protein7.4e-4724.33Show/hide
Query:  ELEKLKIT-TAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL
        E EK  I+ T +    + +E E++ +  E  +  +  ++ T    N   FK  + + W  +  + ++ +  N +L  F    + D + R GPW++DR LL
Subjt:  ELEKLKIT-TAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL

Query:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP
        ++    G  + S      VNFWV ++DLP        A+KLGN +G F E D  +   T   LR++  +D+ +PLKRG  ++   K    WV   YERLP
Subjt:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP

Query:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ
         FC+ CG IGH  +ECE + +  E N          YG ++RA+P+  +   P++                +    Q +  +  ++  D++      +  
Subjt:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ

Query:  SGKSNEENVGVLRPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSE---MITR
               N+ V     S G  +      VGK+      S+  +  R K+ K V     +  A E+GK R+L    + + +A   ++  +       M TR
Subjt:  SGKSNEENVGVLRPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSE---MITR

Query:  KE------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ
         +            ++SG   D KG   + +                                      +G+  E  K   +  +  LE    + +++  
Subjt:  KE------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ

Query:  DSLMFRGEGKNVRTWKRRARVENNSVDSESLKGATIANVGRKKPL---------------------------KRKRRSK---KFEEAWIRVSDSKKIVES
        + L F G      TW      ++N    +    ++   + R  P+                           +R+RR K   +FEE+W   +  + ++++
Subjt:  DSLMFRGEGKNVRTWKRRARVENNSVDSESLKGATIANVGRKKPL---------------------------KRKRRSK---KFEEAWIRVSDSKKIVES

Query:  SWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWF
         W + P  SF+D   +L R + N L    D   GSI   I R    I   +  D+S  ++      E  LE LL EEE+ W+ RSR  WL+ GD+NTK+F
Subjt:  SWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWF

Query:  HHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSN
        H KAS+R+K NEIK L +  G+W   E+++  +   YFK+LF+S++P+ V ++   +  ++ L          D +  + E  G   +++   +  L   
Subjt:  HHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSN

Query:  SKMESPSTVTSHDTFWRKFWKI
         K   P  + +   F++K+W I
Subjt:  SKMESPSTVTSHDTFWRKFWKI

A0A2Z6NZV1 Uncharacterized protein1.4e-1630.58Show/hide
Query:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV
        + S    D     N  I+ + F+  +A  I++TP+      D+IIW+ E+ G+++V+ AYHL     +  +  PS+ +  D+ W+K WK     K K  +
Subjt:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV

Query:  WRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI
         R+  + LPTR N+ KKGI ++  CP   +  E+A H+   C +          L   LFA   G   P   D   WL++ L   ++L  ++  TILW  
Subjt:  WRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI

Query:  WEYRNK
        W  RN+
Subjt:  WEYRNK

A0A2Z6NZV1 Uncharacterized protein1.8e-4522.83Show/hide
Query:  IEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRY
        I D  +++   D    +  R LT + +N    K  M  +W  E  + +K +G   Y+  F    E +R+    PW+++   L++E    +   S  S  +
Subjt:  IEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRY

Query:  VNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEAD-LDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC-
        +  WV ++ L       + A++LGN +G+FVEAD  +      + LRI+VK+DV +PLK+G  ++   K E   V   YERLP FC+ CG +GH ++ C 
Subjt:  VNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEAD-LDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC-

Query:  EKISNEGE--ENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGRSSPE
        E +   G   E  +G  +RA+         +   N  G  W RE             +     DE ++    +++ GK +       R   +  G SS  
Subjt:  EKISNEGE--ENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGRSSPE

Query:  KETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVL--------KDQTSEMITRKEWASGTEKDKKGP----RAKN
           + K  G P ++E++   R +    +ME++        +++   + + DK+    ++L        K    E       + G     K P        
Subjt:  KETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVL--------KDQTSEMITRKEWASGTEKDKKGP----RAKN

Query:  SQSGLEKEI-NKALEREKHL------EEKEISE-----MDTNQDSLM-----------FRGEGKNVRTWKRRARVE--NNSVDSESLKGATIANVGRKKP
        S+  ++ E+ + AL R +H       ++   SE      D +Q S +                K     K R  ++    +V+   L    +      KP
Subjt:  SQSGLEKEI-NKALEREKHL------EEKEISE-----MDTNQDSLM-----------FRGEGKNVRTWKRRARVE--NNSVDSESLKGATIANVGRKKP

Query:  LKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWN---KDRIGGSIRAAIDRKRDEILRMEGDDSTANLASIGLAEKELEN
           + R   FE AW      + ++   W+     S  +   KLN C + L  W    KD+    I      +R    R  G     +      A+K L  
Subjt:  LKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWN---KDRIGGSIRAAIDRKRDEILRMEGDDSTANLASIGLAEKELEN

Query:  LLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSG---VWIDD-EDEI--------------GNIASS------------------
        L   +E++W+ ++++ WL  GDRNTK+FH +ASER++ N I  L +S+G    W +  ED I              G+I  S                  
Subjt:  LLNEEESYWKLRSREDWLQWGDRNTKWFHHKASERKKRNEIKGLLNSSG---VWIDD-EDEI--------------GNIASS------------------

Query:  -----------------------------------------------YFKD-----------------------------------------------LF
                                                       YF+D                                               L 
Subjt:  -----------------------------------------------YFKD-----------------------------------------------LF

Query:  SSSNPTIVRES--------------------------FSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHD---
         + NP I  E+                          F+A D + I   P+ G  + D +IW  +K G++TVKSAY          +   +TV  H    
Subjt:  SSSNPTIVRES--------------------------FSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHD---

Query:  TFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLA
          W+K W ++ LPK +  +WR  ++ LP   N++ K + +  +CP   +SEE+  HI   C  ++ +W        G +A   GS+   D+   ++    
Subjt:  TFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLA

Query:  EEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWI
        E +  +A  +LW IW  RN                         K V R+ L                   W+ PP   LK+NVDAS     R  G+G+I
Subjt:  EEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWI

Query:  LRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFK-EISFLYCPR
        +RD  G  +     +    +  +  EA AV E L  I        S     +LVE+D   VV+ LN    D++  + +I++  RL+     +++F + PR
Subjt:  LRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFK-EISFLYCPR

Query:  DQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCI
          N AA LLAR              S   ++ +G W+  PP+ I
Subjt:  DQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCI

A0A803P5M6 Uncharacterized protein3.3e-5521.09Show/hide
Query:  MEEAMANELEKLKITTAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIK-AMGRNTYLCNFNNSFEKDRIKRGGPW
        M+  M     K+ +T  E + V    D +       +   ++ +ILT K +     +  M   W  +GR  +K +   + ++  F    +K R+    P+
Subjt:  MEEAMANELEKLKITTAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIK-AMGRNTYLCNFNNSFEKDRIKRGGPW

Query:  NYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEF---VEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEET
        ++    +V+   +     +     +  FWV I+ LP +   R  A  LGN +GE+    E  L+EG      LR++V +DVS+PLKRG M+ +    ++ 
Subjt:  NYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEF---VEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEET

Query:  WVKVTYERLPEFCYGCGIIGHVQQEC----EKISNEGEENL-YGDFMRATPIVGGTPKQKTQENKRGNFWG--------------RERGRRG--------
        WV   YERLPE+C  CG+IGH   +C    EK+ N  E  L Y  F++ + +      +   +  +G+ W                +  +RG        
Subjt:  WVKVTYERLPEFCYGCGIIGHVQQEC----EKISNEGEENL-YGDFMRATPIVGGTPKQKTQENKRGNFWG--------------RERGRRG--------

Query:  AYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEI---GKIRDLGQ
        + +  N    +  N+  ++    R    K   +   V+   ++    +    + +     + + +  S+ S +     V  T            + D+  
Subjt:  AYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEI---GKIRDLGQ

Query:  KYVPDKKATKGIVLKDQTSEMITRKEWASGTEKDKKGPRAKNSQSGLEKEINKALER------------------EKHLEEKEISEMDTNQDS-------
        K      AT  +  K      I+      GTE      ++K    GL   + + L+R                  + HL   ++ +   ++D+       
Subjt:  KYVPDKKATKGIVLKDQTSEMITRKEWASGTEKDKKGPRAKNSQSGLEKEINKALER------------------EKHLEEKEISEMDTNQDS-------

Query:  -------LMFRGE-------GKNVRTWKRR-------------------ARVENNSVDSESLKGATIANVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVE
               + F G+         N    K R                   A ++  S D  ++   TI  +   +    ++   +FE+ W++  D+  I+ 
Subjt:  -------LMFRGE-------GKNVRTWKRR-------------------ARVENNSVDSESLKGATIANVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVE

Query:  SSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME--GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTK
        + W +        + + L  C   L +W+  +  G+++  I   + ++ R+    D S + +  +  +E  L+ LL +EE+YW  RSR DWLQ GD+NT 
Subjt:  SSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME--GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTK

Query:  WFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSS-----------------------------------------------------
        +FH  A+ RK +N IK L+N+ G+ +  + E+ N+   Y+  LF+S                                                      
Subjt:  WFHHKASERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSS-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDT
                                       N  ++   F  ID + IL+ P+      D ++W+    G+++VK+ +HLA  L +    S ST      
Subjt:  -------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDT

Query:  FWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAE
        +W+ FW +K  PK +I  W++  + LPT V + K+ +  +  C    S+ ES  H L+ CK +K++WK+     +   A N       DY   L     +
Subjt:  FWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAE

Query:  EELEIAITILWSIWEYRNKVTH----TESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHA-IWKPPPDLSLKLNVDASWNDQFRKGG
         + E  + +LW IW  RNKV H         I Q  S+      +  K+ +R     SS  + P+      H   W+PP     KLNVDA+ N + +K G
Subjt:  EELEIAITILWSIWEYRNKVTH----TESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHA-IWKPPPDLSLKLNVDASWNDQFRKGG

Query:  VGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLL-VESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFL
        +G ILRD  G+ +    K +   +    +EAKA+   +  +        S S   L  +E+DAS V N LN  + D +  S LI +I  L  +F ++   
Subjt:  VGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLL-VESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFL

Query:  YCPRDQNVAADLLARVAI
        +  R  N AA  LA+ A+
Subjt:  YCPRDQNVAADLLARVAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-2030.14Show/hide
Query:  CPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDY--WAWLMDNLAEEEL-------EIAITILWSIWEYRNKVTHTESKPIYQEIS
        CP SR   E+  H+L+ C  ++ +W I     + + A   G W    Y    W+++   E E+        +   +LW +W+ RN++     +    E+ 
Subjt:  CPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDY--WAWLMDNLAEEEL-------EIAITILWSIWEYRNKVTHTESKPIYQEIS

Query:  RIISSKIDFPKVVSRTYLP-KSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEG
        R   +  DF +  +R  L  K+SG   P V+   S   WK PP   +K N DA+W  +  + G+GWILR+ SG  + +G + +           K VLE 
Subjt:  RIISSKIDFPKVVSRTYLP-KSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEG

Query:  LISILSVADLHPSLSSIS-LLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS
         +  L  A L  S  +   ++ ESDA ++VNLLN  D+ +  +   +++I +L  +F+E+ F + PR  N  AD +AR +ISF +  P L S
Subjt:  LISILSVADLHPSLSSIS-LLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS

AT3G25270.1 Ribonuclease H-like superfamily protein1.8e-2125.55Show/hide
Query:  KFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIF-IP----LTNGLFALNRGSWIPRDYWAWLMDNL
        K WK+K  PK K  +W+++  +L T  N+ ++ I  +  C      +E+++H+ +DC  ++ +W+   IP     T G+    +   +     A    N 
Subjt:  KFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIF-IP----LTNGLFALNRGSWIPRDYWAWLMDNL

Query:  AEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKID-----FPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRK
          +   +AI ILW +W+ RN++   +    +Q   +   + +         V S      SS   QPT+    +   W+ PP   +K N D ++N Q R 
Subjt:  AEEELEIAITILWSIWEYRNKVTHTESKPIYQEISRIISSKID-----FPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRK

Query:  GGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHP-SLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEIS
           GW++RD +G  + +G  +     +  +LE++         L +A  H  S     ++ E D+  V  L+N E  +F   ++ I+E    +K F+E  
Subjt:  GGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHP-SLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEIS

Query:  FLYCPRDQNVAADLLAR
        F + PR  N  AD+LA+
Subjt:  FLYCPRDQNVAADLLAR

AT3G42140.1 zinc ion binding;nucleic acid binding2.5e-0724.82Show/hide
Query:  IKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSK
        I R GPW+++  + VI+  +     S   F+ + FW+ I  +P+  +  +    +G  +G F+E +L                DVS              
Subjt:  IKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSK

Query:  AEETWVKVTYERLPEFCYGCGIIGHVQQECEKISNEG
             +K  YE+L  FC  CG++ H   EC    N+G
Subjt:  AEETWVKVTYERLPEFCYGCGIIGHVQQECEKISNEG

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-3426.75Show/hide
Query:  IVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVT--SHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNI
        ++   F  ++ ++I     GG    D   W+    G +TVKS Y +   + N K  SP  V+  S +  ++K WK +  PK +  +W+ + +SLP    +
Subjt:  IVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVT--SHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNI

Query:  LKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKI-FIPLTNGLFALNRGSWIP----RDYWAWLMDN---LAEEELEIAITILWSIWEYRNKVTHTE
          + +     C    S +E+  H+L+ C  ++  W I  IP+  G      G W        YW + + N     E+  ++   +LW +W+ RN++    
Subjt:  LKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKI-FIPLTNGLFALNRGSWIP----RDYWAWLMDN---LAEEELEIAITILWSIWEYRNKVTHTE

Query:  SKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKAL
         +   QE+ R     ++  ++  RT       K Q    N +S   W+PPP   +K N DA+WN    + G+GW+LR+  G    +G + +         
Subjt:  SKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKAL

Query:  EAKAVLEGLISILSVADLHPSLSSISLLV-ESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS
        + K+VLE  +  +  A L  S    + ++ ESD+  ++ +LN  DE +  +   IQ++ RL   F E+ F++ PR+ N  A+ +AR ++SF +  P L S
Subjt:  EAKAVLEGLISILSVADLHPSLSSISLLV-ESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.6e-0922.06Show/hide
Query:  ILWSIWEYRNKVTHTESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSI
        ++W IW+  N +    ++  +Q  + +  +  D  + +  T   +    N+    + + +  W PP    LK N DAS +++    G+GWILR+S G+ I
Subjt:  ILWSIWEYRNKVTHTESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSI

Query:  CVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLA
          G  +   R + +  E   ++  +                 ++ E D  ++  ++N +  +   +   +  I     +F+ I F +  R+QN  AD LA
Subjt:  CVGFKRMHNRWSIKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLA

Query:  RVAI
        + AI
Subjt:  RVAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCAATCAGGGTGGCACCATGCTACAGAAGGGATCCCCAAGCGAGGAACCTGCGGTTAAAAAAACAGAAACAAGCACACCTCAATTGAATGAAATACAAGCAAT
GGAAATGGAGAAAGATCAGGAAGAAGGAAAAAGTATGGAAGAAGCTATGGCAAACGAGCTAGAAAAACTAAAAATTACCACAGCAGAAAGAGCAAAAGTGGTGGCGATTG
AAGATGAAGACTTGGAAAAAGCTACAGAGGACCTCCAGGGAGCGATCTTTTGCAGAATCCTCACTCCAAAACTCATTAATCCCGAAGTGTTCAAAACCTTCATGCCAAGG
ATTTGGAACAAAGAGGGGAGAGTTAGGATAAAAGCGATGGGAAGAAACACCTACCTTTGCAACTTCAACAACAGTTTTGAAAAGGACAGAATAAAGAGGGGTGGTCCCTG
GAACTATGACAGAGCGCTATTGGTCATTGAAGACACTCAAGGAGCTAGCAGAATATCTCACACCAGCTTCAGGTATGTAAACTTTTGGGTCCATATCCATGACTTACCAA
TGGTTTGTATGCGCAGGAAATGGGCCGAGAAGCTTGGAAATTCCTTGGGAGAATTCGTAGAAGCTGACCTTGACGAAGGAGGAAGCACTGAGAATACGTTACGCATCCAA
GTAAAGATAGATGTCTCAGAACCTCTCAAGAGAGGTCTTATGGTGAGAATTGGGTCGAAAGCCGAAGAAACATGGGTGAAGGTGACTTATGAACGCCTCCCGGAGTTCTG
TTATGGATGCGGTATTATCGGCCATGTTCAACAAGAATGCGAAAAAATCAGCAACGAGGGTGAAGAGAATCTGTATGGAGATTTCATGAGAGCGACTCCAATCGTTGGAG
GAACACCAAAACAGAAAACTCAAGAAAATAAAAGAGGAAATTTCTGGGGTAGAGAACGAGGTAGAAGAGGTGCATACCAATTCCAAAACAACAGACAAAACCAGAAATAC
AATGATGAAAAAGAGGAAACATGGAGACGAAGAGATCAAAGTGGAAAAAGCAACGAAGAAAACGTCGGCGTCCTGAGACCGGAAAATTCCACAGGAGGCAGGAGCTCGCC
GGAGAAGGAGACCGTTGGAAAATCCCAAGGCCAGCCAACGGCTAGTGAAGTGTCAGAGCAAAGCAGAATGAAAACAGGAAAAGCGGTGATGGAGACAAATTGTCAGGCGG
TGGAAATAGGAAAAATCAGGGATCTAGGACAAAAGTACGTGCCAGACAAAAAAGCTACAAAAGGAATAGTATTAAAGGACCAGACCAGTGAAATGATTACTAGAAAAGAA
TGGGCAAGTGGAACAGAAAAGGACAAAAAAGGCCCAAGGGCCAAGAATAGCCAATCTGGGCTGGAGAAGGAAATCAATAAAGCCCTGGAAAGAGAAAAACACCTAGAGGA
AAAAGAAATATCAGAGATGGACACAAACCAAGACAGCCTTATGTTCAGAGGAGAAGGTAAAAATGTTCGAACATGGAAAAGGCGTGCTAGAGTGGAGAATAACAGTGTTG
ATTCTGAAAGCTTAAAAGGAGCTACGATAGCAAATGTAGGAAGGAAAAAACCTCTGAAAAGAAAAAGAAGAAGCAAAAAATTTGAAGAGGCTTGGATAAGAGTCTCGGAC
AGCAAAAAGATTGTGGAGAGCTCTTGGAAAGAAATTCCGGGAAGAAGCTTTACTGATTACTCCAATAAGCTGAATCGATGCCTTTCCAATCTTCTGAAATGGAACAAAGA
CAGAATTGGGGGATCCATTAGAGCTGCTATAGACAGAAAGAGAGATGAGATTTTGAGGATGGAAGGAGATGATTCCACGGCAAACCTTGCTTCCATTGGGTTGGCAGAGA
AGGAATTGGAAAATCTCCTCAACGAAGAAGAGAGCTATTGGAAACTCCGGTCGAGAGAGGATTGGCTTCAATGGGGTGACAGGAACACGAAGTGGTTCCACCATAAAGCT
TCAGAGAGGAAAAAAAGAAACGAGATAAAAGGGCTGTTGAACAGCTCTGGTGTTTGGATTGATGATGAAGATGAAATTGGGAATATAGCCTCCTCATATTTCAAAGATCT
CTTCTCCTCTTCGAACCCAACCATAGTGAGAGAATCCTTCTCAGCCATTGACGCTGAAGTTATTCTCAATACCCCTGTCGGCGGGGAAGGCACGAGGGATGAGATAATCT
GGAATAGAGAAAAGAAAGGCTTATTCACGGTCAAAAGTGCATACCACCTTGCTGTTCTTCTTTCAAATAGCAAGATGGAATCGCCCTCAACCGTAACTAGCCATGACACC
TTCTGGAGGAAATTTTGGAAAATTAAAGCTCTCCCTAAGGCTAAAATCTGTGTTTGGAGGATCATTCATGACTCTCTTCCAACAAGAGTTAATATTCTCAAAAAAGGAAT
CCCTATTAACTACCTCTGCCCTTTTTCCAGGTCCTCCGAGGAGTCAGCAAAGCACATTTTATGGGACTGTAAGTTGTCTAAGAATTTATGGAAAATTTTTATCCCCCTAA
CCAACGGTCTGTTTGCGTTAAATAGGGGATCATGGATTCCTAGAGACTACTGGGCTTGGTTGATGGACAATCTAGCGGAAGAAGAATTGGAGATAGCAATCACCATCCTT
TGGAGTATTTGGGAGTACAGAAACAAAGTTACCCACACAGAGAGCAAACCAATCTATCAAGAAATCTCCAGAATTATCAGCAGCAAAATAGATTTTCCAAAAGTTGTTTC
GAGAACCTACCTGCCGAAGTCGAGTGGAAAAAATCAGCCCACAGTGAAGAACCTTGCGAGTCACGCGATTTGGAAGCCTCCCCCGGATTTATCGCTAAAGCTGAATGTAG
ATGCCTCCTGGAACGACCAATTCAGAAAAGGTGGCGTCGGTTGGATCCTCCGTGACTCCTCAGGGTCTTCGATCTGTGTGGGCTTCAAGCGCATGCATAATAGGTGGTCA
ATCAAAGCGTTGGAGGCGAAGGCAGTCTTGGAAGGCCTGATTAGCATCCTTTCAGTTGCTGATCTTCATCCCTCTCTAAGCTCCATATCTCTGTTGGTGGAGTCGGATGC
TTCCTCTGTGGTGAATCTCCTCAACGAAGAAGATGAAGATTTTACAGAAATCTCCTTCCTCATTCAGGAGATCTCAAGATTGAAGAAGAACTTTAAAGAAATTTCTTTCT
TGTATTGCCCCAGAGATCAAAATGTTGCCGCGGATTTATTGGCGCGCGTGGCGATCTCGTTCCCTTCCCTTGTTCCTGTTTTGGACTCCTCTCCCATGGTGGAAGAGGGT
TTGGGTTTTTGGTATGGGCCTCCCCCATCTTGTATTAAAAGCCTCTTAAATGAGGTTGGTGTTCTTGATTTTTCTTCTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCAATCAGGGTGGCACCATGCTACAGAAGGGATCCCCAAGCGAGGAACCTGCGGTTAAAAAAACAGAAACAAGCACACCTCAATTGAATGAAATACAAGCAAT
GGAAATGGAGAAAGATCAGGAAGAAGGAAAAAGTATGGAAGAAGCTATGGCAAACGAGCTAGAAAAACTAAAAATTACCACAGCAGAAAGAGCAAAAGTGGTGGCGATTG
AAGATGAAGACTTGGAAAAAGCTACAGAGGACCTCCAGGGAGCGATCTTTTGCAGAATCCTCACTCCAAAACTCATTAATCCCGAAGTGTTCAAAACCTTCATGCCAAGG
ATTTGGAACAAAGAGGGGAGAGTTAGGATAAAAGCGATGGGAAGAAACACCTACCTTTGCAACTTCAACAACAGTTTTGAAAAGGACAGAATAAAGAGGGGTGGTCCCTG
GAACTATGACAGAGCGCTATTGGTCATTGAAGACACTCAAGGAGCTAGCAGAATATCTCACACCAGCTTCAGGTATGTAAACTTTTGGGTCCATATCCATGACTTACCAA
TGGTTTGTATGCGCAGGAAATGGGCCGAGAAGCTTGGAAATTCCTTGGGAGAATTCGTAGAAGCTGACCTTGACGAAGGAGGAAGCACTGAGAATACGTTACGCATCCAA
GTAAAGATAGATGTCTCAGAACCTCTCAAGAGAGGTCTTATGGTGAGAATTGGGTCGAAAGCCGAAGAAACATGGGTGAAGGTGACTTATGAACGCCTCCCGGAGTTCTG
TTATGGATGCGGTATTATCGGCCATGTTCAACAAGAATGCGAAAAAATCAGCAACGAGGGTGAAGAGAATCTGTATGGAGATTTCATGAGAGCGACTCCAATCGTTGGAG
GAACACCAAAACAGAAAACTCAAGAAAATAAAAGAGGAAATTTCTGGGGTAGAGAACGAGGTAGAAGAGGTGCATACCAATTCCAAAACAACAGACAAAACCAGAAATAC
AATGATGAAAAAGAGGAAACATGGAGACGAAGAGATCAAAGTGGAAAAAGCAACGAAGAAAACGTCGGCGTCCTGAGACCGGAAAATTCCACAGGAGGCAGGAGCTCGCC
GGAGAAGGAGACCGTTGGAAAATCCCAAGGCCAGCCAACGGCTAGTGAAGTGTCAGAGCAAAGCAGAATGAAAACAGGAAAAGCGGTGATGGAGACAAATTGTCAGGCGG
TGGAAATAGGAAAAATCAGGGATCTAGGACAAAAGTACGTGCCAGACAAAAAAGCTACAAAAGGAATAGTATTAAAGGACCAGACCAGTGAAATGATTACTAGAAAAGAA
TGGGCAAGTGGAACAGAAAAGGACAAAAAAGGCCCAAGGGCCAAGAATAGCCAATCTGGGCTGGAGAAGGAAATCAATAAAGCCCTGGAAAGAGAAAAACACCTAGAGGA
AAAAGAAATATCAGAGATGGACACAAACCAAGACAGCCTTATGTTCAGAGGAGAAGGTAAAAATGTTCGAACATGGAAAAGGCGTGCTAGAGTGGAGAATAACAGTGTTG
ATTCTGAAAGCTTAAAAGGAGCTACGATAGCAAATGTAGGAAGGAAAAAACCTCTGAAAAGAAAAAGAAGAAGCAAAAAATTTGAAGAGGCTTGGATAAGAGTCTCGGAC
AGCAAAAAGATTGTGGAGAGCTCTTGGAAAGAAATTCCGGGAAGAAGCTTTACTGATTACTCCAATAAGCTGAATCGATGCCTTTCCAATCTTCTGAAATGGAACAAAGA
CAGAATTGGGGGATCCATTAGAGCTGCTATAGACAGAAAGAGAGATGAGATTTTGAGGATGGAAGGAGATGATTCCACGGCAAACCTTGCTTCCATTGGGTTGGCAGAGA
AGGAATTGGAAAATCTCCTCAACGAAGAAGAGAGCTATTGGAAACTCCGGTCGAGAGAGGATTGGCTTCAATGGGGTGACAGGAACACGAAGTGGTTCCACCATAAAGCT
TCAGAGAGGAAAAAAAGAAACGAGATAAAAGGGCTGTTGAACAGCTCTGGTGTTTGGATTGATGATGAAGATGAAATTGGGAATATAGCCTCCTCATATTTCAAAGATCT
CTTCTCCTCTTCGAACCCAACCATAGTGAGAGAATCCTTCTCAGCCATTGACGCTGAAGTTATTCTCAATACCCCTGTCGGCGGGGAAGGCACGAGGGATGAGATAATCT
GGAATAGAGAAAAGAAAGGCTTATTCACGGTCAAAAGTGCATACCACCTTGCTGTTCTTCTTTCAAATAGCAAGATGGAATCGCCCTCAACCGTAACTAGCCATGACACC
TTCTGGAGGAAATTTTGGAAAATTAAAGCTCTCCCTAAGGCTAAAATCTGTGTTTGGAGGATCATTCATGACTCTCTTCCAACAAGAGTTAATATTCTCAAAAAAGGAAT
CCCTATTAACTACCTCTGCCCTTTTTCCAGGTCCTCCGAGGAGTCAGCAAAGCACATTTTATGGGACTGTAAGTTGTCTAAGAATTTATGGAAAATTTTTATCCCCCTAA
CCAACGGTCTGTTTGCGTTAAATAGGGGATCATGGATTCCTAGAGACTACTGGGCTTGGTTGATGGACAATCTAGCGGAAGAAGAATTGGAGATAGCAATCACCATCCTT
TGGAGTATTTGGGAGTACAGAAACAAAGTTACCCACACAGAGAGCAAACCAATCTATCAAGAAATCTCCAGAATTATCAGCAGCAAAATAGATTTTCCAAAAGTTGTTTC
GAGAACCTACCTGCCGAAGTCGAGTGGAAAAAATCAGCCCACAGTGAAGAACCTTGCGAGTCACGCGATTTGGAAGCCTCCCCCGGATTTATCGCTAAAGCTGAATGTAG
ATGCCTCCTGGAACGACCAATTCAGAAAAGGTGGCGTCGGTTGGATCCTCCGTGACTCCTCAGGGTCTTCGATCTGTGTGGGCTTCAAGCGCATGCATAATAGGTGGTCA
ATCAAAGCGTTGGAGGCGAAGGCAGTCTTGGAAGGCCTGATTAGCATCCTTTCAGTTGCTGATCTTCATCCCTCTCTAAGCTCCATATCTCTGTTGGTGGAGTCGGATGC
TTCCTCTGTGGTGAATCTCCTCAACGAAGAAGATGAAGATTTTACAGAAATCTCCTTCCTCATTCAGGAGATCTCAAGATTGAAGAAGAACTTTAAAGAAATTTCTTTCT
TGTATTGCCCCAGAGATCAAAATGTTGCCGCGGATTTATTGGCGCGCGTGGCGATCTCGTTCCCTTCCCTTGTTCCTGTTTTGGACTCCTCTCCCATGGTGGAAGAGGGT
TTGGGTTTTTGGTATGGGCCTCCCCCATCTTGTATTAAAAGCCTCTTAAATGAGGTTGGTGTTCTTGATTTTTCTTCTTTTTAA
Protein sequenceShow/hide protein sequence
MASNQGGTMLQKGSPSEEPAVKKTETSTPQLNEIQAMEMEKDQEEGKSMEEAMANELEKLKITTAERAKVVAIEDEDLEKATEDLQGAIFCRILTPKLINPEVFKTFMPR
IWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQ
VKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQECEKISNEGEENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRERGRRGAYQFQNNRQNQKY
NDEKEETWRRRDQSGKSNEENVGVLRPENSTGGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEMITRKE
WASGTEKDKKGPRAKNSQSGLEKEINKALEREKHLEEKEISEMDTNQDSLMFRGEGKNVRTWKRRARVENNSVDSESLKGATIANVGRKKPLKRKRRSKKFEEAWIRVSD
SKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKA
SERKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDT
FWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFSRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEELEIAITIL
WSIWEYRNKVTHTESKPIYQEISRIISSKIDFPKVVSRTYLPKSSGKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWS
IKALEAKAVLEGLISILSVADLHPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEG
LGFWYGPPPSCIKSLLNEVGVLDFSSF