; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018149 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018149
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold9:30715234..30721007
RNA-Seq ExpressionSpg018149
SyntenySpg018149
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]1.1e-4924.79Show/hide
Query:  ELEKLKIT-TAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL
        E EK  I+ T +    + +E E++ E  E  +  +  ++ T    N   FK  + + W  +  + ++ +  N +L  F    + D + R GPW++DR LL
Subjt:  ELEKLKIT-TAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL

Query:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP
        ++    G  + S      VNFWV ++DLP        A+KLGN +G F E D  +   T   LR++  +D+ +PLKRG  ++   K    WV   YERLP
Subjt:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP

Query:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ
         FC+ CG IGH  +ECE + +  E N          YG ++RA+P+  +   P++                +    Q +  +  ++  D++      +  
Subjt:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ

Query:  SGKSNEENVGVLRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKE-
               N+ V     S    +      VGK+      S+  +  R K+ K V     +  A E+GK R+L    + + +A   ++  +    V + +  
Subjt:  SGKSNEENVGVLRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKE-

Query:  --------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ
                      ++SG   D KG   + +                                      +G+  E  K   +  +  LE    + +++  
Subjt:  --------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ

Query:  DSLMFRG------EGKNVRTWKR-------------------RARHLNFNSSDHRPILATLEVGRKKPLKRKRRSK---KFEEAWIRVSDSKKIVESSWK
        + L F G       G+N    K+                      HL    SDH  ++ TLE    +  +R+RR K   +FEE+W   +  + ++++ W 
Subjt:  DSLMFRG------EGKNVRTWKR-------------------RARHLNFNSSDHRPILATLEVGRKKPLKRKRRSK---KFEEAWIRVSDSKKIVESSWK

Query:  EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHK
        + P  SF+D   +L R + N L    D   GSI   I R    I   +  D+S  ++      E  LE LL EEE+ W+ RSR  WL+ GD+NTK+FH K
Subjt:  EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHK

Query:  ASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKM
        AS+ +K NEIK L +  G+W   E+++  +   YFK+LF+S++P+ V ++   +  ++ L          D +  + E  G   +++   +  L    K 
Subjt:  ASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKM

Query:  ESPSTVTSHDTFWRKFWKI
          P  + +   F++K+W I
Subjt:  ESPSTVTSHDTFWRKFWKI

GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]1.4e-1731.07Show/hide
Query:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV
        + S    D     N  I+ + F+  +A  I++TP+      D+IIW+ E+ G+++V+ AYHL     +  +  PS+ +  D+ W+K WK     K K  +
Subjt:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV

Query:  WRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI
         R+  + LPTR N+ KKGI ++  CP C +  E+A H+   C +          L   LFA   G   P   D   WL++ L   ++L  ++  TILW  
Subjt:  WRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI

Query:  WEYRNK
        W  RN+
Subjt:  WEYRNK

GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]2.4e-4926.88Show/hide
Query:  WKRR-----ARHLNFNSSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAI
        WKRR     A  L+++SSDH PIL  L+V    P +   R  +FE  W   S  ++IVE  W +    S      KL  C   L +W  D +    +  I
Subjt:  WKRR-----ARHLNFNSSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAI

Query:  DRKRDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDL
        D  +  +  + G     +  S  +A+ +  +LL  +ESYW+ R++E WL+ GD+NTK+FH KA+  +++N I  L + +G W D    +  + ++YFK++
Subjt:  DRKRDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDL

Query:  FSSS----------------------------------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKK
        F+S+                                                          N  +V   F+  D ++IL+ P+    + D   W  +K+
Subjt:  FSSS----------------------------------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKK

Query:  GLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKI
        G ++V SAY L       + +   T   +  FW+K W I A PK +  +WR +  SLPTR  +  + +P   +CP C    ES KH+L +C  ++++W  
Subjt:  GLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKI

Query:  FIPLTNGLFALNRGSWIP--RDYWAWLMDNLA---EEELEIAITILWSIWEYRNKVTHTENKPIYQEI---SRIISSKIDFPKVVSRTYLPKSSEKNQPT
                 A + G + P    +  WLM  LA     +      + WSIWE RN V     +P           +  + D   V  R  +P +++   P 
Subjt:  FIPLTNGLFALNRGSWIP--RDYWAWLMDNLA---EEELEIAITILWSIWEYRNKVTHTENKPIYQEI---SRIISSKIDFPKVVSRTYLPKSSEKNQPT

Query:  VKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVV
                 WK P    +KLNVD + N      G+G ++R+  GS +     RM   +S    E   V E L  I          +  +++VESD   VV
Subjt:  VKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVV

Query:  NLLNEED-EDFTEISFLIQEISRLKKNFK-EISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSP
        N+LN  + E+F+ +  ++ +   L      EI F +  R  N  A  LA+   SFP+ +    S P
Subjt:  NLLNEED-EDFTEISFLIQEISRLKKNFK-EISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSP

KAE8800683.1 retrotransposon unclassified [Hordeum vulgare]6.1e-5322.92Show/hide
Query:  RILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKW
        +IL+ KL +P+     + ++W     +  K MG N ++  F     K R    GPW +D  L+V+E+     R+    F  +  WV +++LP+  M  + 
Subjt:  RILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKW

Query:  AEKLGNSLGEFVEADLDEGGSTENT-LRIQVKIDVSEPLKRGLMV------------RIGSKAEET-WVKVTYERLPEFCYGCGIIGHVQQECEKISNEG
        AE +GN +G+FVEAD    GS     LRI++++ + +PL RG  +             +G + + + W +  YE LP+FCY CG++GH +++C     +G
Subjt:  AEKLGNSLGEFVEADLDEGGSTENT-LRIQVKIDVSEPLKRGLMV------------RIGSKAEET-WVKVTYERLPEFCYGCGIIGHVQQECEKISNEG

Query:  EENLYGDFM-------RATPIVGGTPKQKTQENKRGNF-WGRGRGRRGAYQ---------FQNNRQNQKYNDEKEE------TWRRRDQSGKSNEENVGV
        E+  +G ++       RA    GG  K   +   + N+ + R  GR G+           F+ +    +  ++ EE        + R QSG   +    +
Subjt:  EENLYGDFM-------RATPIVGGTPKQKTQENKRGNF-WGRGRGRRGAYQ---------FQNNRQNQKYNDEKEE------TWRRRDQSGKSNEENVGV

Query:  LRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKEWASGTEKDKKGPRA
        L  EN  E   +  +E V   +G        E+  + T +A +    QA +  +    G K   + K   G   + +      R E  SG    ++    
Subjt:  LRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKEWASGTEKDKKGPRA

Query:  -KNSQSGLEKEINKALEREKHLEEKEISEMDTNQDSLMFRGEGKNVRTWKRRARHLNFNS----SDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSD
         K +Q+  E E  K  ++ + +   E  E++      + R    N R  +        NS    SDHRPI+   +   ++   +   S +FE  W+    
Subjt:  -KNSQSGLEKEINKALEREKHLEEKEISEMDTNQDSLMFRGEGKNVRTWKRRARHLNFNS----SDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSD

Query:  SKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWG
          + ++ +W+E         +  + R    + KW+K  + G +   + + R E+ R M    S   +         L  L  ++    K RS   WL+ G
Subjt:  SKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWG

Query:  DRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYH
        +RNT++F    +  KK+N +K L    G  + +  E+ N   SYF++LF++             + E+      GG G   + + +  K           
Subjt:  DRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYH

Query:  LAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYL-CPFCRSSEESAKHILWDCKLSKNLW--------KIF
                           +  W++ WK+      ++  WRI H+SL    N+ ++G  +    C FC  ++E   H+   CK+ K +W        +I 
Subjt:  LAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYL-CPFCRSSEESAKHILWDCKLSKNLW--------KIF

Query:  IPLTNGLFALNRGSWIPRDYWAWLMDNLAEEELEIAITILWSIWEYRNKVTHTENKPIYQEIS-RIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHA
        +     + A+        DY  W +D    + L + +T  W  W YRNKV   E      E++ R  SS +++ ++ +            P  K +++  
Subjt:  IPLTNGLFALNRGSWIPRDYWAWLMDNLAEEELEIAITILWSIWEYRNKVTHTENKPIYQEIS-RIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHA

Query:  IWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDE
         WKPP +  +K N+D S+    +  G G   R S G  +     R          E       +I++ +   L   L  +  + E+D+  ++  L+    
Subjt:  IWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDE

Query:  DFTEISFLIQEIS-RLKKNFKEISFLYCPRDQNVAADLLARVAISFP
        D +  + +I++   +LK  F +     C R  N  A  LA V   +P
Subjt:  DFTEISFLIQEIS-RLKKNFKEISFLYCPRDQNVAADLLARVAISFP

KAE8813692.1 hypothetical protein D1007_09196 [Hordeum vulgare]2.6e-5121.8Show/hide
Query:  KLKITTAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIED
        K+K++  E+ K+VA +                 ++L+ +L +P+  +  + R+W     +  K +G N ++  FN    K R    GPW +++ L+V++ 
Subjt:  KLKITTAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIED

Query:  TQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLD-EGGSTENTLRIQVKIDVSEPL-----------------KRGLMVRIGSK
             R+    F     WV I +LP+  M  + AE++GN +G FVEAD+  +G +    LR+++++ + +P+                 KR + +    +
Subjt:  TQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLD-EGGSTENTLRIQVKIDVSEPL-----------------KRGLMVRIGSK

Query:  AEE-TWVKVTYERLPEFCYGCGIIGHVQQECEKISNEGEENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEE-TW
         E+  W +  YE LP+FCY CG++GH Q+ C     +GE+  +G  +RA        ++K    + G++ GRGRG  GA  F   R  +K     +  +W
Subjt:  AEE-TWVKVTYERLPEFCYGCGIIGHVQQECEKISNEGEENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEE-TW

Query:  RR---RDQSGKSNEENVG----------------VLRPENSTEGRSSPEKETV----GKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLG--
        R+   R   GK  +   G                + RP        + E E +    G  +G     E++  +     + V +T        +++ +   
Subjt:  RR---RDQSGKSNEENVG----------------VLRPENSTEGRSSPEKETV----GKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLG--

Query:  ------------QKYVPDKKATKGI-------VLKDQTSEVIIRKEWASGTEKDKKGPRAKNSQSGLEKEINKALER-EKHLEEKEISEMDTNQDSLMFR
                    QK  P ++  K +       V  +QT++ +I  + A   E +++  +    + G  +     ++R  + L    + ++    D   +R
Subjt:  ------------QKYVPDKKATKGI-------VLKDQTSEVIIRKEWASGTEKDKKGPRAKNSQSGLEKEINKALER-EKHLEEKEISEMDTNQDSLMFR

Query:  GEGKNVRTW--KRRAR-----------------HLNFNSSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKE--IPGRSFTDYSNK
           + + ++  +R  R                 H     SDHRP++        K      R  +FE  W++     ++++ +W+E  + G    +    
Subjt:  GEGKNVRTW--KRRAR-----------------HLNFNSSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKE--IPGRSFTDYSNK

Query:  LNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGL
        + +  + + +W+K+ I G +   + + + E+ R M    S A +   G       +L  ++ +  K RS   WL+ G+RNT++F   AS  +K+N IKGL
Subjt:  LNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILR-MEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGL

Query:  LNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFW
        +   G+ + + D + N   SYF                                            +GLFT   +  LA LL   +              
Subjt:  LNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFW

Query:  RKFWKIKALPK-AKICVWRIIHDSLPTRVNILKKGIPI-NYLCPFCRSSEESAKHILWDCKLSKNLWK-IFIPLTNGLFALNRGSWIPRDYWAWLMDNLA
            ++ A P+  ++  WR+ H+SL  R N+ K+GIP+ +  C FC  +EE   H+   CK  K  W+ + +    GL           D+  W    L+
Subjt:  RKFWKIKALPK-AKICVWRIIHDSLPTRVNILKKGIPI-NYLCPFCRSSEESAKHILWDCKLSKNLWK-IFIPLTNGLFALNRGSWIPRDYWAWLMDNLA

Query:  EEELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWI
        E +    +T  W  W  RNK+   E     +E++R   +      V+    + ++ +K++        H  WKPPPD ++K+N D S+         G +
Subjt:  EEELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWI

Query:  LRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEIS-RLKKNFKEISFLYCPR
         RD +G  +     R        A E  A+   +        +   L  + ++ E+D+S ++  ++    D +  + +I+++  +LK  F +     C R
Subjt:  LRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEIS-RLKKNFKEISFLYCPR

Query:  DQNVAADLLA
        + N  A  LA
Subjt:  DQNVAADLLA

KAF2317147.1 hypothetical protein GH714_012179 [Hevea brasiliensis]3.1e-4926.88Show/hide
Query:  WKRR-----ARHLNFNSSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAI
        WKRR     A  L+++SSDH PIL  L+V    P +   R  +FE  W   S  ++IVE  W +    S      KL  C   L +W  D +    +  I
Subjt:  WKRR-----ARHLNFNSSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAI

Query:  DRKRDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDL
        D  +  +  + G     +  S  +A+ +  +LL  +ESYW+ R++E WL+ GD+NTK+FH KA+  +++N I  L + +G W D    +  + + YFK++
Subjt:  DRKRDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDL

Query:  FSSS----------------------------------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKK
        F+S+                                                          N  +V   F+  D ++IL+ P+    + D   W  +K+
Subjt:  FSSS----------------------------------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKK

Query:  GLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKI
        G ++V SAY L       + +   T   +  FW+K W I A PK +  +WR +  SLPTR  +  + +P   +CP C    ES KH+L +C  ++++W  
Subjt:  GLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKI

Query:  FIPLTNGLFALNRGSWIP--RDYWAWLMDNLA---EEELEIAITILWSIWEYRNKVTHTENKPIYQEI---SRIISSKIDFPKVVSRTYLPKSSEKNQPT
                 A + G + P    +  WLM  LA     +      + WSIWE RN V     +P           +  + D   V  R  +P +++   P 
Subjt:  FIPLTNGLFALNRGSWIP--RDYWAWLMDNLA---EEELEIAITILWSIWEYRNKVTHTENKPIYQEI---SRIISSKIDFPKVVSRTYLPKSSEKNQPT

Query:  VKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVV
                 WK P    +KLNVD + N      G+G ++R+  GS +     RM   +S    E   V E L  I          +  +++VESD   VV
Subjt:  VKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVV

Query:  NLLNEED-EDFTEISFLIQEISRLKKNFK-EISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSP
        N+LN  + E+F+ +  ++ +   L      EI F +  R  N  A  LA+   SFP+ +    S P
Subjt:  NLLNEED-EDFTEISFLIQEISRLKKNFK-EISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSP

TrEMBL top hitse value%identityAlignment
A0A2N9GD63 Reverse transcriptase domain-containing protein1.1e-5027.17Show/hide
Query:  EDLQGAIFCRILTPKLINPE-VFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHD
        ED Q  +  R +T + +N E V +TF P +W       ++ +G N  +  F +  + +R+++G PW+YD+ L+  +     + ++     YV+FWV I++
Subjt:  EDLQGAIFCRILTPKLINPE-VFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHD

Query:  LPMVCMRRKWAEKLGNSLGEFVE-ADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQECEK-ISNEG--
        LP+  M+R++A  LG+++GE  + A+ ++    E  +RI+VK+D+S+PL RG   R+ S   ETW+   YERLP FCY CG + H +++CE  + ++G  
Subjt:  LPMVCMRRKWAEKLGNSLGEFVE-ADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQECEK-ISNEG--

Query:  --EENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTEGRSSPEKETVGKS
          EE  YG ++RA P+    P ++ +    GN              ++ R+ Q    + + +W    +   S +EN+G  +  N+ +  +  E +  G++
Subjt:  --EENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTEGRSSPEKETVGKS

Query:  QGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKEWA-SGTEKDKKGPRAKNSQSGLEK-----------
        +G      V       TG   +        IG   ++    V  KK  +G+   +     +   +W  +G        R   S + L+K           
Subjt:  QGQPTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKEWA-SGTEKDKKGPRAKNSQSGLEK-----------

Query:  --EINKALEREKHLEEKEISEMDTNQ----------DSLMFRG----------EGKNVRTWKRRA---------------RHLNFNSSDHRPILATLEVG
          + N+ L  ++ L E   S+ +  +            L +RG           G NV+    RA                H+  + SDH PIL   ++G
Subjt:  --EINKALEREKHLEEKEISEMDTNQ----------DSLMFRG----------EGKNVRTWKRRA---------------RHLNFNSSDHRPILATLEVG

Query:  RKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIP--GRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRMEGDDSTANLASIGLAEKE
             + KRR +KFEE W    + +KI+   W +    G        K+ +C  +L +W K           D+       + G+    N  +I   ++E
Subjt:  RKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIP--GRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRMEGDDSTANLASIGLAEKE

Query:  LENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEV
        +  LL  EE +W+ RSR  WL+ GDRNTK+FH  A++ KK N I+G  N + VW D E ++ +IA  YF+++F++S PT + E+ +A+D  V
Subjt:  LENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEV

A0A2U1KHJ0 CCHC-type domain-containing protein4.4e-4922.11Show/hide
Query:  YLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSE
        +L    +  +  R+   GPW+++R L+V++  +   + + T    V FWV + ++P+         ++   +G+ +E  +D+   T+ +  I+VK     
Subjt:  YLCNFNNSFEKDRIKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSE

Query:  PLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC---------------------------------------------------------
               VR         V + YERLP FCY CG++GH ++EC                                                         
Subjt:  PLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQEC---------------------------------------------------------

Query:  EKISNEGE--ENLYGDFMRATPIVGGTPK-----------QKTQENKRGNFWGRGRGRRGAYQFQNNRQNQK--YNDEKEETWRRRDQSGKSNEENVGVL
        + I NE +  E+   D      + GGT             QK  +N  G+      GR+     + + + Q+   N    + W+RR +   + +   G  
Subjt:  EKISNEGE--ENLYGDFMRATPIVGGTPK-----------QKTQENKRGNFWGRGRGRRGAYQFQNNRQNQK--YNDEKEETWRRRDQSGKSNEENVGVL

Query:  RPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKA-------VMETNCQAVEIGKIRDLGQKY----VPDKKATKGIVLKDQTS--EVIIRKEWA
           NS    + P   ++     Q   +  + Q      K        +MET     E+   R +  +Y    V   +     V+K+  +         W 
Subjt:  RPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKA-------VMETNCQAVEIGKIRDLGQKY----VPDKKATKGIVLKDQTS--EVIIRKEWA

Query:  SGTEKDKKGPRAKNSQSGLEK---------EINKALEREKHLEEKEISEMDTNQDSLMF----------------RGEGKNVRTWKRRARHLNFN-----
           EK +     ++ ++  E+         EI  A E+E        +EM   +++  F                 G   N    KR  R L  +     
Subjt:  SGTEKDKKGPRAKNSQSGLEK---------EINKALEREKHLEEKEISEMDTNQDSLMF----------------RGEGKNVRTWKRRARHLNFN-----

Query:  ------------SSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWK-EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRK
                    +SDH PI+  L       +K+K R  +FE  W+R      +V   W   +      D    ++ C + L  WNK R  G ++ +I  K
Subjt:  ------------SSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWK-EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRK

Query:  RDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSS
        +  +  ++     +  A      ++++ LL  EE  WK RSR +WL+ GD+NT++FH +AS  ++RN I  L    G W+++ +E+  + SSYF DLFSS
Subjt:  RDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSS

Query:  SNP----TIVRE---------------SFSAIDAEVILNTPVGG------------------------EGTRDEIIWNREKKGLFTVKSAYHLAVLLSNS
        S+P    ++VR+                 ++ +   +LNT   G                        +   D + W+    G F+ KSAY LA+     
Subjt:  SNP----TIVRE---------------SFSAIDAEVILNTPVGG------------------------EGTRDEIIWNREKKGLFTVKSAYHLAVLLSNS

Query:  KMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIP
         + + +   S   FWR  WK +   K K+ +WR  ++ +PT  N+  +G+     C  C  + E+  H+L+ C ++K++W        G F   +G+   
Subjt:  KMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIP

Query:  RDYWAWLMDNLAEEELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKI-DFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDAS
        +D+   +++     E E  + ILW +W  RN+  H +       +  I  S + D+ K   R  +  +S      V N+ +  +W  P    +K+N DA+
Subjt:  RDYWAWLMDNLAEEELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKI-DFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDAS

Query:  WNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKK
        W  +  K G+G++ R+  G  +  G +      S    EAKAV   +  ++          + SL +     + V LL        +I+ L  EI     
Subjt:  WNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKK

Query:  NFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCIKSLLNEVGVLD
         F   ++ +  R+ N  A  +A +A+S  S    LD S  V      W     + IKS     G+++
Subjt:  NFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCIKSLLNEVGVLD

A0A2Z6NZV1 Uncharacterized protein5.2e-5024.79Show/hide
Query:  ELEKLKIT-TAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL
        E EK  I+ T +    + +E E++ E  E  +  +  ++ T    N   FK  + + W  +  + ++ +  N +L  F    + D + R GPW++DR LL
Subjt:  ELEKLKIT-TAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALL

Query:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP
        ++    G  + S      VNFWV ++DLP        A+KLGN +G F E D  +   T   LR++  +D+ +PLKRG  ++   K    WV   YERLP
Subjt:  VIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLP

Query:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ
         FC+ CG IGH  +ECE + +  E N          YG ++RA+P+  +   P++                +    Q +  +  ++  D++      +  
Subjt:  EFCYGCGIIGHVQQECEKISNEGEEN---------LYGDFMRATPI--VGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEETWRRRDQ

Query:  SGKSNEENVGVLRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKE-
               N+ V     S    +      VGK+      S+  +  R K+ K V     +  A E+GK R+L    + + +A   ++  +    V + +  
Subjt:  SGKSNEENVGVLRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQ--AVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKE-

Query:  --------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ
                      ++SG   D KG   + +                                      +G+  E  K   +  +  LE    + +++  
Subjt:  --------------WASGTEKDKKGPRAKNS-------------------------------------QSGLEKEINKA--LEREKHLEEKEISEMDTNQ

Query:  DSLMFRG------EGKNVRTWKR-------------------RARHLNFNSSDHRPILATLEVGRKKPLKRKRRSK---KFEEAWIRVSDSKKIVESSWK
        + L F G       G+N    K+                      HL    SDH  ++ TLE    +  +R+RR K   +FEE+W   +  + ++++ W 
Subjt:  DSLMFRG------EGKNVRTWKR-------------------RARHLNFNSSDHRPILATLEVGRKKPLKRKRRSK---KFEEAWIRVSDSKKIVESSWK

Query:  EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHK
        + P  SF+D   +L R + N L    D   GSI   I R    I   +  D+S  ++      E  LE LL EEE+ W+ RSR  WL+ GD+NTK+FH K
Subjt:  EIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME-GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHK

Query:  ASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKM
        AS+ +K NEIK L +  G+W   E+++  +   YFK+LF+S++P+ V ++   +  ++ L          D +  + E  G   +++   +  L    K 
Subjt:  ASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKM

Query:  ESPSTVTSHDTFWRKFWKI
          P  + +   F++K+W I
Subjt:  ESPSTVTSHDTFWRKFWKI

A0A2Z6NZV1 Uncharacterized protein6.9e-1831.07Show/hide
Query:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV
        + S    D     N  I+ + F+  +A  I++TP+      D+IIW+ E+ G+++V+ AYHL     +  +  PS+ +  D+ W+K WK     K K  +
Subjt:  IASSYFKDLFSSSNPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICV

Query:  WRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI
         R+  + LPTR N+ KKGI ++  CP C +  E+A H+   C +          L   LFA   G   P   D   WL++ L   ++L  ++  TILW  
Subjt:  WRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPR--DYWAWLMDNL-AEEEL--EIAITILWSI

Query:  WEYRNK
        W  RN+
Subjt:  WEYRNK

A0A2Z6NZV1 Uncharacterized protein2.0e-4926.31Show/hide
Query:  SSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRMEGDDST
        SSDH P+L  +E  +K P     R  ++E  W    + K+++   W     ++  +    L +  +   K +K  IGG+ +  + R  D   + E     
Subjt:  SSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRMEGDDST

Query:  ANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEI---------GNIASSYFKDLFSSS---
             I  A++ +  L  +EE YW  R+R  WL+WGDRNT +FH    + + RN I  L ++ G W+  ++ I         G+I +   ++L +SS   
Subjt:  ANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEI---------GNIASSYFKDLFSSS---

Query:  NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMES--PSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTR
        +   ++ SF  + AE I+ TPV      D ++W   K G FT K+ Y++A    N +  +  PS+    +  W   W +KA  K K+ +W+  H+ +  R
Subjt:  NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMES--PSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTR

Query:  VNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLW----KIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEE---LEIAITILWSIWEYRNKVTH
         N+ KK +  N +CP C   EE+ +H L  C+ ++ +W        P +  + ++  G W+ R Y      +  E +     I ITI W+IW+ R     
Subjt:  VNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLW----KIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEE---LEIAITILWSIWEYRNKVTH

Query:  TENKPIYQEISRIISSKIDFPKVVSRTY---LPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRW
          N  +Y  I   + S I+  +++ + Y   + +++ K +   K      IW+PPP   LK+N DA ++   + G    +++D +   I  G   +    
Subjt:  TENKPIYQEISRIISSKIDFPKVVSRTY---LPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRW

Query:  SIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAI
        S  + EA+AV E +I       L  +L+  ++++E+D+  +V  L        +I  +IQ+I  ++K      F + PR+ N  A  +A + +
Subjt:  SIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAI

A0A803P5M6 Uncharacterized protein1.9e-6021.7Show/hide
Query:  MEEAMANELEKLKITTAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIK-AMGRNTYLCNFNNSFEKDRIKRGGPW
        M+  M     K+ +T  E + V    D +       +   ++ +ILT K +     +  M   W  +GR  +K +   + ++  F    +K R+    P+
Subjt:  MEEAMANELEKLKITTAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIK-AMGRNTYLCNFNNSFEKDRIKRGGPW

Query:  NYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEF---VEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEET
        ++    +V+   +     +     +  FWV I+ LP +   R  A  LGN +GE+    E  L+EG      LR++V +DVS+PLKRG M+ +    ++ 
Subjt:  NYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEF---VEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEET

Query:  WVKVTYERLPEFCYGCGIIGHVQQEC----EKISNEGEENL-YGDFMRATPIVGGTPKQKTQENKRGNFWG--------------RGRGRRG--------
        WV   YERLPE+C  CG+IGH   +C    EK+ N  E  L Y  F++ + +      +   +  +G+ W                   +RG        
Subjt:  WVKVTYERLPEFCYGCGIIGHVQQEC----EKISNEGEENL-YGDFMRATPIVGGTPKQKTQENKRGNFWG--------------RGRGRRG--------

Query:  AYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEI---GKIRDLGQ
        + +  N    +  N+  ++    R    K   +   V+   ++    +    + +     + + +  S+ S +     V  T            + D+  
Subjt:  AYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTEGRSSPEKETVGKSQGQPTASEVSEQSRMKTGKAVMETNCQAVEI---GKIRDLGQ

Query:  KYVPDKKATKGIVLKDQTSEVI-IRKEWASGTEKDKKGPRAKNSQSGLEKEINKALER------------------EKHLEEKEISEMDTNQDS------
        K      AT  +  K   + +      W  GTE      ++K    GL   + + L+R                  + HL   ++ +   ++D+      
Subjt:  KYVPDKKATKGIVLKDQTSEVI-IRKEWASGTEKDKKGPRAKNSQSGLEKEINKALER------------------EKHLEEKEISEMDTNQDS------

Query:  --------LMFRGE-------GKNVRTWKRR------------------ARHLNFNSSDHRPILATLEVGRKKPLKRKRRSK-KFEEAWIRVSDSKKIVE
                + F G+         N    K R                    HL++ SSDHR I  T+E       +  R+++ +FE+ W++  D+  I+ 
Subjt:  --------LMFRGE-------GKNVRTWKRR------------------ARHLNFNSSDHRPILATLEVGRKKPLKRKRRSK-KFEEAWIRVSDSKKIVE

Query:  SSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME--GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTK
        + W +        + + L  C   L +W+  +  G+++  I   + ++ R+    D S + +  +  +E  L+ LL +EE+YW  RSR DWLQ GD+NT 
Subjt:  SSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDRKRDEILRME--GDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTK

Query:  WFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSS-----------------------------------------------------
        +FH  A+  K +N IK L+N+ G+ +  + E+ N+   Y+  LF+S                                                      
Subjt:  WFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSS-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDT
                                       N  ++   F  ID + IL+ P+      D ++W+    G+++VK+ +HLA  L +    S ST      
Subjt:  -------------------------------NPTIVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDT

Query:  FWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAE
        +W+ FW +K  PK +I  W++  + LPT V + K+ +  +  C  C S+ ES  H L+ CK +K++WK+     +   A N       DY   L     +
Subjt:  FWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAE

Query:  EELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKI--DF--PKVVSRTYLPKSSEKNQPTVKNLASHA-IWKPPPDLSLKLNVDASWNDQFRKGG
         + E  + +LW IW  RNKV H   +P +       +SK   DF   K+ +R     SS  + P+      H   W+PP     KLNVDA+ N + +K G
Subjt:  EELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKI--DF--PKVVSRTYLPKSSEKNQPTVKNLASHA-IWKPPPDLSLKLNVDASWNDQFRKGG

Query:  VGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLL-VESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFL
        +G ILRD  G+ +    K +   +    +EAKA+   +  +        S S   L  +E+DAS V N LN  + D +  S LI +I  L  +F ++   
Subjt:  VGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLL-VESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFL

Query:  YCPRDQNVAADLLARVAI
        +  R  N AA  LA+ A+
Subjt:  YCPRDQNVAADLLARVAI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.2e-1623.14Show/hide
Query:  GTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHI
        G RD + W   + G F+V+SAY    +L+  ++  P+      +F+   WK++   + K  +W + + ++ T     ++ +  + +C  C+   ES  H+
Subjt:  GTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHI

Query:  LWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEE-------LEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKIDFPKVVSRT
        L DC     +W   +P         +  +  +  + WL DNL +           I   I+W  W++R      EN      +  +    ++  +  S  
Subjt:  LWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEE-------LEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKIDFPKVVSRT

Query:  YLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSI
         L   +   QP V+ +     W  P    +K+N D +          G +LRD +G + C GF     R S    E   V  GL         R  L   
Subjt:  YLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSI

Query:  SLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFP------SLVPVLDSSPMVEEGLG
            E D+  +V  L     D   +SFL++      +    +  ++  R+ N  AD LA  A S         LVP   SS + E+ LG
Subjt:  SLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFP------SLVPVLDSSPMVEEGLG

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.8e-2128.87Show/hide
Query:  CPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDY--WAWLMDNLAEEEL-------EIAITILWSIWEYRNKVTHTENKPIYQEIS
        C  C  S E+  H+L+ C  ++ +W I     + + A   G W    Y    W+++   E E+        +   +LW +W+ RN++     +    E+ 
Subjt:  CPFCRSSEESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDY--WAWLMDNLAEEEL-------EIAITILWSIWEYRNKVTHTENKPIYQEIS

Query:  RIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGL
        R   +  DF +  +R  L   +   Q   +NL+    WK PP   +K N DA+W  +  + G+GWILR+ SG  + +G + +           K VLE  
Subjt:  RIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGL

Query:  ISILSVADLRPSLSSIS-LLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS
        +  L  A L  S  +   ++ ESDA ++VNLLN  D+ +  +   +++I +L  +F+E+ F + PR  N  AD +AR +ISF +  P L S
Subjt:  ISILSVADLRPSLSSIS-LLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS

AT3G25270.1 Ribonuclease H-like superfamily protein1.6e-2225.95Show/hide
Query:  KFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIF-IP----LTNGLFALNRGSWIPRDYWAWLMDNL
        K WK+K  PK K  +W+++  +L T  N+ ++ I  +  C  C   +E+++H+ +DC  ++ +W+   IP     T G+    +   +     A    N 
Subjt:  KFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKIF-IP----LTNGLFALNRGSWIPRDYWAWLMDNL

Query:  AEEELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKID-----FPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRK
          +   +AI ILW +W+ RN++   +    +Q   +   + +         V S      SS   QPT+    +   W+ PP   +K N D ++N Q R 
Subjt:  AEEELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKID-----FPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRK

Query:  GGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISF
           GW++RD +G  + +G  +     +  +LE++   + LI  +  A    S     ++ E D+  V  L+N E  +F   ++ I+E    +K F+E  F
Subjt:  GGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISF

Query:  LYCPRDQNVAADLLAR
         + PR  N  AD+LA+
Subjt:  LYCPRDQNVAADLLAR

AT3G42140.1 zinc ion binding;nucleic acid binding2.4e-0724.82Show/hide
Query:  IKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSK
        I R GPW+++  + VI+  +     S   F+ + FW+ I  +P+  +  +    +G  +G F+E +L                DVS              
Subjt:  IKRGGPWNYDRALLVIEDTQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSK

Query:  AEETWVKVTYERLPEFCYGCGIIGHVQQECEKISNEG
             +K  YE+L  FC  CG++ H   EC    N+G
Subjt:  AEETWVKVTYERLPEFCYGCGIIGHVQQECEKISNEG

AT4G29090.1 Ribonuclease H-like superfamily protein4.2e-3627Show/hide
Query:  IVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVT--SHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNI
        ++   F  ++ ++I     GG    D   W+    G +TVKS Y +   + N K  SP  V+  S +  ++K WK +  PK +  +W+ + +SLP    +
Subjt:  IVRESFSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVT--SHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNI

Query:  LKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKI-FIPLTNGLFALNRGSWIP----RDYWAWLMDN---LAEEELEIAITILWSIWEYRNKVTHTE
          + +     C  C S +E+  H+L+ C  ++  W I  IP+  G      G W        YW + + N     E+  ++   +LW +W+ RN++    
Subjt:  LKKGIPINYLCPFCRSSEESAKHILWDCKLSKNLWKI-FIPLTNGLFALNRGSWIP----RDYWAWLMDN---LAEEELEIAITILWSIWEYRNKVTHTE

Query:  NKPIYQEISRIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKAL
         +   QE+ R     ++  ++  RT       K Q    N +S   W+PPP   +K N DA+WN    + G+GW+LR+  G    +G + +         
Subjt:  NKPIYQEISRIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKAL

Query:  EAKAVLEGLISILSVADLRPSLSSISLLV-ESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS
        + K+VLE  +  +  A L  S    + ++ ESD+  ++ +LN  DE +  +   IQ++ RL   F E+ F++ PR+ N  A+ +AR ++SF +  P L S
Subjt:  EAKAVLEGLISILSVADLRPSLSSISLLV-ESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.8e-0922.06Show/hide
Query:  ILWSIWEYRNKVTHTENKPIYQEISRIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSI
        ++W IW+  N +     +  +Q  + +  +  D  + +  T   +    N+    + + +  W PP    LK N DAS +++    G+GWILR+S G+ I
Subjt:  ILWSIWEYRNKVTHTENKPIYQEISRIISSKIDFPKVVSRTYLPKSSEKNQPTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSI

Query:  CVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLA
          G  +   R + +  E   ++  + +       +       ++ E D  ++  ++N +  +   +   +  I     +F+ I F +  R+QN  AD LA
Subjt:  CVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDEDFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLA

Query:  RVAI
        + AI
Subjt:  RVAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGCTATGGCAAACGAACTAGAAAAACTAAAAATTACCACAGCAGAAAGAGCAAAAGTGGTGGCGATTGAAGATGAAGACTTGGAAGAAGCTACAGAGGACCT
CCAGGGAGCGATCTTTTGCAGAATCCTCACTCCAAAACTCATTAATCCCGAAGTGTTCAAAACCTTCATGCCAAGGATTTGGAACAAAGAGGGGAGAGTTAGGATAAAAG
CGATGGGAAGAAACACCTACCTTTGCAACTTCAACAACAGTTTTGAAAAGGACAGAATAAAGAGGGGTGGTCCCTGGAACTATGACAGAGCGCTATTGGTCATTGAAGAC
ACTCAAGGAGCTAGCAGAATATCTCACACCAGCTTCAGGTATGTAAACTTTTGGGTCCATATCCATGACTTACCAATGGTTTGTATGCGCAGGAAATGGGCCGAGAAGCT
TGGAAATTCCTTGGGAGAATTCGTAGAAGCTGACCTTGACGAAGGAGGAAGCACTGAGAATACGTTACGCATCCAAGTAAAGATAGATGTCTCAGAACCTCTCAAGAGAG
GTCTTATGGTGAGAATTGGGTCGAAAGCCGAAGAAACATGGGTGAAGGTGACTTATGAACGCCTCCCGGAGTTCTGTTATGGATGCGGTATTATCGGCCATGTTCAACAA
GAATGCGAAAAAATCAGCAACGAGGGTGAAGAGAATCTGTATGGAGATTTCATGAGAGCGACTCCAATCGTTGGAGGAACACCAAAACAGAAAACTCAAGAAAATAAAAG
AGGAAATTTCTGGGGTAGAGGACGAGGTAGAAGAGGTGCATACCAATTCCAAAACAACAGACAAAACCAGAAATACAATGATGAAAAAGAGGAAACATGGAGACGAAGAG
ATCAAAGTGGAAAAAGCAACGAAGAAAACGTCGGCGTCCTGAGACCGGAAAATTCCACAGAAGGCAGGAGCTCGCCGGAGAAGGAGACCGTTGGAAAATCCCAAGGCCAG
CCAACGGCTAGTGAAGTGTCAGAGCAAAGCAGAATGAAAACAGGAAAAGCGGTGATGGAGACAAATTGTCAGGCGGTGGAAATAGGAAAAATCAGGGATCTAGGACAAAA
GTACGTGCCAGACAAAAAAGCTACAAAAGGAATAGTATTAAAGGACCAGACCAGTGAAGTGATTATTAGAAAAGAATGGGCAAGTGGAACAGAAAAGGACAAAAAAGGCC
CAAGGGCCAAGAATAGCCAATCTGGGCTGGAGAAGGAAATCAATAAAGCCCTGGAAAGAGAAAAACACCTAGAGGAAAAAGAAATATCAGAGATGGACACAAACCAAGAC
AGCCTTATGTTCAGAGGAGAAGGTAAAAATGTTCGAACATGGAAAAGGCGTGCTAGACACCTCAATTTCAACTCTTCGGATCACAGACCAATCTTGGCAACTCTGGAAGT
AGGAAGGAAAAAACCTCTGAAAAGAAAAAGAAGAAGCAAAAAATTTGAAGAGGCTTGGATAAGAGTCTCGGACAGCAAAAAGATTGTGGAGAGCTCTTGGAAAGAAATTC
CGGGAAGAAGCTTTACTGATTATTCCAATAAGCTGAATCGATGCCTTTCCAATCTTCTGAAATGGAACAAAGACAGAATTGGGGGATCCATTAGAGCTGCTATAGACAGA
AAGAGAGATGAGATTTTGAGGATGGAAGGAGATGATTCCACGGCAAACCTTGCTTCCATTGGGTTGGCAGAGAAGGAATTGGAAAATCTCCTCAACGAAGAAGAGAGCTA
TTGGAAACTCCGGTCGAGAGAGGATTGGCTTCAATGGGGTGACAGGAACACGAAGTGGTTCCACCATAAAGCTTCAGAGGGGAAAAAAAGAAACGAGATAAAAGGGCTGT
TGAACAGCTCTGGTGTTTGGATTGATGATGAAGATGAAATTGGGAATATAGCCTCTTCATATTTCAAAGATCTCTTCTCCTCTTCGAACCCAACCATAGTGAGAGAATCC
TTCTCAGCCATTGACGCTGAAGTTATTCTCAATACCCCTGTCGGCGGGGAAGGCACGAGGGATGAGATAATCTGGAATAGAGAAAAGAAAGGCTTATTCACGGTCAAAAG
TGCATACCACCTTGCTGTTCTTCTTTCAAATAGCAAGATGGAATCGCCCTCAACCGTAACTAGCCATGACACCTTCTGGAGGAAATTTTGGAAAATTAAAGCTCTCCCTA
AGGCTAAAATCTGTGTTTGGAGGATCATTCATGACTCTCTTCCAACAAGAGTTAATATTCTCAAAAAAGGAATCCCTATTAACTACCTCTGCCCTTTTTGCAGGTCCTCC
GAGGAGTCAGCAAAGCACATTTTATGGGACTGTAAGTTGTCTAAGAATTTATGGAAAATTTTTATCCCCCTAACCAACGGTCTGTTTGCGTTAAACAGGGGATCATGGAT
TCCTAGAGACTACTGGGCTTGGTTGATGGACAATCTAGCGGAAGAAGAATTGGAGATAGCAATCACCATCCTTTGGAGTATTTGGGAGTACAGAAACAAAGTTACCCACA
CAGAGAACAAACCAATCTATCAAGAAATCTCCAGAATTATCAGCAGCAAAATAGATTTTCCAAAAGTTGTTTCGAGAACCTACCTGCCGAAGTCGAGTGAAAAAAATCAG
CCCACAGTGAAGAACCTTGCGAGTCACGCGATTTGGAAGCCTCCCCCGGATTTATCGCTAAAGCTGAATGTAGATGCCTCCTGGAACGACCAATTCAGAAAAGGTGGCGT
CGGTTGGATCCTCCGTGACTCCTCAGGGTCTTCGATCTGTGTGGGCTTCAAGCGCATGCATAATAGGTGGTCAATCAAAGCGTTGGAGGCGAAGGCAGTCTTGGAAGGCC
TGATTAGCATCCTTTCAGTTGCGGATCTTCGTCCCTCTCTAAGCTCCATATCTCTGTTGGTGGAGTCGGATGCTTCCTCTGTGGTGAATCTCCTCAACGAAGAAGATGAA
GATTTTACAGAAATCTCCTTCCTCATTCAGGAGATCTCAAGATTGAAGAAGAACTTTAAAGAAATTTCTTTCTTGTATTGCCCCAGAGATCAAAATGTTGCCGCGGATTT
ATTGGCGCGCGTGGCGATCTCGTTCCCTTCCCTTGTTCCTGTTTTGGACTCCTCTCCCATGGTGGAAGAGGGTTTGGGTTTTTGGTATGGGCCTCCCCCATCTTGTATTA
AAAGCCTCTTAAATGAGGTTGGTGTTCTTGATTTTTCTTCTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGCTATGGCAAACGAACTAGAAAAACTAAAAATTACCACAGCAGAAAGAGCAAAAGTGGTGGCGATTGAAGATGAAGACTTGGAAGAAGCTACAGAGGACCT
CCAGGGAGCGATCTTTTGCAGAATCCTCACTCCAAAACTCATTAATCCCGAAGTGTTCAAAACCTTCATGCCAAGGATTTGGAACAAAGAGGGGAGAGTTAGGATAAAAG
CGATGGGAAGAAACACCTACCTTTGCAACTTCAACAACAGTTTTGAAAAGGACAGAATAAAGAGGGGTGGTCCCTGGAACTATGACAGAGCGCTATTGGTCATTGAAGAC
ACTCAAGGAGCTAGCAGAATATCTCACACCAGCTTCAGGTATGTAAACTTTTGGGTCCATATCCATGACTTACCAATGGTTTGTATGCGCAGGAAATGGGCCGAGAAGCT
TGGAAATTCCTTGGGAGAATTCGTAGAAGCTGACCTTGACGAAGGAGGAAGCACTGAGAATACGTTACGCATCCAAGTAAAGATAGATGTCTCAGAACCTCTCAAGAGAG
GTCTTATGGTGAGAATTGGGTCGAAAGCCGAAGAAACATGGGTGAAGGTGACTTATGAACGCCTCCCGGAGTTCTGTTATGGATGCGGTATTATCGGCCATGTTCAACAA
GAATGCGAAAAAATCAGCAACGAGGGTGAAGAGAATCTGTATGGAGATTTCATGAGAGCGACTCCAATCGTTGGAGGAACACCAAAACAGAAAACTCAAGAAAATAAAAG
AGGAAATTTCTGGGGTAGAGGACGAGGTAGAAGAGGTGCATACCAATTCCAAAACAACAGACAAAACCAGAAATACAATGATGAAAAAGAGGAAACATGGAGACGAAGAG
ATCAAAGTGGAAAAAGCAACGAAGAAAACGTCGGCGTCCTGAGACCGGAAAATTCCACAGAAGGCAGGAGCTCGCCGGAGAAGGAGACCGTTGGAAAATCCCAAGGCCAG
CCAACGGCTAGTGAAGTGTCAGAGCAAAGCAGAATGAAAACAGGAAAAGCGGTGATGGAGACAAATTGTCAGGCGGTGGAAATAGGAAAAATCAGGGATCTAGGACAAAA
GTACGTGCCAGACAAAAAAGCTACAAAAGGAATAGTATTAAAGGACCAGACCAGTGAAGTGATTATTAGAAAAGAATGGGCAAGTGGAACAGAAAAGGACAAAAAAGGCC
CAAGGGCCAAGAATAGCCAATCTGGGCTGGAGAAGGAAATCAATAAAGCCCTGGAAAGAGAAAAACACCTAGAGGAAAAAGAAATATCAGAGATGGACACAAACCAAGAC
AGCCTTATGTTCAGAGGAGAAGGTAAAAATGTTCGAACATGGAAAAGGCGTGCTAGACACCTCAATTTCAACTCTTCGGATCACAGACCAATCTTGGCAACTCTGGAAGT
AGGAAGGAAAAAACCTCTGAAAAGAAAAAGAAGAAGCAAAAAATTTGAAGAGGCTTGGATAAGAGTCTCGGACAGCAAAAAGATTGTGGAGAGCTCTTGGAAAGAAATTC
CGGGAAGAAGCTTTACTGATTATTCCAATAAGCTGAATCGATGCCTTTCCAATCTTCTGAAATGGAACAAAGACAGAATTGGGGGATCCATTAGAGCTGCTATAGACAGA
AAGAGAGATGAGATTTTGAGGATGGAAGGAGATGATTCCACGGCAAACCTTGCTTCCATTGGGTTGGCAGAGAAGGAATTGGAAAATCTCCTCAACGAAGAAGAGAGCTA
TTGGAAACTCCGGTCGAGAGAGGATTGGCTTCAATGGGGTGACAGGAACACGAAGTGGTTCCACCATAAAGCTTCAGAGGGGAAAAAAAGAAACGAGATAAAAGGGCTGT
TGAACAGCTCTGGTGTTTGGATTGATGATGAAGATGAAATTGGGAATATAGCCTCTTCATATTTCAAAGATCTCTTCTCCTCTTCGAACCCAACCATAGTGAGAGAATCC
TTCTCAGCCATTGACGCTGAAGTTATTCTCAATACCCCTGTCGGCGGGGAAGGCACGAGGGATGAGATAATCTGGAATAGAGAAAAGAAAGGCTTATTCACGGTCAAAAG
TGCATACCACCTTGCTGTTCTTCTTTCAAATAGCAAGATGGAATCGCCCTCAACCGTAACTAGCCATGACACCTTCTGGAGGAAATTTTGGAAAATTAAAGCTCTCCCTA
AGGCTAAAATCTGTGTTTGGAGGATCATTCATGACTCTCTTCCAACAAGAGTTAATATTCTCAAAAAAGGAATCCCTATTAACTACCTCTGCCCTTTTTGCAGGTCCTCC
GAGGAGTCAGCAAAGCACATTTTATGGGACTGTAAGTTGTCTAAGAATTTATGGAAAATTTTTATCCCCCTAACCAACGGTCTGTTTGCGTTAAACAGGGGATCATGGAT
TCCTAGAGACTACTGGGCTTGGTTGATGGACAATCTAGCGGAAGAAGAATTGGAGATAGCAATCACCATCCTTTGGAGTATTTGGGAGTACAGAAACAAAGTTACCCACA
CAGAGAACAAACCAATCTATCAAGAAATCTCCAGAATTATCAGCAGCAAAATAGATTTTCCAAAAGTTGTTTCGAGAACCTACCTGCCGAAGTCGAGTGAAAAAAATCAG
CCCACAGTGAAGAACCTTGCGAGTCACGCGATTTGGAAGCCTCCCCCGGATTTATCGCTAAAGCTGAATGTAGATGCCTCCTGGAACGACCAATTCAGAAAAGGTGGCGT
CGGTTGGATCCTCCGTGACTCCTCAGGGTCTTCGATCTGTGTGGGCTTCAAGCGCATGCATAATAGGTGGTCAATCAAAGCGTTGGAGGCGAAGGCAGTCTTGGAAGGCC
TGATTAGCATCCTTTCAGTTGCGGATCTTCGTCCCTCTCTAAGCTCCATATCTCTGTTGGTGGAGTCGGATGCTTCCTCTGTGGTGAATCTCCTCAACGAAGAAGATGAA
GATTTTACAGAAATCTCCTTCCTCATTCAGGAGATCTCAAGATTGAAGAAGAACTTTAAAGAAATTTCTTTCTTGTATTGCCCCAGAGATCAAAATGTTGCCGCGGATTT
ATTGGCGCGCGTGGCGATCTCGTTCCCTTCCCTTGTTCCTGTTTTGGACTCCTCTCCCATGGTGGAAGAGGGTTTGGGTTTTTGGTATGGGCCTCCCCCATCTTGTATTA
AAAGCCTCTTAAATGAGGTTGGTGTTCTTGATTTTTCTTCTTTTTAA
Protein sequenceShow/hide protein sequence
MEEAMANELEKLKITTAERAKVVAIEDEDLEEATEDLQGAIFCRILTPKLINPEVFKTFMPRIWNKEGRVRIKAMGRNTYLCNFNNSFEKDRIKRGGPWNYDRALLVIED
TQGASRISHTSFRYVNFWVHIHDLPMVCMRRKWAEKLGNSLGEFVEADLDEGGSTENTLRIQVKIDVSEPLKRGLMVRIGSKAEETWVKVTYERLPEFCYGCGIIGHVQQ
ECEKISNEGEENLYGDFMRATPIVGGTPKQKTQENKRGNFWGRGRGRRGAYQFQNNRQNQKYNDEKEETWRRRDQSGKSNEENVGVLRPENSTEGRSSPEKETVGKSQGQ
PTASEVSEQSRMKTGKAVMETNCQAVEIGKIRDLGQKYVPDKKATKGIVLKDQTSEVIIRKEWASGTEKDKKGPRAKNSQSGLEKEINKALEREKHLEEKEISEMDTNQD
SLMFRGEGKNVRTWKRRARHLNFNSSDHRPILATLEVGRKKPLKRKRRSKKFEEAWIRVSDSKKIVESSWKEIPGRSFTDYSNKLNRCLSNLLKWNKDRIGGSIRAAIDR
KRDEILRMEGDDSTANLASIGLAEKELENLLNEEESYWKLRSREDWLQWGDRNTKWFHHKASEGKKRNEIKGLLNSSGVWIDDEDEIGNIASSYFKDLFSSSNPTIVRES
FSAIDAEVILNTPVGGEGTRDEIIWNREKKGLFTVKSAYHLAVLLSNSKMESPSTVTSHDTFWRKFWKIKALPKAKICVWRIIHDSLPTRVNILKKGIPINYLCPFCRSS
EESAKHILWDCKLSKNLWKIFIPLTNGLFALNRGSWIPRDYWAWLMDNLAEEELEIAITILWSIWEYRNKVTHTENKPIYQEISRIISSKIDFPKVVSRTYLPKSSEKNQ
PTVKNLASHAIWKPPPDLSLKLNVDASWNDQFRKGGVGWILRDSSGSSICVGFKRMHNRWSIKALEAKAVLEGLISILSVADLRPSLSSISLLVESDASSVVNLLNEEDE
DFTEISFLIQEISRLKKNFKEISFLYCPRDQNVAADLLARVAISFPSLVPVLDSSPMVEEGLGFWYGPPPSCIKSLLNEVGVLDFSSF