; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014408 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014408
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold3:47283759..47289529
RNA-Seq ExpressionSpg014408
SyntenySpg014408
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]5.7e-4525.96Show/hide
Query:  TEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGG
        TE+E+ C+  ++ EEI + E+  +  L+ K+ T+   N   FK  + + W  + +I +  +  NL+L +F   +    +   GPW FD+ L++L    G 
Subjt:  TEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGG

Query:  NYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGC
            D+D   V+FW+  + LPF   S   A ++G ++G  E++D ++      G  LR+K  +D+ KPLKRG  +  +D  +N  +   YE+LP+FC+ C
Subjt:  NYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGC

Query:  GYLGHTIKECESIDQAGS------SEEELEYGAWLREP----IFLKMRE---AESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERG
        G +GH +KECE ++          +E+   YG WLR      IF + R+   + S  +++   + + +G   EG +      + ++   G++  +   + 
Subjt:  GYLGHTIKECESIDQAGS------SEEELEYGAWLREP----IFLKMRE---AESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERG

Query:  DDRSD---------------SVLAGGKLKTPANSP-------VSPCPETAATATKPEKENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIH
         +  D               S + G   K+   S         S  P  A TA K  KE       G  + ++   SE   L+  I  EN       +  
Subjt:  DDRSD---------------SVLAGGKLKTPANSP-------VSPCPETAATATKPEKENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIH

Query:  I-------------YDSNMEVDQDNDGTNRRGQLA-------------------QGESVNV--IEDIDLQPAMDTIQKAKL---EGKLKVWKRIPRSQQQ
        +             + S + +D    G  R G LA                    G+ V+V   E  DL       +K  L   +G+L++ ++     + 
Subjt:  I-------------YDSNMEVDQDNDGTNRRGQLA-------------------QGESVNV--IEDIDLQPAMDTIQKAKL---EGKLKVWKRIPRSQQQ

Query:  DDSKIIMETTSKTISGAKHSIE----------EIDVTIRSFSKGHIDALVK-ENDFLWRPIIAEWSTDPLIRSFAIANRPRRFEEAWSKYEDCREIVKRV
        +D  +  E    T S  ++S +            D  I  FS  H+  L +  +D +   I  E  T    R      R  RFEE+W+    C  +++  
Subjt:  DDSKIIMETTSKTISGAKHSIE----------EIDVTIRSFSKGHIDALVK-ENDFLWRPIIAEWSTDPLIRSFAIANRPRRFEEAWSKYEDCREIVKRV

Query:  WESQNSQDFNAFMVKAEACLGQLAQWSRIQYGGSLRGAIARLEREIQSFSRSDDQQGGI-AMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFH
        W SQ    F+  + +      +L   S     GS+   I R+E+ IQ+    D+ +  I   +  E  LE LL+++E  W+QR+R  WL  GD+NTK+FH
Subjt:  WESQNSQDFNAFMVKAEACLGQLAQWSRIQYGGSLRGAIARLEREIQSFSRSDDQQGGI-AMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFH

Query:  MKANNRRKRNRIRGLIDDMGIWTEEDNGMEVIATQYFCKE--KSESTTHLFWDCKITKE
         KA+ RRK N I+ L D+ G+W   +  +E +   YF KE   S S +++   C + +E
Subjt:  MKANNRRKRNRIRGLIDDMGIWTEEDNGMEVIATQYFCKE--KSESTTHLFWDCKITKE

KAF4368982.1 hypothetical protein G4B88_011810 [Cannabis sativa]2.3e-3826.66Show/hide
Query:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-----QEQTIISQVGFNLYL-CKFKNGRIKSLIKETGPW
        LV +L+E+ V +E    V  L    I + +KK++  L+ K++  +  N E  +  M  +W      Q + + S+  F  +  C+    R+       GPW
Subjt:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-----QEQTIISQVGFNLYL-CKFKNGRIKSLIKETGPW

Query:  FFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRR
          DK LI   +P G      M+F + SFWI  + +P AC +   A E G  +GK+E I +         G+++V+++I++++PLKRG+ +  +D      
Subjt:  FFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRR

Query:  IPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSE--EELEYGAWLREPI--------FLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEE
        +   YE LPDFC+ CG +GH   +C   D  G +   +   YG+W+  P         + K RE     R I+   G     +    R   R S    +E
Subjt:  IPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSE--EELEYGAWLREPI--------FLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEE

Query:  DGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPETAATATKPEKENSES-----------------KGNGNVSAINENFSEKEELNSNITDENSGA
              ++    D+   S     K  T A +PV     +  T T+  + N +                   G G V A  +N S  E+L    + +  G 
Subjt:  DGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPETAATATKPEKENSES-----------------KGNGNVSAINENFSEKEELNSNITDENSGA

Query:  NGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSIEEID--VTI
        +   K+   D   EV     G  +R              +++    ++++K +LE            +Q D S     T      G  H  E +D     
Subjt:  NGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSIEEID--VTI

Query:  RSFSKGHIDALVKENDFL---WRPIIAEWSTDPLIRSFAIANRPR--RFEEAWSKYEDCREIVKRVW----ESQNSQDFNAFMVKAEACLGQLAQWSRIQ
        + +   +    V   D L    R ++A  S + +IR     +R R  RFE  W K +DC +IV+R W     S    + +  +     C  QL  W++ +
Subjt:  RSFSKGHIDALVKENDFL---WRPIIAEWSTDPLIRSFAIANRPR--RFEEAWSKYEDCREIVKRVW----ESQNSQDFNAFMVKAEACLGQLAQWSRIQ

Query:  YGGSLRGAIARLEREIQSFSRSDDQQGGIA-------MREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMG--IW
        +     G+I RL RE Q   + DD     A       ++  E +L  LL  +E YWK R+R DWL  GDRNTK+FH KA  R+K+N I  ++ + G  + 
Subjt:  YGGSLRGAIARLEREIQSFSRSDDQQGGIA-------MREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMG--IW

Query:  TEEDNGMEV
        TEED   E+
Subjt:  TEEDNGMEV

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]8.5e-4128.13Show/hide
Query:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN
        +++   + ++KG    + E+ L  +LI K +T K IN E FKS +  IW  +  +  + +G N++  +F+N   +  I E GPW FDK L++L+E  G  
Subjt:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN

Query:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGCG
           D+ FRYV FWI  H LP AC +R     +GGL+G+V++ID  E   +  G  +R+++ IDV  PLKRG+ +   D  +   + I YE+LP+FCY CG
Subjt:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGCG

Query:  YLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRG-RGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGK-
         +GH +++C    +  +S    ++G W+R         A S  RS     G G +    EG R G  S   E     + G+ +   G D S  +  G + 
Subjt:  YLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRG-RGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGK-

Query:  -LKTPANSPVSPCPETAATATKPEKEN-----SESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDG--TNRR--GQLAQG
         L T   S  +   +TA + T    +      + S     ++  +   S++ E    +T+   G N  ++     S     +DN G  TN++   +LA+ 
Subjt:  -LKTPANSPVSPCPETAATATKPEKEN-----SESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDG--TNRR--GQLAQG

Query:  ESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSI---EEIDVTIRSFSKGHIDALVKENDFL-WRPIIAEWSTD
        +  +V E              K +G + +       +  D  KI + T  +   G   S+    +I+V+IRSF+KGHIDA++K++D L WR     +  +
Subjt:  ESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSI---EEIDVTIRSFSKGHIDALVKENDFL-WRPIIAEWSTD

Query:  PL----IRSFAIANRPRRFEE-AWSKYEDCREIV----KRVWESQNSQDFNAFMVKAEAC-------LGQLAQWSRIQYGGSL
        P+    + S+++  R  R     W    D  EI+    K+    +++   ++F    + C       +G    WS  Q+ G L
Subjt:  PL----IRSFAIANRPRRFEE-AWSKYEDCREIV----KRVWESQNSQDFNAFMVKAEAC-------LGQLAQWSRIQYGGSL

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]7.9e-3936.05Show/hide
Query:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK
        K T +E   V   +G+ I  ++  ++  ++ K+ T K+I+ E  +S M  +W     T    +G N+Y+  FK+   KS +  +GPW F+K+L++L  P 
Subjt:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK

Query:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFC
          N   DM+F + +FWI  H +PF C S   A  +G  LG VE+I  E +    W G  +RV+++IDVSKPL+RGI L   D  ++   P+ YEKLPDFC
Subjt:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFC

Query:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER
        Y CG +GH+ +ECE   +  ++    +YG WLR  +   ++++ S P   +  + GR GRG    GGRGG   WR   ++E    +DG     R
Subjt:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER

XP_042941294.1 uncharacterized protein LOC122275977 [Carya illinoinensis]2.1e-3925.27Show/hide
Query:  ELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKE
        +L++TEEE A V  L    + + + K + +L+ K+++ +K+N E+ +S M  IW    +     +  NL++  F N + K  + +  PW FD  L+LLK 
Subjt:  ELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKE

Query:  PKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDF
          G    + M+F Y +FW+H + LP AC +    T+IG  +G V+++D+ E+    WG  LRVKI++D+ K + RG   T         +P++YEKLP  
Subjt:  PKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDF

Query:  CYGCGYLGHTIKECESIDQAGSSEEELEYGAWLR----------EPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKER
        C+ CG L H  + C  +D  G      +YG WLR          E      RE  +      N+   G+G + + G     + + E  ED +   ++K +
Subjt:  CYGCGYLGHTIKECESIDQAGSSEEELEYGAWLR----------EPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKER

Query:  GD-----DRSDSVLAGGKLKTPANSPVSPCPETA---ATATKPEKEN---SESKGNGNVSAINEN--FSEKEELNSNITDENSGANGQSKIHIYD----S
        G      + +D V      KT  N  V          A      KE    +ES+ N  +   NE      KE L        +   G+S +         
Subjt:  GD-----DRSDSVLAGGKLKTPANSPVSPCPETA---ATATKPEKEN---SESKGNGNVSAINEN--FSEKEELNSNITDENSGANGQSKIHIYD----S

Query:  NMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKL---------KVWKRI----------------PRSQ-QQDDSKIIMETTSKTIS
        ++  D  +D      + ++ ++    E + L  A     K K   KL         + W  I                PRS+ Q ++ + ++E       
Subjt:  NMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKL---------KVWKRI----------------PRSQ-QQDDSKIIMETTSKTIS

Query:  GAKHSIEEIDVTI--RSFSKGHIDALVKEN----DFLWRPI----IAEWSTDPLI---RSFAIANRPR----RFEEAWSKYEDCREIVKRVWESQNSQD-
        G K S+          +F+K  +D  V  +     F +R +    +A+     L+   R   I  + R    RFE  W + E+   +V   W+   + D 
Subjt:  GAKHSIEEIDVTI--RSFSKGHIDALVKEN----DFLWRPI----IAEWSTDPLI---RSFAIANRPR----RFEEAWSKYEDCREIVKRVWESQNSQD-

Query:  -FNAFMVKAEACLGQLAQWS---RIQYGGSLRGAIARLEREIQSFSRSDDQQGGIAMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANN
               K   C G L +WS      +G +++    RL+ E+ +    + +     +++ +  +  LLE ++L W+QRA+  W   GD+NTK+FH  A+ 
Subjt:  -FNAFMVKAEACLGQLAQWS---RIQYGGSLRGAIARLEREIQSFSRSDDQQGGIAMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANN

Query:  RRKRNRIRGLIDDMGIWTEEDNGMEVIATQYF
        RR++N+I+ L++  G    +   +E +  +YF
Subjt:  RRKRNRIRGLIDDMGIWTEEDNGMEVIATQYF

TrEMBL top hitse value%identityAlignment
A0A5C7H9Y2 CCHC-type domain-containing protein4.1e-4128.13Show/hide
Query:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN
        +++   + ++KG    + E+ L  +LI K +T K IN E FKS +  IW  +  +  + +G N++  +F+N   +  I E GPW FDK L++L+E  G  
Subjt:  EEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQ-VGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGN

Query:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGCG
           D+ FRYV FWI  H LP AC +R     +GGL+G+V++ID  E   +  G  +R+++ IDV  PLKRG+ +   D  +   + I YE+LP+FCY CG
Subjt:  YGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGCG

Query:  YLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRG-RGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGK-
         +GH +++C    +  +S    ++G W+R         A S  RS     G G +    EG R G  S   E     + G+ +   G D S  +  G + 
Subjt:  YLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRG-RGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGK-

Query:  -LKTPANSPVSPCPETAATATKPEKEN-----SESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDG--TNRR--GQLAQG
         L T   S  +   +TA + T    +      + S     ++  +   S++ E    +T+   G N  ++     S     +DN G  TN++   +LA+ 
Subjt:  -LKTPANSPVSPCPETAATATKPEKEN-----SESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDG--TNRR--GQLAQG

Query:  ESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSI---EEIDVTIRSFSKGHIDALVKENDFL-WRPIIAEWSTD
        +  +V E              K +G + +       +  D  KI + T  +   G   S+    +I+V+IRSF+KGHIDA++K++D L WR     +  +
Subjt:  ESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSI---EEIDVTIRSFSKGHIDALVKENDFL-WRPIIAEWSTD

Query:  PL----IRSFAIANRPRRFEE-AWSKYEDCREIV----KRVWESQNSQDFNAFMVKAEAC-------LGQLAQWSRIQYGGSL
        P+    + S+++  R  R     W    D  EI+    K+    +++   ++F    + C       +G    WS  Q+ G L
Subjt:  PL----IRSFAIANRPRRFEE-AWSKYEDCREIV----KRVWESQNSQDFNAFMVKAEAC-------LGQLAQWSRIQYGGSL

A0A6J1D765 uncharacterized protein LOC1110179023.8e-3936.05Show/hide
Query:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK
        K T +E   V   +G+ I  ++  ++  ++ K+ T K+I+ E  +S M  +W     T    +G N+Y+  FK+   KS +  +GPW F+K+L++L  P 
Subjt:  KVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-QEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPK

Query:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFC
          N   DM+F + +FWI  H +PF C S   A  +G  LG VE+I  E +    W G  +RV+++IDVSKPL+RGI L   D  ++   P+ YEKLPDFC
Subjt:  GGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGS-LRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFC

Query:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER
        Y CG +GH+ +ECE   +  ++    +YG WLR  +   ++++ S P   +  + GR GRG    GGRGG   WR   ++E    +DG     R
Subjt:  YGCGYLGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLP-RSIVNQAGR-GRGRIWEGGRGG---WRSSIQEEEEDGLDGTIQKER

A0A6J5WUK8 Uncharacterized protein2.5e-3824.49Show/hide
Query:  LENALICKILTQKKINPEMFKSKMPRIW-GQEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPF
        L N L+ K+L+ + +N E F S   R+W G  +  I ++   L+L +F+N R K+ + +  PW F  AL+LL E   G     +D +   FW+  H +P 
Subjt:  LENALICKILTQKKINPEMFKSKMPRIW-GQEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPF

Query:  ACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGCGYLGHTIKECESIDQAG--SSE
           +   A +IG LLG+V ++D   + +   G   RV+IQ DV++PL RG F+   +   +  I   YE LP++C+ CG LGH  + C  +++AG  SS+
Subjt:  ACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGCGYLGHTIKECESIDQAG--SSE

Query:  ----EELEYGAWLREPIFLKMREAESLPRSIVN---QAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPE
            E L   A L     ++ R  +S+ R +      AG  R R++  G GG    +       +  ++  E  +   ++    GK+      P+ P P 
Subjt:  ----EELEYGAWLREPIFLKMREAESLPRSIVN---QAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPE

Query:  TAATATKPEKENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIE-------------DIDL
           +  K  +E  +S   G        F ++  L +    ++S    +  + + +S   V+   D    + +L   + V ++E              +D+
Subjt:  TAATATKPEKENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIE-------------DIDL

Query:  QPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSIEEIDVTIRSFSKGHID-ALVKENDFLWRPIIAEWSTDPLIRSFA-------
        +       +   EG    W+   ++  +     +   + ++++  +  + ++ +    F+         +EN  +   +    +++P +RS++       
Subjt:  QPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSIEEIDVTIRSFSKGHID-ALVKENDFLWRPIIAEWSTDPLIRSFA-------

Query:  -----------IANRP------RRF--EEAWSKYEDCREIVKRVWES--QNSQDFNAFMVKAEACLGQLAQWSRIQYGGSLRGAIARLEREIQSFSRSDD
                   ++ +P      RRF  +  W +   C+ +V   W+   Q S+  + F+ K         QW RI+ G +    I  L +++QS + ++ 
Subjt:  -----------IANRP------RRF--EEAWSKYEDCREIVKRVWES--QNSQDFNAFMVKAEACLGQLAQWSRIQYGGSLRGAIARLEREIQSFSRSDD

Query:  QQGGIAMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMGIWTEEDNGMEVIATQYF
           G  +R  EQ L+  L ++EL+WKQ++R  WL  G++NTK+FH K   RR++NR+ GL D  G+W +++  +  IA  YF
Subjt:  QQGGIAMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMGIWTEEDNGMEVIATQYF

A0A7J6FE65 CCHC-type domain-containing protein1.1e-3826.66Show/hide
Query:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-----QEQTIISQVGFNLYL-CKFKNGRIKSLIKETGPW
        LV +L+E+ V +E    V  L    I + +KK++  L+ K++  +  N E  +  M  +W      Q + + S+  F  +  C+    R+       GPW
Subjt:  LVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWG-----QEQTIISQVGFNLYL-CKFKNGRIKSLIKETGPW

Query:  FFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRR
          DK LI   +P G      M+F + SFWI  + +P AC +   A E G  +GK+E I +         G+++V+++I++++PLKRG+ +  +D      
Subjt:  FFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRR

Query:  IPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSE--EELEYGAWLREPI--------FLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEE
        +   YE LPDFC+ CG +GH   +C   D  G +   +   YG+W+  P         + K RE     R I+   G     +    R   R S    +E
Subjt:  IPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSE--EELEYGAWLREPI--------FLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEE

Query:  DGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPETAATATKPEKENSES-----------------KGNGNVSAINENFSEKEELNSNITDENSGA
              ++    D+   S     K  T A +PV     +  T T+  + N +                   G G V A  +N S  E+L    + +  G 
Subjt:  DGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPETAATATKPEKENSES-----------------KGNGNVSAINENFSEKEELNSNITDENSGA

Query:  NGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSIEEID--VTI
        +   K+   D   EV     G  +R              +++    ++++K +LE            +Q D S     T      G  H  E +D     
Subjt:  NGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRIPRSQQQDDSKIIMETTSKTISGAKHSIEEID--VTI

Query:  RSFSKGHIDALVKENDFL---WRPIIAEWSTDPLIRSFAIANRPR--RFEEAWSKYEDCREIVKRVW----ESQNSQDFNAFMVKAEACLGQLAQWSRIQ
        + +   +    V   D L    R ++A  S + +IR     +R R  RFE  W K +DC +IV+R W     S    + +  +     C  QL  W++ +
Subjt:  RSFSKGHIDALVKENDFL---WRPIIAEWSTDPLIRSFAIANRPR--RFEEAWSKYEDCREIVKRVW----ESQNSQDFNAFMVKAEACLGQLAQWSRIQ

Query:  YGGSLRGAIARLEREIQSFSRSDDQQGGIA-------MREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMG--IW
        +     G+I RL RE Q   + DD     A       ++  E +L  LL  +E YWK R+R DWL  GDRNTK+FH KA  R+K+N I  ++ + G  + 
Subjt:  YGGSLRGAIARLEREIQSFSRSDDQQGGIA-------MREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMG--IW

Query:  TEEDNGMEV
        TEED   E+
Subjt:  TEEDNGMEV

A0A7N2R0C3 Reverse transcriptase domain-containing protein5.0e-3923.12Show/hide
Query:  QEEEVLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPW
        +E EVL ++   LKVTEEE+  +  L  E +  + ++ +  +  K++++K +  E  +  +  +W   ++I +S +G  L+L +F++ R K  + +  PW
Subjt:  QEEEVLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTI-ISQVGFNLYLCKFKNGRIKSLIKETGPW

Query:  FFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRR
         ++K L+L KE +G    +D+  ++  FW+  + LP    ++     IG  +GK  ++D+EE   Q WG  LRV+++IDV++ L RG  +  E   E R 
Subjt:  FFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRR

Query:  IPITYEKLPDFCYGCGYLGHTIKEC--ESIDQAGSSEEELEYGAWLR-EPI--------FLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEE
        +   YE+LP+FCY CG L H +K+C  E        E +L+YGAWLR EPI        F K +    +      +A   +GR  +  + G     QE E
Subjt:  IPITYEKLPDFCYGCGYLGHTIKEC--ESIDQAGSSEEELEYGAWLR-EPI--------FLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEE

Query:  EDGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPETAATATKPE----KENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQS--KIHIYD
           L  + Q+  GD      L GG   T      +   E+   +   E     E +   GNGN           E LN  I +   G   ++    ++  
Subjt:  EDGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCPETAATATKPE----KENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQS--KIHIYD

Query:  SNMEVDQDNDG-------------TNRRGQLAQGESVNVIE---DIDLQPAMDTIQKAK--------LEGKLKVWKR--IPRSQQ---------------
          +  D++ DG              N+ G  + G    +I    D +++ ++  +Q+ +        ++  +K  KR   P SQ                
Subjt:  SNMEVDQDNDG-------------TNRRGQLAQGESVNVIE---DIDLQPAMDTIQKAK--------LEGKLKVWKR--IPRSQQ---------------

Query:  ----QDDSK-------IIMET--TSKTISGAKHSI----------------------EEIDVTIRSFSKGHIDALV------------------------
             D+ K        + ET  + + I G +  +                      E +DV+++S S  HID +V                        
Subjt:  ----QDDSK-------IIMET--TSKTISGAKHSI----------------------EEIDVTIRSFSKGHIDALV------------------------

Query:  -----------------------------------KEND-------------------------FLW--------RPII--------AEW----------
                                            E D                         F W        R ++         EW          
Subjt:  -----------------------------------KEND-------------------------FLW--------RPII--------AEW----------

Query:  -----STDPLIRSFAIANRPRR--------FEEAWSKYEDCREIVKRVWESQNSQDFNAFMVKAEACLGQLAQWSRIQYGG---SLRGAIARLER--EIQ
             ++D  + S +I  R  R        FEE W++ E CRE+++R W+            + + C  QL  W+R  +G     L+    RL++  E+ 
Subjt:  -----STDPLIRSFAIANRPRR--------FEEAWSKYEDCREIVKRVWESQNSQDFNAFMVKAEACLGQLAQWSRIQYGG---SLRGAIARLER--EIQ

Query:  SFSRSDDQQGGIAMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMGIWTEEDNGMEVIATQYFCKEKSESTTHL
            S ++     +++ ++ +  ++  +E+ W QR+R  W+ +GDRNT++FH  ANNRR++N+I G++D  G W E +  +E I  +YF +  S +    
Subjt:  SFSRSDDQQGGIAMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMGIWTEEDNGMEVIATQYFCKEKSESTTHL

Query:  FWDC
        F  C
Subjt:  FWDC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-1324.59Show/hide
Query:  CKEKSESTTHLFWDCKITKEVW-LKYFPFTNLGSFNGRNGWTVPDYCEKLWRNNNEGSLDDNSLRKCLI--ICWKIWAIRNTICHNGQNHSQAQIKAMVQ
        C +  E+  HL + C   + VW +   P    G       WT   Y    W  N E  +        L+  + W++W  RN +   G+ +   ++     
Subjt:  CKEKSESTTHLFWDCKITKEVW-LKYFPFTNLGSFNGRNGWTVPDYCEKLWRNNNEGSLDDNSLRKCLI--ICWKIWAIRNTICHNGQNHSQAQIKAMVQ

Query:  QQIE--SSINELIGEEGTYQSSPYPNVEHNANPGSPLQTAKQRRWIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLET
        +  E  S+  EL G+      +  P VE N +           +W   P    K + DA+W  +  R G+GWILR+  G  +  G R++ R   +   E 
Subjt:  QQIE--SSINELIGEEGTYQSSPYPNVEHNANPGSPLQTAKQRRWIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLET

Query:  MAITEGLRATSFAISLATQSFAPQIRVESDCLQVVRLINGEDVDGTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAHMACE-ENESKLWVHSFP
              L A  +A+   ++    +I  ESD   +V L+N +D   T L   ++++QQL+           PR  N++A R+A  +    N         P
Subjt:  MAITEGLRATSFAISLATQSFAPQIRVESDCLQVVRLINGEDVDGTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAHMACE-ENESKLWVHSFP

Query:  SWLLS
         WL S
Subjt:  SWLLS

AT3G42140.1 zinc ion binding;nucleic acid binding2.7e-0520.83Show/hide
Query:  IKETGPWFFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPE
        I   GPW F+  + +++  +      D +F+ + FWI    +P    +    T IG                                   + G+FL   
Subjt:  IKETGPWFFDKALILLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPE

Query:  DSNENRRIPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSEEE
           +   +   YEKL +FC  CG L H   EC +    G   ++
Subjt:  DSNENRRIPITYEKLPDFCYGCGYLGHTIKECESIDQAGSSEEE

AT4G29090.1 Ribonuclease H-like superfamily protein2.8e-1021.71Show/hide
Query:  CKEKSESTTHLFWDCKITKEVW-LKYFPFTNLGSFNGRNGWTVPDYCEKLWRNN--NEGSLDDNSLRKCLIICWKIWAIRNTICHNGQNHSQAQIKAMVQ
        C    E+  HL + C   +  W +   P    G       W    Y    W  N  N     + + +    + W++W  RN +   G+  +  ++    +
Subjt:  CKEKSESTTHLFWDCKITKEVW-LKYFPFTNLGSFNGRNGWTVPDYCEKLWRNN--NEGSLDDNSLRKCLIICWKIWAIRNTICHNGQNHSQAQIKAMVQ

Query:  QQIESSINELIGEEGTYQSSPYPNVEHNANPGSPLQTAKQRRWIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLETMA
          +E     +  E  +  + P  N             +   RW P P    K + DA+W+ D +R G+GW+LR+  G+    G R++ +      L+++ 
Subjt:  QQIESSINELIGEEGTYQSSPYPNVEHNANPGSPLQTAKQRRWIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLETMA

Query:  ITEGLRATSFAISLATQSFAPQIRVESDCLQVVRLINGEDVDGTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAH--MACEENESKLWVHSFPS
          E L A  +A+   ++     +  ESD   ++ ++N +++    L   I+++Q+L++         IPR  N +A R+A   ++    + KL+    PS
Subjt:  ITEGLRATSFAISLATQSFAPQIRVESDCLQVVRLINGEDVDGTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAH--MACEENESKLWVHSFPS

Query:  WLLS
        W  S
Subjt:  WLLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGCCGACAAGAAGAAGAGGTCCTAGTCAAACAGTTAACGGAGTTGAAAGTCACAGAAGAGGAAAAAGCATGTGTCTTCAAGCTGAAAGGAGAGGAAATAAATAA
ATCAGAAAAAAAGTTGGAAAATGCCCTCATTTGCAAGATTCTGACACAAAAGAAAATTAACCCAGAGATGTTTAAGTCCAAAATGCCACGGATTTGGGGTCAGGAACAGA
CTATCATTAGTCAAGTGGGATTCAATTTGTATCTGTGCAAGTTCAAGAACGGGCGTATCAAGAGCTTAATCAAAGAAACCGGACCATGGTTTTTCGACAAGGCTCTCATC
CTGTTAAAGGAACCAAAGGGAGGCAATTACGGCGAGGATATGGATTTCAGGTACGTATCGTTTTGGATTCATTTCCATAAACTTCCTTTTGCTTGTTTTTCCAGGAATGC
AGCAACTGAAATAGGAGGTCTTCTTGGAAAAGTGGAACAAATCGATTTGGAGGAGGAGATAGATCAAAATTGGGGAGGATCTTTACGTGTGAAGATCCAAATTGATGTAT
CCAAACCTTTGAAGCGGGGAATTTTTTTGACACCGGAAGATTCTAATGAAAACCGAAGGATTCCGATTACGTATGAAAAGTTACCCGACTTTTGTTATGGTTGTGGATAT
CTTGGTCACACTATAAAGGAATGTGAAAGTATAGATCAAGCTGGGTCATCTGAGGAGGAATTGGAGTATGGTGCTTGGCTTCGTGAACCTATCTTTCTAAAGATGAGAGA
GGCAGAGTCATTGCCCAGATCCATAGTCAATCAAGCTGGTCGGGGTAGAGGTAGAATCTGGGAAGGAGGAAGAGGAGGATGGAGAAGTTCCATCCAAGAAGAAGAAGAAG
ACGGGCTCGACGGTACAATTCAGAAAGAAAGAGGAGATGACAGGTCTGATTCAGTATTGGCCGGCGGGAAGTTGAAGACTCCGGCGAACAGTCCGGTGAGTCCATGTCCA
GAAACGGCAGCAACGGCTACTAAGCCAGAAAAGGAAAATTCGGAAAGCAAGGGTAACGGTAACGTATCTGCCATTAATGAAAATTTCTCAGAAAAAGAGGAATTAAATTC
AAATATTACAGACGAAAATTCTGGTGCAAATGGTCAATCAAAGATTCATATTTATGACTCTAATATGGAGGTGGATCAAGACAACGATGGGACAAATCGAAGGGGACAGC
TGGCGCAAGGGGAGAGTGTTAATGTCATTGAGGATATTGATTTACAACCTGCTATGGATACTATTCAGAAAGCTAAGTTGGAAGGCAAGCTTAAAGTATGGAAAAGGATT
CCACGGTCCCAACAGCAGGATGATTCTAAAATCATTATGGAAACAACTAGCAAGACCATTTCAGGGGCTAAACATTCAATTGAGGAGATTGATGTTACAATTCGCTCTTT
CTCTAAGGGGCACATTGATGCTTTAGTGAAGGAAAATGACTTTCTATGGAGACCTATAATTGCAGAATGGTCCACTGACCCCTTGATTCGAAGTTTTGCTATTGCAAATC
GCCCTAGGAGATTTGAAGAGGCGTGGAGTAAGTATGAGGATTGTAGGGAGATTGTGAAGAGAGTATGGGAATCTCAAAACAGTCAAGATTTTAATGCATTCATGGTCAAA
GCAGAGGCTTGCCTAGGTCAGTTAGCTCAGTGGAGTCGGATACAGTATGGAGGCTCGCTAAGAGGGGCTATTGCAAGATTAGAGCGGGAGATTCAGAGCTTCTCACGTAG
TGATGATCAGCAAGGGGGTATTGCTATGCGAGAAAAGGAGCAACGTCTTGAAGCCCTGCTTGAGGATGATGAGCTTTATTGGAAACAAAGAGCTCGAGAGGATTGGTTGG
TTTGGGGTGACAGAAACACGAAGTGGTTTCATATGAAGGCGAATAACAGAAGAAAACGAAACAGAATTCGAGGCCTAATAGACGATATGGGGATTTGGACAGAAGAGGAT
AATGGAATGGAGGTTATTGCAACTCAATATTTTTGCAAGGAGAAATCGGAATCTACTACTCATCTCTTTTGGGATTGCAAAATCACTAAGGAGGTGTGGCTTAAATATTT
TCCTTTTACTAACTTGGGGAGTTTTAATGGCAGGAATGGATGGACGGTTCCAGACTATTGTGAGAAACTATGGAGGAACAACAATGAAGGATCACTGGATGATAATAGCC
TAAGGAAGTGCCTTATTATTTGTTGGAAAATTTGGGCAATTCGAAACACAATTTGCCACAATGGTCAGAATCACAGTCAAGCACAAATAAAAGCAATGGTTCAGCAGCAG
ATTGAAAGCTCCATCAACGAATTAATCGGAGAAGAAGGTACTTACCAGTCGAGTCCCTACCCGAATGTCGAGCACAACGCGAACCCAGGCTCTCCCTTGCAGACGGCAAA
ACAGAGGCGTTGGATCCCGATTCCTGAAGGCTGTTGGAAGCTGAGTTGTGATGCGTCATGGAGTGAAGATCTTCAACGGGGTGGATTAGGATGGATTCTTCGAGACTGGG
GAGGGAAACCGATAATGGCGGGTTACAGAAGTATTTGCCGGAGGTGGAAGATTAGTTGGCTTGAGACTATGGCGATCACTGAGGGACTGCGTGCGACTTCATTTGCGATC
TCCCTTGCAACCCAGAGCTTCGCCCCTCAAATTCGCGTGGAAAGTGACTGTCTCCAGGTGGTTCGGCTGATTAATGGTGAAGATGTAGATGGTACTGAACTTGACTTTTT
CATAAAGGAAGTCCAACAACTTATTGCCATGAGAAGGATAGATCTGACTTCTCATATCCCCAGGGCTTATAACCAAATGGCCCATAGGTTGGCCCATATGGCTTGTGAGG
AAAATGAGTCAAAATTATGGGTTCACTCCTTTCCAAGTTGGCTTTTATCTTACAATGAAGCTGATGTTGGCTATTTCCTTGACAATAGTGGGGGTTCATGTCCCACTAAT
GTCAATCTTTTGAACTTTTTGCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGCCGACAAGAAGAAGAGGTCCTAGTCAAACAGTTAACGGAGTTGAAAGTCACAGAAGAGGAAAAAGCATGTGTCTTCAAGCTGAAAGGAGAGGAAATAAATAA
ATCAGAAAAAAAGTTGGAAAATGCCCTCATTTGCAAGATTCTGACACAAAAGAAAATTAACCCAGAGATGTTTAAGTCCAAAATGCCACGGATTTGGGGTCAGGAACAGA
CTATCATTAGTCAAGTGGGATTCAATTTGTATCTGTGCAAGTTCAAGAACGGGCGTATCAAGAGCTTAATCAAAGAAACCGGACCATGGTTTTTCGACAAGGCTCTCATC
CTGTTAAAGGAACCAAAGGGAGGCAATTACGGCGAGGATATGGATTTCAGGTACGTATCGTTTTGGATTCATTTCCATAAACTTCCTTTTGCTTGTTTTTCCAGGAATGC
AGCAACTGAAATAGGAGGTCTTCTTGGAAAAGTGGAACAAATCGATTTGGAGGAGGAGATAGATCAAAATTGGGGAGGATCTTTACGTGTGAAGATCCAAATTGATGTAT
CCAAACCTTTGAAGCGGGGAATTTTTTTGACACCGGAAGATTCTAATGAAAACCGAAGGATTCCGATTACGTATGAAAAGTTACCCGACTTTTGTTATGGTTGTGGATAT
CTTGGTCACACTATAAAGGAATGTGAAAGTATAGATCAAGCTGGGTCATCTGAGGAGGAATTGGAGTATGGTGCTTGGCTTCGTGAACCTATCTTTCTAAAGATGAGAGA
GGCAGAGTCATTGCCCAGATCCATAGTCAATCAAGCTGGTCGGGGTAGAGGTAGAATCTGGGAAGGAGGAAGAGGAGGATGGAGAAGTTCCATCCAAGAAGAAGAAGAAG
ACGGGCTCGACGGTACAATTCAGAAAGAAAGAGGAGATGACAGGTCTGATTCAGTATTGGCCGGCGGGAAGTTGAAGACTCCGGCGAACAGTCCGGTGAGTCCATGTCCA
GAAACGGCAGCAACGGCTACTAAGCCAGAAAAGGAAAATTCGGAAAGCAAGGGTAACGGTAACGTATCTGCCATTAATGAAAATTTCTCAGAAAAAGAGGAATTAAATTC
AAATATTACAGACGAAAATTCTGGTGCAAATGGTCAATCAAAGATTCATATTTATGACTCTAATATGGAGGTGGATCAAGACAACGATGGGACAAATCGAAGGGGACAGC
TGGCGCAAGGGGAGAGTGTTAATGTCATTGAGGATATTGATTTACAACCTGCTATGGATACTATTCAGAAAGCTAAGTTGGAAGGCAAGCTTAAAGTATGGAAAAGGATT
CCACGGTCCCAACAGCAGGATGATTCTAAAATCATTATGGAAACAACTAGCAAGACCATTTCAGGGGCTAAACATTCAATTGAGGAGATTGATGTTACAATTCGCTCTTT
CTCTAAGGGGCACATTGATGCTTTAGTGAAGGAAAATGACTTTCTATGGAGACCTATAATTGCAGAATGGTCCACTGACCCCTTGATTCGAAGTTTTGCTATTGCAAATC
GCCCTAGGAGATTTGAAGAGGCGTGGAGTAAGTATGAGGATTGTAGGGAGATTGTGAAGAGAGTATGGGAATCTCAAAACAGTCAAGATTTTAATGCATTCATGGTCAAA
GCAGAGGCTTGCCTAGGTCAGTTAGCTCAGTGGAGTCGGATACAGTATGGAGGCTCGCTAAGAGGGGCTATTGCAAGATTAGAGCGGGAGATTCAGAGCTTCTCACGTAG
TGATGATCAGCAAGGGGGTATTGCTATGCGAGAAAAGGAGCAACGTCTTGAAGCCCTGCTTGAGGATGATGAGCTTTATTGGAAACAAAGAGCTCGAGAGGATTGGTTGG
TTTGGGGTGACAGAAACACGAAGTGGTTTCATATGAAGGCGAATAACAGAAGAAAACGAAACAGAATTCGAGGCCTAATAGACGATATGGGGATTTGGACAGAAGAGGAT
AATGGAATGGAGGTTATTGCAACTCAATATTTTTGCAAGGAGAAATCGGAATCTACTACTCATCTCTTTTGGGATTGCAAAATCACTAAGGAGGTGTGGCTTAAATATTT
TCCTTTTACTAACTTGGGGAGTTTTAATGGCAGGAATGGATGGACGGTTCCAGACTATTGTGAGAAACTATGGAGGAACAACAATGAAGGATCACTGGATGATAATAGCC
TAAGGAAGTGCCTTATTATTTGTTGGAAAATTTGGGCAATTCGAAACACAATTTGCCACAATGGTCAGAATCACAGTCAAGCACAAATAAAAGCAATGGTTCAGCAGCAG
ATTGAAAGCTCCATCAACGAATTAATCGGAGAAGAAGGTACTTACCAGTCGAGTCCCTACCCGAATGTCGAGCACAACGCGAACCCAGGCTCTCCCTTGCAGACGGCAAA
ACAGAGGCGTTGGATCCCGATTCCTGAAGGCTGTTGGAAGCTGAGTTGTGATGCGTCATGGAGTGAAGATCTTCAACGGGGTGGATTAGGATGGATTCTTCGAGACTGGG
GAGGGAAACCGATAATGGCGGGTTACAGAAGTATTTGCCGGAGGTGGAAGATTAGTTGGCTTGAGACTATGGCGATCACTGAGGGACTGCGTGCGACTTCATTTGCGATC
TCCCTTGCAACCCAGAGCTTCGCCCCTCAAATTCGCGTGGAAAGTGACTGTCTCCAGGTGGTTCGGCTGATTAATGGTGAAGATGTAGATGGTACTGAACTTGACTTTTT
CATAAAGGAAGTCCAACAACTTATTGCCATGAGAAGGATAGATCTGACTTCTCATATCCCCAGGGCTTATAACCAAATGGCCCATAGGTTGGCCCATATGGCTTGTGAGG
AAAATGAGTCAAAATTATGGGTTCACTCCTTTCCAAGTTGGCTTTTATCTTACAATGAAGCTGATGTTGGCTATTTCCTTGACAATAGTGGGGGTTCATGTCCCACTAAT
GTCAATCTTTTGAACTTTTTGCATTGA
Protein sequenceShow/hide protein sequence
MARRQEEEVLVKQLTELKVTEEEKACVFKLKGEEINKSEKKLENALICKILTQKKINPEMFKSKMPRIWGQEQTIISQVGFNLYLCKFKNGRIKSLIKETGPWFFDKALI
LLKEPKGGNYGEDMDFRYVSFWIHFHKLPFACFSRNAATEIGGLLGKVEQIDLEEEIDQNWGGSLRVKIQIDVSKPLKRGIFLTPEDSNENRRIPITYEKLPDFCYGCGY
LGHTIKECESIDQAGSSEEELEYGAWLREPIFLKMREAESLPRSIVNQAGRGRGRIWEGGRGGWRSSIQEEEEDGLDGTIQKERGDDRSDSVLAGGKLKTPANSPVSPCP
ETAATATKPEKENSESKGNGNVSAINENFSEKEELNSNITDENSGANGQSKIHIYDSNMEVDQDNDGTNRRGQLAQGESVNVIEDIDLQPAMDTIQKAKLEGKLKVWKRI
PRSQQQDDSKIIMETTSKTISGAKHSIEEIDVTIRSFSKGHIDALVKENDFLWRPIIAEWSTDPLIRSFAIANRPRRFEEAWSKYEDCREIVKRVWESQNSQDFNAFMVK
AEACLGQLAQWSRIQYGGSLRGAIARLEREIQSFSRSDDQQGGIAMREKEQRLEALLEDDELYWKQRAREDWLVWGDRNTKWFHMKANNRRKRNRIRGLIDDMGIWTEED
NGMEVIATQYFCKEKSESTTHLFWDCKITKEVWLKYFPFTNLGSFNGRNGWTVPDYCEKLWRNNNEGSLDDNSLRKCLIICWKIWAIRNTICHNGQNHSQAQIKAMVQQQ
IESSINELIGEEGTYQSSPYPNVEHNANPGSPLQTAKQRRWIPIPEGCWKLSCDASWSEDLQRGGLGWILRDWGGKPIMAGYRSICRRWKISWLETMAITEGLRATSFAI
SLATQSFAPQIRVESDCLQVVRLINGEDVDGTELDFFIKEVQQLIAMRRIDLTSHIPRAYNQMAHRLAHMACEENESKLWVHSFPSWLLSYNEADVGYFLDNSGGSCPTN
VNLLNFLH