; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031202 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031202
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold8:28786838..28792567
RNA-Seq ExpressionSpg031202
SyntenySpg031202
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5475845.1 hypothetical protein F2P56_007609 [Juglans regia]6.9e-5428.6Show/hide
Query:  ILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYA
        I+  + IN E F + M K+W  EG V+  + G N ++ +F + +D++RV KG PWS+D  +L     +G+  V D++F    FW+  H +P      +  
Subjt:  ILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYA

Query:  IALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSR
          +  ++G       D+     G+ +R++V +  T+ L RG  +       KTW+   YE+LP FC  CG + H  + C+    A    + G    A + 
Subjt:  IALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSR

Query:  EEGDIS-SRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKE-------------WAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIG
        +EGDI+  R     +++TSG   A  +W    +   E+  + G+     G K+             + Q  K++  DL E       +  K       + 
Subjt:  EEGDIS-SRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKE-------------WAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIG

Query:  LKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTKNENSKK--WKRRVREDNNNSESVTCGVL--ETGGKRKA
        + +  ++ D G+ T+ +     +E  + +    K    KR ++ +        +PR    T +   K   W+   RE     E     +L      K K 
Subjt:  LKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTKNENSKK--WKRRVREDNNNSESVTCGVL--ETGGKRKA

Query:  DDEL----EGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIK--MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNL
           +      C   K++ +KR+   +    +  KG SGGL  LW  ++E ++ S+S+ HI  M+K       W  TGFYG+PVT +R  SWKLL+ +  +
Subjt:  DDEL----EGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIK--MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNL

Query:  SSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWS
          L W+  GDFNEIL   +KSGG  R  NQ++ FR  V+ C L DLGF+ N FTWS
Subjt:  SSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWS

KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis]5.8e-5327.44Show/hide
Query:  EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCS
        EE    + ++++    + R  + ++  KI + R +   V      KIW L  +V + +   N +I  F    DK+RV  G PW +D  + V +   GS  
Subjt:  EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCS

Query:  VEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKL
        V +++F   SFW+ FH LP +   ++    L ++IG  E+ E+DE++   G +LRVK++++  +PL RG  +    +  K W P+ YEK+P FC+ CG++
Subjt:  VEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKL

Query:  GHVKQECEVEEAAETN-----------------EEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSG------
         H    C+V++                      ++E   +  +S E G   + ++ G A +  G  +           S+ M+ D G  G+G G      
Subjt:  GHVKQECEVEEAAETN-----------------EEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSG------

Query:  -QKEWAQKNKDSDVD--LTEKSEVEKDRDAKGK------------GGPEEIGLKISPMDQDLGIQT--VAKRKVVIKEKVINRGMEEKGRSVKRQ-----
          K      K+  ++  +TE  ++  D +  GK            GGP  +G    P D   G+    +      ++  V    + ++GR  +       
Subjt:  -QKEWAQKNKDSDVD--LTEKSEVEKDRDAKGK------------GGPEEIGLKISPMDQDLGIQT--VAKRKVVIKEKVINRGMEEKGRSVKRQ-----

Query:  ----TTSEGLHK----------VKIDKPRSPAKTK----NENSKKWKRRVREDNNNSESVTCGVLETGGKRKADDELEGCSDSKLDS-----IKRVTSLE
              +E L K          + +D+   PA  K      +   W  R          V    L    + KA D L    ++KL++     +K     +
Subjt:  ----TTSEGLHK----------VKIDKPRSPAKTK----NENSKKWKRRVREDNNNSESVTCGVLETGGKRKADDELEGCSDSKLDS-----IKRVTSLE

Query:  CGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKMN----EGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQ
          L+V  +G SGG+ L W+   +V++ +FS+ HI + +       E WW  TGFYGN   +KR +SW LL  L   S   W++ GDFNEIL   EKSGG+
Subjt:  CGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKMN----EGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQ

Query:  PRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRKVKLDGVIAERM
         + + QM  FR V+D C L DLGF  NPFTW  + +    I+ER+
Subjt:  PRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRKVKLDGVIAERM

OMO61345.1 reverse transcriptase [Corchorus capsularis]2.5e-5629.42Show/hide
Query:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY
        M + +    E F L EE    V V+   ++    +    +  K+L+ R +N EV  + M  +W L G +++ + G N++I +F    +KERV +  PW++
Subjt:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY

Query:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI
        + A+LV         VED++    SFW   H LP           +  S G+ E+ +   ++   G+ LR +  +N T+PL+RG  + T     K  I  
Subjt:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI

Query:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETS----GVRRAVEAWEKQTRRSAEMIDDG----GRTGEGSG
         YEKLPDFCY CG L HV+ EC  E+A     ++G        +E     R  + R+K       G+       E++ + S     +G    GR G+   
Subjt:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETS----GVRRAVEAWEKQTRRSAEMIDDG----GRTGEGSG

Query:  QKEWAQKNKDSD--VDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIK------EKVINRGMEEKG------RSVKRQTTSEGLHK
        + + A+ N+DS    D T+           GK    +   K+            AK+KV  K      + V+  G+  KG      ++V   + S G  K
Subjt:  QKEWAQKNKDSD--VDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIK------EKVINRGMEEKG------RSVKRQTTSEGLHK

Query:  VKIDKPRSPAKTKNENSKKWKRRVREDNNNSESVTCGVLETGGKRKADDELEGCSDSKLDSIK------RVTSLECGLSVPSKGCSGGLMLLWDKEIEVD
        +K               KKWKR          +VT   ++ G    ++D+L   +  K +  K      R  S E G  V S   SGGL LLW +E EV 
Subjt:  VKIDKPRSPAKTKNENSKKWKRRVREDNNNSESVTCGVLETGGKRKADDELEGCSDSKLDSIK------RVTSLECGLSVPSKGCSGGLMLLWDKEIEVD

Query:  LISFSEGHIDSMIKMNEGW--WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDN
        ++S+S  H D+++   +G   WRFTGFYGNP+T++R +SW L+  L   SSL W++GGDFNEI+   EK GG  R  +Q+  FR ++  C+L  L  +  
Subjt:  LISFSEGHIDSMIKMNEGW--WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDN

Query:  PFTWSRKVKLDGVI--AERMIISRCWSDEPTSKPDGFMR
          TW R    + V    +R ++S  W D    K +  ++
Subjt:  PFTWSRKVKLDGVI--AERMIISRCWSDEPTSKPDGFMR

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]2.4e-7531.63Show/hide
Query:  ADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYD
        +D++  + EK  L ++ G +  ++    E  ++    ++  K +T ++IN E F   +  IW  +  V ME  G NI+  +F+   D++R+ +GGPW +D
Subjt:  ADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYD

Query:  DAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPIS
          +LV  E  GS  V DL+F++V FWI  H LP  C  R+  + L   +G  ++ +  E+ +  G+ +R++V+I+   PLKRG  V  G   +   + I 
Subjt:  DAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPIS

Query:  YEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTG-----EGSGQKEW
        YE+LP+FCY+CGK+GH+ ++C                P N++E   I+S  S         V R       + + S E   +GG +         G  +W
Subjt:  YEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTG-----EGSGQKEW

Query:  AQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKIS-----PMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTK
            KDS V L +  +++   + K  G   E    +S        + L +     ++ + ++   N    E   +V      E +   +          K
Subjt:  AQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKIS-----PMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTK

Query:  NENSKKWKRRVREDNNNSESVTCGVLETG----GKRKADDELEGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKM
          N K+WKR  RE         CG +  G    GK+  D ++E  SD K  S+          +V   G  GGL LLW  +IEV + SF++GHID++IK 
Subjt:  NENSKKWKRRVREDNNNSESVTCGVLETG----GKRKADDELEGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKM

Query:  NEGW-WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRKVKLDGVIAE
        ++   WRFTGFYG P+ + R+ SW LL RLG +S+L WIV GDFNEIL  +EK GG  R    M  FR  VD C LMD+G++ N +TWS +     +I E
Subjt:  NEGW-WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRKVKLDGVIAE

Query:  RMIISRC
        R+  + C
Subjt:  RMIISRC

XP_035544642.1 uncharacterized protein LOC109020982 [Juglans regia]1.3e-5528.05Show/hide
Query:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY
        M +E+  Q + F+L E+    + +  D +E      K  +   I+  + IN E F + M K+W  EG V+  + G N ++ +F + +D++RV KG PWS+
Subjt:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY

Query:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI
        D  +L     +G+  V D++F    FW+  H +P      +    +  ++G       D+     G+ +R++V +  T+ L RG  +       KTW+  
Subjt:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI

Query:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDIS-SRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKE----
         YE+LP FC  CG + H  + C+    A    + G    A + +EGDI+  R     +++TSG   A  +W    +   E+  + G+     G K+    
Subjt:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDIS-SRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKE----

Query:  ---------WAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRS
                 + Q  K++  DL E       +  K       + + +  ++ D G+ T+ +     +E  + +    K    KR ++ +        +PR 
Subjt:  ---------WAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRS

Query:  PAKTKNENSKK--WKRRVREDNNNSESVTCGVL--ETGGKRKADDEL----EGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSE
           T +   K   W+   RE     E     +L      K K    +      C   K++ +KR+   +    +  KG SGGL  LW  ++E ++ S+S+
Subjt:  PAKTKNENSKK--WKRRVREDNNNSESVTCGVL--ETGGKRKADDEL----EGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSE

Query:  GHIDSMIK--MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWS
         HI  M+K       W  TGFYG+PVT +R  SWKLL+ +  +  L W+  GDFNEIL   +KSGG  R  NQ++ FR  V+ C L DLGF+ N FTWS
Subjt:  GHIDSMIK--MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWS

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase1.2e-5629.42Show/hide
Query:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY
        M + +    E F L EE    V V+   ++    +    +  K+L+ R +N EV  + M  +W L G +++ + G N++I +F    +KERV +  PW++
Subjt:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY

Query:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI
        + A+LV         VED++    SFW   H LP           +  S G+ E+ +   ++   G+ LR +  +N T+PL+RG  + T     K  I  
Subjt:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI

Query:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETS----GVRRAVEAWEKQTRRSAEMIDDG----GRTGEGSG
         YEKLPDFCY CG L HV+ EC  E+A     ++G        +E     R  + R+K       G+       E++ + S     +G    GR G+   
Subjt:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETS----GVRRAVEAWEKQTRRSAEMIDDG----GRTGEGSG

Query:  QKEWAQKNKDSD--VDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIK------EKVINRGMEEKG------RSVKRQTTSEGLHK
        + + A+ N+DS    D T+           GK    +   K+            AK+KV  K      + V+  G+  KG      ++V   + S G  K
Subjt:  QKEWAQKNKDSD--VDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIK------EKVINRGMEEKG------RSVKRQTTSEGLHK

Query:  VKIDKPRSPAKTKNENSKKWKRRVREDNNNSESVTCGVLETGGKRKADDELEGCSDSKLDSIK------RVTSLECGLSVPSKGCSGGLMLLWDKEIEVD
        +K               KKWKR          +VT   ++ G    ++D+L   +  K +  K      R  S E G  V S   SGGL LLW +E EV 
Subjt:  VKIDKPRSPAKTKNENSKKWKRRVREDNNNSESVTCGVLETGGKRKADDELEGCSDSKLDSIK------RVTSLECGLSVPSKGCSGGLMLLWDKEIEVD

Query:  LISFSEGHIDSMIKMNEGW--WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDN
        ++S+S  H D+++   +G   WRFTGFYGNP+T++R +SW L+  L   SSL W++GGDFNEI+   EK GG  R  +Q+  FR ++  C+L  L  +  
Subjt:  LISFSEGHIDSMIKMNEGW--WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDN

Query:  PFTWSRKVKLDGVI--AERMIISRCWSDEPTSKPDGFMR
          TW R    + V    +R ++S  W D    K +  ++
Subjt:  PFTWSRKVKLDGVI--AERMIISRCWSDEPTSKPDGFMR

A0A2N9G933 Reverse transcriptase domain-containing protein1.8e-5529.46Show/hide
Query:  DRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFWIHFHK
        + D +  +A + +T R +N E  +     +W       +   G N+ +  F    D ERV +G PWSYD  ++ F        V ++E  FVSFW+  H 
Subjt:  DRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFWIHFHK

Query:  LPRVCFCRKYAIALANSIGSFED-AEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEV----EEA
        LP     R++A+AL  ++G  E  AE +E    EG  +R++V I+ ++PL RG   +  S   +TWI   YE+LP FCY CG L H  ++CEV    +  
Subjt:  LPRVCFCRKYAIALANSIGSFED-AEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEV----EEA

Query:  AETNEEEGG-------EEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQ-----TRRSAEMIDDGGRTGEGSGQ---------KEWAQKNKDSDVD
            +++ G       ++P   R E  ++ R ++ R  + S    +V +  K        +  + I   G   E SG+             + N+D + D
Subjt:  AETNEEEGG-------EEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQ-----TRRSAEMIDDGGRTGEGSGQ---------KEWAQKNKDSDVD

Query:  LTEKSEV-----------EKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPA-------
        + E  E             ++  ++  G P+ IG + S +D+         R  V+ + ++     E G ++  Q T  G  K+  D   S A       
Subjt:  LTEKSEV-----------EKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPA-------

Query:  -KTKNENSKKWKRRVREDNNNSESVT--CGVLETG-----------GKRKADD-------ELEG-----------------CSDSKLDSIKRVTSLECGL
         K   E+   W+R  R         T   GV   G           GK+K+ D       E++G                   ++KLD ++R+  +   L
Subjt:  -KTKNENSKKWKRRVREDNNNSESVT--CGVLETG-----------GKRKADD-------ELEG-----------------CSDSKLDSIKRVTSLECGL

Query:  ------SVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIK-MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQ
              +VPS G SGGL LLW+ E+E+ + +FS  HID+ ++   E  WRFTGFYGNPV ++R +SW LLE+L +LS L W++ GDFNEIL  EE+SG  
Subjt:  ------SVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIK-MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQ

Query:  PRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSR--------KVKLDGVIAE---RMIISRCWSDE-PTSKPD
           Q  M  F  V++ C L+DLG+   PFTW          + +LD  +A      I + C  D  PTS  D
Subjt:  PRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSR--------KVKLDGVIAE---RMIISRCWSDE-PTSKPD

A0A2N9J6Y2 Uncharacterized protein7.0e-5230.28Show/hide
Query:  DIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFW
        D+ +      G +A K LT R+IN E  M  +  +W           G N  +  F    D ERV   GPWS+D  +++    + + S   + F   SFW
Subjt:  DIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFW

Query:  IHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEVEEA
        +  H LP  C        + N++G  E  E        G  +RV+V ++ T+PL RG  +  G   +  W+   +E+LP FCY CG++ H  ++C +   
Subjt:  IHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEVEEA

Query:  AETNEEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKEWAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEE
               G   P   ++E     R  L R        R  E    + RR   ++   G         E+           T K + E +  AK K    E
Subjt:  AETNEEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKEWAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEE

Query:  IGLKISPMDQDLGI--QTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTKNENSKKWKRRVREDNNN-----------------
        +GL    +++ L I  Q  ++    I E  + R   E  R+        G H   +D+       K      WKR VR+ N                   
Subjt:  IGLKISPMDQDLGI--QTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTKNENSKKWKRRVREDNNN-----------------

Query:  ---SESVTCGVLETGGKRKADDELEGCSDSKLDSIKRVTSLEC------GLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKMNEGWWRFTGFYG
            E VT   L T   RK D      S++KLD  KR+  L C         VPS+G SGGL   W KE+ V + S+S+ HID+++  +E  WR TGFYG
Subjt:  ---SESVTCGVLETGGKRKADDELEGCSDSKLDSIKRVTSLEC------GLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKMNEGWWRFTGFYG

Query:  NPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRK
        +P    +   W LL  L     L WI GGDFNE+L  EEK G   R + QM  FR+VVD C  +DLGF+ +PFTW  K
Subjt:  NPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRK

A0A5C7H9Y2 CCHC-type domain-containing protein1.2e-7531.63Show/hide
Query:  ADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYD
        +D++  + EK  L ++ G +  ++    E  ++    ++  K +T ++IN E F   +  IW  +  V ME  G NI+  +F+   D++R+ +GGPW +D
Subjt:  ADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYD

Query:  DAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPIS
          +LV  E  GS  V DL+F++V FWI  H LP  C  R+  + L   +G  ++ +  E+ +  G+ +R++V+I+   PLKRG  V  G   +   + I 
Subjt:  DAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPIS

Query:  YEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTG-----EGSGQKEW
        YE+LP+FCY+CGK+GH+ ++C                P N++E   I+S  S         V R       + + S E   +GG +         G  +W
Subjt:  YEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTG-----EGSGQKEW

Query:  AQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKIS-----PMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTK
            KDS V L +  +++   + K  G   E    +S        + L +     ++ + ++   N    E   +V      E +   +          K
Subjt:  AQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKIS-----PMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTK

Query:  NENSKKWKRRVREDNNNSESVTCGVLETG----GKRKADDELEGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKM
          N K+WKR  RE         CG +  G    GK+  D ++E  SD K  S+          +V   G  GGL LLW  +IEV + SF++GHID++IK 
Subjt:  NENSKKWKRRVREDNNNSESVTCGVLETG----GKRKADDELEGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKM

Query:  NEGW-WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRKVKLDGVIAE
        ++   WRFTGFYG P+ + R+ SW LL RLG +S+L WIV GDFNEIL  +EK GG  R    M  FR  VD C LMD+G++ N +TWS +     +I E
Subjt:  NEGW-WRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRKVKLDGVIAE

Query:  RMIISRC
        R+  + C
Subjt:  RMIISRC

A0A6P9EQ08 uncharacterized protein LOC1090209826.1e-5628.05Show/hide
Query:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY
        M +E+  Q + F+L E+    + +  D +E      K  +   I+  + IN E F + M K+W  EG V+  + G N ++ +F + +D++RV KG PWS+
Subjt:  MADEVFSQMEKFRL-EEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSY

Query:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI
        D  +L     +G+  V D++F    FW+  H +P      +    +  ++G       D+     G+ +R++V +  T+ L RG  +       KTW+  
Subjt:  DDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPLKRGTHVKTGSMAEKTWIPI

Query:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDIS-SRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKE----
         YE+LP FC  CG + H  + C+    A    + G    A + +EGDI+  R     +++TSG   A  +W    +   E+  + G+     G K+    
Subjt:  SYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDIS-SRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGSGQKE----

Query:  ---------WAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRS
                 + Q  K++  DL E       +  K       + + +  ++ D G+ T+ +     +E  + +    K    KR ++ +        +PR 
Subjt:  ---------WAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRS

Query:  PAKTKNENSKK--WKRRVREDNNNSESVTCGVL--ETGGKRKADDEL----EGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSE
           T +   K   W+   RE     E     +L      K K    +      C   K++ +KR+   +    +  KG SGGL  LW  ++E ++ S+S+
Subjt:  PAKTKNENSKK--WKRRVREDNNNSESVTCGVL--ETGGKRKADDEL----EGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSE

Query:  GHIDSMIK--MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWS
         HI  M+K       W  TGFYG+PVT +R  SWKLL+ +  +  L W+  GDFNEIL   +KSGG  R  NQ++ FR  V+ C L DLGF+ N FTWS
Subjt:  GHIDSMIK--MNEGWWRFTGFYGNPVTNKRVDSWKLLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.1e-0431.17Show/hide
Query:  WKISIYLEEYEGRADKKANLAKLENSSS--------HQRWRPPDPNHWKLNLDASWCDEEGAGGIGWFMRDSNGSLI
        W+I++ L + +     +AN     N+ S        H RWR P+    K N D S+ + +     GW +RDSNGS +
Subjt:  WKISIYLEEYEGRADKKANLAKLENSSS--------HQRWRPPDPNHWKLNLDASWCDEEGAGGIGWFMRDSNGSLI

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)8.8e-0725.36Show/hide
Query:  LWNIWNIRNQCLHD---------SSKANSQAAIWKISIYLEEYEGRADKKANLAKLENSSSHQRWRPPDPNHWKLNLDASWCDEEGAGGIGWFMRDSNGS
        +W +W  RN+ L           + K   +A  W  +   +     + ++ N      S   + W PP   + K N D+ +          W +RDSNG 
Subjt:  LWNIWNIRNQCLHD---------SSKANSQAAIWKISIYLEEYEGRADKKANLAKLENSSSHQRWRPPDPNHWKLNLDASWCDEEGAGGIGWFMRDSNGS

Query:  LIGDGCKKIHQKGPIKWFEALALLEGLNSLVRLANERY
        +I  GC K+ Q       EAL  L  L  +V +   RY
Subjt:  LIGDGCKKIHQKGPIKWFEALALLEGLNSLVRLANERY

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-1325.58Show/hide
Query:  VSSRLGVLITFASNLCNKEEASGADDSKVRKLWNAIWNINCIPRAKITIWKIISNSLPTKVNLAKRGMNTNMFC--------------------------
        V S   VL    +   + +E S   +  +  ++  IW     P+ +  +WK +SNSLP    LA R ++    C                          
Subjt:  VSSRLGVLITFASNLCNKEEASGADDSKVRKLWNAIWNINCIPRAKITIWKIISNSLPTKVNLAKRGMNTNMFC--------------------------

Query:  ------LGRKEAMDIWDGMIHVLSASELN--------MVALILWNIWNIRNQCLHDSSKANSQAAIWKISIYLEEYEGRADKKANLAKLE-NSSSHQRWR
              LG + A  I+  +  V +    N        +V  +LW +W  RN+ +    + N+Q  + +    LEE+  R + ++   K + N SS  RWR
Subjt:  ------LGRKEAMDIWDGMIHVLSASELN--------MVALILWNIWNIRNQCLHDSSKANSQAAIWKISIYLEEYEGRADKKANLAKLE-NSSSHQRWR

Query:  PPDPNHW-KLNLDASWCDEEGAGGIGWFMRDSNGSLIGDGCKKIHQKGPIKWFEALAL
        PP P+ W K N DA+W  +    GIGW +R              ++KG +KW  A AL
Subjt:  PPDPNHW-KLNLDASWCDEEGAGGIGWFMRDSNGSLIGDGCKKIHQKGPIKWFEALAL

AT5G36228.1 nucleic acid binding;zinc ion binding1.9e-0925.63Show/hide
Query:  KILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKY
        +IL P+  + E  +  +P  WGL   V         +  +FR   D     +  PW +++  +     +     ED    F+  W+H   +P      + 
Subjt:  KILTPRLINPEVFMHFMPKIWGLEGAVKMEKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKY

Query:  AIALANSIGSFEDAEMDENEKLEGET--LRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEP
           +A+++G  E   MD NE+   +   +RVKV ++ TEPL+    V+  S  E+  I   YEKL   C +C ++ H    C        ++EE   EP
Subjt:  AIALANSIGSFEDAEMDENEKLEGET--LRVKVIINCTEPLKRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEP

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0424.51Show/hide
Query:  ILWNIWNIRNQCLHDSSKANSQAAIWKISIYLEEY--EGRADKKANLAKLENSSSHQRWRPPDPNHWKLNLDASWCDEEGAGGIGWFMRDSNGSLIGDGC
        ++W IW   N  + + ++   Q  +       +E+      +++ N  +  + S + +W PP  +  K N DAS  +     G+GW +R+S G++I  G 
Subjt:  ILWNIWNIRNQCLHDSSKANSQAAIWKISIYLEEY--EGRADKKANLAKLENSSSHQRWRPPDPNHWKLNLDASWCDEEGAGGIGWFMRDSNGSLIGDGC

Query:  KK
         K
Subjt:  KK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAAGAGGGCAACGACTCGGTTTGCGAAATGGTTCTTGGGGGCGATTCCGACCAACAGGCGATGGACGAAGCTCTCATCGATAACCAGACAGGTGGAGTAAACGA
AGAAATAGCCATGGCAGATGAAGTCTTCAGTCAGATGGAGAAATTCCGGCTGGAGGAAGCTGGCAGAGTGGTGGAGGTGGAAGACGATGATATAGAAAACATAGATAGGG
ACTTTAAAGGAGCAATTGCCTGCAAGATTTTAACACCAAGACTAATAAATCCAGAGGTTTTTATGCACTTTATGCCCAAGATATGGGGCTTAGAGGGAGCTGTGAAGATG
GAGAAAGCGGGAACCAACATATACATTTGCAAATTCAGGAGAATGAAAGACAAAGAGAGAGTGTCCAAAGGCGGGCCATGGTCGTATGACGACGCAATTTTAGTTTTCGA
CGAACCAAAAGGGAGTTGTAGTGTTGAAGACCTTGAGTTTAAATTTGTTTCGTTTTGGATTCACTTCCACAAATTACCTCGTGTGTGTTTTTGCAGGAAATATGCAATTG
CCCTGGCCAATTCGATCGGAAGTTTTGAAGATGCTGAGATGGATGAAAATGAAAAACTAGAAGGGGAAACCCTTCGGGTAAAAGTCATAATAAACTGCACTGAGCCTTTA
AAGAGAGGAACACACGTAAAAACTGGATCTATGGCAGAAAAAACATGGATTCCAATATCTTACGAGAAATTACCGGACTTTTGTTATCACTGTGGGAAGCTTGGTCACGT
CAAACAAGAGTGTGAAGTTGAGGAAGCTGCCGAAACCAACGAGGAAGAGGGCGGGGAAGAGCCGGCAAATTCCAGAGAGGAGGGAGATATTTCCAGTCGAATGAGTTTGG
GGAGAGCCAAAGAGACCAGTGGGGTGCGCCGAGCAGTAGAAGCGTGGGAGAAACAAACAAGAAGATCGGCAGAGATGATCGACGATGGAGGAAGAACAGGAGAGGGGTCA
GGACAAAAAGAATGGGCTCAGAAGAACAAGGACAGTGATGTGGACTTGACAGAAAAGTCTGAAGTAGAAAAGGACAGGGACGCTAAAGGAAAAGGGGGCCCAGAAGAAAT
TGGTCTAAAAATTAGCCCAATGGACCAAGATCTAGGCATTCAAACGGTGGCAAAAAGGAAAGTAGTTATTAAAGAAAAAGTTATAAATCGGGGCATGGAGGAGAAAGGAA
GATCTGTGAAAAGGCAAACCACCTCTGAAGGTTTGCATAAAGTTAAGATTGATAAACCACGAAGTCCAGCAAAAACAAAGAATGAAAACAGTAAAAAATGGAAAAGGAGA
GTTAGGGAGGACAACAACAATAGTGAATCTGTGACATGTGGAGTTCTGGAAACTGGGGGCAAACGAAAGGCTGATGATGAGTTGGAAGGATGCAGTGACAGCAAGTTGGA
TAGCATCAAGAGAGTTACGAGTCTCGAGTGTGGCCTTTCGGTCCCTAGCAAGGGTTGTAGCGGGGGACTTATGTTACTCTGGGATAAAGAGATTGAAGTTGATTTAATCT
CTTTCTCGGAGGGGCATATTGATTCGATGATTAAGATGAATGAAGGTTGGTGGCGCTTCACCGGGTTTTATGGCAATCCGGTTACGAATAAGAGAGTGGATTCGTGGAAG
TTACTCGAGAGGTTAGGGAATTTGTCCTCCCTCTCTTGGATTGTGGGAGGGGATTTTAATGAGATCCTGTTAGCGGAAGAGAAGAGTGGAGGTCAGCCTAGGAAGCAGAA
TCAGATGGATGGTTTTCGGAGGGTCGTGGACAGCTGCAAGCTTATGGACCTAGGCTTCTTGGATAATCCTTTCACTTGGAGTAGGAAGGTTAAATTGGATGGGGTCATTG
CTGAGAGGATGATTATCAGTAGGTGCTGGTCTGATGAGCCGACTTCAAAGCCTGATGGGTTTATGAGAAAGATGGTTGGTTGTCTGGCCAAGTTGAAAGTGTGGAACAAG
AACAGGCTTAATGGATCTTTAAGAGCTGTTATTTCTAAGAAAGAAGAGGAGCTGAAAGTTCTTAACAGAGAGCAAGTAAAGGATGAGTTTAAAGGGAAAAGAATTGGGGC
TCTGTTGAATGAGGGTGGTGGATGGAATGTTGAGTTAATTAAAGAGGTGTTCAATCCGGATGAAGCTAGTGCCATTGTAAGCATTCTGATTCTATTATTTGGGATAAAAA
GCCTAAGGGTATCTTCTCGATTAGGAGTGCTTATCACCTTTGCTTCAAATCTTTGTAATAAGGAAGAAGCTTCAGGGGCGGATGATTCAAAAGTCAGGAAATTGTGGAAT
GCCATATGGAACATCAATTGCATCCCAAGAGCCAAGATCACTATTTGGAAAATCATCAGCAATTCCCTCCCTACCAAAGTGAATCTCGCTAAAAGAGGGATGAATACTAA
TATGTTCTGTTTGGGAAGAAAGGAAGCTATGGACATCTGGGATGGGATGATTCATGTTCTAAGCGCCTCAGAGTTAAATATGGTTGCCCTAATCCTATGGAATATCTGGA
ATATTAGGAATCAATGCCTTCACGACAGCAGTAAGGCAAATTCTCAAGCGGCGATATGGAAAATCAGTATATACTTGGAAGAGTACGAAGGAAGAGCAGATAAGAAGGCT
AACCTGGCCAAATTGGAGAACTCTTCGAGTCACCAAAGATGGAGACCTCCGGACCCCAACCACTGGAAGCTGAACTTAGACGCCTCTTGGTGCGATGAGGAAGGTGCGGG
TGGAATTGGGTGGTTCATGCGTGACTCTAACGGATCTCTAATTGGAGATGGGTGCAAGAAAATCCATCAGAAAGGACCGATCAAATGGTTCGAAGCCTTGGCGTTGCTGG
AAGGATTGAATTCGCTCGTGAGATTGGCAAATGAGAGATACGCTCCGCCTATTCAACCGCTGATGGTTGAAACAAATGCTTTGGATGTGATAAAGCTGCTAAATGATGAA
GATGAAGATCTAACCGAGATATCCATGGTGGTCGATAACATCAAAGAGATGGCCTCTCTTTTAGTTGTTTCGTTCGAGAATGGAGAATCTGGAAGCCCATCGCCTTGCGC
GCGCTGGAGTGCGTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAAGAGGGCAACGACTCGGTTTGCGAAATGGTTCTTGGGGGCGATTCCGACCAACAGGCGATGGACGAAGCTCTCATCGATAACCAGACAGGTGGAGTAAACGA
AGAAATAGCCATGGCAGATGAAGTCTTCAGTCAGATGGAGAAATTCCGGCTGGAGGAAGCTGGCAGAGTGGTGGAGGTGGAAGACGATGATATAGAAAACATAGATAGGG
ACTTTAAAGGAGCAATTGCCTGCAAGATTTTAACACCAAGACTAATAAATCCAGAGGTTTTTATGCACTTTATGCCCAAGATATGGGGCTTAGAGGGAGCTGTGAAGATG
GAGAAAGCGGGAACCAACATATACATTTGCAAATTCAGGAGAATGAAAGACAAAGAGAGAGTGTCCAAAGGCGGGCCATGGTCGTATGACGACGCAATTTTAGTTTTCGA
CGAACCAAAAGGGAGTTGTAGTGTTGAAGACCTTGAGTTTAAATTTGTTTCGTTTTGGATTCACTTCCACAAATTACCTCGTGTGTGTTTTTGCAGGAAATATGCAATTG
CCCTGGCCAATTCGATCGGAAGTTTTGAAGATGCTGAGATGGATGAAAATGAAAAACTAGAAGGGGAAACCCTTCGGGTAAAAGTCATAATAAACTGCACTGAGCCTTTA
AAGAGAGGAACACACGTAAAAACTGGATCTATGGCAGAAAAAACATGGATTCCAATATCTTACGAGAAATTACCGGACTTTTGTTATCACTGTGGGAAGCTTGGTCACGT
CAAACAAGAGTGTGAAGTTGAGGAAGCTGCCGAAACCAACGAGGAAGAGGGCGGGGAAGAGCCGGCAAATTCCAGAGAGGAGGGAGATATTTCCAGTCGAATGAGTTTGG
GGAGAGCCAAAGAGACCAGTGGGGTGCGCCGAGCAGTAGAAGCGTGGGAGAAACAAACAAGAAGATCGGCAGAGATGATCGACGATGGAGGAAGAACAGGAGAGGGGTCA
GGACAAAAAGAATGGGCTCAGAAGAACAAGGACAGTGATGTGGACTTGACAGAAAAGTCTGAAGTAGAAAAGGACAGGGACGCTAAAGGAAAAGGGGGCCCAGAAGAAAT
TGGTCTAAAAATTAGCCCAATGGACCAAGATCTAGGCATTCAAACGGTGGCAAAAAGGAAAGTAGTTATTAAAGAAAAAGTTATAAATCGGGGCATGGAGGAGAAAGGAA
GATCTGTGAAAAGGCAAACCACCTCTGAAGGTTTGCATAAAGTTAAGATTGATAAACCACGAAGTCCAGCAAAAACAAAGAATGAAAACAGTAAAAAATGGAAAAGGAGA
GTTAGGGAGGACAACAACAATAGTGAATCTGTGACATGTGGAGTTCTGGAAACTGGGGGCAAACGAAAGGCTGATGATGAGTTGGAAGGATGCAGTGACAGCAAGTTGGA
TAGCATCAAGAGAGTTACGAGTCTCGAGTGTGGCCTTTCGGTCCCTAGCAAGGGTTGTAGCGGGGGACTTATGTTACTCTGGGATAAAGAGATTGAAGTTGATTTAATCT
CTTTCTCGGAGGGGCATATTGATTCGATGATTAAGATGAATGAAGGTTGGTGGCGCTTCACCGGGTTTTATGGCAATCCGGTTACGAATAAGAGAGTGGATTCGTGGAAG
TTACTCGAGAGGTTAGGGAATTTGTCCTCCCTCTCTTGGATTGTGGGAGGGGATTTTAATGAGATCCTGTTAGCGGAAGAGAAGAGTGGAGGTCAGCCTAGGAAGCAGAA
TCAGATGGATGGTTTTCGGAGGGTCGTGGACAGCTGCAAGCTTATGGACCTAGGCTTCTTGGATAATCCTTTCACTTGGAGTAGGAAGGTTAAATTGGATGGGGTCATTG
CTGAGAGGATGATTATCAGTAGGTGCTGGTCTGATGAGCCGACTTCAAAGCCTGATGGGTTTATGAGAAAGATGGTTGGTTGTCTGGCCAAGTTGAAAGTGTGGAACAAG
AACAGGCTTAATGGATCTTTAAGAGCTGTTATTTCTAAGAAAGAAGAGGAGCTGAAAGTTCTTAACAGAGAGCAAGTAAAGGATGAGTTTAAAGGGAAAAGAATTGGGGC
TCTGTTGAATGAGGGTGGTGGATGGAATGTTGAGTTAATTAAAGAGGTGTTCAATCCGGATGAAGCTAGTGCCATTGTAAGCATTCTGATTCTATTATTTGGGATAAAAA
GCCTAAGGGTATCTTCTCGATTAGGAGTGCTTATCACCTTTGCTTCAAATCTTTGTAATAAGGAAGAAGCTTCAGGGGCGGATGATTCAAAAGTCAGGAAATTGTGGAAT
GCCATATGGAACATCAATTGCATCCCAAGAGCCAAGATCACTATTTGGAAAATCATCAGCAATTCCCTCCCTACCAAAGTGAATCTCGCTAAAAGAGGGATGAATACTAA
TATGTTCTGTTTGGGAAGAAAGGAAGCTATGGACATCTGGGATGGGATGATTCATGTTCTAAGCGCCTCAGAGTTAAATATGGTTGCCCTAATCCTATGGAATATCTGGA
ATATTAGGAATCAATGCCTTCACGACAGCAGTAAGGCAAATTCTCAAGCGGCGATATGGAAAATCAGTATATACTTGGAAGAGTACGAAGGAAGAGCAGATAAGAAGGCT
AACCTGGCCAAATTGGAGAACTCTTCGAGTCACCAAAGATGGAGACCTCCGGACCCCAACCACTGGAAGCTGAACTTAGACGCCTCTTGGTGCGATGAGGAAGGTGCGGG
TGGAATTGGGTGGTTCATGCGTGACTCTAACGGATCTCTAATTGGAGATGGGTGCAAGAAAATCCATCAGAAAGGACCGATCAAATGGTTCGAAGCCTTGGCGTTGCTGG
AAGGATTGAATTCGCTCGTGAGATTGGCAAATGAGAGATACGCTCCGCCTATTCAACCGCTGATGGTTGAAACAAATGCTTTGGATGTGATAAAGCTGCTAAATGATGAA
GATGAAGATCTAACCGAGATATCCATGGTGGTCGATAACATCAAAGAGATGGCCTCTCTTTTAGTTGTTTCGTTCGAGAATGGAGAATCTGGAAGCCCATCGCCTTGCGC
GCGCTGGAGTGCGTTTTGGTGA
Protein sequenceShow/hide protein sequence
MDKEGNDSVCEMVLGGDSDQQAMDEALIDNQTGGVNEEIAMADEVFSQMEKFRLEEAGRVVEVEDDDIENIDRDFKGAIACKILTPRLINPEVFMHFMPKIWGLEGAVKM
EKAGTNIYICKFRRMKDKERVSKGGPWSYDDAILVFDEPKGSCSVEDLEFKFVSFWIHFHKLPRVCFCRKYAIALANSIGSFEDAEMDENEKLEGETLRVKVIINCTEPL
KRGTHVKTGSMAEKTWIPISYEKLPDFCYHCGKLGHVKQECEVEEAAETNEEEGGEEPANSREEGDISSRMSLGRAKETSGVRRAVEAWEKQTRRSAEMIDDGGRTGEGS
GQKEWAQKNKDSDVDLTEKSEVEKDRDAKGKGGPEEIGLKISPMDQDLGIQTVAKRKVVIKEKVINRGMEEKGRSVKRQTTSEGLHKVKIDKPRSPAKTKNENSKKWKRR
VREDNNNSESVTCGVLETGGKRKADDELEGCSDSKLDSIKRVTSLECGLSVPSKGCSGGLMLLWDKEIEVDLISFSEGHIDSMIKMNEGWWRFTGFYGNPVTNKRVDSWK
LLERLGNLSSLSWIVGGDFNEILLAEEKSGGQPRKQNQMDGFRRVVDSCKLMDLGFLDNPFTWSRKVKLDGVIAERMIISRCWSDEPTSKPDGFMRKMVGCLAKLKVWNK
NRLNGSLRAVISKKEEELKVLNREQVKDEFKGKRIGALLNEGGGWNVELIKEVFNPDEASAIVSILILLFGIKSLRVSSRLGVLITFASNLCNKEEASGADDSKVRKLWN
AIWNINCIPRAKITIWKIISNSLPTKVNLAKRGMNTNMFCLGRKEAMDIWDGMIHVLSASELNMVALILWNIWNIRNQCLHDSSKANSQAAIWKISIYLEEYEGRADKKA
NLAKLENSSSHQRWRPPDPNHWKLNLDASWCDEEGAGGIGWFMRDSNGSLIGDGCKKIHQKGPIKWFEALALLEGLNSLVRLANERYAPPIQPLMVETNALDVIKLLNDE
DEDLTEISMVVDNIKEMASLLVVSFENGESGSPSPCARWSAFW