; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030997 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030997
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold10:23279473..23284451
RNA-Seq ExpressionSpg030997
SyntenySpg030997
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]1.7e-6428.31Show/hide
Query:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDI---TIKDPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNS
        E+ G SGGL L+W+  +++ + S+SK HID+   T+ + +  WR TG YG+P+ +K+ ++W L+  L+   ++PW+  GDFNE+  + EK G + K+   
Subjt:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDI---TIKDPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNS

Query:  LNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRLLRLEEGWLAHE
        +  F +++  C LI +GF G+ +TW   +  +   +ERLDR    S    L     V+HLS   SDH P++L   +   + ++ R +R  R E  W+   
Subjt:  LNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRLLRLEEGWLAHE

Query:  GSKEAFKDAWGSSAVITNVNFNRKIQEGL---KAMHIWNRNRLKGTIKGAIQRTEEALILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSA
           E    AW     ++     R  +  L   K    WN                  L++  F P  ++ I +IP  ++   D+  W    KG FSV+SA
Subjt:  GSKEAFKDAWGSSAVITNVNFNRKIQEGL---KAMHIWNRNRLKGTIKGAIQRTEEALILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSA

Query:  YHLAIESRDAQEASQSDKSKQS-------SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEA
        YHL    RD + A+ S  S  S         W+ +W   I P+ K+  WK+  + +P +AN+  + + +  +C +C +  E+  H++  C   ++ W   
Subjt:  YHLAIESRDAQEASQSDKSKQS-------SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEA

Query:  IPKTKSLFNCGRENWNPQDYWSWMRDNL---SKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDWEDTNLEEHHPVSRPRSQASQDSWE
             S      +  +     SW+ + +    +E L    +  WS+W  RN+   S V  +  +  +   K + D+ + N       + P S ++  SW 
Subjt:  IPKTKSLFNCGRENWNPQDYWSWMRDNL---SKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDWEDTNLEEHHPVSRPRSQASQDSWE

Query:  APAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIPLEIESDALEIIRALKGESEDLSESKV
        AP  + +K+N D A +      G+G VV D NG LI + SKRI    S   +EA    EG+K       ++ I   +ESD++  IRAL  + E+ S   +
Subjt:  APAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIPLEIESDALEIIRALKGESEDLSESKV

Query:  ILDE
        +LD+
Subjt:  ILDE

KMS97072.1 hypothetical protein BVRB_7g179330 [Beta vulgaris subsp. vulgaris]1.8e-5023.41Show/hide
Query:  KRRAREEQKNWKTEKENENEG-----HSGGLILMWQNHINVVVNSFSKGHIDITIKDPDW-WWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGD
        ++  R+E +    + +   +G      +GGL L+W   +++ + +     ID+T+++     WRFTG YG  + S++  +W ++  L    NL W++ GD
Subjt:  KRRAREEQKNWKTEKENENEG-----HSGGLILMWQNHINVVVNSFSKGHIDITIKDPDW-WWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGD

Query:  FNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSS
        FNEV++  EK    P    S+  F   +    L D+G  G  +TW+  + +    +ERLDR   +   + L  +  V +L +  SDH PI++ +    ++
Subjt:  FNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSS

Query:  QQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNVNFNRKIQEGLKAMHIWNRNRLKGT------IKGAIQRTEEALILQ-------SFPPQVSKD
              ++  R E  WL         K+ W  + ++    ++  ++   + +  W+    K T      +K  ++R  +A +         S+  +V  +
Subjt:  QQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNVNFNRKIQEGLKAMHIWNRNRLKGT------IKGAIQRTEEALILQ-------SFPPQVSKD

Query:  IL---------NIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVL
        +          NI   ++  +DEI W     G+F V+ AY LAIE+ +   +S          W  IW   I P+     W+   D +P   N+  K  +
Subjt:  IL---------NIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVL

Query:  LNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEK
            C  C    E+T H   +C+ +++ W     K  +++ C     N +++ +W+ +   KE+ E  ++ +W +W  RN+      + SA    +    
Subjt:  LNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEK

Query:  NVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQ
         ++     N  +   V R     ++D W  P+    K+N DAA N      G+G V  D NG ++ + S+ +  +W  +  EA  +L      E    + 
Subjt:  NVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQ

Query:  RIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVAR
           + +ESDA  II A+    +   + + +L+++++LV    ++ F  C R  N +AH +A+
Subjt:  RIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVAR

MBA0733287.1 hypothetical protein [Gossypium gossypioides]2.1e-5126.75Show/hide
Query:  KAIEKKMGRSWKRRAREEQKNWKTEKEN------ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIKDPDWW--WRFTGFYGNPDQSKRKDSWRLLERL
        K I+  +G S + ++   +K W  +K        E+E   G         I++ + SFSK HID+ I D +    WRFTGFYG+P    R  SW  L+RL
Subjt:  KAIEKKMGRSWKRRAREEQKNWKTEKEN------ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIKDPDWW--WRFTGFYGNPDQSKRKDSWRLLERL

Query:  NNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDH
         + V +PW++  DFNE+M+  EKKGGIP+    +  F  +L  C L DVGF G  +TW +    +   +ERLDR  +N   + +  ++KV+HLS   SDH
Subjt:  NNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDH

Query:  RPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSA-------VITNVNFNRKIQEGLKAMHIWN---RNRLK---GTIKGAIQRTEEA
         P+++  T     + V       + E  WL  +   E  K  W SS         I  +       EG+  + I N   R  +K     +  + ++    
Subjt:  RPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSA-------VITNVNFNRKIQEGLKAMHIWN---RNRLK---GTIKGAIQRTEEA

Query:  LILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLN
        LI  +FP  + + IL IP  +    D   W  +  G FSV+S Y L +++ +   +    +++   F+  +W+ ++  +     W+I  D IPC  N+  
Subjt:  LILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLN

Query:  KGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHK
        + V+ N  CP C   VE + H+  +C    + W   +     + N     W   ++ +W+    + ++      A+W +W  RN+  H + I    +L  
Subjt:  KGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHK

Query:  SFEKNVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYT
        + ++ V + +  N  +   ++  RS   Q+        + +++ DAA++   +    G V  D+ G L+   +   +   S  + EA   LEGVK+    
Subjt:  SFEKNVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYT

Query:  CFRQRIPLEIES-----DALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA
             I L I S     D+  +I+  +  S D S    I+ +I    +    + F++  RS N  AH +A+ A
Subjt:  CFRQRIPLEIES-----DALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA

RYQ92149.1 hypothetical protein Ahy_B09g098304 [Arachis hypogaea]2.3e-5025.23Show/hide
Query:  RAREEQKNWKTEKEN----ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNE
        R R  + N K + +N    E  G SGGL L+W++++NV V  +   +I   I  + D  W+    YGNP   KR+  W+ L   N     P +  GDFN+
Subjt:  RAREEQKNWKTEKEN----ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNE

Query:  VMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQV
        ++   EK G +P+    L  F   +    L+DV   G +YTW  N +N   T+ERLDR  +N   L + +++ ++    + SDH  +IL +      Q  
Subjt:  VMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQV

Query:  VRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQR-------TEEALILQSFPPQVSKDILNIPTG
         R +R  + E  W  HE  KE  K  W       N    F RK    ++ +  W++ + K   K   ++        E  +I + FP  +++ I   P  
Subjt:  VRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQR-------TEEALILQSFPPQVSKDILNIPTG

Query:  KKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFV
            KD + W    +G ++VKS Y  A E +DA+E  + +++  S      W +IW   +  + ++  WK +   +P  +N+  +   + P C +C+   
Subjt:  KKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFV

Query:  ESTPHIIWECKIIKKFW-EEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSK------EELERSLIAM----WSVWNFRNKTEHSQVIQSADSLHKSFEKN
        E+  H +  C   +  W   +I  T + +N          +  W+ D + K      ++ ER L  +    W +W  RN+    Q   +      + E+ 
Subjt:  ESTPHIIWECKIIKKFW-EEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSK------EELERSLIAM----WSVWNFRNKTEHSQVIQSADSLHKSFEKN

Query:  VKDWEDT---NLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCF
          ++ +T   +  +H+          + +W  P Q+  K+N+DAA+ +       G V+ +  G  I +G+       S  + EA+   E + +I+    
Subjt:  VKDWEDT---NLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCF

Query:  RQRIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA
           I   IE+D L++++ +K  +  + E+  IL +I+ L+    +V      R  N VAH +A  A
Subjt:  RQRIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA

RYR18269.1 hypothetical protein Ahy_B03g062876 [Arachis hypogaea]3.0e-5025.49Show/hide
Query:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLN
        E  G SGGL L+W+++ N+ V      +I   I  + D  W+    YGNP   KR+  W  L   N    +P    GDFN+++  +EK G  P+    L 
Subjt:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLN

Query:  DFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGS
         F   +   DLID+   G++YTW  N +N   T++RLDR  +N K L + +++ +     + SDH  +IL       +QQ VR ++  + E  W+ HE  
Subjt:  DFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGS

Query:  KEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQRTEEALILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHL
        KE  + +W       N    F +K    ++ +  W+  + K   K   ++  E   +Q    + ++ I   P      KD   W   K G ++V++ YH+
Subjt:  KEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQRTEEALILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHL

Query:  AIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKS
        A E +D++E  +  K+  S      W +IW   +  + ++  WK ++  +P   N+  + + + P C +C+K  E+  H +  C   +  W E      S
Subjt:  AIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKS

Query:  LFNCGRENWNPQDYWSWMRDNL------SKEELERSLIAM----WSVWNFRN-----KTE--HSQVIQSADSLHKSFEKNVKDWEDTNLEEHHPVSRPRS
                +N + +  W+ D +      S  E E++L  +    W +W  RN     +TE    +VI  ++ L   F K  ++    N+    P +    
Subjt:  LFNCGRENWNPQDYWSWMRDNL------SKEELERSLIAM----WSVWNFRN-----KTE--HSQVIQSADSLHKSFEKNVKDWEDTNLEEHHPVSRPRS

Query:  QASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIP-LEIESDALEIIRALKGE
           + +W  P +N  K+N+DAA+ +      +   V D  G +I +G+    +  S    EA+   E + +I+      +IP   IE+D+L +++A+K  
Subjt:  QASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIP-LEIESDALEIIRALKGE

Query:  SEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNAC--NLHFRFGQEAPLSREN
        +  ++++  I+ +I+ L+     V      R  N +AH +A  A   NL  ++    P    N
Subjt:  SEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNAC--NLHFRFGQEAPLSREN

TrEMBL top hitse value%identityAlignment
A0A0J8BAU9 Uncharacterized protein8.6e-5123.41Show/hide
Query:  KRRAREEQKNWKTEKENENEG-----HSGGLILMWQNHINVVVNSFSKGHIDITIKDPDW-WWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGD
        ++  R+E +    + +   +G      +GGL L+W   +++ + +     ID+T+++     WRFTG YG  + S++  +W ++  L    NL W++ GD
Subjt:  KRRAREEQKNWKTEKENENEG-----HSGGLILMWQNHINVVVNSFSKGHIDITIKDPDW-WWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGD

Query:  FNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSS
        FNEV++  EK    P    S+  F   +    L D+G  G  +TW+  + +    +ERLDR   +   + L  +  V +L +  SDH PI++ +    ++
Subjt:  FNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSS

Query:  QQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNVNFNRKIQEGLKAMHIWNRNRLKGT------IKGAIQRTEEALILQ-------SFPPQVSKD
              ++  R E  WL         K+ W  + ++    ++  ++   + +  W+    K T      +K  ++R  +A +         S+  +V  +
Subjt:  QQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNVNFNRKIQEGLKAMHIWNRNRLKGT------IKGAIQRTEEALILQ-------SFPPQVSKD

Query:  IL---------NIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVL
        +          NI   ++  +DEI W     G+F V+ AY LAIE+ +   +S          W  IW   I P+     W+   D +P   N+  K  +
Subjt:  IL---------NIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVL

Query:  LNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEK
            C  C    E+T H   +C+ +++ W     K  +++ C     N +++ +W+ +   KE+ E  ++ +W +W  RN+      + SA    +    
Subjt:  LNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEK

Query:  NVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQ
         ++     N  +   V R     ++D W  P+    K+N DAA N      G+G V  D NG ++ + S+ +  +W  +  EA  +L      E    + 
Subjt:  NVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQ

Query:  RIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVAR
           + +ESDA  II A+    +   + + +L+++++LV    ++ F  C R  N +AH +A+
Subjt:  RIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVAR

A0A2Z6N4T0 Uncharacterized protein4.3e-5025.83Show/hide
Query:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIKD-PDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLN
        + +G  GG+ +MW+  +N  + ++S  HIDI + D     WR TGFYG P+ S+R+DSW  L +L+N   LPW I GDFN+++ S EK+G   +    +N
Subjt:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIKD-PDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLN

Query:  DFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRL--LRLEEGWLAHE
         F +++S   L+D+ + G  +TW K+   + A +E+LDR   N     + +   VE L+   SDH P++L+        + ++ R L   + E  W A  
Subjt:  DFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRL--LRLEEGWLAHE

Query:  GSKEAFKDAWGSSAVITNVNFNRKIQE---------GLK-------AMHIWNRNRLK-GT--------------------IKGAIQRTEEALILQSFPPQ
              K  W +     N    RK+ +         G K        + +W++N L  GT                    +    +  +  LI       
Subjt:  GSKEAFKDAWGSSAVITNVNFNRKIQE---------GLK-------AMHIWNRNRLK-GT--------------------IKGAIQRTEEALILQSFPPQ

Query:  VSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLC
        ++  IL+ P  +    D+I W  +K G+++VKSAY   I +   +     D+ +    W+ IW T++ P+ K   W+I  + +P +A + ++GV     C
Subjt:  VSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLC

Query:  PLCRKFVESTPHIIWECKIIKKFWEE-----AIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIA--MWSVWNFRN-------KTEHSQVIQSA
         LC    E + H+ + C+     W++     +I + ++L    +EN             L   E  R++ A  MWS+W  RN       +   + V + A
Subjt:  PLCRKFVESTPHIIWECKIIKKFWEE-----AIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIA--MWSVWNFRN-------KTEHSQVIQSA

Query:  DSLHKSFEKNVKDWEDT-NLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGV
        +SL   + +N ++  D  N ++H P         +  W  P   +WK N DA+++ + N  GIG  + D  G  + + ++       + + EA  +L  +
Subjt:  DSLHKSFEKNVKDWEDT-NLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGV

Query:  KMIE
        K ++
Subjt:  KMIE

A0A444XQY7 RNase H domain-containing protein1.1e-5025.23Show/hide
Query:  RAREEQKNWKTEKEN----ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNE
        R R  + N K + +N    E  G SGGL L+W++++NV V  +   +I   I  + D  W+    YGNP   KR+  W+ L   N     P +  GDFN+
Subjt:  RAREEQKNWKTEKEN----ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNE

Query:  VMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQV
        ++   EK G +P+    L  F   +    L+DV   G +YTW  N +N   T+ERLDR  +N   L + +++ ++    + SDH  +IL +      Q  
Subjt:  VMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQV

Query:  VRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQR-------TEEALILQSFPPQVSKDILNIPTG
         R +R  + E  W  HE  KE  K  W       N    F RK    ++ +  W++ + K   K   ++        E  +I + FP  +++ I   P  
Subjt:  VRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQR-------TEEALILQSFPPQVSKDILNIPTG

Query:  KKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFV
            KD + W    +G ++VKS Y  A E +DA+E  + +++  S      W +IW   +  + ++  WK +   +P  +N+  +   + P C +C+   
Subjt:  KKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFV

Query:  ESTPHIIWECKIIKKFW-EEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSK------EELERSLIAM----WSVWNFRNKTEHSQVIQSADSLHKSFEKN
        E+  H +  C   +  W   +I  T + +N          +  W+ D + K      ++ ER L  +    W +W  RN+    Q   +      + E+ 
Subjt:  ESTPHIIWECKIIKKFW-EEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSK------EELERSLIAM----WSVWNFRNKTEHSQVIQSADSLHKSFEKN

Query:  VKDWEDT---NLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCF
          ++ +T   +  +H+          + +W  P Q+  K+N+DAA+ +       G V+ +  G  I +G+       S  + EA+   E + +I+    
Subjt:  VKDWEDT---NLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCF

Query:  RQRIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA
           I   IE+D L++++ +K  +  + E+  IL +I+ L+    +V      R  N VAH +A  A
Subjt:  RQRIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA

A0A444ZVS3 Uncharacterized protein1.5e-5025.49Show/hide
Query:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLN
        E  G SGGL L+W+++ N+ V      +I   I  + D  W+    YGNP   KR+  W  L   N    +P    GDFN+++  +EK G  P+    L 
Subjt:  ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIK-DPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLN

Query:  DFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGS
         F   +   DLID+   G++YTW  N +N   T++RLDR  +N K L + +++ +     + SDH  +IL       +QQ VR ++  + E  W+ HE  
Subjt:  DFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGS

Query:  KEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQRTEEALILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHL
        KE  + +W       N    F +K    ++ +  W+  + K   K   ++  E   +Q    + ++ I   P      KD   W   K G ++V++ YH+
Subjt:  KEAFKDAWGSSAVITNV--NFNRKIQEGLKAMHIWNRNRLKGTIKGAIQRTEEALILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHL

Query:  AIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKS
        A E +D++E  +  K+  S      W +IW   +  + ++  WK ++  +P   N+  + + + P C +C+K  E+  H +  C   +  W E      S
Subjt:  AIESRDAQEASQSDKSKQS----SFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKS

Query:  LFNCGRENWNPQDYWSWMRDNL------SKEELERSLIAM----WSVWNFRN-----KTE--HSQVIQSADSLHKSFEKNVKDWEDTNLEEHHPVSRPRS
                +N + +  W+ D +      S  E E++L  +    W +W  RN     +TE    +VI  ++ L   F K  ++    N+    P +    
Subjt:  LFNCGRENWNPQDYWSWMRDNL------SKEELERSLIAM----WSVWNFRN-----KTE--HSQVIQSADSLHKSFEKNVKDWEDTNLEEHHPVSRPRS

Query:  QASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIP-LEIESDALEIIRALKGE
           + +W  P +N  K+N+DAA+ +      +   V D  G +I +G+    +  S    EA+   E + +I+      +IP   IE+D+L +++A+K  
Subjt:  QASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIP-LEIESDALEIIRALKGE

Query:  SEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNAC--NLHFRFGQEAPLSREN
        +  ++++  I+ +I+ L+     V      R  N +AH +A  A   NL  ++    P    N
Subjt:  SEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNAC--NLHFRFGQEAPLSREN

A0A7J9BAA2 Uncharacterized protein (Fragment)1.0e-5126.75Show/hide
Query:  KAIEKKMGRSWKRRAREEQKNWKTEKEN------ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIKDPDWW--WRFTGFYGNPDQSKRKDSWRLLERL
        K I+  +G S + ++   +K W  +K        E+E   G         I++ + SFSK HID+ I D +    WRFTGFYG+P    R  SW  L+RL
Subjt:  KAIEKKMGRSWKRRAREEQKNWKTEKEN------ENEGHSGGLILMWQNHINVVVNSFSKGHIDITIKDPDWW--WRFTGFYGNPDQSKRKDSWRLLERL

Query:  NNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDH
         + V +PW++  DFNE+M+  EKKGGIP+    +  F  +L  C L DVGF G  +TW +    +   +ERLDR  +N   + +  ++KV+HLS   SDH
Subjt:  NNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDH

Query:  RPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSA-------VITNVNFNRKIQEGLKAMHIWN---RNRLK---GTIKGAIQRTEEA
         P+++  T     + V       + E  WL  +   E  K  W SS         I  +       EG+  + I N   R  +K     +  + ++    
Subjt:  RPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSA-------VITNVNFNRKIQEGLKAMHIWN---RNRLK---GTIKGAIQRTEEA

Query:  LILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLN
        LI  +FP  + + IL IP  +    D   W  +  G FSV+S Y L +++ +   +    +++   F+  +W+ ++  +     W+I  D IPC  N+  
Subjt:  LILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRAKVCSWKIINDAIPCKANVLN

Query:  KGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHK
        + V+ N  CP C   VE + H+  +C    + W   +     + N     W   ++ +W+    + ++      A+W +W  RN+  H + I    +L  
Subjt:  KGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQVIQSADSLHK

Query:  SFEKNVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYT
        + ++ V + +  N  +   ++  RS   Q+        + +++ DAA++   +    G V  D+ G L+   +   +   S  + EA   LEGVK+    
Subjt:  SFEKNVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYT

Query:  CFRQRIPLEIES-----DALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA
             I L I S     D+  +I+  +  S D S    I+ +I    +    + F++  RS N  AH +A+ A
Subjt:  CFRQRIPLEIES-----DALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.4e-0528.87Show/hide
Query:  QSKRKDSWRLLERL---NNMVNLPWIIGGDFNEVMFSHEKKGGIPK--SLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMN
        +++R+  W  + RL   + + N PW++ GDFN++    E    +P   SL  L D    +   DL+D+   G  YTW+ ++++    + +LDR  +N
Subjt:  QSKRKDSWRLLERL---NNMVNLPWIIGGDFNEVMFSHEKKGGIPK--SLNSLNDFCDSLSSCDLIDVGFIGDRYTWTKNKKNKEATKERLDRFFMN

AT3G25270.1 Ribonuclease H-like superfamily protein5.9e-1222.49Show/hide
Query:  IWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQD-YWSWMRDNLSKEEL
        IW  K  P+ K   WK+++ A+    N+  + +  +P C  C +  E++ H+ ++C   ++ W  +    + L   G       +   S    N   +  
Subjt:  IWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQD-YWSWMRDNLSKEEL

Query:  ERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDWEDTNL------EEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHD
          ++  +W +W  RN+    Q   S  +  +    +V++WEDTN       ++ H     +   ++  W+ P     K N D A+N        GW++ D
Subjt:  ERSLIAMWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDWEDTNL------EEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHD

Query:  SNGSLICSG
         NG  + SG
Subjt:  SNGSLICSG

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-0736.07Show/hide
Query:  IWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKK
        IW+ KI P+ K+  WK +N+A+P  A +L++ + + P C  CR F E+  HI++ C   ++
Subjt:  IWNTKILPRAKVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKK

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)9.8e-0724.41Show/hide
Query:  MWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDWEDTNLEE---HHPVSRPRSQASQDS--WEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLIC
        MW +W  RN+    Q+ +    + +  E+   +W +T + +    H   +P  + S  S  W  P +   K N D+ + +  +     W++ DSNG +I 
Subjt:  MWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDWEDTNLEE---HHPVSRPRSQASQDS--WEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLIC

Query:  SGSKRIKRNWSIKSLEAKEILEGVKMI
        SG  ++++++S    EA   L  ++M+
Subjt:  SGSKRIKRNWSIKSLEAKEILEGVKMI

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-0520.1Show/hide
Query:  MWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDW-EDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSK
        MW +W   N    +       +  +    + K+W ++T   E    +R    +    W  P ++  K N DA+ +E     G+GW++ +S G++I  G  
Subjt:  MWSVWNFRNKTEHSQVIQSADSLHKSFEKNVKDW-EDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSK

Query:  RIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA
        + +   + +  E   ++  ++   Y    +++  E ++  +  +   K  +  L      LD I S +    S++F    R  N  A  +A+ A
Subjt:  RIKRNWSIKSLEAKEILEGVKMIEYTCFRQRIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAAAGGGAAACCGAAGATCAATATGGAAAACAAGAAACCAGAATTGCAAAACAGAGGTAAGGAAAGCAAGGCTACCGGAAAGTTGACAGAAAATAGAGAAGAAAA
TATAAGTTTCAGGCCGGAAAGCTGGCCGAAAAAGTCACCGGGAAAAAAGGAAGAAGACGGTACGGCTAGAAAGGACAGCAGGACAGACACAACGGATAGTTCTGGGTATT
CTGGAAAGGCTGACACGAACATGACAGATCAAAAGCAAATCAGCTCAAATCCCAAAGAAAAAGACAAAGGGAAAGGAAAGGCTCATGAGGCAGAATTATGCACGTTACCT
TCCAACATTCCCCAAAAGATGTTATCAGAGGAAGAAACCAATGGGGAAAAGGGTCAAAAACCAAGGCCCAAGTTTGATCAGCTGGATAAAAACTTGGGCCAAAGTCATCA
AGTTATTAGTAAGTCCAGCAAACACCCAGATATGGAAAAAGGCTTGATGATCACAGAACCTAAACTAAAGGAAAAAGAACCTGAAGACCAAATGCAAATTAAGCAGCTAG
TGAGGAAGGATGAAGTTAATGGTAAAGCAATAGAGAAGAAGATGGGAAGATCTTGGAAAAGAAGGGCTCGGGAAGAACAGAAAAACTGGAAAACAGAAAAGGAGAACGAG
AATGAAGGCCATAGTGGAGGGTTAATCCTCATGTGGCAAAACCATATAAATGTTGTTGTAAATTCGTTTTCGAAGGGGCATATCGATATCACCATCAAGGATCCTGATTG
GTGGTGGAGGTTTACCGGGTTTTACGGGAACCCGGATCAAAGCAAAAGGAAAGACTCCTGGCGTCTTCTCGAGAGGCTTAATAACATGGTCAACCTCCCATGGATAATAG
GAGGGGACTTCAACGAAGTGATGTTCAGTCACGAAAAAAAAGGGGGAATTCCGAAATCTCTAAACTCTCTTAATGATTTTTGTGACTCTTTAAGCTCTTGTGATCTGATT
GATGTTGGCTTTATCGGTGACAGGTACACATGGACAAAAAACAAAAAAAACAAGGAGGCTACAAAGGAAAGGCTTGATAGGTTTTTTATGAACTCCAAAATGCTGCCATT
AGTCAGGGACATTAAAGTGGAGCACTTGAGCTTCCTCCATTCAGACCATCGGCCAATCATTTTGAAGATGACCTGGATTAATTCTTCCCAGCAGGTGGTTAGATCTAGGA
GGCTTTTGAGATTGGAGGAGGGTTGGTTAGCCCATGAAGGTAGCAAAGAAGCTTTTAAAGATGCGTGGGGCTCAAGTGCAGTGATAACGAACGTCAACTTCAATAGGAAG
ATTCAGGAGGGTTTGAAGGCAATGCATATTTGGAACAGAAACAGGCTCAAGGGTACCATTAAGGGGGCTATTCAAAGAACAGAGGAAGCATTAATCCTCCAATCTTTTCC
CCCCCAAGTTTCGAAAGACATACTTAACATTCCTACGGGTAAGAAAGAGACCAAGGACGAGATCTTCTGGGGACCGGACAAAAAAGGGATCTTTTCTGTCAAGAGTGCCT
ACCACCTAGCAATTGAATCTAGAGATGCCCAAGAAGCCTCCCAATCGGATAAAAGCAAGCAGTCCTCCTTCTGGAACAGCATCTGGAACACCAAAATCCTCCCTCGGGCC
AAAGTTTGCTCATGGAAGATCATTAATGACGCTATTCCGTGCAAGGCCAACGTTCTTAACAAGGGAGTTCTTCTCAACCCTCTCTGTCCTTTATGCCGAAAGTTTGTTGA
ATCCACCCCTCATATTATATGGGAGTGCAAAATTATCAAAAAGTTTTGGGAAGAGGCTATTCCGAAAACTAAATCTTTGTTCAATTGTGGCAGGGAAAATTGGAATCCTC
AAGATTATTGGAGCTGGATGAGAGATAACCTTTCAAAAGAAGAGTTGGAAAGAAGCCTTATTGCCATGTGGAGCGTGTGGAATTTTAGAAACAAAACAGAACATTCTCAA
GTCATTCAATCAGCAGATTCTCTCCATAAAAGCTTCGAAAAGAATGTCAAGGATTGGGAAGATACAAACCTGGAAGAGCATCATCCAGTGAGCAGGCCTAGGAGCCAGGC
GAGTCAAGATTCGTGGGAAGCCCCGGCGCAAAATTCGTGGAAATTGAACTCCGATGCAGCGTGGAATGAAGCTGCGAACTGTGGAGGAATTGGGTGGGTCGTTCATGACT
CGAACGGATCCTTGATCTGCAGTGGATCGAAACGCATCAAGAGAAATTGGTCAATAAAATCCTTGGAAGCAAAAGAAATTTTAGAGGGGGTTAAGATGATCGAGTATACA
TGCTTTCGTCAAAGAATCCCCCTCGAAATTGAATCGGATGCGCTCGAAATCATCAGAGCGCTGAAGGGAGAATCTGAAGATCTCTCTGAGTCGAAGGTCATCTTAGATGA
AATCATCAGTCTCGTTTCGCGGCTGGTCTCGGTGGATTTCCGCCATTGTCGAAGGTCTTCGAACACAGTAGCCCACTGTGTAGCGAGAAACGCTTGTAATCTTCATTTTC
GTTTTGGCCAAGAGGCTCCCCTCTCGCGGGAAAATGGGTTTCATTTTGTAGCCCCTGGCATCGTGGTGAACCCGACGTTTTGTAATCCCTCTGGTTCTCTGAGGGTTGGT
GGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACAAAGGGAAACCGAAGATCAATATGGAAAACAAGAAACCAGAATTGCAAAACAGAGGTAAGGAAAGCAAGGCTACCGGAAAGTTGACAGAAAATAGAGAAGAAAA
TATAAGTTTCAGGCCGGAAAGCTGGCCGAAAAAGTCACCGGGAAAAAAGGAAGAAGACGGTACGGCTAGAAAGGACAGCAGGACAGACACAACGGATAGTTCTGGGTATT
CTGGAAAGGCTGACACGAACATGACAGATCAAAAGCAAATCAGCTCAAATCCCAAAGAAAAAGACAAAGGGAAAGGAAAGGCTCATGAGGCAGAATTATGCACGTTACCT
TCCAACATTCCCCAAAAGATGTTATCAGAGGAAGAAACCAATGGGGAAAAGGGTCAAAAACCAAGGCCCAAGTTTGATCAGCTGGATAAAAACTTGGGCCAAAGTCATCA
AGTTATTAGTAAGTCCAGCAAACACCCAGATATGGAAAAAGGCTTGATGATCACAGAACCTAAACTAAAGGAAAAAGAACCTGAAGACCAAATGCAAATTAAGCAGCTAG
TGAGGAAGGATGAAGTTAATGGTAAAGCAATAGAGAAGAAGATGGGAAGATCTTGGAAAAGAAGGGCTCGGGAAGAACAGAAAAACTGGAAAACAGAAAAGGAGAACGAG
AATGAAGGCCATAGTGGAGGGTTAATCCTCATGTGGCAAAACCATATAAATGTTGTTGTAAATTCGTTTTCGAAGGGGCATATCGATATCACCATCAAGGATCCTGATTG
GTGGTGGAGGTTTACCGGGTTTTACGGGAACCCGGATCAAAGCAAAAGGAAAGACTCCTGGCGTCTTCTCGAGAGGCTTAATAACATGGTCAACCTCCCATGGATAATAG
GAGGGGACTTCAACGAAGTGATGTTCAGTCACGAAAAAAAAGGGGGAATTCCGAAATCTCTAAACTCTCTTAATGATTTTTGTGACTCTTTAAGCTCTTGTGATCTGATT
GATGTTGGCTTTATCGGTGACAGGTACACATGGACAAAAAACAAAAAAAACAAGGAGGCTACAAAGGAAAGGCTTGATAGGTTTTTTATGAACTCCAAAATGCTGCCATT
AGTCAGGGACATTAAAGTGGAGCACTTGAGCTTCCTCCATTCAGACCATCGGCCAATCATTTTGAAGATGACCTGGATTAATTCTTCCCAGCAGGTGGTTAGATCTAGGA
GGCTTTTGAGATTGGAGGAGGGTTGGTTAGCCCATGAAGGTAGCAAAGAAGCTTTTAAAGATGCGTGGGGCTCAAGTGCAGTGATAACGAACGTCAACTTCAATAGGAAG
ATTCAGGAGGGTTTGAAGGCAATGCATATTTGGAACAGAAACAGGCTCAAGGGTACCATTAAGGGGGCTATTCAAAGAACAGAGGAAGCATTAATCCTCCAATCTTTTCC
CCCCCAAGTTTCGAAAGACATACTTAACATTCCTACGGGTAAGAAAGAGACCAAGGACGAGATCTTCTGGGGACCGGACAAAAAAGGGATCTTTTCTGTCAAGAGTGCCT
ACCACCTAGCAATTGAATCTAGAGATGCCCAAGAAGCCTCCCAATCGGATAAAAGCAAGCAGTCCTCCTTCTGGAACAGCATCTGGAACACCAAAATCCTCCCTCGGGCC
AAAGTTTGCTCATGGAAGATCATTAATGACGCTATTCCGTGCAAGGCCAACGTTCTTAACAAGGGAGTTCTTCTCAACCCTCTCTGTCCTTTATGCCGAAAGTTTGTTGA
ATCCACCCCTCATATTATATGGGAGTGCAAAATTATCAAAAAGTTTTGGGAAGAGGCTATTCCGAAAACTAAATCTTTGTTCAATTGTGGCAGGGAAAATTGGAATCCTC
AAGATTATTGGAGCTGGATGAGAGATAACCTTTCAAAAGAAGAGTTGGAAAGAAGCCTTATTGCCATGTGGAGCGTGTGGAATTTTAGAAACAAAACAGAACATTCTCAA
GTCATTCAATCAGCAGATTCTCTCCATAAAAGCTTCGAAAAGAATGTCAAGGATTGGGAAGATACAAACCTGGAAGAGCATCATCCAGTGAGCAGGCCTAGGAGCCAGGC
GAGTCAAGATTCGTGGGAAGCCCCGGCGCAAAATTCGTGGAAATTGAACTCCGATGCAGCGTGGAATGAAGCTGCGAACTGTGGAGGAATTGGGTGGGTCGTTCATGACT
CGAACGGATCCTTGATCTGCAGTGGATCGAAACGCATCAAGAGAAATTGGTCAATAAAATCCTTGGAAGCAAAAGAAATTTTAGAGGGGGTTAAGATGATCGAGTATACA
TGCTTTCGTCAAAGAATCCCCCTCGAAATTGAATCGGATGCGCTCGAAATCATCAGAGCGCTGAAGGGAGAATCTGAAGATCTCTCTGAGTCGAAGGTCATCTTAGATGA
AATCATCAGTCTCGTTTCGCGGCTGGTCTCGGTGGATTTCCGCCATTGTCGAAGGTCTTCGAACACAGTAGCCCACTGTGTAGCGAGAAACGCTTGTAATCTTCATTTTC
GTTTTGGCCAAGAGGCTCCCCTCTCGCGGGAAAATGGGTTTCATTTTGTAGCCCCTGGCATCGTGGTGAACCCGACGTTTTGTAATCCCTCTGGTTCTCTGAGGGTTGGT
GGTTAG
Protein sequenceShow/hide protein sequence
MHKGKPKINMENKKPELQNRGKESKATGKLTENREENISFRPESWPKKSPGKKEEDGTARKDSRTDTTDSSGYSGKADTNMTDQKQISSNPKEKDKGKGKAHEAELCTLP
SNIPQKMLSEEETNGEKGQKPRPKFDQLDKNLGQSHQVISKSSKHPDMEKGLMITEPKLKEKEPEDQMQIKQLVRKDEVNGKAIEKKMGRSWKRRAREEQKNWKTEKENE
NEGHSGGLILMWQNHINVVVNSFSKGHIDITIKDPDWWWRFTGFYGNPDQSKRKDSWRLLERLNNMVNLPWIIGGDFNEVMFSHEKKGGIPKSLNSLNDFCDSLSSCDLI
DVGFIGDRYTWTKNKKNKEATKERLDRFFMNSKMLPLVRDIKVEHLSFLHSDHRPIILKMTWINSSQQVVRSRRLLRLEEGWLAHEGSKEAFKDAWGSSAVITNVNFNRK
IQEGLKAMHIWNRNRLKGTIKGAIQRTEEALILQSFPPQVSKDILNIPTGKKETKDEIFWGPDKKGIFSVKSAYHLAIESRDAQEASQSDKSKQSSFWNSIWNTKILPRA
KVCSWKIINDAIPCKANVLNKGVLLNPLCPLCRKFVESTPHIIWECKIIKKFWEEAIPKTKSLFNCGRENWNPQDYWSWMRDNLSKEELERSLIAMWSVWNFRNKTEHSQ
VIQSADSLHKSFEKNVKDWEDTNLEEHHPVSRPRSQASQDSWEAPAQNSWKLNSDAAWNEAANCGGIGWVVHDSNGSLICSGSKRIKRNWSIKSLEAKEILEGVKMIEYT
CFRQRIPLEIESDALEIIRALKGESEDLSESKVILDEIISLVSRLVSVDFRHCRRSSNTVAHCVARNACNLHFRFGQEAPLSRENGFHFVAPGIVVNPTFCNPSGSLRVG
G