; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001706 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001706
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEnzymatic polyprotein
Genome locationchr4:34519548..34524210
RNA-Seq ExpressionLag0001706
SyntenyLag0001706
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN64330.1 hypothetical protein VITISV_014666 [Vitis vinifera]2.1e-4931.04Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTP-PKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGT
        M T    + L    KG T+L +AN   SS+  P  + W  +T + TW   +   P P K+     I +   G V +QF   P  +     +S + S  G+
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTP-PKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGT

Query:  TSEIESKYYMD-RSNSLRVKSVNIEQNVANVQYENQ----PQSPTQTDM--DNRSVFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQ-
        T+    +Y +D R   +++ +V+   N+   +Y  Q    P SPT + M   +       I+ + + F IDK+ L ++    K++ +R+ FF      Q 
Subjt:  TSEIESKYYMD-RSNSLRVKSVNIEQNVANVQYENQ----PQSPTQTDM--DNRSVFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQ-

Query:  RSEIRSRWYSYMDETERNIPFFTWMK--ESNN-------EINMLQES--WKTSQRGNIHSTHPPLEEIEFDTIYGEKVKASPFKGNISEKDEKNTPTLKD
        R + R  WY  M+    NIP FT+++   SNN       E+NML +   WKT+ + NI + H  LEE    T    +V  SPFK    + DEK + TLKD
Subjt:  RSEIRSRWYSYMDETERNIPFFTWMK--ESNN-------EINMLQES--WKTSQRGNIHSTHPPLEEIEFDTIYGEKVKASPFKGNISEKDEKNTPTLKD

Query:  IKNIQLQNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIP-PIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINV
         KN+Q QNNF+N++L TI +Q++++E ++     +       F+   +D++ P+FKP  + +K+K  + +++ L  +  K++   +S  ++S  T TINV
Subjt:  IKNIQLQNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIP-PIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINV

Query:  INDTFIDQIIQKTRNLSLEEEAIEPPNEIL-KIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFIGQ-
        I +   + + +       ++  I   N+I+ K   +S+       RN++P+P+ PD+Q+EE++Q+ Q+ YDG  I+EWNIDG+SD+ +LN+L EM +   
Subjt:  INDTFIDQIIQKTRNLSLEEEAIEPPNEIL-KIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFIGQ-

Query:  EWKSWNSMD
         +KS  ++D
Subjt:  EWKSWNSMD

KAA0057417.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.2e-4432.02Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT
        M+TN+SP+AL  SPKG T+L+E N+++SS+T+P++L W+++T+NP W+L    TP K++S  A I+E+PDG+VEVQF+   +  ++ + MSSRPST    
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT

Query:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS
        +E   +  + RS S+R  SV+    + +V YE + +  SPTQ++M+ RS    +QINV+    + DKE  +E +           +   + K   +E R 
Subjt:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS

Query:  RWYSYMDETERNIPFFTWMKESNNEINMLQESWKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQLQNNFSNKIL
         + +  D  E  +     M+ + NE ++ +     S++  I S +PP EE  F    I   K+ +SP+K  I  +D+     +++IKNIQ Q NF+NK+L
Subjt:  RWYSYMDETERNIPFFTWMKESNNEINMLQESWKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQLQNNFSNKIL

Query:  STIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPS----TSKTTPTINVINDTFIDQIIQK
        ST++  VE IE     P +    N    IP I+ + P+F+P   +   ++     + LA I ++L++  ++K S      +    IN+I   ++ Q    
Subjt:  STIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPS----TSKTTPTINVINDTFIDQIIQK

Query:  TRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
        T N           ++IL +      ++ V M+N YPQPS PD+ +++     +  YDG ++  WN DG  +  ++N   EM +
Subjt:  TRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.5e-4432.11Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT
        M+TN+SP+AL  SPKG T+L+E N+++SS+T+P++L W+++T+NP W+L     P  ++S  A I E+PDG+VEVQF+   +  ++ + MSSRPST    
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT

Query:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQ--PQSPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS
        SE   +  + RS S+R  SV+    + +V YE +    SPTQ+DM+ RS    +QINV+    + DKE  +E +           +   + K   +E R 
Subjt:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQ--PQSPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS

Query:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEFD--TIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL
         + +  D  E  +       E       ++ ++ M++ S  W T+    + S +PP EE  F   TI   K+ +SP+K  I+E D+     +++IKNIQ 
Subjt:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEFD--TIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL

Query:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEET---LALIEEKLSSFHISKPSTSKTTPTINVINDT
        Q NF+NK LST++  VE++E   S+P +   +     IP I+ + P+F+P    +   I S  E+    LA I  +L++  ++K S         V  + 
Subjt:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEET---LALIEEKLSSFHISKPSTSKTTPTINVINDT

Query:  FIDQIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
           ++I   +  SL + +    ++IL +      ++ + M+N YPQPS PD+ +++     +  YDG ++  WNIDG S+  ++N   EM +
Subjt:  FIDQIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

TYJ98087.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.5e-4432.11Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT
        M+TN+SP+AL  SPKG T+L+E N+++SS+T+P++L W+++T+NP W+L     P K++S  A I E+PDG+VEVQF+   +  ++ + MSSR ST    
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT

Query:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS
        +E   +  + RS S+R  SV+    + +V YE + +  SPTQ+DM+ RS    +QINV+    + +KE  +E +           +   + K   +E R 
Subjt:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS

Query:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL
         + +  D  E  +       E       ++ +I M++ S  W T     + S +PP EE  F    I   K+ +SP+K  I+E D+     +++IKNIQ 
Subjt:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL

Query:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFID
        Q NF+NKILST++  VE+IE     P +    N    IP I+ + P+F+P   +   K+     + LA I ++L++  ++K S + T             
Subjt:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFID

Query:  QIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
           Q  + +++ ++   P    LKI      ++ V M+N YPQPS PD+ +++     +  YDG ++  WNIDG S+  ++N   EM +
Subjt:  QIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]2.6e-8452.54Show/hide
Query:  MDNRSVFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRSRWYSYMDETERNIPFFTWMKESNNEI-NMLQESWKTSQRGNIHST
        MD +SV+ SQ+NV+  DF IDKE LK DF+S  N ++R  FFQ Y + +R+E+R++WYS+M+  + NIPFF W +E+  +I  + Q SWKT++RG ++S 
Subjt:  MDNRSVFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRSRWYSYMDETERNIPFFTWMKESNNEI-NMLQESWKTSQRGNIHST

Query:  HPPLEEIEFDTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQLQNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDS
        HPPLEE+EFD  YGEKVKASPFK +I E  EK  PTLKDIKNIQ QNN+SNKILSTIA Q+E IEG+ISK S T        +P +DESIP+ +P  ++ 
Subjt:  HPPLEEIEFDTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQLQNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDS

Query:  KIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFIDQIIQKTRNLSLEEEAIEP-PNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEK
          K +SKEE  +A IEEKL    I  P+ +    ++NV+N+   +Q  +      +  E  EP  N I +I  RS+ +     +NWYPQPSFPD+QFEEK
Subjt:  KIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFIDQIIQKTRNLSLEEEAIEP-PNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEK

Query:  AQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
           TQA YDGLAI+EWNIDG+SDYLI+NV+NEM +
Subjt:  AQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

TrEMBL top hitse value%identityAlignment
A0A5A7UF59 Enzymatic polyprotein7.4e-4532.11Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT
        M+TN+SP+AL  SPKG T+L+E N+++SS+T+P++L W+++T+NP W+L     P K++S  A I E+PDG+VEVQF+   +  ++ + MSSR ST    
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT

Query:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS
        +E   +  + RS S+R  SV+    + +V YE + +  SPTQ+DM+ RS    +QINV+    + +KE  +E +           +   + K   +E R 
Subjt:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS

Query:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL
         + +  D  E  +       E       ++ +I M++ S  W T     + S +PP EE  F    I   K+ +SP+K  I+E D+     +++IKNIQ 
Subjt:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL

Query:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFID
        Q NF+NKILST++  VE+IE     P +    N    IP I+ + P+F+P   +   K+     + LA I ++L++  ++K S + T             
Subjt:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFID

Query:  QIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
           Q  + +++ ++   P    LKI      ++ V M+N YPQPS PD+ +++     +  YDG ++  WNIDG S+  ++N   EM +
Subjt:  QIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

A0A5A7URX9 Enzymatic polyprotein5.7e-4532.02Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT
        M+TN+SP+AL  SPKG T+L+E N+++SS+T+P++L W+++T+NP W+L    TP K++S  A I+E+PDG+VEVQF+   +  ++ + MSSRPST    
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT

Query:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS
        +E   +  + RS S+R  SV+    + +V YE + +  SPTQ++M+ RS    +QINV+    + DKE  +E +           +   + K   +E R 
Subjt:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS

Query:  RWYSYMDETERNIPFFTWMKESNNEINMLQESWKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQLQNNFSNKIL
         + +  D  E  +     M+ + NE ++ +     S++  I S +PP EE  F    I   K+ +SP+K  I  +D+     +++IKNIQ Q NF+NK+L
Subjt:  RWYSYMDETERNIPFFTWMKESNNEINMLQESWKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQLQNNFSNKIL

Query:  STIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPS----TSKTTPTINVINDTFIDQIIQK
        ST++  VE IE     P +    N    IP I+ + P+F+P   +   ++     + LA I ++L++  ++K S      +    IN+I   ++ Q    
Subjt:  STIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPS----TSKTTPTINVINDTFIDQIIQK

Query:  TRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
        T N           ++IL +      ++ V M+N YPQPS PD+ +++     +  YDG ++  WN DG  +  ++N   EM +
Subjt:  TRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

A0A5D3BEY3 Enzymatic polyprotein7.4e-4532.11Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT
        M+TN+SP+AL  SPKG T+L+E N+++SS+T+P++L W+++T+NP W+L     P  ++S  A I E+PDG+VEVQF+   +  ++ + MSSRPST    
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT

Query:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQ--PQSPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS
        SE   +  + RS S+R  SV+    + +V YE +    SPTQ+DM+ RS    +QINV+    + DKE  +E +           +   + K   +E R 
Subjt:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQ--PQSPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS

Query:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEFD--TIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL
         + +  D  E  +       E       ++ ++ M++ S  W T+    + S +PP EE  F   TI   K+ +SP+K  I+E D+     +++IKNIQ 
Subjt:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEFD--TIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL

Query:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEET---LALIEEKLSSFHISKPSTSKTTPTINVINDT
        Q NF+NK LST++  VE++E   S+P +   +     IP I+ + P+F+P    +   I S  E+    LA I  +L++  ++K S         V  + 
Subjt:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEET---LALIEEKLSSFHISKPSTSKTTPTINVINDT

Query:  FIDQIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
           ++I   +  SL + +    ++IL +      ++ + M+N YPQPS PD+ +++     +  YDG ++  WNIDG S+  ++N   EM +
Subjt:  FIDQIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

A0A5D3BG41 Enzymatic polyprotein7.4e-4532.11Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT
        M+TN+SP+AL  SPKG T+L+E N+++SS+T+P++L W+++T+NP W+L     P K++S  A I E+PDG+VEVQF+   +  ++ + MSSR ST    
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTT

Query:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS
        +E   +  + RS S+R  SV+    + +V YE + +  SPTQ+DM+ RS    +QINV+    + +KE  +E +           +   + K   +E R 
Subjt:  SEIESKYYMDRSNSLRVKSVNIEQNVANVQYENQPQ--SPTQTDMDNRS-VFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRS

Query:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL
         + +  D  E  +       E       ++ +I M++ S  W T     + S +PP EE  F    I   K+ +SP+K  I+E D+     +++IKNIQ 
Subjt:  RWYSYMDETERNIPFFTWMKE-------SNNEINMLQES--WKTSQRGNIHSTHPPLEEIEF--DTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQL

Query:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFID
        Q NF+NKILST++  VE+IE     P +    N    IP I+ + P+F+P   +   K+     + LA I ++L++  ++K S + T             
Subjt:  QNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFID

Query:  QIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI
           Q  + +++ ++   P    LKI      ++ V M+N YPQPS PD+ +++     +  YDG ++  WNIDG S+  ++N   EM +
Subjt:  QIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFI

A5C8I5 Uncharacterized protein1.0e-4931.04Show/hide
Query:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTP-PKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGT
        M T    + L    KG T+L +AN   SS+  P  + W  +T + TW   +   P P K+     I +   G V +QF   P  +     +S + S  G+
Subjt:  MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTP-PKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGT

Query:  TSEIESKYYMD-RSNSLRVKSVNIEQNVANVQYENQ----PQSPTQTDM--DNRSVFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQ-
        T+    +Y +D R   +++ +V+   N+   +Y  Q    P SPT + M   +       I+ + + F IDK+ L ++    K++ +R+ FF      Q 
Subjt:  TSEIESKYYMD-RSNSLRVKSVNIEQNVANVQYENQ----PQSPTQTDM--DNRSVFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQ-

Query:  RSEIRSRWYSYMDETERNIPFFTWMK--ESNN-------EINMLQES--WKTSQRGNIHSTHPPLEEIEFDTIYGEKVKASPFKGNISEKDEKNTPTLKD
        R + R  WY  M+    NIP FT+++   SNN       E+NML +   WKT+ + NI + H  LEE    T    +V  SPFK    + DEK + TLKD
Subjt:  RSEIRSRWYSYMDETERNIPFFTWMK--ESNN-------EINMLQES--WKTSQRGNIHSTHPPLEEIEFDTIYGEKVKASPFKGNISEKDEKNTPTLKD

Query:  IKNIQLQNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIP-PIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINV
         KN+Q QNNF+N++L TI +Q++++E ++     +       F+   +D++ P+FKP  + +K+K  + +++ L  +  K++   +S  ++S  T TINV
Subjt:  IKNIQLQNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIP-PIDESIPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINV

Query:  INDTFIDQIIQKTRNLSLEEEAIEPPNEIL-KIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFIGQ-
        I +   + + +       ++  I   N+I+ K   +S+       RN++P+P+ PD+Q+EE++Q+ Q+ YDG  I+EWNIDG+SD+ +LN+L EM +   
Subjt:  INDTFIDQIIQKTRNLSLEEEAIEPPNEIL-KIGTRSKASKPVIMRNWYPQPSFPDVQFEEKAQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFIGQ-

Query:  EWKSWNSMD
         +KS  ++D
Subjt:  EWKSWNSMD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-0927.32Show/hide
Query:  ILWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYLDKVESNSAERLKTQTSDAQWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSICVGM
        +LW LW  RN ++    + +   ++R       E     +     S  +++   S  QW  PP    K N DA+    N R G+GWILR+ SG  + +G 
Subjt:  ILWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYLDKVESNSAERLKTQTSDAQWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSICVGM

Query:  RRVIGNWSIKRLEMRAVLEGLR-SIPTLRASRSDLSIPAIEVASDAVGVINLLNRVEEDHSEITFIVSEIERMTFELDNISFIHCPRSQNGEAHMLAR-S
        R +    ++    + A LE LR ++ T+    S  +   I   SDA  ++NLLN  ++    +   + +I+++    + + F   PR  N  A  +AR S
Subjt:  RRVIGNWSIKRLEMRAVLEGLR-SIPTLRASRSDLSIPAIEVASDAVGVINLLNRVEEDHSEITFIVSEIERMTFELDNISFIHCPRSQNGEAHMLAR-S

Query:  ATFGN
         +F N
Subjt:  ATFGN

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.8e-0424.1Show/hide
Query:  KYLDKVESNSAERLKTQTSDA---QWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSICVGMRRVIGNWSIKRLEMRAVLEGLRSIPTLRASRSD
        ++LD    N       QT  +   +W  P A   K N D S        G+ WI+R+S G+ +  G  +  G  +IK  E  A++  ++          D
Subjt:  KYLDKVESNSAERLKTQTSDA---QWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSICVGMRRVIGNWSIKRLEMRAVLEGLRSIPTLRASRSD

Query:  LSIPAIEVASDAVGVINLLNRVEEDHSEITFIVSEIERMTFELDNISFIHCPRSQNGEAHMLARSA
        L    +E   D +  +N L R +E +  + + +  I++ +     + F    R QN    +LA+ A
Subjt:  LSIPAIEVASDAVGVINLLNRVEEDHSEITFIVSEIERMTFELDNISFIHCPRSQNGEAHMLARSA

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)1.7e-0425.98Show/hide
Query:  LWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYL--DKVESNSAERLKTQTS--DAQWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSIC
        +W LW  RN  L          + +  + ++ E V+    D   S+S E+   + S    +W+PPP G  K N D+   +  +     WI+RDS+G  I 
Subjt:  LWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYL--DKVESNSAERLKTQTS--DAQWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSIC

Query:  VGMRRVIGNWSIKRLEMRAVLEGLRSI
         G  ++  ++S  + E    L  L+ +
Subjt:  VGMRRVIGNWSIKRLEMRAVLEGLRSI

AT4G29090.1 Ribonuclease H-like superfamily protein4.8e-1224.79Show/hide
Query:  IGQEWKSWNSMDHWTWVNENLNDKELEEAIQ----ILWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYLDKVESNSAERLKTQTSDAQWTPPPAG
        +G EW     ++ +   N    + + E+A Q    +LW LW +RN ++    + N   ++R  +    E  +   + ES   +    ++S  +W PPP  
Subjt:  IGQEWKSWNSMDHWTWVNENLNDKELEEAIQ----ILWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYLDKVESNSAERLKTQTSDAQWTPPPAG

Query:  AWKLNVDASRDESNNRGGVGWILRDSSGSSICVGMRRVIGNWSIKRLEMRAVLEGLRSIPTLRASRSDLSIPAIEVASDAVGVINLLNRVEEDHSEITFI
          K N DA+ +  N R G+GW+LR+  G    +G R +    S+   E+ A+   + S+     SR   +    E  SD+  +I +LN  +E    +   
Subjt:  AWKLNVDASRDESNNRGGVGWILRDSSGSSICVGMRRVIGNWSIKRLEMRAVLEGLRSIPTLRASRSDLSIPAIEVASDAVGVINLLNRVEEDHSEITFI

Query:  VSEIERMTFELDNISFIHCPRSQNGEAHMLARSA
        + +++R+  +   + F+  PR  N  A  +AR +
Subjt:  VSEIERMTFELDNISFIHCPRSQNGEAHMLARSA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-0922.12Show/hide
Query:  ILWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYLDKVESNSAERLKTQTSDAQWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSICVGM
        ++W +W   N ++ N  +      +    + + E +      E  +  R    + + +W+PP     K N DAS  E N   G+GWILR+S G+ I  GM
Subjt:  ILWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYLDKVESNSAERLKTQTSDAQWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSICVGM

Query:  RRVIGNWSIKRLEMRAVLEGLRSIPTLRASRSDLSIPAIEVASDAVGVINLLNRVEEDHSEITFIVSEIERMTFELDNISFIHCPRSQNGEAHMLARSAT
         +  G  + +  E   ++  ++       +        +    D   +  ++N  +  +  +   +  I+      ++I F    R QNG A  LA+ A 
Subjt:  RRVIGNWSIKRLEMRAVLEGLRSIPTLRASRSDLSIPAIEVASDAVGVINLLNRVEEDHSEITFIVSEIERMTFELDNISFIHCPRSQNGEAHMLARSAT

Query:  FGNTPFSV
          NT +S+
Subjt:  FGNTPFSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACCAACATATCCCCTAGAGCCTTGAGATCTTCACCCAAAGGATCTACAGTTCTCTTAGAAGCAAATCTTGATAGATCATCAATCACAGTCCCTAAGAGTTTGTC
ATGGGAACAAATCACTAGAAATCCAACGTGGAGACTTACGGAAGCCTTCACTCCACCAAAGAAAAATTCAAACCTTGCACAAATTGTTGAATATCCAGATGGATCGGTAG
AAGTTCAATTTTCTGAAGAACCAGCAACCTCAAAGGTTAAAGACTTTATGTCTTCTCGACCTAGTACATTTGGAACAACTTCAGAAATCGAATCAAAATACTATATGGAT
AGATCTAACTCATTAAGAGTAAAATCTGTCAATATAGAACAAAATGTGGCAAATGTTCAATACGAAAATCAACCACAATCACCCACACAGACAGACATGGATAACCGATC
TGTGTTCGCCAGTCAAATTAATGTCCTTATACAAGATTTCACAATCGATAAAGAAACCTTAAAGGAGGATTTTCTTTCCCTAAAAAATAAGGCAAGAAGAAAAACCTTTT
TCCAAAATTACAACAAAGACCAAAGATCTGAGATAAGATCGAGATGGTATTCATATATGGATGAAACCGAAAGAAATATACCATTCTTCACTTGGATGAAAGAATCCAAT
AACGAAATCAATATGCTTCAAGAATCCTGGAAAACATCTCAAAGAGGAAATATTCATTCTACTCATCCTCCTCTTGAAGAGATCGAGTTTGACACAATTTATGGTGAAAA
AGTCAAAGCTAGTCCATTCAAAGGTAATATCAGTGAAAAGGACGAAAAGAACACCCCTACTCTTAAGGATATAAAGAATATTCAACTTCAAAACAATTTTTCTAATAAAA
TTCTTTCTACTATAGCTAACCAAGTAGAAAAGATTGAAGGAAGAATATCAAAACCTTCCATTACATCATCTTCAAACGGAGGAATCTTTATACCTCCTATAGATGAGTCA
ATTCCGTTGTTCAAACCTACAATTATTGATAGCAAAATTAAAATAATGTCTAAAGAAGAAGAAACTCTAGCTTTAATTGAAGAAAAGCTAAGTAGTTTCCATATTTCAAA
ACCATCAACTTCAAAGACAACCCCTACAATAAACGTCATTAATGATACCTTTATTGATCAAATTATTCAGAAAACAAGAAATCTTTCTTTAGAGGAAGAAGCGATTGAGC
CGCCTAACGAGATCCTAAAAATAGGGACAAGATCAAAAGCTTCAAAACCAGTAATTATGCGAAATTGGTATCCACAACCTTCTTTCCCGGATGTCCAATTTGAAGAAAAG
GCACAAATGACTCAGGCCGTTTATGATGGATTAGCCATCCATGAATGGAATATAGATGGCATATCCGATTATCTTATCCTCAATGTGCTCAATGAAATGTTCATAGGTCA
AGAATGGAAGTCGTGGAATTCGATGGATCATTGGACCTGGGTAAACGAGAACCTCAACGATAAAGAGCTGGAGGAAGCTATTCAAATTCTCTGGGAATTATGGACTCACA
GGAACCACATTTTGCACAACTTAGGGAAGCCTAACATGGACCTCATCATCAGAGCAATAAAATCAAAAAGTCCAGAAATCGTAAAGTACCTGGATAAAGTCGAATCCAAT
TCGGCAGAGAGATTGAAGACTCAGACGAGTGACGCTCAGTGGACTCCCCCTCCCGCTGGTGCCTGGAAGCTAAACGTCGACGCCTCTCGTGATGAATCAAACAATAGAGG
AGGGGTGGGTTGGATTTTGCGTGACTCCTCAGGTTCTTCAATCTGTGTGGGTATGAGAAGAGTCATCGGAAATTGGTCGATAAAAAGGCTTGAGATGAGAGCTGTTCTAG
AAGGCCTTAGAAGCATTCCGACGCTGAGAGCCTCCCGCTCGGACCTCTCAATTCCGGCAATCGAGGTCGCCTCTGACGCCGTCGGAGTGATTAATCTGCTAAACCGTGTT
GAAGAAGACCATTCGGAGATAACCTTCATCGTTTCTGAGATCGAGCGCATGACTTTCGAGCTCGACAATATCTCCTTCATTCACTGCCCTCGTTCGCAAAACGGAGAGGC
ACACATGCTGGCGCGAAGCGCGACTTTTGGCAACACCCCATTCTCTGTAAATTTTTTGGATGCCTCTTCCGCTTCAGAAGAAGGCTGTTTTTTGTTTTTGGGTCGAATTC
ATCCCGATTTCTTTCCCCCTTTTTCTGGGGGGCGCCTACGACTCCTCGAGGCCATCGACGATGGTGGAATCAATCGCGACGAACACAGTGCGCTACTGTGTTCAATACCT
GCTAGGATTGCTTTCGCCTCTATGATTTTGATGGGTGATTTTCTCGTGTTTTTTGCATTCCGAAACAGACCAAGGATCCATCCGAGTCACACACGATCCAGCCTAGCCCT
CTCCGCAGTTTCGTTCCATGAGTTGAGGGGGGAAGACGTGGCCTGCAAGACAGTAAACCTGCACACCGGTGTGGTGCTCGCCACACCGGCTCCGATGCTTAAGTCAGCAA
ACAGAACGGTAGGGCGTGGAAAAGTCAAACCAGGCGAAACCGGGGCAACCAGAGGCGGTGGGGACCAGACGAGACCTAACGGGCCCGGCCTCGGCCATGGGCCGAGGCCG
AGCATGGGGTCAGGTCAAAAACCCGACCCCTTCGGCCTTGGCCCGTCCCGCTTGCCGGCTCCGCCTCCTTGGCCCGTATCCCAGCCCGATTTCTCCCCGATTGTCCTCGT
CAGCTCCTCGTACATCGGGGTGGTCCAAAATTGCCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATACCAACATATCCCCTAGAGCCTTGAGATCTTCACCCAAAGGATCTACAGTTCTCTTAGAAGCAAATCTTGATAGATCATCAATCACAGTCCCTAAGAGTTTGTC
ATGGGAACAAATCACTAGAAATCCAACGTGGAGACTTACGGAAGCCTTCACTCCACCAAAGAAAAATTCAAACCTTGCACAAATTGTTGAATATCCAGATGGATCGGTAG
AAGTTCAATTTTCTGAAGAACCAGCAACCTCAAAGGTTAAAGACTTTATGTCTTCTCGACCTAGTACATTTGGAACAACTTCAGAAATCGAATCAAAATACTATATGGAT
AGATCTAACTCATTAAGAGTAAAATCTGTCAATATAGAACAAAATGTGGCAAATGTTCAATACGAAAATCAACCACAATCACCCACACAGACAGACATGGATAACCGATC
TGTGTTCGCCAGTCAAATTAATGTCCTTATACAAGATTTCACAATCGATAAAGAAACCTTAAAGGAGGATTTTCTTTCCCTAAAAAATAAGGCAAGAAGAAAAACCTTTT
TCCAAAATTACAACAAAGACCAAAGATCTGAGATAAGATCGAGATGGTATTCATATATGGATGAAACCGAAAGAAATATACCATTCTTCACTTGGATGAAAGAATCCAAT
AACGAAATCAATATGCTTCAAGAATCCTGGAAAACATCTCAAAGAGGAAATATTCATTCTACTCATCCTCCTCTTGAAGAGATCGAGTTTGACACAATTTATGGTGAAAA
AGTCAAAGCTAGTCCATTCAAAGGTAATATCAGTGAAAAGGACGAAAAGAACACCCCTACTCTTAAGGATATAAAGAATATTCAACTTCAAAACAATTTTTCTAATAAAA
TTCTTTCTACTATAGCTAACCAAGTAGAAAAGATTGAAGGAAGAATATCAAAACCTTCCATTACATCATCTTCAAACGGAGGAATCTTTATACCTCCTATAGATGAGTCA
ATTCCGTTGTTCAAACCTACAATTATTGATAGCAAAATTAAAATAATGTCTAAAGAAGAAGAAACTCTAGCTTTAATTGAAGAAAAGCTAAGTAGTTTCCATATTTCAAA
ACCATCAACTTCAAAGACAACCCCTACAATAAACGTCATTAATGATACCTTTATTGATCAAATTATTCAGAAAACAAGAAATCTTTCTTTAGAGGAAGAAGCGATTGAGC
CGCCTAACGAGATCCTAAAAATAGGGACAAGATCAAAAGCTTCAAAACCAGTAATTATGCGAAATTGGTATCCACAACCTTCTTTCCCGGATGTCCAATTTGAAGAAAAG
GCACAAATGACTCAGGCCGTTTATGATGGATTAGCCATCCATGAATGGAATATAGATGGCATATCCGATTATCTTATCCTCAATGTGCTCAATGAAATGTTCATAGGTCA
AGAATGGAAGTCGTGGAATTCGATGGATCATTGGACCTGGGTAAACGAGAACCTCAACGATAAAGAGCTGGAGGAAGCTATTCAAATTCTCTGGGAATTATGGACTCACA
GGAACCACATTTTGCACAACTTAGGGAAGCCTAACATGGACCTCATCATCAGAGCAATAAAATCAAAAAGTCCAGAAATCGTAAAGTACCTGGATAAAGTCGAATCCAAT
TCGGCAGAGAGATTGAAGACTCAGACGAGTGACGCTCAGTGGACTCCCCCTCCCGCTGGTGCCTGGAAGCTAAACGTCGACGCCTCTCGTGATGAATCAAACAATAGAGG
AGGGGTGGGTTGGATTTTGCGTGACTCCTCAGGTTCTTCAATCTGTGTGGGTATGAGAAGAGTCATCGGAAATTGGTCGATAAAAAGGCTTGAGATGAGAGCTGTTCTAG
AAGGCCTTAGAAGCATTCCGACGCTGAGAGCCTCCCGCTCGGACCTCTCAATTCCGGCAATCGAGGTCGCCTCTGACGCCGTCGGAGTGATTAATCTGCTAAACCGTGTT
GAAGAAGACCATTCGGAGATAACCTTCATCGTTTCTGAGATCGAGCGCATGACTTTCGAGCTCGACAATATCTCCTTCATTCACTGCCCTCGTTCGCAAAACGGAGAGGC
ACACATGCTGGCGCGAAGCGCGACTTTTGGCAACACCCCATTCTCTGTAAATTTTTTGGATGCCTCTTCCGCTTCAGAAGAAGGCTGTTTTTTGTTTTTGGGTCGAATTC
ATCCCGATTTCTTTCCCCCTTTTTCTGGGGGGCGCCTACGACTCCTCGAGGCCATCGACGATGGTGGAATCAATCGCGACGAACACAGTGCGCTACTGTGTTCAATACCT
GCTAGGATTGCTTTCGCCTCTATGATTTTGATGGGTGATTTTCTCGTGTTTTTTGCATTCCGAAACAGACCAAGGATCCATCCGAGTCACACACGATCCAGCCTAGCCCT
CTCCGCAGTTTCGTTCCATGAGTTGAGGGGGGAAGACGTGGCCTGCAAGACAGTAAACCTGCACACCGGTGTGGTGCTCGCCACACCGGCTCCGATGCTTAAGTCAGCAA
ACAGAACGGTAGGGCGTGGAAAAGTCAAACCAGGCGAAACCGGGGCAACCAGAGGCGGTGGGGACCAGACGAGACCTAACGGGCCCGGCCTCGGCCATGGGCCGAGGCCG
AGCATGGGGTCAGGTCAAAAACCCGACCCCTTCGGCCTTGGCCCGTCCCGCTTGCCGGCTCCGCCTCCTTGGCCCGTATCCCAGCCCGATTTCTCCCCGATTGTCCTCGT
CAGCTCCTCGTACATCGGGGTGGTCCAAAATTGCCTATAA
Protein sequenceShow/hide protein sequence
MNTNISPRALRSSPKGSTVLLEANLDRSSITVPKSLSWEQITRNPTWRLTEAFTPPKKNSNLAQIVEYPDGSVEVQFSEEPATSKVKDFMSSRPSTFGTTSEIESKYYMD
RSNSLRVKSVNIEQNVANVQYENQPQSPTQTDMDNRSVFASQINVLIQDFTIDKETLKEDFLSLKNKARRKTFFQNYNKDQRSEIRSRWYSYMDETERNIPFFTWMKESN
NEINMLQESWKTSQRGNIHSTHPPLEEIEFDTIYGEKVKASPFKGNISEKDEKNTPTLKDIKNIQLQNNFSNKILSTIANQVEKIEGRISKPSITSSSNGGIFIPPIDES
IPLFKPTIIDSKIKIMSKEEETLALIEEKLSSFHISKPSTSKTTPTINVINDTFIDQIIQKTRNLSLEEEAIEPPNEILKIGTRSKASKPVIMRNWYPQPSFPDVQFEEK
AQMTQAVYDGLAIHEWNIDGISDYLILNVLNEMFIGQEWKSWNSMDHWTWVNENLNDKELEEAIQILWELWTHRNHILHNLGKPNMDLIIRAIKSKSPEIVKYLDKVESN
SAERLKTQTSDAQWTPPPAGAWKLNVDASRDESNNRGGVGWILRDSSGSSICVGMRRVIGNWSIKRLEMRAVLEGLRSIPTLRASRSDLSIPAIEVASDAVGVINLLNRV
EEDHSEITFIVSEIERMTFELDNISFIHCPRSQNGEAHMLARSATFGNTPFSVNFLDASSASEEGCFLFLGRIHPDFFPPFSGGRLRLLEAIDDGGINRDEHSALLCSIP
ARIAFASMILMGDFLVFFAFRNRPRIHPSHTRSSLALSAVSFHELRGEDVACKTVNLHTGVVLATPAPMLKSANRTVGRGKVKPGETGATRGGGDQTRPNGPGLGHGPRP
SMGSGQKPDPFGLGPSRLPAPPPWPVSQPDFSPIVLVSSSYIGVVQNCL