; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001910 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001910
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold8:32690376..32697972
RNA-Seq ExpressionSpg001910
SyntenySpg001910
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4383622.1 hypothetical protein F8388_014122 [Cannabis sativa]8.6e-6137.11Show/hide
Query:  DVSIRSYSKGHIDAFIKDP-KGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYI
        +VS++S++ GHIDA +K P + LWRF+G YGNP  + R E+W +L RLKD +++PWI GGDFNEI    EK GG  R    M +F+ A+D C L D G+ 
Subjt:  DVSIRSYSKGHIDAFIKDP-KGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYI

Query:  GPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPS--QRG
        G  +TW NK      + ERLDR+  N          KV +   L+S+HRPI+A  LE  S ++    +R  +FE  W+K  EC++II++ W +       
Subjt:  GPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPS--QRG

Query:  FRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPRLDATSKLELALK-EKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRR
          ++ D   LC  +L  W++ KY GSL   +  T++ +  LL       ++E   + E +L +LL  +E YW+ RSR EWL  GDRNT++FH KAT R+R
Subjt:  FRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPRLDATSKLELALK-EKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRR

Query:  TNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNPQSDSINLIETWNEEIINRCFMEDDAKAILNIPLRPLAEEDEIIWNYDSKG
         N I+ +M E G+ L  + D+      YF  +F + +P                         +AIL+IPL      D   W+Y S G
Subjt:  TNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNPQSDSINLIETWNEEIINRCFMEDDAKAILNIPLRPLAEEDEIIWNYDSKG

PPE00159.1 hypothetical protein GOBAR_DD02812 [Gossypium barbadense]4.9e-6429.94Show/hide
Query:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRD---AGDVKEDE
        +++++ +A+++G+ +G +  ID  +        +R+K++IDV  PL+R I     +   E+   + YE+L DFC+ CG++GH++K C      G ++  +
Subjt:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRD---AGDVKEDE

Query:  LPYGPWMREPI------KIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWR-----KTSVEDEEGYSRSP------KKIEIPANSPPSKPITVKNPSKEI
        L +G WMR P       K   R+ +  F +    +      L    + +W+     K  + +EE  S SP      K+I+       SK    K+    I
Subjt:  LPYGPWMREPI------KIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWR-----KTSVEDEEGYSRSP------KKIEIPANSPPSKPITVKNPSKEI

Query:  EKETVEISVKENINDQAATDVDCGININE----------------TVVPNSTEIAALSSYNVTDVSIRSYSKGHIDAFIKDPKG-LWRFSGIYGNPNWNL
          E+    VK  +    +      ++ NE                +    S+ +A +   +V DV+I++YSK HID+ IK   G + RF+  YG+PN NL
Subjt:  EKETVEISVKENINDQAATDVDCGININE----------------TVVPNSTEIAALSSYNVTDVSIRSYSKGHIDAFIKDPKG-LWRFSGIYGNPNWNL

Query:  RHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFK
        RH  W +L ++KD     WI+GGDFN I   +EK GG  +P  +M DF + +D   L D       +TW N    + +I ERLDRFLI++ +  +     
Subjt:  RHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFK

Query:  VYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIK
           +    S+H  IL D       ++    R   +++  W    E +DIIS+ W   S  G  T+ DKMKL   +L  W   +    L+  I+  E+ I 
Subjt:  VYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIK

Query:  TLLP-RLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNPQ
         L+  ++  +S   L +   +L    E +E YW  R++ +WL+ GDRNT++FH KA++RR  N I  L DE G W  + +++ ++A +YF NLF   N  
Subjt:  TLLP-RLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNPQ

Query:  SDSINLIETWNEEIINRCFMEDDAKAIL
        S   N   T    +I +C  ED +K +L
Subjt:  SDSINLIETWNEEIINRCFMEDDAKAIL

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]7.3e-6038.12Show/hide
Query:  DVSIRSYSKGHIDAFIKDPKGL-WRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYI
        +V ++S+S  HIDA + + +G  WR +G YGNP  + R E+W +L  L   +++PW+  GDFNEI   +EK GG  R ++ M DFREAID C  +D G+ 
Subjt:  DVSIRSYSKGHIDAFIKDPKGL-WRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYI

Query:  GPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPIL-ADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNP----S
        GPE+TWCN       ++ RLDR L+  +        +V+HL    S+H  +L  D +  QS ++ R       FE  W + DECRDII   WN+     S
Subjt:  GPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPIL-ADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNP----S

Query:  QRGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPR-LDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATT
          G   +   +K C   LS W+ GK  G +   I+     +  L+ R  D ++  E+    KE+ +LL+ +EI W+QRS+ +W+  GDRNT++FH KA+ 
Subjt:  QRGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPR-LDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATT

Query:  RRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNP
        R++ N I  ++DE G W E   ++  VA  YF+ L++TSNP
Subjt:  RRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNP

XP_031116488.1 uncharacterized protein LOC116020112 [Ipomoea triloba]1.5e-6027.98Show/hide
Query:  EIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGD--VKEDELPYGPWMRE-
        ++G  +G    +D +       S  RI++++DV  PLKR I L + K    +WI   YE+L  FC+ CGLLGH+ K C+ A D  ++ +  PYG W+R  
Subjt:  EIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGD--VKEDELPYGPWMRE-

Query:  -----------------PIKIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWRKTSVEDEEGYSRSPKKI---EIP-----------------ANSPPSK
                         P+K     N ++    F        +   +++G+ ++   +  E  S S K I   ++P                 AN+  S 
Subjt:  -----------------PIKIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWRKTSVEDEEGYSRSPKKI---EIP-----------------ANSPPSK

Query:  PITVKNPSKEIEKETVEISVKENINDQAATDVDCGININETVVPNSTEIAALSS------------YNVTDV----------------SIRSYSKGHIDA
         + V +  KE++ +     +K    D        G +I    VP +  +A L              +NV  V                ++  +S  HID 
Subjt:  PITVKNPSKEIEKETVEISVKENINDQAATDVDCGININETVVPNSTEIAALSS------------YNVTDV----------------SIRSYSKGHIDA

Query:  FIKDP-KGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSD
         +  P K  WR +  YG P  + R  +W +L +LKDN  +PW++ GDFN+ITC++EK G  + P  L+  F EA+ +C L+D G IG  +TW     T  
Subjt:  FIKDP-KGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSD

Query:  LIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQDKMKLCLSRLS
         + ERLDR + N +     +   + ++   +S+H  I  +     +  R   S+R  KFE +W+    CR+I+   W       F   QD++ LC   L 
Subjt:  LIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQDKMKLCLSRLS

Query:  AWSRGKYEGSLKGAIERTEEAIKTLLPRLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEE
         W  G+Y       I+     +  L      ++  +    E EL  LL  +EI+W+QRS++ WL+ GD NT++FH  A+ RRR N +L L++  G W+ E
Subjt:  AWSRGKYEGSLKGAIERTEEAIKTLLPRLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEE

Query:  DRDLERVATHYFQNLFQTSNPQSDSINLIETWNEEIIN----RCFMEDDAKAIL
        D ++      YF  +F ++   SD  +L++    E +N    + F  D+ KA L
Subjt:  DRDLERVATHYFQNLFQTSNPQSDSINLIETWNEEIIN----RCFMEDDAKAIL

XP_035545013.1 uncharacterized protein LOC108979776 [Juglans regia]1.0e-6130.81Show/hide
Query:  IGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGDV--KEDELPYGPWMR---
        IG  LG+ME ID+ +     G  +RI++ ID+   L RG  L  G +    W+  +YE+LPDFC+ C  +GH  ++C  A D   K+   PYG W+R   
Subjt:  IGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGDV--KEDELPYGPWMR---

Query:  EPIKIKI---RDNISSFRAHFFQAGRGRGRLGEDIRGN------------WR-------KTSVEDEEGYS----------------RSPKKIEIPANSPP
          +K+ +   R   SS       A R  G + ED+               W        +  VE  EG+S                ++P     P+ S  
Subjt:  EPIKIKI---RDNISSFRAHFFQAGRGRGRLGEDIRGN------------WR-------KTSVEDEEGYS----------------RSPKKIEIPANSPP

Query:  S------KPITVKNPSKEIEKETVE--------ISVKENINDQAATDVDCGININETVVPNSTEIAALSSYNVTDVSIRSYSKGHIDAFI-KDPKGLWRF
        S        + + NP      ET           +++E I  +  T V       +T    S  +A L   ++  V ++S+S  H+D  + +D    WRF
Subjt:  S------KPITVKNPSKEIEKETVE--------ISVKENINDQAATDVDCGININETVVPNSTEIAALSSYNVTDVSIRSYSKGHIDAFI-KDPKGLWRF

Query:  SGIYGNPNWNLRHETWAVLNRL--KDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFL
        +GIYGNP    R  TW ++ +L   D+ ++PW+LGGDFNE+   +EK  G  R E  M  FRE +  C L D G+ GP++TW N    ++ I ERLDRFL
Subjt:  SGIYGNPNWNLRHETWAVLNRL--KDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFL

Query:  INNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVW---NNPSQRGFRTIQDKMKLCLSRLSAWSRGKY
         NN          V H     S+H PI   W + +         R  +FE  W+   +C DII  VW   N+        +  ++KLC  RL++W++  +
Subjt:  INNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVW---NNPSQRGFRTIQDKMKLCLSRLSAWSRGKY

Query:  EGSLKGAIERTEEAIKTLLPRLDATSK-----LELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDR
             G +++ +   +  L  L A ++     + L+  ++ L+  LE +E+ WRQRSR +WL  GD+NT++FH +A  RRR N I GL +E G W+E   
Subjt:  EGSLKGAIERTEEAIKTLLPRLDATSK-----LELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDR

Query:  DLERVATHYFQNLFQTSNPQ
            +   +FQ LF  S  Q
Subjt:  DLERVATHYFQNLFQTSNPQ

TrEMBL top hitse value%identityAlignment
A0A2N9GPY1 Reverse transcriptase domain-containing protein3.0e-6730.07Show/hide
Query:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKEC----RDAGDVKED
        + +N  +A+ IGS LG++ Q+   E N + G+++R+++ +D+  PL RG  +R  K + E WI   YE+LP+FCY CG + H+ K+C    R+   ++ +
Subjt:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKEC----RDAGDVKED

Query:  ELPYGPWMREPIKIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWRKTSVEDEEGYSRSPKKIEIPANSPPS---------KPITVKNPSKEIEKE--TV
        E  +GPW+R P +                               WRK  +           K+E+P  +  +          P TV+  ++ +  +   V
Subjt:  ELPYGPWMREPIKIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWRKTSVEDEEGYSRSPKKIEIPANSPPS---------KPITVKNPSKEIEKE--TV

Query:  EISVKENINDQAATDVDCGINI-NETVVPNSTEIAALSSYNVTDVSIR--SYSKGHIDAFI-KDPKGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIP
           ++   +D     + C +   N+ V  +  +   L  +   ++ +R  S+S  HIDA I ++ + +WR +G YG P    R E+WA+L RL   Y IP
Subjt:  EISVKENINDQAATDVDCGINI-NETVVPNSTEIAALSSYNVTDVSIR--SYSKGHIDAFI-KDPKGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIP

Query:  WILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADW
        W   GDFNE+    EK G   R E+ M  FR+ +D    +D G+ GP +TW N     D+ WERLDR +            +V+HL    S+H+PI   W
Subjt:  WILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADW

Query:  LEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQR-GFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPR-LDATSKLELAL
        +   +++     R+P +FEE W     C + I   W  P       T+ +K+  C   L  WS+  + G++K  I  TE+ +K    + +  +   ++  
Subjt:  LEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQR-GFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPR-LDATSKLELAL

Query:  KEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNP
         + +L  LL  DE  WRQ SR EWL+ GD+NT++FH KAT RRR N +  L D  G W      +  +  +Y+ +LF T+NP
Subjt:  KEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNP

A0A2N9HKV4 Uncharacterized protein9.3e-6931.14Show/hide
Query:  KASAME-IGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKEC----RDAGDVKEDELP
        KA   E IG  LG ++     E     GS +RI++++D   PL R   +R G+     W++  +E+LP FCY CGLL H IK+C    R  G   E    
Subjt:  KASAME-IGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKEC----RDAGDVKEDELP

Query:  YGPWMREPIKIKIRDNISSFRAHFFQAGRGRG------------------RLGEDIRGNWRK--TSVEDEEGYSRSPKKIEIPANSP-------------
        YG W+R   +   +  + S +    +   G G                  +  E +R N  +  ++  D+ G S     + +    P             
Subjt:  YGPWMREPIKIKIRDNISSFRAHFFQAGRGRG------------------RLGEDIRGNWRK--TSVEDEEGYSRSPKKIEIPANSP-------------

Query:  ----PSKPITVKNPS----KEIEKETV-EISVKENINDQAAT-------DVD------CGINI-NETVVPNSTEIAALSSYNVTD--VSIRSYSKGHIDA
             S P  +  P     K++ +E V E++    I D +A        D D      C ++  N+ VVP       L  +   D  ++I+S+S  HID 
Subjt:  ----PSKPITVKNPS----KEIEKETV-EISVKENINDQAAT-------DVD------CGINI-NETVVPNSTEIAALSSYNVTD--VSIRSYSKGHIDA

Query:  FIKDPKGL-WRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSD
         I +   L WRF+G YG P    R  +W VL  L   + +PW   GDFNE+    EK GG  RP+  M  FR  +D C   D G+ GPE+TWCN      
Subjt:  FIKDPKGL-WRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSD

Query:  LIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQ--DKMKLCLSR
         IW RLDRF++N E  +R    +V+H+    S+H P+   WL       S   R+  +FE  W+  + C+  +   W N +  G   +Q  +++K C  R
Subjt:  LIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQ--DKMKLCLSR

Query:  LSAWSRGKYEGSLKGAIERTEEAIKTLLPRLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWL
        L  WSR  +    +   E+ ++  K  L  +       +     EL  LLE +E  W QRSR  WLQ GDRNT++FH +A+ RRR N I+GL D+ G W 
Subjt:  LSAWSRGKYEGSLKGAIERTEEAIKTLLPRLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWL

Query:  EEDRDLERVATHYFQNLFQTSNP
        ++   +  +A  YFQNLF T  P
Subjt:  EEDRDLERVATHYFQNLFQTSNP

A0A2N9I921 Reverse transcriptase domain-containing protein3.8e-7027.95Show/hide
Query:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGD----VKED
        + +++  AM++GSL+G +      EE     +  RIK+++D+  PL RG  ++ GK +   WIA  YE+LP+FCY CGLL H  K+C +          D
Subjt:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGD----VKED

Query:  ELPYGPWMREPIKIKIRDNISSF--RAHFFQ----------------------AGRGRGRL-------------------GEDIRGNWRKTSVEDEEGYS
        +  +GPW+R  ++   R +  +   R + FQ                      AG     +                    +D   N R+       G++
Subjt:  ELPYGPWMREPIKIKIRDNISSF--RAHFFQ----------------------AGRGRGRL-------------------GEDIRGNWRKTSVEDEEGYS

Query:  RSPKKIEI----------------PAN-----------------------------SPPSKPITVKNPSKEIEKE-------TVEISVKENINDQAAT--
             I +                P N                             SPP KP   +   K I ++        + +  K  ++  A T  
Subjt:  RSPKKIEI----------------PAN-----------------------------SPPSKPITVKNPSKEIEKE-------TVEISVKENINDQAAT--

Query:  ------DVDCGINI-------------------------------------NETVVPNSTEIAALSSY--NVTDVSIRSYSKGHIDAFIKDPKGL-WRFS
              +V C  ++                                     N+ VVP   +   L+ +     D+ I SYS  HID  +    G+ WRF+
Subjt:  ------DVDCGINI-------------------------------------NETVVPNSTEIAALSSY--NVTDVSIRSYSKGHIDAFIKDPKGL-WRFS

Query:  GIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINN
          YG P  + R  +W +L  L   +++PW  GGDFNEI    EK G +++PE  ML FREA+D C  +D GYIG  +TWCN  F+   +WERLDR + + 
Subjt:  GIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINN

Query:  EMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQ-RGFRTIQDKMKLCLSRLSAWSRGKYEGSLK
            R    +VYHL    S+H+P+   WL + +  R+R   +P +FEE W+    C + IS  W +PS       +  K+  C ++L  WS+  + GS++
Subjt:  EMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQ-RGFRTIQDKMKLCLSRLSAWSRGKYEGSLK

Query:  GAIERTEEAIKTLLPR-LDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHY
          ++   E +K    + +   S   +     E+  LL  +E  WRQRSR +WL+ GDRNT +FH +AT R+R N I+GL D  G+W  +   ++ +   Y
Subjt:  GAIERTEEAIKTLLPR-LDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHY

Query:  FQNLFQTSNPQS
        FQN+F +SNP S
Subjt:  FQNLFQTSNPQS

A0A2N9IXK4 RNase H domain-containing protein2.7e-7630.91Show/hide
Query:  MVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKEC----RDAGDVKEDE
        +++ A+A +IG  +G + Q   +E+    G  +R+++ IDVH PL RG  +  G  N E  ++  YEKLP+FCY CGL+ H  K+C    R+   ++  +
Subjt:  MVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKEC----RDAGDVKEDE

Query:  LPYGPWMREPIKIKIRDNISSFRAHFFQAGRGRGRLGEDI---RGNWRKTSVEDEEGYSRSPKKIEIPANSPPSKP-ITVKNPSKEIEKET-------VE
          YG W+R P  +  R    S +   F++ R      ++    + +  K+  + E       +  ++  NS P  P I  KNP   I  E         E
Subjt:  LPYGPWMREPIKIKIRDNISSFRAHFFQAGRGRGRLGEDI---RGNWRKTSVEDEEGYSRSPKKIEIPANSPPSKP-ITVKNPSKEIEKET-------VE

Query:  ISVKE--------------------NINDQAATDVDCGININETVVPNSTEIAALSSY--NVTDVSIRSYSKGHIDAFIKDPK-GLWRFSGIYGNPNWNL
        I V                      + N+    ++    + ++ VVP   +   L+ +      VSI+S+S  HIDA I + +   WRF+G YG P  + 
Subjt:  ISVKE--------------------NINDQAATDVDCGININETVVPNSTEIAALSSY--NVTDVSIRSYSKGHIDAFIKDPK-GLWRFSGIYGNPNWNL

Query:  RHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFK
        RHE+W++L  L     +PW   GDFNE+    EK GG  R  + M DFR+AID C   D G+ GP +TWCN    S  +WERLDR L          L +
Subjt:  RHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFK

Query:  VYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQ-RGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAI
        V HL  +SS+H PI   +  + S +    S R  +FEE W+ +  C++ I+  W           + DK++ C + L  WSR  +        ++T+   
Subjt:  VYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWNNPSQ-RGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAI

Query:  KTLLPRLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNP-
        +     +      +    ++E+  LL  +E  WRQRSR++WL+WGD+NT +FH  AT RRR N I  + D  G     + D+ R    +F +LF +S+P 
Subjt:  KTLLPRLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNP-

Query:  QSDSI
        + DS+
Subjt:  QSDSI

A0A2N9J109 Uncharacterized protein3.3e-7430.77Show/hide
Query:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGD----VKED
        + +++  A+++GS LG +     +EE     +  RIK+++D+   L RG  ++ GK N+  WI+  YE+LP+FCY CGLL H  K+C + GD    V  D
Subjt:  KMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKLPDFCYGCGLLGHTIKECRDAGD----VKED

Query:  ELPYGPWMR-EPIKIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWRKTSVEDEEG----YSRSPKKIEIPANSPPS----KPITVKNPSKEIEKETVEI
        +  +GPW+R EP K   +  +S           GR           + +SV  E G        P+  ++     P      PI ++   K    +  EI
Subjt:  ELPYGPWMR-EPIKIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWRKTSVEDEEG----YSRSPKKIEIPANSPPS----KPITVKNPSKEIEKETVEI

Query:  SVKENINDQAATDV-----------------------------------DCGINI--------NETVVPNSTEIAALSSY--NVTDVSIRSYSKGHIDAF
            N       D                                    D  + +        N+ VVP   +   L+ +     D+ I SYS  HID  
Subjt:  SVKENINDQAATDV-----------------------------------DCGINI--------NETVVPNSTEIAALSSY--NVTDVSIRSYSKGHIDAF

Query:  I-KDPKGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDL
        +       W F+G YG P+ + R E+W +L  LK  Y++PW  GGDFNEI    EK G +++P+  M  FR+A+D+C  ID GYIG  +TWCN  F+   
Subjt:  I-KDPKGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNEITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDL

Query:  IWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWN-NPSQRGFRTIQDKMKLCLSRLS
        +WERLD+ +  +E        +V+HL    S+H+P+   WL  +S+   R + +P  FEE W+    C + I+  W  + S      +  K+  C   L 
Subjt:  IWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFEESWIKYDECRDIISQVWN-NPSQRGFRTIQDKMKLCLSRLS

Query:  AWSRGKYEGSLKGAIERTEEAIKTLLPR-LDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLE
         WS+  + GS++  ++     +K      +   S + +    +E+  LL  +E  WRQRSR +WL++GDRNT +FH +AT R+R N I+GL D  G W  
Subjt:  AWSRGKYEGSLKGAIERTEEAIKTLLPR-LDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLE

Query:  EDRDLERVATHYFQNLFQTSNPQS
        E   ++ + T YFQ++F TSNP S
Subjt:  EDRDLERVATHYFQNLFQTSNPQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.5e-0723.67Show/hide
Query:  ILGGDFNEITCEAEKSG--GVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNE-MQVRSSLFKVYHLALLSSNHRP--I
        IL GDF++I   ++       + P + + +F+  +   +L+D    G  YTW N H   + I  +LDR + N +      S   V+ L+ + S+H P  I
Subjt:  ILGGDFNEITCEAEKSG--GVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNE-MQVRSSLFKVYHLALLSSNHRP--I

Query:  LADWLEAQSDQRSR----MSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPR-LDAT
        + + L  +S +  R    +S  P       + ++E   + S +++              K C   L+    G  +   K A++  E     LL    D+ 
Subjt:  LADWLEAQSDQRSR----MSRRPKKFEESWIKYDECRDIISQVWNNPSQRGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPR-LDAT

Query:  SKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNL
         ++E  +  K+        E ++RQ+SR +WLQ GD NT++FH      +  N I  L  +    +E    ++ +   Y+ +L
Subjt:  SKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRNTQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGCCCACAAATCCTGATGCACTTGCAAGGCAACTTGCAGATTTGAAAGTTATTGCAGCAGAAAGATCTAGCGTTTATCAACTCAAAGAGGAAGAGGTTGATCA
AGCGGAGAAGAAAATGGTCAATAAGGCTTCGGCAATGGAGATTGGCAGCTTGTTGGGAAACATGGAGCAAATTGATATAGATGAGGAGAATGATCAGTGTGGCAGTTCCT
TGCGAATTAAAATTCAGATAGATGTTCATATGCCATTAAAGAGAGGGATCTTCTTACGAAAAGGTAAGGCAAACGCAGAAAAATGGATTGCAGTTACTTATGAGAAACTC
CCGGATTTTTGTTATGGCTGTGGATTGCTCGGCCACACGATTAAAGAGTGTAGAGATGCTGGGGATGTGAAAGAGGACGAGCTTCCGTACGGCCCATGGATGCGTGAACC
TATCAAAATTAAGATCAGGGACAATATTTCTTCTTTTAGGGCTCATTTCTTTCAAGCGGGAAGAGGAAGAGGACGTCTAGGGGAGGATATCAGAGGAAACTGGAGGAAAA
CGTCGGTTGAAGATGAAGAGGGGTATTCTCGATCACCGAAAAAAATCGAGATTCCGGCGAACAGCCCTCCGTCTAAGCCGATAACGGTCAAAAATCCAAGCAAGGAAATT
GAGAAGGAAACGGTCGAGATCTCTGTTAAAGAAAATATTAATGATCAAGCGGCTACGGATGTTGATTGCGGTATCAATATTAATGAAACGGTGGTCCCAAATAGCACTGA
AATTGCGGCTCTCAGCAGTTATAACGTGACTGATGTGTCAATACGATCGTACTCTAAAGGTCATATTGATGCGTTCATTAAGGATCCTAAGGGGTTATGGCGGTTCTCAG
GCATTTACGGTAATCCCAATTGGAATCTCAGGCATGAGACGTGGGCTGTATTAAATAGACTAAAGGATAATTATGAGATTCCTTGGATCCTAGGTGGAGACTTTAATGAA
ATCACTTGCGAGGCTGAAAAAAGTGGGGGAGTGGCTCGTCCGGAAAAATTGATGCTCGATTTTCGGGAGGCTATCGATTCTTGTGAGTTGATTGACCCTGGTTATATAGG
TCCGGAATATACATGGTGCAATAAACATTTTACTTCAGATCTTATTTGGGAAAGACTTGACAGATTCCTTATTAATAATGAAATGCAAGTTCGGAGCAGTTTATTCAAAG
TTTACCATCTAGCCTTACTGTCCTCGAATCATAGACCAATTCTTGCAGATTGGTTGGAGGCCCAATCGGATCAAAGAAGTAGGATGAGTAGACGCCCTAAGAAATTTGAG
GAATCTTGGATAAAATATGATGAATGTAGGGATATAATATCGCAAGTATGGAATAACCCAAGCCAAAGAGGATTTCGAACTATCCAAGACAAAATGAAGTTGTGTTTGAG
TCGTCTTTCTGCTTGGAGTCGAGGTAAGTATGAAGGATCTCTGAAGGGTGCTATTGAGAGGACCGAGGAAGCCATTAAAACCCTTTTACCAAGATTAGATGCAACTAGCA
AGCTTGAATTGGCTCTAAAGGAAAAAGAATTGGAGAACCTATTGGAAGATGATGAGATTTACTGGAGGCAAAGATCTAGGGAGGAGTGGCTGCAATGGGGTGATCGAAAC
ACTCAATGGTTTCATATGAAGGCTACTACAAGACGAAGAACTAATAAAATCCTTGGATTAATGGATGAGATGGGAAAATGGCTTGAAGAAGATAGAGATTTGGAGAGAGT
GGCCACTCATTACTTTCAAAATCTTTTCCAAACATCCAACCCTCAGTCGGATTCCATAAACCTCATTGAGACATGGAATGAAGAGATTATCAATCGTTGCTTTATGGAAG
ATGATGCCAAAGCAATCCTTAACATTCCATTGCGGCCTTTGGCAGAGGAAGACGAGATTATATGGAACTATGATTCAAAAGGGAGAAATTGGAGACTCCTATTCAGCTCT
TATGGGAATGCAAAATCACCAAAGGAGTATGGAGAATTTTCCTACCTACAACTGAAGATCTTAGGGTTGAAGACCAGAGAGGCTCGTCAGACATTAAACCCAACACCCTG
TCAGATCCCTGCTATTACCGTTCGGTCCTCTTCTTCAACGAGTGTTGCTGTTTGGTCTCCGTCGCCGGTCGGTACCTGGAAGCTTAACAGCGATGCGACGTGGAGTGAAG
CTCGAGGCCGCAGAGGAATTGGATGGGTTATTAGGCGTTGGGATGGATCGATTGTGGCTGCGGGTTGCAAAAGCATGGAACAAAGATGGCAAGTCAATTGGTTAGAGACC
TTTGCGGTTTGTGAAGGATTGAAAGCCTTGGCAACAACTTCTCCCCCTATCAGAATTGAGATGGATGTCCTTCAGGTTGTGCGACTTCTTTTACGCAAGGAGGAAGATGT
CACAGATTTGGCCAAGTTTATTAATGAAGCTGGAGCCCTAATGGCTGTTAGGAAAATTGATTCTATTAGCCATATATCCAGAACCCAGAATATCTTGGCCCATAAGTTGG
CTCGTGAGGCGTGTGACTTTAATAAAGCTGAGTGTTGGTTTAATGTATTTCCTGATTGGCTACTTTCTTTAAATAGAGATGATATTCAGGGAAGTTGTGTCACTATTGGG
GGTTCTTGTCCCACTAGTGTGTGTTCTTTGGGTAATTTTCCCAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGCCCACAAATCCTGATGCACTTGCAAGGCAACTTGCAGATTTGAAAGTTATTGCAGCAGAAAGATCTAGCGTTTATCAACTCAAAGAGGAAGAGGTTGATCA
AGCGGAGAAGAAAATGGTCAATAAGGCTTCGGCAATGGAGATTGGCAGCTTGTTGGGAAACATGGAGCAAATTGATATAGATGAGGAGAATGATCAGTGTGGCAGTTCCT
TGCGAATTAAAATTCAGATAGATGTTCATATGCCATTAAAGAGAGGGATCTTCTTACGAAAAGGTAAGGCAAACGCAGAAAAATGGATTGCAGTTACTTATGAGAAACTC
CCGGATTTTTGTTATGGCTGTGGATTGCTCGGCCACACGATTAAAGAGTGTAGAGATGCTGGGGATGTGAAAGAGGACGAGCTTCCGTACGGCCCATGGATGCGTGAACC
TATCAAAATTAAGATCAGGGACAATATTTCTTCTTTTAGGGCTCATTTCTTTCAAGCGGGAAGAGGAAGAGGACGTCTAGGGGAGGATATCAGAGGAAACTGGAGGAAAA
CGTCGGTTGAAGATGAAGAGGGGTATTCTCGATCACCGAAAAAAATCGAGATTCCGGCGAACAGCCCTCCGTCTAAGCCGATAACGGTCAAAAATCCAAGCAAGGAAATT
GAGAAGGAAACGGTCGAGATCTCTGTTAAAGAAAATATTAATGATCAAGCGGCTACGGATGTTGATTGCGGTATCAATATTAATGAAACGGTGGTCCCAAATAGCACTGA
AATTGCGGCTCTCAGCAGTTATAACGTGACTGATGTGTCAATACGATCGTACTCTAAAGGTCATATTGATGCGTTCATTAAGGATCCTAAGGGGTTATGGCGGTTCTCAG
GCATTTACGGTAATCCCAATTGGAATCTCAGGCATGAGACGTGGGCTGTATTAAATAGACTAAAGGATAATTATGAGATTCCTTGGATCCTAGGTGGAGACTTTAATGAA
ATCACTTGCGAGGCTGAAAAAAGTGGGGGAGTGGCTCGTCCGGAAAAATTGATGCTCGATTTTCGGGAGGCTATCGATTCTTGTGAGTTGATTGACCCTGGTTATATAGG
TCCGGAATATACATGGTGCAATAAACATTTTACTTCAGATCTTATTTGGGAAAGACTTGACAGATTCCTTATTAATAATGAAATGCAAGTTCGGAGCAGTTTATTCAAAG
TTTACCATCTAGCCTTACTGTCCTCGAATCATAGACCAATTCTTGCAGATTGGTTGGAGGCCCAATCGGATCAAAGAAGTAGGATGAGTAGACGCCCTAAGAAATTTGAG
GAATCTTGGATAAAATATGATGAATGTAGGGATATAATATCGCAAGTATGGAATAACCCAAGCCAAAGAGGATTTCGAACTATCCAAGACAAAATGAAGTTGTGTTTGAG
TCGTCTTTCTGCTTGGAGTCGAGGTAAGTATGAAGGATCTCTGAAGGGTGCTATTGAGAGGACCGAGGAAGCCATTAAAACCCTTTTACCAAGATTAGATGCAACTAGCA
AGCTTGAATTGGCTCTAAAGGAAAAAGAATTGGAGAACCTATTGGAAGATGATGAGATTTACTGGAGGCAAAGATCTAGGGAGGAGTGGCTGCAATGGGGTGATCGAAAC
ACTCAATGGTTTCATATGAAGGCTACTACAAGACGAAGAACTAATAAAATCCTTGGATTAATGGATGAGATGGGAAAATGGCTTGAAGAAGATAGAGATTTGGAGAGAGT
GGCCACTCATTACTTTCAAAATCTTTTCCAAACATCCAACCCTCAGTCGGATTCCATAAACCTCATTGAGACATGGAATGAAGAGATTATCAATCGTTGCTTTATGGAAG
ATGATGCCAAAGCAATCCTTAACATTCCATTGCGGCCTTTGGCAGAGGAAGACGAGATTATATGGAACTATGATTCAAAAGGGAGAAATTGGAGACTCCTATTCAGCTCT
TATGGGAATGCAAAATCACCAAAGGAGTATGGAGAATTTTCCTACCTACAACTGAAGATCTTAGGGTTGAAGACCAGAGAGGCTCGTCAGACATTAAACCCAACACCCTG
TCAGATCCCTGCTATTACCGTTCGGTCCTCTTCTTCAACGAGTGTTGCTGTTTGGTCTCCGTCGCCGGTCGGTACCTGGAAGCTTAACAGCGATGCGACGTGGAGTGAAG
CTCGAGGCCGCAGAGGAATTGGATGGGTTATTAGGCGTTGGGATGGATCGATTGTGGCTGCGGGTTGCAAAAGCATGGAACAAAGATGGCAAGTCAATTGGTTAGAGACC
TTTGCGGTTTGTGAAGGATTGAAAGCCTTGGCAACAACTTCTCCCCCTATCAGAATTGAGATGGATGTCCTTCAGGTTGTGCGACTTCTTTTACGCAAGGAGGAAGATGT
CACAGATTTGGCCAAGTTTATTAATGAAGCTGGAGCCCTAATGGCTGTTAGGAAAATTGATTCTATTAGCCATATATCCAGAACCCAGAATATCTTGGCCCATAAGTTGG
CTCGTGAGGCGTGTGACTTTAATAAAGCTGAGTGTTGGTTTAATGTATTTCCTGATTGGCTACTTTCTTTAAATAGAGATGATATTCAGGGAAGTTGTGTCACTATTGGG
GGTTCTTGTCCCACTAGTGTGTGTTCTTTGGGTAATTTTCCCAGTTAA
Protein sequenceShow/hide protein sequence
MEEPTNPDALARQLADLKVIAAERSSVYQLKEEEVDQAEKKMVNKASAMEIGSLLGNMEQIDIDEENDQCGSSLRIKIQIDVHMPLKRGIFLRKGKANAEKWIAVTYEKL
PDFCYGCGLLGHTIKECRDAGDVKEDELPYGPWMREPIKIKIRDNISSFRAHFFQAGRGRGRLGEDIRGNWRKTSVEDEEGYSRSPKKIEIPANSPPSKPITVKNPSKEI
EKETVEISVKENINDQAATDVDCGININETVVPNSTEIAALSSYNVTDVSIRSYSKGHIDAFIKDPKGLWRFSGIYGNPNWNLRHETWAVLNRLKDNYEIPWILGGDFNE
ITCEAEKSGGVARPEKLMLDFREAIDSCELIDPGYIGPEYTWCNKHFTSDLIWERLDRFLINNEMQVRSSLFKVYHLALLSSNHRPILADWLEAQSDQRSRMSRRPKKFE
ESWIKYDECRDIISQVWNNPSQRGFRTIQDKMKLCLSRLSAWSRGKYEGSLKGAIERTEEAIKTLLPRLDATSKLELALKEKELENLLEDDEIYWRQRSREEWLQWGDRN
TQWFHMKATTRRRTNKILGLMDEMGKWLEEDRDLERVATHYFQNLFQTSNPQSDSINLIETWNEEIINRCFMEDDAKAILNIPLRPLAEEDEIIWNYDSKGRNWRLLFSS
YGNAKSPKEYGEFSYLQLKILGLKTREARQTLNPTPCQIPAITVRSSSSTSVAVWSPSPVGTWKLNSDATWSEARGRRGIGWVIRRWDGSIVAAGCKSMEQRWQVNWLET
FAVCEGLKALATTSPPIRIEMDVLQVVRLLLRKEEDVTDLAKFINEAGALMAVRKIDSISHISRTQNILAHKLAREACDFNKAECWFNVFPDWLLSLNRDDIQGSCVTIG
GSCPTSVCSLGNFPS