; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy04g002990 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy04g002990
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr04:23579483..23582660
RNA-Seq ExpressionLcy04g002990
SyntenyLcy04g002990
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAS55787.1 hypothetical protein [Oryza sativa Japonica Group]1.2e-4230.43Show/hide
Query:  GDFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPL
        G   +     N+S  W  I +G +L K+G  WR+GNG+ + +  DPWIPR  +R P+    +   K V DLI E+  W+   I   F+ IDA  I  I +
Subjt:  GDFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPL

Query:  GNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETP
            ++D I W+ D  GRFSV++AY LA  +        S  S   + W  IW  N+  + +I  WR+  N + T  N   + ++   +C +C  E E  
Subjt:  GNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETP

Query:  GHIFWRCKKASANKVTVDDQKLTQWIHRNFEDQRKRTHCHLAEIRLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRIN
        GH   RC    AN + V+ +K    +  NF       H   AE R         W  P    +KLN D S++ +   GGIG ++ + LG+++ +  + ++
Subjt:  GHIFWRCKKASANKVTVDDQKLTQWIHRNFEDQRKRTHCHLAEIRLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRIN

Query:  RKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA
                E+ A  EGL  +L L       + VE D S VI+ LNH + D S    +  + ++L      IA  K  R  N ++H L+  A
Subjt:  RKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA

AAS55787.1 hypothetical protein [Oryza sativa Japonica Group]1.8e-2232.57Show/hide
Query:  MKLLCWNARDLGNSRAIRVLRHMVESVKPQLIFIMETKCDVGRSAKIKLALQRDNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLG--R
        M  L WN R LGN+  ++ LR +++    QL+F+ ET+  V + ++++  L       V S+GKSGG  L+W+  + V++   +  +ID +V  S    +
Subjt:  MKLLCWNARDLGNSRAIRVLRHMVESVKPQLIFIMETKCDVGRSAKIKLALQRDNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLG--R

Query:  WRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDAG
        W  T  YG P  + RH  W+LL  ++     PW++ GDFNE ++  E      R   Q+  F +A+  C+L D G
Subjt:  WRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDAG

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]1.8e-5426.75Show/hide
Query:  MKLLCWNARDLGNSRAIRVLRHMVESVKPQLIFIMETKCDVGRSAKIKLALQRDNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLGR--
        MK L WN + LGN   IR L ++V+   P  +F+MET+       KIK++L   N   V S G+SGG  L W  D+ +++ SYS  HID  +     R  
Subjt:  MKLLCWNARDLGNSRAIRVLRHMVESVKPQLIFIMETKCDVGRSAKIKLALQRDNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLGR--

Query:  WRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDAG-DGDFFNAPVGVNSSLVWKSIVWGR
        WR TG YG+PE  K+  +W L+  L   +  PW+  GDFNEI  + EK G V +   ++  F EAI  C L+  G +G+ F           W +   G 
Subjt:  WRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDAG-DGDFFNAPVGVNSSLVWKSIVWGR

Query:  ELFKEGFRWRVGNGHHIYV--------------EHDPWIPRQGNRHPL--------------------------------KVHPSLFGKRVKDLILENED
           +E     +      ++              +H P +       P                                 +V   +  +  K  +L ++D
Subjt:  ELFKEGFRWRVGNGHHIYV--------------EHDPWIPRQGNRHPL--------------------------------KVHPSLFGKRVKDLILENED

Query:  ---WNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIR-------RSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWR
           WN  L+  +F+P +A+ I +IPL      D+ +W+  +KG FSV++AYHL + +R        ST +   + S    KW  +W L I  + KI  W+
Subjt:  ---WNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIR-------RSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWR

Query:  IVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKA----------------SANKVTVDDQKLTQ-----------------WIHRN---FE
        +  N++P + NL  + + +  +C +C  E ET  H+   C  A                SA+ ++   +++ +                 W HRN   F 
Subjt:  IVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKA----------------SANKVTVDDQKLTQ-----------------WIHRN---FE

Query:  DQR-------KRTHCHLAEIR-------LESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGL
          +       +R +  LA+          ES+     W  PP +L K+N D + +    S G+G V+ D  G L+ A  KRI+       +E  A +EGL
Subjt:  DQR-------KRTHCHLAEIR-------LESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGL

Query:  QNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLID
        +  L L       +I+E+D+   I++L  +E + S +  +L D
Subjt:  QNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLID

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]4.3e-4829.35Show/hide
Query:  FFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGN
        F+NA VG N S +W+SI+WG ++ K+G RWR+G+G  + V  D WIPR     P+          V DLI     W  + +   F+  D + IL I L +
Subjt:  FFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGN

Query:  SRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGH
         +++DE++W+ D KG +SVK+ Y LA N     E   S+ S+  R W+  W L++  + KI  WR +KN++PT  NL  +     P+C  C+ +VET  H
Subjt:  SRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGH

Query:  IFWRCKKAS----------------------------ANKVTVDDQKL-----TQWIHRN-FEDQRKRTHCHLAEIRLESLR------------------
        +   CK A                             +   T + + +       W  RN F  + K++       + +S+                   
Subjt:  IFWRCKKAS----------------------------ANKVTVDDQKL-----TQWIHRN-FEDQRKRTHCHLAEIRLESLR------------------

Query:  --NHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEE
          + + W PP QN+LKLN DA+ +      G+G ++ D+ G ++    K+   + ++   E  AI  GLQ     +Q  SS+LIVE+D  EV++ LN+ +
Subjt:  --NHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEE

Query:  SDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA
           ++   +L DV   + E   + F   PR  N  AH L++ A
Subjt:  SDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.0e-2143.8Show/hide
Query:  DNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLGR-WRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNR
        +N F V   G  GG  LFW+SD+ V I S+S  HID  V +  G+ WR TG YG+ E  ++H +WALL+ L   +   W   GDFNEI++S+EK+G+ + 
Subjt:  DNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLGR-WRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNR

Query:  NLNQIGLFVEAINRCELMDAG
        + N +  F E+I  C LMD G
Subjt:  NLNQIGLFVEAINRCELMDAG

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]9.3e-4328.15Show/hide
Query:  DGDFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENE-DWNEELIRNLFIPIDAKDILAI
        D  F  A +  N S +W+SI+WGR+L K+G RWR+GNG  +++  D W+P Q     L         RV  L+   E  W  +++R+ F P +AK IL+I
Subjt:  DGDFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENE-DWNEELIRNLFIPIDAKDILAI

Query:  PLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIR-KWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEV
        P+G   ++D +IWN +  G +SV++ Y +A       +A  S  S ++R  W   W ++I  + K+  WR+  + +PT  NL  +G++I   C  C    
Subjt:  PLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIR-KWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEV

Query:  ETPGHIFWRCKKASANKVTVDDQKLTQ--------------------------WIHRN---FEDQRKR-----------THCHLAEIRLESLRN------
        E   H+FW CK A A  +     KL+                           W  RN   F D  K             + +  E R E+  N      
Subjt:  ETPGHIFWRCKKASANKVTVDDQKLTQ--------------------------WIHRN---FEDQRKR-----------THCHLAEIRLESLRN------

Query:  ----HECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHE
               W PP + + K+N+DAS+  S    G+G +IH+  G ++ A  K +     +   E  A  EGLQ                  ASE+   ++  
Subjt:  ----HECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHE

Query:  ESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA
          DLS+   +++  +N   ++   +F    R GN+ AH L+R A
Subjt:  ESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]4.3e-4828.19Show/hide
Query:  FYGNPETDKRHFSWALLERL-KACFEGPWIIGGDFNEIMFSNEKVGSVN--RNLNQIGLFVEAINRCELMDAGDGDFFNAPVGVNSSLVWKSIVWGRELF
        F+   + DKR   W   E+L +A   G    G  F E    N+ + +    R L      V  + +       +  F  A  G N+S +W+SI+WGR++ 
Subjt:  FYGNPETDKRHFSWALLERL-KACFEGPWIIGGDFNEIMFSNEKVGSVN--RNLNQIGLFVEAINRCELMDAGDGDFFNAPVGVNSSLVWKSIVWGRELF

Query:  KEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYH
        K+G RWR+GNG  I +  D W+PR     P+          V DLI  +  W+E  +R  F+ +D  +IL IPL   + +DE++W+ D +G +SVK+ Y 
Subjt:  KEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYH

Query:  LATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKA---------SANKVTV
        LA  +R       S      + W ++W L +  + KI  WR   NL+P+  NL  + +   P C  C+  VET  H    CK A         SA ++  
Subjt:  LATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKA---------SANKVTV

Query:  DDQKL------------------------TQWIHRN------------------------FEDQRKRTHCHLAEIRLESLRNHECWSPPPQNLLKLNSDA
        + Q +                        + W  RN                        F+  RK    H   I +      + W PPPQN+ K+N DA
Subjt:  DDQKL------------------------TQWIHRN------------------------FEDQRKRTHCHLAEIRLESLRNHECWSPPPQNLLKLNSDA

Query:  SWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRS-SNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEA
        ++N    S G+G VI DS G +V A   +   K      E  A+  GLQ    L++N   S+LI+E+D  EV++ +N+ +   S+    ++ ++N     
Subjt:  SWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRS-SNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEA

Query:  GVIAFVKCPRLGNRVAHNLSRAAAG
          +     PR  N  AH L++ A G
Subjt:  GVIAFVKCPRLGNRVAHNLSRAAAG

TrEMBL top hitse value%identityAlignment
A0A1S8ACU2 Ribonuclease H-like superfamily protein2.8e-4527.78Show/hide
Query:  GFYGNPETDKRHFSWALLERL-KACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDA---GDGDFFNAPVGVNSSLVWKSIVWGRE
        GF+   + D++   WA  E+L +A   G    G  F +I   N+ +  V +   +I  + E++   +++ A      DF  A +G   S VW+SI+WGR+
Subjt:  GFYGNPETDKRHFSWALLERL-KACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDA---GDGDFFNAPVGVNSSLVWKSIVWGRE

Query:  LFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNA
        +   G RWR+G G  + +    WIPR     P+          V +LI EN+ W E LI+  F   DA+ I  I L  S   D+I+W+ D KG +SVK+ 
Subjt:  LFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNA

Query:  YHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKAS-------------
        Y +A  I+    A  S  S+ + +W  IW L +  + KI  W+  +N +PT  NL  + M   P+C  C+ + E   H    CK A              
Subjt:  YHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKAS-------------

Query:  --------------ANKVTVDDQKL------TQWIHRN---FEDQRKRTHCHLAE----------IRLESLR--------NHECWSPPPQNLLKLNSDAS
                       N+ + D+ +L      T W  RN   FE++R+     +A+          +R+  ++            W PPPQ   K N DA+
Subjt:  --------------ANKVTVDDQKL------TQWIHRN---FEDQRKRTHCHLAE----------IRLESLR--------NHECWSPPPQNLLKLNSDAS

Query:  WNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDV-ENLTVEAG
         N+ +N  G+G VI D  G+++ A          +   E AA+  GLQ  +  S    S LI+E D  +V+  + H +S  ++    + ++ + L     
Subjt:  WNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDV-ENLTVEAG

Query:  VIAFVKCPRLGNRVAHNLSRAA
        ++   K  R  N +AH L++ A
Subjt:  VIAFVKCPRLGNRVAHNLSRAA

A0A803QI56 Uncharacterized protein3.8e-5024.19Show/hide
Query:  MKLLCWNARDLGNSRAIRVLRHMVESVKPQLIFIMETKCDVGRSAKIKLALQRDNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDS-LGRW
        M +L WN R LGN RAI+ L+ +V    P  IF+ ETKCD  R   +   L+ + VF V ++G SGG  L W +    ++  YS+ HID  V  S  G W
Subjt:  MKLLCWNARDLGNSRAIRVLRHMVESVKPQLIFIMETKCDVGRSAKIKLALQRDNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDS-LGRW

Query:  RFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMD-----------AGDGDFFNAPVGVNSSL
        + TGFYG PE + RH SW LL  L      PW + GD N I+   +K G        I  F +A+N C L+D            G G      V ++ +L
Subjt:  RFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMD-----------AGDGDFFNAPVGVNSSL

Query:  V---WKSI---------------------------------------------------------------------------VWGREL-----------
        +   W  I                                                                           +WG+E+           
Subjt:  V---WKSI---------------------------------------------------------------------------VWGREL-----------

Query:  ---------------------------------------------FKEG-----FRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDL-ILE
                                                      KEG      RW V +G  I V  +PW+P + + +    HPSL   +V++L ++E
Subjt:  ---------------------------------------------FKEG-----FRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDL-ILE

Query:  NEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIP
           W+ +++ +LFI  D K IL IPL  S   D++ W+ ++ G +SVK+ Y+L   I    +    DD T+   W   W   I  + K   WR  +  +P
Subjt:  NEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIP

Query:  TKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKK--------------------------ASANKVTVDDQKL-------TQWIHRN------------
        T   L  K +D+   C +C  E E+  H    C K                           +A      ++KL         W  RN            
Subjt:  TKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKK--------------------------ASANKVTVDDQKL-------TQWIHRN------------

Query:  --------FEDQRKRTHCHLAEIRLESLR---NHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQ
                + +Q K       E     L+     E W+ P  + +K+N DA+  +S N  G G V  D  G L++   K        +  E   I+E L 
Subjt:  --------FEDQRKRTHCHLAEIRLESLR---NHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQ

Query:  NYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA
            + ++   ++ +E D   V++++  E   +S    ++ + +NL +E   I+ +   R  N VAHN +RA+
Subjt:  NYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAA

M5VU98 Reverse transcriptase domain-containing protein7.0e-4429.14Show/hide
Query:  DFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGN----RHPLKVHPSLFGKRVKDLILE--NEDWNEELIRNLFIPIDAKDI
        +F+ A +G   S VWKSI   R++ + G R+++G+G  + +  D W+PR         PL     +   +V +LI    +  W+ + + NLF+P+D  DI
Subjt:  DFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGN----RHPLKVHPSLFGKRVKDLILE--NEDWNEELIRNLFIPIDAKDI

Query:  LAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTE-AFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCR
        + IPL      D I+WN D  G F+VK+AY +A  +    E    S +S     WR IW+  +  + KI  WR+  +++PTK NLI KG+D+  +C+ C 
Subjt:  LAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTE-AFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCR

Query:  GEVETPGHIFWRCKKASAN-KVTVDDQKLTQWIHRNFEDQRKRTHCHLAEI---------RLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVI
           E+  H+   C  A A   +++  +   Q + R+  +       ++ E            + +R+   W+ PP   LK N D +++ +   G +G V 
Subjt:  GEVETPGHIFWRCKKASAN-KVTVDDQKLTQWIHRNFEDQRKRTHCHLAEI---------RLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVI

Query:  HDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVA
         D+ G  V A  K +      +  EI A +EG+   L+L    +++ I E D++ V+ ++     D S   T++ DV++L  +     F   PR  N VA
Subjt:  HDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVA

Query:  HNLSR
        H L+R
Subjt:  HNLSR

M5VU98 Reverse transcriptase domain-containing protein8.3e-1337.72Show/hide
Query:  KGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLG--RWRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGL
        +G SGG  L W  ++ V++ ++SD  ID  +  + G  RWR T FYG P    R  SW LL++L    + PW+  GDFNEI+ ++EK G   RN  Q+  
Subjt:  KGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLG--RWRFTGFYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGL

Query:  FVEAINRCELMDAG
        F   +++    D G
Subjt:  FVEAINRCELMDAG

M5VU98 Reverse transcriptase domain-containing protein9.1e-4429.41Show/hide
Query:  FFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGN
        F  A  G + SL W+ I WGR+L  +G R+++GNG H+    DPWIP      P+  +       V   IL+  +WN   +   F  ID   IL IPL  
Subjt:  FFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGN

Query:  SRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGH
         +  D +IW+++  G +SVK+++HLAT+I    +   SDD      W+  W L +  + KI  W++++N +P    L  + +  + LC  C+   E+ GH
Subjt:  SRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGH

Query:  IFWRCKKASA----NKVTVDDQKLTQWIHRNFE-----DQRKRTHCHLA----------EIRLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWV
          + CK A +     K  +D     +  + N++      Q ++T  ++            I  +  R+   W PPP NLLK+N DA+ N    + GIG V
Subjt:  IFWRCKKASA----NKVTVDDQKLTQWIHRNFE-----DQRKRTHCHLA----------EIRLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWV

Query:  IHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRV
        + +  G ++ A  K++   ++   +E   +   L      SQN+ S  +VE DA  V  +LN   +DLS    L+ D   L      +      R  N+ 
Subjt:  IHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRV

Query:  AHNLSRAA
        AH L++ A
Subjt:  AHNLSRAA

M5XSK0 Reverse transcriptase domain-containing protein8.2e-4526.94Show/hide
Query:  FYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDAGDG--------------DFFNAPVGVNSSLV
        F+ N  T+ +   W   +RL A                   E+ G   RNL+   L + A     L+   D                F    V   +S+V
Subjt:  FYGNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDAGDG--------------DFFNAPVGVNSSLV

Query:  WKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFG-KRVKDLIL-ENEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNN
        WKS+   R +  +G RW+VG+G  I +  D W+P+  +       P      +V DLI  ++ +WN  L++N+F P +   I +IPL      D ++W+ 
Subjt:  WKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFG-KRVKDLIL-ENEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNN

Query:  DAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQI-RKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIF--------
        D KG F+VK+AYH+A ++  ST    S +S  + R W  +W   +  R K  +WR++  ++PTK NL  K + ++  C+LC G V++  HI         
Subjt:  DAKGRFSVKNAYHLATNIRRSTEAFGSDDSTQI-RKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIF--------

Query:  ------WRCKKA----SANKVTVDDQKLTQWIHRN--FEDQRKRTHCHL---AEIRL-ESLRNHEC-------------WSPPPQNLLKLNSDASWNESQ
              W C+ A    S +  T        W  RN    + +K  H  +   A +RL + LR   C             W PP +N LK+N D +W    
Subjt:  ------WRCKKA----SANKVTVDDQKLTQWIHRN--FEDQRKRTHCHL---AEIRL-ESLRNHEC-------------WSPPPQNLLKLNSDASWNESQ

Query:  NSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVK
          GG+G V+ DS G  V     ++   +    +E  A +    N +   +    N++ E+DA +++ +L +   D S    ++ D ++L  +     F  
Subjt:  NSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVK

Query:  CPRLGNRVAHNLSRAA
          R  N VAH L+R A
Subjt:  CPRLGNRVAHNLSRAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0926.95Show/hide
Query:  WSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKD
        W  PP   +K N+DA+W       GIGW++ +  G ++    + + R   +   E+ A++  +   LT+S+     +I E+DA  ++  LN ++      
Subjt:  WSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDLSKD

Query:  KTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAAAGFS
        +  L D++ L      + F   PR GN+VA  ++R +  FS
Subjt:  KTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAAAGFS

AT3G09510.1 Ribonuclease H-like superfamily protein5.5e-1722.15Show/hide
Query:  DGDFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENED---WNEELIRNLFIPIDAKDIL
        D    +A V    S  W S++ G  L K+G R  +G+G +I +  D  +     R PL    +     + +L         W++  I       D   I 
Subjt:  DGDFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENED---WNEELIRNLFIPIDAKDIL

Query:  AIPLGNSRDKDEIIWNNDAKGRFSVKNAYHL-----ATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCV
         I L  S+  D+IIWN +  G ++V++ Y L     +TNI       GS D         IW+L I+ + K   WR +   + T   L ++GM I+P C 
Subjt:  AIPLGNSRDKDEIIWNNDAKGRFSVKNAYHL-----ATNIRRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCV

Query:  LCRGEVETPGHIFWRCKKAS-----------ANKVTVDD------------QKLTQ---------------WIHRN------FEDQRKRT----------
         C  E E+  H  + C  A+            N++  +D            Q  T                W  RN      F +   +T          
Subjt:  LCRGEVETPGHIFWRCKKAS-----------ANKVTVDD------------QKLTQ---------------WIHRN------FEDQRKRT----------

Query:  -------HCHLAEIRLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSN
               H        +   N   W  PP   +K N DA ++  +     GW+I +  G+ +     ++         E  A+   LQ          + 
Subjt:  -------HCHLAEIRLESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSN

Query:  LIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVK---CPRLGNRVAHNLSRAAAGFSPMASPS
        + +E D   +I  +N     +S   +L   +E+++  A   A ++     R GN++AH L++    +S   S S
Subjt:  LIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVK---CPRLGNRVAHNLSRAAAGFSPMASPS

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.7e-0629.73Show/hide
Query:  IWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKASANKVTVDDQKLTQW
        IW L I  + K+  W+ + N +P    L+S+ + I P C  CR + ET  HI + C  A    +        +W
Subjt:  IWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKASANKVTVDDQKLTQW

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-3124.14Show/hide
Query:  DFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWI---PRQGNRHPLKVHPSLFGK-----RVKDLILEN-EDWNEELIRNLFIPIDA
        D  NAP+G   S VWKSI   +E+ ++G R  VGNG  I +    W+   P        +V P  +       +V DLI E+  +W +++I  LF  ++ 
Subjt:  DFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGNGHHIYVEHDPWI---PRQGNRHPLKVHPSLFGK-----RVKDLILEN-EDWNEELIRNLFIPIDA

Query:  KDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNI--RRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLC
        K I  +  G  R  D   W+  + G ++VK+ Y + T I  +RS+    S+ S     ++ IW      + +   W+ + N +P  G L  + +     C
Subjt:  KDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNI--RRSTEAFGSDDSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLC

Query:  VLCRGEVETPGHIFWRCKKA--------------------------------SANKVTVDDQKLTQWI----------------HRNFEDQRKRTHCHLA
        + C    ET  H+ ++C  A                                + N       +L  W+                  N ++  +R    L 
Subjt:  VLCRGEVETPGHIFWRCKKA--------------------------------SANKVTVDDQKLTQWI----------------HRNFEDQRKRTHCHLA

Query:  EIRLES----------LRNHEC--WSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSN
        E R+ +          +    C  W PPP   +K N+DA+WN      GIGWV+ +  G +     + + +   +   E+ A++  +   L+LS+ + + 
Subjt:  EIRLES----------LRNHEC--WSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSN

Query:  LIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAAAGF
        +I E+D+  +I+ LN++E   S   T+  D++ L  +   + FV  PR GN +A  ++R +  F
Subjt:  LIVEADASEVIKSLNHEESDLSKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAAAGF

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.2e-0523.96Show/hide
Query:  WSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESD
        WSPP ++ LK N DAS +E     G+GW++ +S G++++    +   +   +  E + +   +Q        +   +I E D   + + +N + S+
Subjt:  WSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTCTTATGTTGGAATGCTCGAGATCTGGGGAATTCTCGAGCGATCCGAGTGCTTCGACACATGGTAGAAAGCGTTAAACCCCAACTGATCTTCATAATG
GAAACCAAGTGTGATGTGGGCAGGAGTGCTAAAATCAAGCTAGCTTTACAACGTGACAATGTTTTTTGTGTCCCTAGCAAAGGTAAGAGTGGGGGGTTCATGCTT
TTCTGGAATTCTGATATAGGGGTCAATATCAATTCTTATTCGGATGGTCATATCGACACCTTTGTTAATGATAGTTTGGGAAGATGGAGATTCACGGGTTTTTAT
GGCAACCCAGAAACCGACAAAAGGCACTTCTCTTGGGCGCTTCTTGAAAGGTTAAAGGCTTGTTTTGAGGGGCCGTGGATTATCGGGGGCGACTTCAACGAAATC
ATGTTCTCAAATGAGAAGGTAGGGAGTGTGAATAGGAACTTAAATCAAATAGGCCTGTTTGTGGAAGCTATTAACAGATGTGAGTTAATGGACGCGGGAGATGGG
GACTTCTTCAATGCCCCGGTGGGGGTTAACTCCTCCCTCGTTTGGAAAAGCATCGTGTGGGGGAGAGAGCTCTTCAAGGAAGGTTTTAGATGGAGGGTTGGTAAT
GGTCACCATATCTACGTTGAACATGATCCATGGATTCCTAGGCAAGGCAATCGTCACCCTTTGAAGGTGCATCCCTCTTTGTTTGGGAAACGAGTGAAAGATCTA
ATCCTAGAGAACGAAGATTGGAATGAAGAGCTAATTAGAAACCTCTTCATTCCCATAGACGCCAAGGATATCCTTGCTATTCCTCTTGGAAACTCGCGGGACAAG
GATGAGATTATATGGAATAACGATGCTAAGGGTAGATTCAGTGTAAAAAACGCATATCATCTGGCAACTAATATCCGCCGTTCAACAGAAGCTTTCGGTTCGGAC
GACTCCACTCAAATCAGGAAATGGAGATCCATATGGGATCTCAACATTATACTGAGAGCCAAGATAGGCTTTTGGAGAATAGTGAAAAACCTAATCCCTACCAAA
GGCAATCTTATCTCTAAAGGAATGGATATTAATCCCCTTTGTGTTTTGTGCAGGGGCGAGGTGGAAACCCCGGGTCATATCTTCTGGAGATGTAAAAAGGCCTCG
GCTAACAAAGTCACAGTGGATGATCAAAAGCTTACTCAGTGGATTCATCGGAATTTTGAAGATCAAAGGAAACGTACCCACTGTCATCTGGCAGAGATCAGGCTA
GAGAGCCTCCGGAATCACGAATGCTGGTCCCCTCCTCCTCAGAATCTCCTGAAGCTTAATTCTGACGCCTCTTGGAATGAGTCTCAGAATTCTGGGGGAATTGGC
TGGGTAATTCACGATTCCTTAGGATCTCTGGTCCAAGCGTGGTTCAAACGCATTAATCGCAAATGGAAGATAAAATCGTTGGAGATCGCTGCGATTAAGGAAGGG
TTACAAAATTACCTTACTCTAAGCCAGAATCGATCCTCGAATTTGATTGTAGAAGCAGACGCCTCTGAAGTAATCAAGTCGCTTAATCATGAGGAGTCTGATCTC
TCTAAAGATAAGACTTTGCTGATCGATGTTGAAAATCTCACGGTGGAAGCAGGAGTAATCGCCTTCGTCAAATGTCCAAGGCTGGGCAATCGAGTAGCACACAAT
CTCTCGCGAGCAGCTGCGGGCTTCTCGCCGATGGCCTCTCCGTCGACCAACGTAGTCGACGGCTTTTGTGTTTCTTCGACTTCTTCCACGCTGGAAGGAATTTTT
TTTTGTACAGGTGATTTGATCCCTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACTCTTATGTTGGAATGCTCGAGATCTGGGGAATTCTCGAGCGATCCGAGTGCTTCGACACATGGTAGAAAGCGTTAAACCCCAACTGATCTTCATAATG
GAAACCAAGTGTGATGTGGGCAGGAGTGCTAAAATCAAGCTAGCTTTACAACGTGACAATGTTTTTTGTGTCCCTAGCAAAGGTAAGAGTGGGGGGTTCATGCTT
TTCTGGAATTCTGATATAGGGGTCAATATCAATTCTTATTCGGATGGTCATATCGACACCTTTGTTAATGATAGTTTGGGAAGATGGAGATTCACGGGTTTTTAT
GGCAACCCAGAAACCGACAAAAGGCACTTCTCTTGGGCGCTTCTTGAAAGGTTAAAGGCTTGTTTTGAGGGGCCGTGGATTATCGGGGGCGACTTCAACGAAATC
ATGTTCTCAAATGAGAAGGTAGGGAGTGTGAATAGGAACTTAAATCAAATAGGCCTGTTTGTGGAAGCTATTAACAGATGTGAGTTAATGGACGCGGGAGATGGG
GACTTCTTCAATGCCCCGGTGGGGGTTAACTCCTCCCTCGTTTGGAAAAGCATCGTGTGGGGGAGAGAGCTCTTCAAGGAAGGTTTTAGATGGAGGGTTGGTAAT
GGTCACCATATCTACGTTGAACATGATCCATGGATTCCTAGGCAAGGCAATCGTCACCCTTTGAAGGTGCATCCCTCTTTGTTTGGGAAACGAGTGAAAGATCTA
ATCCTAGAGAACGAAGATTGGAATGAAGAGCTAATTAGAAACCTCTTCATTCCCATAGACGCCAAGGATATCCTTGCTATTCCTCTTGGAAACTCGCGGGACAAG
GATGAGATTATATGGAATAACGATGCTAAGGGTAGATTCAGTGTAAAAAACGCATATCATCTGGCAACTAATATCCGCCGTTCAACAGAAGCTTTCGGTTCGGAC
GACTCCACTCAAATCAGGAAATGGAGATCCATATGGGATCTCAACATTATACTGAGAGCCAAGATAGGCTTTTGGAGAATAGTGAAAAACCTAATCCCTACCAAA
GGCAATCTTATCTCTAAAGGAATGGATATTAATCCCCTTTGTGTTTTGTGCAGGGGCGAGGTGGAAACCCCGGGTCATATCTTCTGGAGATGTAAAAAGGCCTCG
GCTAACAAAGTCACAGTGGATGATCAAAAGCTTACTCAGTGGATTCATCGGAATTTTGAAGATCAAAGGAAACGTACCCACTGTCATCTGGCAGAGATCAGGCTA
GAGAGCCTCCGGAATCACGAATGCTGGTCCCCTCCTCCTCAGAATCTCCTGAAGCTTAATTCTGACGCCTCTTGGAATGAGTCTCAGAATTCTGGGGGAATTGGC
TGGGTAATTCACGATTCCTTAGGATCTCTGGTCCAAGCGTGGTTCAAACGCATTAATCGCAAATGGAAGATAAAATCGTTGGAGATCGCTGCGATTAAGGAAGGG
TTACAAAATTACCTTACTCTAAGCCAGAATCGATCCTCGAATTTGATTGTAGAAGCAGACGCCTCTGAAGTAATCAAGTCGCTTAATCATGAGGAGTCTGATCTC
TCTAAAGATAAGACTTTGCTGATCGATGTTGAAAATCTCACGGTGGAAGCAGGAGTAATCGCCTTCGTCAAATGTCCAAGGCTGGGCAATCGAGTAGCACACAAT
CTCTCGCGAGCAGCTGCGGGCTTCTCGCCGATGGCCTCTCCGTCGACCAACGTAGTCGACGGCTTTTGTGTTTCTTCGACTTCTTCCACGCTGGAAGGAATTTTT
TTTTGTACAGGTGATTTGATCCCTAATTAA
Protein sequenceShow/hide protein sequence
MKLLCWNARDLGNSRAIRVLRHMVESVKPQLIFIMETKCDVGRSAKIKLALQRDNVFCVPSKGKSGGFMLFWNSDIGVNINSYSDGHIDTFVNDSLGRWRFTGFY
GNPETDKRHFSWALLERLKACFEGPWIIGGDFNEIMFSNEKVGSVNRNLNQIGLFVEAINRCELMDAGDGDFFNAPVGVNSSLVWKSIVWGRELFKEGFRWRVGN
GHHIYVEHDPWIPRQGNRHPLKVHPSLFGKRVKDLILENEDWNEELIRNLFIPIDAKDILAIPLGNSRDKDEIIWNNDAKGRFSVKNAYHLATNIRRSTEAFGSD
DSTQIRKWRSIWDLNIILRAKIGFWRIVKNLIPTKGNLISKGMDINPLCVLCRGEVETPGHIFWRCKKASANKVTVDDQKLTQWIHRNFEDQRKRTHCHLAEIRL
ESLRNHECWSPPPQNLLKLNSDASWNESQNSGGIGWVIHDSLGSLVQAWFKRINRKWKIKSLEIAAIKEGLQNYLTLSQNRSSNLIVEADASEVIKSLNHEESDL
SKDKTLLIDVENLTVEAGVIAFVKCPRLGNRVAHNLSRAAAGFSPMASPSTNVVDGFCVSSTSSTLEGIFFCTGDLIPN