; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022476 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022476
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold2:10684438..10690185
RNA-Seq ExpressionSpg022476
SyntenySpg022476
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR040256 - Uncharacterized protein At4g02000-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO70721.1 Zinc finger, CCHC-type [Corchorus olitorius]6.0e-5022.71Show/hide
Query:  ESLIKQLTDLKVTAEEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTI-ITNIGFNMFLCKFKNLRIKNFIMESGPWFFD
        E L       K+TAEE+  + +  D   +  + +    LV K+ ++K  N + F + M  IW   + + +  +  N+FL KF     K  +++  PW F 
Subjt:  ESLIKQLTDLKVTAEEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTI-ITNIGFNMFLCKFKNLRIKNFIMESGPWFFD

Query:  KALILLQVPKGDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEID-QSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWI-
          L++     GD   +D  F    FWI  + L     +RD+A  IG  +G++  VDVD+ +D + W   LRV+V IDVT+PL+R I +    + +D  I 
Subjt:  KALILLQVPKGDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEID-QSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWI-

Query:  -PITYEKLPNFCYGCGHLGHTIKECENESQDESQSEHKLPYGAWLREPTNLRMREPFAPPMSPGPYAGRGRGRGWEGGRGGWRK--DQSDDYGGRSFD-S
          + YE+ P+FCY CG +GH  ++C  E   +   E    YG W+   ++L+ +         G  A +  G+   G +G   +   QS +     F  +
Subjt:  -PITYEKLPNFCYGCGHLGHTIKECENESQDESQSEHKLPYGAWLREPTNLRMREPFAPPMSPGPYAGRGRGRGWEGGRGGWRK--DQSDDYGGRSFD-S

Query:  TQHQDGGANGSGDDDPVVNSGEWAEPPPANGLNSPPRMKGPTDKKTEKESEGTNDTELIGGEGIKEDLVNNISSDIQGISQGNKCLGIISKDISIM--EV
        T   D G         V N  E       +GL   P   G   KK+            I    + E+ + +I  DI+     + C   +   + +   E+
Subjt:  TQHQDGGANGSGDDDPVVNSGEWAEPPPANGLNSPPRMKGPTDKKTEKESEGTNDTELIGGEGIKEDLVNNISSDIQGISQGNKCLGIISKDISIM--EV

Query:  DTDGVKSKPIKHKDVGQNTSDGGIKNQDPDL---NAQVKSETGKNKVWKRIPRLKKED--------VCDENM-----GQSSQSLSPQFTWC---------
        +    +++P        +  + G+  +   +   N + +     +KVW+ + R K  D         C            + S+      C         
Subjt:  DTDGVKSKPIKHKDVGQNTSDGGIKNQDPDL---NAQVKSETGKNKVWKRIPRLKKED--------VCDENM-----GQSSQSLSPQFTWC---------

Query:  -NNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDTNKFTENL
           + N   +   +  FLL   +      F    +  G+ D R L A + ++ +  S      +G         E  R ++ Q+   +G    +      
Subjt:  -NNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDTNKFTENL

Query:  QECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWK
           + K+ + +++   +    G WN  L+ + F   +A AI  +        D+I+W+F+  G++SV+S Y   V  N        +    + F +  W 
Subjt:  QECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWK

Query:  TNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW---ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLV
         ++P KIKI GW+V+++I+    NL +RGM+V+P C  C +  ET  H + +C+  + +W   +  +    ND ++G               + K G L 
Subjt:  TNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW---ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLV

Query:  DHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWI
         +    +L+  W IW  RN  +H   +            + TE  G  I+Y  E         P       W       +K++ D ++    + G  G I
Subjt:  DHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWI

Query:  LRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGC------QVRVESDAIQVVRLLNGEDFDSTELIHFI---KEAKSIITELGFIESVSHV
         RD      SAG        ++ +   + + E   A+ +          ++ +E DA+ V++  N  + D + +  +I   KEA++   +  F    SHV
Subjt:  LRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGC------QVRVESDAIQVVRLLNGEDFDSTELIHFI---KEAKSIITELGFIESVSHV

Query:  SRNHNRMAHLLARKACEIQESKSWTNFFPEWLLYVNDMD
         R  NR   +LA+      ES  W    P +L  +   D
Subjt:  SRNHNRMAHLLARKACEIQESKSWTNFFPEWLLYVNDMD

RYR39253.1 hypothetical protein Ahy_A09g044758 [Arachis hypogaea]5.1e-4124.73Show/hide
Query:  QFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDT--N
        ++TW +N  N  +  ER+DR L+N +     Q   +      +SDH  L+ E          + +K+  RFE  WT++EEC++++R+ W ++ GY    N
Subjt:  QFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDT--N

Query:  KFTENLQECLKKLSQWSR--------ILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLG------VQLNQN
        +FT     C ++L +WSR         + + K+  G W+   I + F   +A  I   P S     D  +W +   GQ+SV++ Y          +  + 
Subjt:  KFTENLQECLKKLSQWSR--------ILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLG------VQLNQN

Query:  SQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW--ANFFPLPNNDSLNGRDRW
        ++ASTS +  +   WK+ WK  +P K+K+  WK  + ILP   NL +R   V+P C +C+E  ET  H L  C  T+ +W  ++   +P + ++   ++W
Subjt:  SQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW--ANFFPLPNNDSLNGRDRW

Query:  TIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADS----SSPGDQTLQRWAPIPD
         +        +  + G   D+ L     +CW IW  RN  + ++   N +     +   + E       + T    + D+       G++T   W P P 
Subjt:  TIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADS----SSPGDQTLQRWAPIPD

Query:  GLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGE------DFDSTELIHF
           K++ DA++  E        + RDW G+    G     +       E  A  E L  I +       +E+D + +V+ +         D    +++  
Subjt:  GLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGE------DFDSTELIHF

Query:  IKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPE
        ++EA  +          +   R  N +AH LA  A   Q  + W+ F PE
Subjt:  IKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPE

TXG50387.1 hypothetical protein EZV62_022911 [Acer yangbiense]3.0e-4127.94Show/hide
Query:  NLQECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSF
        ++Q+ L  +   + ++++L    G WN  LIRNSFL  DA+ IL +P  +   DD + W+F+ +G ++V+S Y++ + L +   +S+    P   +W+  
Subjt:  NLQECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSF

Query:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCD---SYWMRNKEGV
        WK N+P+K KI  WK +N  LPT   L +R +DV   C +C +  E+  H+LW C    ++W     L  +D +    R  + D+     S W       
Subjt:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCD---SYWMRNKEGV

Query:  LVDHCLKRSLIL-CWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGV
         VD  +   LI+  W++W  RNS+VH         + T  D    E   A      E  S   S          W     G  K++CDAS+       GV
Subjt:  LVDHCLKRSLIL-CWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGV

Query:  GWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNR
        G I+RD+ G   +A    +     +   E  A  EG+          V +ESDA  V++LL+ +    TEL   I  + ++   +  +  V+ V R  N 
Subjt:  GWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNR

Query:  MAHLLARKACEIQESKSWTNFFPEWLLYVNDMDTRDVQNTGGGSCPISVIP
        +AH +A+ A  +     W    P            D+     G  P SV P
Subjt:  MAHLLARKACEIQESKSWTNFFPEWLLYVNDMDTRDVQNTGGGSCPISVIP

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]6.6e-4129.43Show/hide
Query:  WNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTLD
        WN  L+   F  AD   IL IP S N   D  IW++E  G+++VKS Y L   L    Q S+S    QET+WK FW   LPSK++I GWKV N+ LP   
Subjt:  WNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTLD

Query:  NLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANF-FPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKK
        NL  R +  + TC LC    E+  H L+ C   K +W N  F L         D        D  ++     +L +  L+R     W IW+ RN+ +H K
Subjt:  NLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANF-FPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKK

Query:  QMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWF
        Q+     + +  +  +      ++     +F  A      D    +W P P+  LK++ DA+    R   G+G I+RD  G   +A  +    N+K    
Subjt:  QMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWF

Query:  ETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPEWLLY
        E  A+  GL+           VE+D + +V  +NG     +     +K+    ++       +SHV R+ N+ AH LA++A ++     W    P  +  
Subjt:  ETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPEWLLY

Query:  V
        V
Subjt:  V

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]1.4e-0626.88Show/hide
Query:  QFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDT-NK
        +FTW  N+     + ER+D   +N        + ++ HL    SDHR L+A      + P     K   RFE+ W K +EC +I+   W      D+  +
Subjt:  QFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDT-NK

Query:  FTENLQECLKKLSQW-SRILAQLKDSYGIWNEALI-RNSFLNADATAILDIPTSTNLGDD
           +L +C   L QW SR   ++K       +A+   N+ +N+D      I ++  + DD
Subjt:  FTENLQECLKKLSQW-SRILAQLKDSYGIWNEALI-RNSFLNADATAILDIPTSTNLGDD

XP_030943489.1 uncharacterized protein LOC115968280 [Quercus lobata]1.6e-4230.25Show/hide
Query:  WNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQ-ASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTL
        W   L+R+ FL  +A+ IL+IP S NL +D+IIW    KG+F+VKSAY + + LN  +     S+   +   WK  W   +PSKI+I GW+   N LPT 
Subjt:  WNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQ-ASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTL

Query:  DNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKK
        +NL KRG++++  C  C +  E+  H L +C+  K +W  +   P N S     +W   D+ D      ++G   D  L+   ++ W IW  RN +VH+ 
Subjt:  DNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKK

Query:  --QMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTL--QRWAPIPDGLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWK
          Q+PN+        + +            + +  A SSS  D+T     W P P G+ K++ D +  E  R   VG I+RD  G   +A    +   + 
Subjt:  --QMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTL--QRWAPIPDGLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWK

Query:  ISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFD------STELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSW
            ET+A+  G+         Q+ +ESDA+ V++ +   +FD      +  +I  +K  +S          ++H+ R++NR+AH LA+ A   + S+ W
Subjt:  ISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFD------STELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSW

TrEMBL top hitse value%identityAlignment
A0A1R3HK95 Zinc finger, CCHC-type2.9e-5022.71Show/hide
Query:  ESLIKQLTDLKVTAEEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTI-ITNIGFNMFLCKFKNLRIKNFIMESGPWFFD
        E L       K+TAEE+  + +  D   +  + +    LV K+ ++K  N + F + M  IW   + + +  +  N+FL KF     K  +++  PW F 
Subjt:  ESLIKQLTDLKVTAEEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTI-ITNIGFNMFLCKFKNLRIKNFIMESGPWFFD

Query:  KALILLQVPKGDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEID-QSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWI-
          L++     GD   +D  F    FWI  + L     +RD+A  IG  +G++  VDVD+ +D + W   LRV+V IDVT+PL+R I +    + +D  I 
Subjt:  KALILLQVPKGDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEID-QSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWI-

Query:  -PITYEKLPNFCYGCGHLGHTIKECENESQDESQSEHKLPYGAWLREPTNLRMREPFAPPMSPGPYAGRGRGRGWEGGRGGWRK--DQSDDYGGRSFD-S
          + YE+ P+FCY CG +GH  ++C  E   +   E    YG W+   ++L+ +         G  A +  G+   G +G   +   QS +     F  +
Subjt:  -PITYEKLPNFCYGCGHLGHTIKECENESQDESQSEHKLPYGAWLREPTNLRMREPFAPPMSPGPYAGRGRGRGWEGGRGGWRK--DQSDDYGGRSFD-S

Query:  TQHQDGGANGSGDDDPVVNSGEWAEPPPANGLNSPPRMKGPTDKKTEKESEGTNDTELIGGEGIKEDLVNNISSDIQGISQGNKCLGIISKDISIM--EV
        T   D G         V N  E       +GL   P   G   KK+            I    + E+ + +I  DI+     + C   +   + +   E+
Subjt:  TQHQDGGANGSGDDDPVVNSGEWAEPPPANGLNSPPRMKGPTDKKTEKESEGTNDTELIGGEGIKEDLVNNISSDIQGISQGNKCLGIISKDISIM--EV

Query:  DTDGVKSKPIKHKDVGQNTSDGGIKNQDPDL---NAQVKSETGKNKVWKRIPRLKKED--------VCDENM-----GQSSQSLSPQFTWC---------
        +    +++P        +  + G+  +   +   N + +     +KVW+ + R K  D         C            + S+      C         
Subjt:  DTDGVKSKPIKHKDVGQNTSDGGIKNQDPDL---NAQVKSETGKNKVWKRIPRLKKED--------VCDENM-----GQSSQSLSPQFTWC---------

Query:  -NNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDTNKFTENL
           + N   +   +  FLL   +      F    +  G+ D R L A + ++ +  S      +G         E  R ++ Q+   +G    +      
Subjt:  -NNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDTNKFTENL

Query:  QECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWK
           + K+ + +++   +    G WN  L+ + F   +A AI  +        D+I+W+F+  G++SV+S Y   V  N        +    + F +  W 
Subjt:  QECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWK

Query:  TNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW---ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLV
         ++P KIKI GW+V+++I+    NL +RGM+V+P C  C +  ET  H + +C+  + +W   +  +    ND ++G               + K G L 
Subjt:  TNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW---ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLV

Query:  DHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWI
         +    +L+  W IW  RN  +H   +            + TE  G  I+Y  E         P       W       +K++ D ++    + G  G I
Subjt:  DHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWI

Query:  LRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGC------QVRVESDAIQVVRLLNGEDFDSTELIHFI---KEAKSIITELGFIESVSHV
         RD      SAG        ++ +   + + E   A+ +          ++ +E DA+ V++  N  + D + +  +I   KEA++   +  F    SHV
Subjt:  LRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGC------QVRVESDAIQVVRLLNGEDFDSTELIHFI---KEAKSIITELGFIESVSHV

Query:  SRNHNRMAHLLARKACEIQESKSWTNFFPEWLLYVNDMD
         R  NR   +LA+      ES  W    P +L  +   D
Subjt:  SRNHNRMAHLLARKACEIQESKSWTNFFPEWLLYVNDMD

A0A445BKP3 Uncharacterized protein2.5e-4124.73Show/hide
Query:  QFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDT--N
        ++TW +N  N  +  ER+DR L+N +     Q   +      +SDH  L+ E          + +K+  RFE  WT++EEC++++R+ W ++ GY    N
Subjt:  QFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWARQGGYDT--N

Query:  KFTENLQECLKKLSQWSR--------ILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLG------VQLNQN
        +FT     C ++L +WSR         + + K+  G W+   I + F   +A  I   P S     D  +W +   GQ+SV++ Y          +  + 
Subjt:  KFTENLQECLKKLSQWSR--------ILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLG------VQLNQN

Query:  SQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW--ANFFPLPNNDSLNGRDRW
        ++ASTS +  +   WK+ WK  +P K+K+  WK  + ILP   NL +R   V+P C +C+E  ET  H L  C  T+ +W  ++   +P + ++   ++W
Subjt:  SQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW--ANFFPLPNNDSLNGRDRW

Query:  TIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADS----SSPGDQTLQRWAPIPD
         +        +  + G   D+ L     +CW IW  RN  + ++   N +     +   + E       + T    + D+       G++T   W P P 
Subjt:  TIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADS----SSPGDQTLQRWAPIPD

Query:  GLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGE------DFDSTELIHF
           K++ DA++  E        + RDW G+    G     +       E  A  E L  I +       +E+D + +V+ +         D    +++  
Subjt:  GLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGE------DFDSTELIHF

Query:  IKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPE
        ++EA  +          +   R  N +AH LA  A   Q  + W+ F PE
Subjt:  IKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPE

A0A803NML1 Uncharacterized protein2.2e-4229Show/hide
Query:  IWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTL
        +WN  L+ + F   D   IL IP +   G D ++W+  P G +SVK+ + L   L   + +STSN   Q  +WK FW   LP KI+I  WKV+ NILPT 
Subjt:  IWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSFWKTNLPSKIKICGWKVYNNILPTL

Query:  DNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW-ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHK
          L KR +  +  C LC  + E+  H L+ CK  KD+W  + F +  + + N           +  ++ +   +   H  +  L + W IW  RN +VH 
Subjt:  DNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW-ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLILCWKIWAYRNSIVHK

Query:  KQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPG----DQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNW
         Q  +   +     K   +   A++   + + +S   SSP     DQ +QRW P      KL+ DA+   E++  G+G ILRD  G   +A  + +  ++
Subjt:  KQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPG----DQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWILRDWLGRPRSAGFRCIHRNW

Query:  KISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFP
        K    E  A+   +  +  +      +E+DA +V   LN  + D +     I + + +++    +  V+HV R  N+ AH LA+ A  + E   W    P
Subjt:  KISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFP

A0A803NML1 Uncharacterized protein4.4e-2229.33Show/hide
Query:  EEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTIITNIGFNMFLCKFKNLRIKNFIMESGPWFFDKALILLQVPK-GDNY
        E++  +F+  D         +   L  KI ++KK+     +++M + W     +  +   +MF+  F     K  ++   P+ F    I+L  P+ G N+
Subjt:  EEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTIITNIGFNMFLCKFKNLRIKNFIMESGPWFFDKALILLQVPK-GDNY

Query:  GDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEIDQSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWIPITYEKLPNFCYGCGH
          D D  F  FW+  ++LPF   +R LA  +G+I+G+   V  ++ +++ WG  LRV+V +DV++PLKRG  +     K+  W+   YE+LP +C  CG 
Subjt:  GDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEIDQSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWIPITYEKLPNFCYGCGH

Query:  LGHTIKEC
        +GH   +C
Subjt:  LGHTIKEC

A0A803NML1 Uncharacterized protein1.4e-4127.94Show/hide
Query:  NLQECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSF
        ++Q+ L  +   + ++++L    G WN  LIRNSFL  DA+ IL +P  +   DD + W+F+ +G ++V+S Y++ + L +   +S+    P   +W+  
Subjt:  NLQECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSF

Query:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCD---SYWMRNKEGV
        WK N+P+K KI  WK +N  LPT   L +R +DV   C +C +  E+  H+LW C    ++W     L  +D +    R  + D+     S W       
Subjt:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCD---SYWMRNKEGV

Query:  LVDHCLKRSLIL-CWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGV
         VD  +   LI+  W++W  RNS+VH         + T  D    E   A      E  S   S          W     G  K++CDAS+       GV
Subjt:  LVDHCLKRSLIL-CWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGV

Query:  GWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNR
        G I+RD+ G   +A    +     +   E  A  EG+          V +ESDA  V++LL+ +    TEL   I  + ++   +  +  V+ V R  N 
Subjt:  GWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNR

Query:  MAHLLARKACEIQESKSWTNFFPEWLLYVNDMDTRDVQNTGGGSCPISVIP
        +AH +A+ A  +     W    P            D+     G  P SV P
Subjt:  MAHLLARKACEIQESKSWTNFFPEWLLYVNDMDTRDVQNTGGGSCPISVIP

A0A803P5M6 Uncharacterized protein3.4e-4320.17Show/hide
Query:  KQLTDLKVTAEEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTIITNIGFNMFLCKFKNLRIKNFIMESGPWFFDKALIL
        K  T + +T +E++ +FQ  D         +   L  KI ++KK+     +++M + W     +  +   +MF+  F     K  +++  P+ F    I+
Subjt:  KQLTDLKVTAEEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTIITNIGFNMFLCKFKNLRIKNFIMESGPWFFDKALIL

Query:  LQVPK-GDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEIDQSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWIPITYEK
        L  P+ G N+  D D  F  FW+  ++LPF   +R LA  +G+I+G+   V  ++ +++ WG  LRV+V +DV++PLKRG  +     K+  W+   YE+
Subjt:  LQVPK-GDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEIDQSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWIPITYEK

Query:  LPNFCYGCGHLGHTIKEC----ENESQDESQSEHKLPY--GAWLREPTNLRMREPFAPPMSPGPYAGRGRGRGWEGGRGGWRKDQSDD----YGGRSFDS
        LP +C  CG +GH   +C    E     E  +    P+  G+ L      R R  FA      P   R   +          K         + G S ++
Subjt:  LPNFCYGCGHLGHTIKEC----ENESQDESQSEHKLPY--GAWLREPTNLRMREPFAPPMSPGPYAGRGRGRGWEGGRGGWRKDQSDD----YGGRSFDS

Query:  TQHQDGGANGSGDDDPVVNS---GEWAEPP-----PANGLNSPPRMKGPTDKKTEKESEGTNDTELIGGEGIKEDLVNNISSDIQGISQGNKCLGIISKD
           +D  +N + DD     S    +  +PP     P+  L+S   +    +    K+S  TN ++L          V N ++             I  K 
Subjt:  TQHQDGGANGSGDDDPVVNS---GEWAEPP-----PANGLNSPPRMKGPTDKKTEKESEGTNDTELIGGEGIKEDLVNNISSDIQGISQGNKCLGIISKD

Query:  ISIMEVDTDGVKSKPIKHKDVGQNTSDGGIKNQDPDLNAQVKSE-TGKNKVWKRIPRLKKEDVCD-----------------ENMGQSSQSL--------
           M           +    +       G +N +P++ ++ ++E     +  KR       D                    ++MG    S+        
Subjt:  ISIMEVDTDGVKSKPIKHKDVGQNTSDGGIKNQDPDLNAQVKSE-TGKNKVWKRIPRLKKEDVCD-----------------ENMGQSSQSL--------

Query:  --------SPQFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWA
                  +FTW   + N   + ER+D   LN    +  +     HL   +SDHR +       +S     T K   RFE+ W K  +   I+R  W+
Subjt:  --------SPQFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECRDIVRQVWA

Query:  RQGGYDTNKFTENLQECLKKLSQW-------------------SRI----------LAQLKDSYGI----------------------------------
               + F  NLQ C   L QW                   SR+          + +LKDS  I                                  
Subjt:  RQGGYDTNKFTENLQECLKKLSQW-------------------SRI----------LAQLKDSYGI----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------WNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSF
                                 WN  L+ + F   D   IL IP +   G D ++W+  P G +SVK+ + L   L   + +STSN   Q  +WK F
Subjt:  -------------------------WNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNHKPQETFWKSF

Query:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW-ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLV
        W   LP KI+I  WKV+ NILPT   L KR +  +  C LC  + E+  H L+ CK  KD+W  + F +  + + N           +  ++ +   +  
Subjt:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLW-ANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLV

Query:  DHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQ----TESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGG
         H  +  L + W IW  RN +VH  Q  +   +     K   +   A++  +    T S  S+ S+S  DQ +QRW P      KL+ DA+   E++  G
Subjt:  DHCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQ----TESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGG

Query:  VGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHN
        +G ILRD  G   +A  + +  ++K    E  A+   +  +  +      +E+DA +V   LN  + D +     I + + +++    +  V+HV R  N
Subjt:  VGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHN

Query:  RMAHLLARKACEIQESKSWTNFFP
        + AH LA+ A  + E   W    P
Subjt:  RMAHLLARKACEIQESKSWTNFFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-1928.28Show/hide
Query:  TCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLI--LCWKIWAYRNSIVHK-KQMPNKEMLK
        +C  C +SRET  HLL+KC   + +WA   P+P          WT   Y + YW+ N E  +       +L+  L W++W  RN ++ K K+    E+L+
Subjt:  TCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVDHCLKRSLI--LCWKIWAYRNSIVHK-KQMPNKEMLK

Query:  TLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASW-CEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEG
            +A+ +    E + + E    A           +W   P   +K + DA+W  E  RC G+GWILR+  G     G R + R   +   E  A+   
Subjt:  TLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASW-CEERRCGGVGWILRDWLGRPRSAGFRCIHRNWKISWFETIAICEG

Query:  LRAIPSTNGCQVRVESDAIQVVRLLNGEDF---------DSTELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPE
        +  +   N  ++  ESDA  +V LLN +DF         D  +L+H  +E K   T            R  N++A  +AR      ES S++N+ P+
Subjt:  LRAIPSTNGCQVRVESDAIQVVRLLNGEDF---------DSTELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPE

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-0538.18Show/hide
Query:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKC
        W   +  KIK+  WK  NN LP    L+ R + + P C  CR+  ET  H+L+ C
Subjt:  WKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKC

AT3G42140.1 zinc ion binding;nucleic acid binding1.8e-0725.76Show/hide
Query:  IMESGPWFFDKALILLQVPKGDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEIDQSWGGSLRVKVQIDVTRPLKRGIFLQSK
        I+  GPW F+  + ++Q  +      D +FK + FWI    +P     R L A I + +G+                              + G+FL++ 
Subjt:  IMESGPWFFDKALILLQVPKGDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEIDQSWGGSLRVKVQIDVTRPLKRGIFLQSK

Query:  ARKEDRWIPITYEKLPNFCYGCGHLGHTIKEC
          ++   +   YEKL NFC  CG L H   EC
Subjt:  ARKEDRWIPITYEKLPNFCYGCGHLGHTIKEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCAATCGCAAGAGGAGGAGTCATTGATTAAACAACTTACAGACCTGAAGGTGACAGCAGAAGAGAAAGCCTGCATTTTTCAACTAAAGGATGAGTATATTGACAG
ATCAGAGAAAAAACTCACAAATGCTCTGGTTTGCAAGATATATTCTCAAAAGAAAATCAATCCAGAGATTTTTAAATCCAAGATGCCGAAGATCTGGAGTCAGGAGCAAA
CAATCATTACAAACATCGGCTTCAACATGTTCTTGTGCAAGTTCAAGAATTTACGAATCAAGAACTTCATCATGGAATCCGGCCCGTGGTTTTTTGATAAGGCTTTGATA
TTACTACAAGTGCCAAAGGGAGATAACTATGGAGATGATGTTGATTTTAAATTTGTTTCATTTTGGATTCATTTTCACAAACTTCCTTTTGCATGTTTTTCCAGGGATTT
GGCTGCGGAGATTGGAAGTATTCTTGGCAAGGTTGATCAAGTCGATGTAGACGAGGAGATTGACCAAAGCTGGGGTGGTTCTCTTCGGGTTAAAGTTCAGATAGATGTTA
CTCGGCCGCTAAAGCGTGGGATATTTTTACAGTCAAAGGCAAGAAAGGAAGATAGATGGATCCCAATAACTTACGAAAAACTCCCGAATTTTTGCTATGGATGTGGACAC
CTCGGGCATACGATAAAAGAATGTGAAAATGAATCACAAGACGAGAGTCAGTCTGAACACAAGCTACCGTACGGGGCATGGCTACGTGAACCAACTAATTTAAGAATGAG
AGAGCCTTTTGCTCCTCCGATGTCACCTGGCCCCTACGCCGGTCGGGGAAGGGGAAGAGGATGGGAGGGAGGCAGAGGTGGCTGGCGGAAAGATCAGTCGGACGATTATG
GAGGGCGTTCGTTTGATAGCACACAACATCAGGATGGCGGCGCTAATGGTAGTGGAGACGATGATCCAGTTGTAAATTCCGGTGAGTGGGCGGAGCCACCGCCGGCGAAT
GGTCTGAATAGCCCACCGAGAATGAAGGGCCCAACGGATAAAAAAACGGAAAAGGAAAGTGAAGGTACAAACGACACAGAATTAATTGGAGGGGAGGGAATTAAAGAGGA
TTTGGTCAATAATATTTCAAGTGATATTCAAGGCATTTCCCAAGGAAATAAATGTCTGGGTATTATTTCCAAAGATATTTCCATTATGGAAGTGGATACAGATGGGGTTA
AGTCAAAACCTATCAAACACAAGGATGTGGGCCAGAATACTAGTGACGGTGGAATAAAAAATCAAGACCCAGATCTGAATGCACAAGTTAAGTCAGAAACGGGAAAAAAC
AAAGTTTGGAAACGTATCCCACGCTTGAAGAAGGAAGATGTCTGTGATGAGAATATGGGGCAGAGTAGCCAATCTCTTAGCCCACAGTTCACTTGGTGCAATAATCAGTT
TAATGGTGTGCTCATCTGGGAAAGAATAGATCGGTTCCTGTTGAATGCCTCTATGCATAGCAGGTGTCAGTATTTCAGAGTTCATCATCTACATTGTGGAGCTTCTGACC
ATAGACCTTTAGTTGCAGAATGGAGTATTGAGTCTTCGATTCCGAGTACTGTCACTATGAAACGTCTAGGGAGATTTGAAGAAGCATGGACCAAGTATGAAGAGTGTAGA
GATATAGTGCGACAAGTTTGGGCGAGACAAGGTGGCTATGATACTAATAAGTTCACTGAGAATTTACAGGAATGCTTAAAAAAGCTAAGTCAATGGAGTCGTATTCTGGC
TCAGCTCAAGGATAGTTATGGTATTTGGAATGAAGCATTAATTCGAAACTCTTTCCTGAATGCAGATGCAACAGCTATATTAGATATTCCAACGAGTACTAATTTGGGTG
ATGACGAGATTATTTGGAATTTTGAGCCTAAGGGCCAATTTTCGGTGAAGAGCGCCTACAGGCTAGGCGTTCAGTTAAACCAAAATTCACAAGCCTCAACATCGAATCAC
AAGCCTCAGGAAACATTTTGGAAGAGTTTCTGGAAAACTAACCTTCCCTCTAAGATCAAAATCTGTGGTTGGAAAGTTTATAACAATATCCTTCCTACTCTAGATAATTT
GATTAAACGGGGGATGGATGTGAATCCTACATGCTTTTTGTGCAGGGAGAGTCGAGAAACGGCCGTGCATCTGCTCTGGAAGTGTAAACTAACCAAAGATCTTTGGGCAA
ATTTTTTTCCCCTTCCTAACAATGACTCTTTGAATGGCAGGGACAGGTGGACTATTGAGGACTATTGTGACAGCTACTGGATGCGGAACAAAGAAGGGGTGCTCGTGGAT
CATTGCCTGAAGAGGAGTCTTATCTTGTGTTGGAAGATATGGGCTTATCGTAACTCTATCGTGCATAAGAAGCAGATGCCCAACAAAGAAATGCTGAAGACGCTAACGGA
TAAAGCAATAACAGAGATTGGGGGAGCTGAAATCACATACCAGACGGAGAGTTTCTCGAGTGCCGATTCCTCTTCTCCCGGCGACCAAACGTTGCAACGATGGGCTCCGA
TTCCGGATGGTCTCCTGAAGCTAAGTTGCGATGCCTCGTGGTGTGAAGAGCGACGATGCGGCGGCGTTGGATGGATTCTCAGAGACTGGCTAGGAAGACCAAGATCGGCT
GGATTCCGTTGTATTCACCGCAATTGGAAGATCAGTTGGTTTGAGACGATCGCGATTTGTGAAGGCCTGCGAGCGATTCCTTCAACGAATGGGTGCCAAGTGCGTGTGGA
ATCTGATGCAATCCAAGTGGTTAGACTTCTAAATGGAGAGGATTTTGATTCAACAGAACTAATCCACTTCATTAAAGAGGCCAAATCCATTATTACTGAGTTGGGCTTCA
TTGAATCGGTTTCACATGTTTCTAGGAACCACAATAGAATGGCCCATCTGTTGGCCCGAAAGGCTTGTGAAATACAGGAGTCTAAAAGCTGGACCAATTTCTTTCCCGAG
TGGCTTTTATATGTAAACGATATGGATACTAGAGATGTTCAGAACACTGGTGGGGGATCCTGTCCTATCAGTGTTATCCCGTTGGGAGCATTTGCTCTTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCAATCGCAAGAGGAGGAGTCATTGATTAAACAACTTACAGACCTGAAGGTGACAGCAGAAGAGAAAGCCTGCATTTTTCAACTAAAGGATGAGTATATTGACAG
ATCAGAGAAAAAACTCACAAATGCTCTGGTTTGCAAGATATATTCTCAAAAGAAAATCAATCCAGAGATTTTTAAATCCAAGATGCCGAAGATCTGGAGTCAGGAGCAAA
CAATCATTACAAACATCGGCTTCAACATGTTCTTGTGCAAGTTCAAGAATTTACGAATCAAGAACTTCATCATGGAATCCGGCCCGTGGTTTTTTGATAAGGCTTTGATA
TTACTACAAGTGCCAAAGGGAGATAACTATGGAGATGATGTTGATTTTAAATTTGTTTCATTTTGGATTCATTTTCACAAACTTCCTTTTGCATGTTTTTCCAGGGATTT
GGCTGCGGAGATTGGAAGTATTCTTGGCAAGGTTGATCAAGTCGATGTAGACGAGGAGATTGACCAAAGCTGGGGTGGTTCTCTTCGGGTTAAAGTTCAGATAGATGTTA
CTCGGCCGCTAAAGCGTGGGATATTTTTACAGTCAAAGGCAAGAAAGGAAGATAGATGGATCCCAATAACTTACGAAAAACTCCCGAATTTTTGCTATGGATGTGGACAC
CTCGGGCATACGATAAAAGAATGTGAAAATGAATCACAAGACGAGAGTCAGTCTGAACACAAGCTACCGTACGGGGCATGGCTACGTGAACCAACTAATTTAAGAATGAG
AGAGCCTTTTGCTCCTCCGATGTCACCTGGCCCCTACGCCGGTCGGGGAAGGGGAAGAGGATGGGAGGGAGGCAGAGGTGGCTGGCGGAAAGATCAGTCGGACGATTATG
GAGGGCGTTCGTTTGATAGCACACAACATCAGGATGGCGGCGCTAATGGTAGTGGAGACGATGATCCAGTTGTAAATTCCGGTGAGTGGGCGGAGCCACCGCCGGCGAAT
GGTCTGAATAGCCCACCGAGAATGAAGGGCCCAACGGATAAAAAAACGGAAAAGGAAAGTGAAGGTACAAACGACACAGAATTAATTGGAGGGGAGGGAATTAAAGAGGA
TTTGGTCAATAATATTTCAAGTGATATTCAAGGCATTTCCCAAGGAAATAAATGTCTGGGTATTATTTCCAAAGATATTTCCATTATGGAAGTGGATACAGATGGGGTTA
AGTCAAAACCTATCAAACACAAGGATGTGGGCCAGAATACTAGTGACGGTGGAATAAAAAATCAAGACCCAGATCTGAATGCACAAGTTAAGTCAGAAACGGGAAAAAAC
AAAGTTTGGAAACGTATCCCACGCTTGAAGAAGGAAGATGTCTGTGATGAGAATATGGGGCAGAGTAGCCAATCTCTTAGCCCACAGTTCACTTGGTGCAATAATCAGTT
TAATGGTGTGCTCATCTGGGAAAGAATAGATCGGTTCCTGTTGAATGCCTCTATGCATAGCAGGTGTCAGTATTTCAGAGTTCATCATCTACATTGTGGAGCTTCTGACC
ATAGACCTTTAGTTGCAGAATGGAGTATTGAGTCTTCGATTCCGAGTACTGTCACTATGAAACGTCTAGGGAGATTTGAAGAAGCATGGACCAAGTATGAAGAGTGTAGA
GATATAGTGCGACAAGTTTGGGCGAGACAAGGTGGCTATGATACTAATAAGTTCACTGAGAATTTACAGGAATGCTTAAAAAAGCTAAGTCAATGGAGTCGTATTCTGGC
TCAGCTCAAGGATAGTTATGGTATTTGGAATGAAGCATTAATTCGAAACTCTTTCCTGAATGCAGATGCAACAGCTATATTAGATATTCCAACGAGTACTAATTTGGGTG
ATGACGAGATTATTTGGAATTTTGAGCCTAAGGGCCAATTTTCGGTGAAGAGCGCCTACAGGCTAGGCGTTCAGTTAAACCAAAATTCACAAGCCTCAACATCGAATCAC
AAGCCTCAGGAAACATTTTGGAAGAGTTTCTGGAAAACTAACCTTCCCTCTAAGATCAAAATCTGTGGTTGGAAAGTTTATAACAATATCCTTCCTACTCTAGATAATTT
GATTAAACGGGGGATGGATGTGAATCCTACATGCTTTTTGTGCAGGGAGAGTCGAGAAACGGCCGTGCATCTGCTCTGGAAGTGTAAACTAACCAAAGATCTTTGGGCAA
ATTTTTTTCCCCTTCCTAACAATGACTCTTTGAATGGCAGGGACAGGTGGACTATTGAGGACTATTGTGACAGCTACTGGATGCGGAACAAAGAAGGGGTGCTCGTGGAT
CATTGCCTGAAGAGGAGTCTTATCTTGTGTTGGAAGATATGGGCTTATCGTAACTCTATCGTGCATAAGAAGCAGATGCCCAACAAAGAAATGCTGAAGACGCTAACGGA
TAAAGCAATAACAGAGATTGGGGGAGCTGAAATCACATACCAGACGGAGAGTTTCTCGAGTGCCGATTCCTCTTCTCCCGGCGACCAAACGTTGCAACGATGGGCTCCGA
TTCCGGATGGTCTCCTGAAGCTAAGTTGCGATGCCTCGTGGTGTGAAGAGCGACGATGCGGCGGCGTTGGATGGATTCTCAGAGACTGGCTAGGAAGACCAAGATCGGCT
GGATTCCGTTGTATTCACCGCAATTGGAAGATCAGTTGGTTTGAGACGATCGCGATTTGTGAAGGCCTGCGAGCGATTCCTTCAACGAATGGGTGCCAAGTGCGTGTGGA
ATCTGATGCAATCCAAGTGGTTAGACTTCTAAATGGAGAGGATTTTGATTCAACAGAACTAATCCACTTCATTAAAGAGGCCAAATCCATTATTACTGAGTTGGGCTTCA
TTGAATCGGTTTCACATGTTTCTAGGAACCACAATAGAATGGCCCATCTGTTGGCCCGAAAGGCTTGTGAAATACAGGAGTCTAAAAGCTGGACCAATTTCTTTCCCGAG
TGGCTTTTATATGTAAACGATATGGATACTAGAGATGTTCAGAACACTGGTGGGGGATCCTGTCCTATCAGTGTTATCCCGTTGGGAGCATTTGCTCTTTCGTAA
Protein sequenceShow/hide protein sequence
MAQSQEEESLIKQLTDLKVTAEEKACIFQLKDEYIDRSEKKLTNALVCKIYSQKKINPEIFKSKMPKIWSQEQTIITNIGFNMFLCKFKNLRIKNFIMESGPWFFDKALI
LLQVPKGDNYGDDVDFKFVSFWIHFHKLPFACFSRDLAAEIGSILGKVDQVDVDEEIDQSWGGSLRVKVQIDVTRPLKRGIFLQSKARKEDRWIPITYEKLPNFCYGCGH
LGHTIKECENESQDESQSEHKLPYGAWLREPTNLRMREPFAPPMSPGPYAGRGRGRGWEGGRGGWRKDQSDDYGGRSFDSTQHQDGGANGSGDDDPVVNSGEWAEPPPAN
GLNSPPRMKGPTDKKTEKESEGTNDTELIGGEGIKEDLVNNISSDIQGISQGNKCLGIISKDISIMEVDTDGVKSKPIKHKDVGQNTSDGGIKNQDPDLNAQVKSETGKN
KVWKRIPRLKKEDVCDENMGQSSQSLSPQFTWCNNQFNGVLIWERIDRFLLNASMHSRCQYFRVHHLHCGASDHRPLVAEWSIESSIPSTVTMKRLGRFEEAWTKYEECR
DIVRQVWARQGGYDTNKFTENLQECLKKLSQWSRILAQLKDSYGIWNEALIRNSFLNADATAILDIPTSTNLGDDEIIWNFEPKGQFSVKSAYRLGVQLNQNSQASTSNH
KPQETFWKSFWKTNLPSKIKICGWKVYNNILPTLDNLIKRGMDVNPTCFLCRESRETAVHLLWKCKLTKDLWANFFPLPNNDSLNGRDRWTIEDYCDSYWMRNKEGVLVD
HCLKRSLILCWKIWAYRNSIVHKKQMPNKEMLKTLTDKAITEIGGAEITYQTESFSSADSSSPGDQTLQRWAPIPDGLLKLSCDASWCEERRCGGVGWILRDWLGRPRSA
GFRCIHRNWKISWFETIAICEGLRAIPSTNGCQVRVESDAIQVVRLLNGEDFDSTELIHFIKEAKSIITELGFIESVSHVSRNHNRMAHLLARKACEIQESKSWTNFFPE
WLLYVNDMDTRDVQNTGGGSCPISVIPLGAFALS