; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012460 (gene) of Snake gourd v1 genome

Gene IDTan0012460
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG06:68485991..68491911
RNA-Seq ExpressionTan0012460
SyntenyTan0012460
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5446505.1 hypothetical protein F2P56_032128 [Juglans regia]6.0e-4727.36Show/hide
Query:  ETFSKRLSSLNLQEEELGGVVEVD---DDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPW
        E    +   L L EEE   +   D   D+ LE    + +  +  KI T +SIN  V ++ + K+W +    K +  G N+++  F    +K RI+ G PW
Subjt:  ETFSKRLSSLNLQEEELGGVVEVD---DDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPW

Query:  SFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWV
         FD  L V +   G      ++F+   FW   H+LP VC  + + E +  ++GR + V+    G   G+ LRV +++D+ +PL R   + V   G+  W 
Subjt:  SFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWV

Query:  QISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNG
        Q  YEKLP  C  CG + H          G +  +    R+ +  +  R G  G+K            +    +   QG EM +   + +          
Subjt:  QISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNG

Query:  GTNRGGIILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNKGKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARDKSN
                   + L   TRQ       E+ E    P I   +G+  + KV++ G      R    E+ + E+    +++EG+    N  + K +  +K +
Subjt:  GTNRGGIILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNKGKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARDKSN

Query:  CSEGGSQEAKFLGSKHDLDIRI--VEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSII--KDGKRE-WRLTGFYGDPSVEKRSDS
           GG  +   +    +  +R    E  K  L+ D  F V  VGR GGLMLLW  ++ + I++YS+ HI + I  ++GK E W LTGFYG+P    R ++
Subjt:  CSEGGSQEAKFLGSKHDLDIRI--VEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSII--KDGKRE-WRLTGFYGDPSVEKRSDS

Query:  WLLLDRLSRLYDLPWLVGGDFNY--------------KKPMDAFGECSFRCKLADAGFKGDKFTWRKSRSID----EKLTSCIQNLKSWDRRRLKGSLKE
        W LL  L  L    W V GDFN               +K M  F E      L D G++GDK+TW      +    E+L   + N  +W     +G ++ 
Subjt:  WLLLDRLSRLYDLPWLVGGDFNY--------------KKPMDAFGECSFRCKLADAGFKGDKFTWRKSRSID----EKLTSCIQNLKSWDRRRLKGSLKE

Query:  AISR-KEERILLLS
         + R  + R LLL+
Subjt:  AISR-KEERILLLS

KAF5475845.1 hypothetical protein F2P56_007609 [Juglans regia]3.9e-4627.72Show/hide
Query:  ICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKA
        I   K IN E F+N + K+W  EG ++    G N FL  F K +++ R++ G PWSFDR L+     +G + +  + F    FW   H++P    T +  
Subjt:  ICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKA

Query:  EALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQI------
        + +  ++G+   V  D  G   G+ +R+RV++ +T+ L R   L     G + WVQ  YE+LP FC  CG + H  K     +   +A+   Q       
Subjt:  EALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQI------

Query:  -RKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNG-----GTNRGGIILDPSGLTTDTRQGPQLAEVEVT--E
           KE  I+Q+            A    R   K     + G  +     +D +  SDP          T      L  S + TDT    QL +V VT  +
Subjt:  -RKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNG-----GTNRGGIILDPSGLTTDTRQGPQLAEVEVT--E

Query:  VNFGPIINTNKGKGINRKVIKEGSTVH---TDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARDKSNCSEGGSQEAK---FLGSKHDLDIRI---
        +     + T K      + +++ S      + R  S +  E    L  R +  +++     R+ ++ + +    E   + A    FL  K  L   +   
Subjt:  VNFGPIINTNKGKGINRKVIKEGSTVH---TDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARDKSNCSEGGSQEAK---FLGSKHDLDIRI---

Query:  --------VEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDGK--REWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLV
                VE  K  + FDN F +   G SGGL  LWK+ +   + SYS+ HI  ++K  K   +W LTGFYG P   +R  SW LL  ++ ++ L WL 
Subjt:  --------VEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDGK--REWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLV

Query:  GGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTWRKSR
         GDFN                  +++F +    C L+D GF G+KFTW   R
Subjt:  GGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTWRKSR

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]7.1e-6430.2Show/hide
Query:  MEEETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPW
        ME +  S++   L+L ++  G +  +     E  E+     +  K  T K IN E FK+ +  IW  +  + +E  G N+F   F+   ++ RI+ GGPW
Subjt:  MEEETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPW

Query:  SFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWV
         FD+ L+V  +  G+  +  + FRY  FW   H+LP  C  R+    L   +G+ + ++   +G+C GQ +R+RV IDV  PL+R  ++ +G   +   V
Subjt:  SFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWV

Query:  QISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDKIDQRIGLGGEKDSVP-VAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPN
         I YE+LP+FC  CGK+ HLV+D               +  KE         G    +V     +G  E++ +  G  +G            G+SD   N
Subjt:  QISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDKIDQRIGLGGEKDSVP-VAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPN

Query:  ---GGTNRGGIILDPSGLTTDTRQGPQLAEVE-------VTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQ
            G+ +  +  D S L  D  Q   + E++        T V+   +++  +     +  +++ I E ST ++ R  +     N + +G+       T 
Subjt:  ---GGTNRGGIILDPSGLTTDTRQGPQLAEVE-------VTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQ

Query:  ---------TNNRRWKRIARDKSNCSEGGSQEAKFLGSKH-DLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDG-
                 TN +RWKR+AR+K      G+Q    LG K  D+DI    + K      + F V R+G+ GGL LLWK+ + + I S++K HID++IKD  
Subjt:  ---------TNNRRWKRIARDKSNCSEGGSQEAKFLGSKH-DLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDG-

Query:  KREWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTW
           WR TGFYG+P    R  SW LL RL R+ +LPW+V GDFN                  M +F E    C L D G+ G+K+TW
Subjt:  KREWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTW

XP_018826186.1 uncharacterized protein LOC108995141 [Juglans regia]3.8e-4926.97Show/hide
Query:  ETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFD
        ++  ++   L L E+E    +E+D+D   E   + Q  V  KI +++ I+ EV  + + K+W +    K      N F   F    +K R+  G PW FD
Subjt:  ETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFD

Query:  RGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQIS
          ++V ++ +G   L ++ F    FW   H+LP  C T+ K E +  ++G+ E V++   G   G  LRV+V +D+T+P+ R   L V   G   W   S
Subjt:  RGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQIS

Query:  YEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDK--IDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGG
        YEKLP  C  CG + H V      +   E  +++  R+ E K  ++ R G    K+ +  + E   + RK   G  + +   D+ +++     D    G 
Subjt:  YEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDK--IDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGG

Query:  TNRGGIILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARD
          RG    D       T +  +    +  E  +G ++  +     G G+     K+G      R+   +Q  ++  +    ++            R  +D
Subjt:  TNRGGIILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARD

Query:  KSNCSEGGSQEAKFLGSKHDLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDS-IIKDGKREWRLTGFYGDPSVEKRSDSW
          N  E       F+     L  +  +  +  L  +  F V  VG+ GGL+LLW   +   IV+YS++HI+  I ++G+++W LT FYG P   KR +SW
Subjt:  KSNCSEGGSQEAKFLGSKHDLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDS-IIKDGKREWRLTGFYGDPSVEKRSDSW

Query:  LLLDRLSRLYDLPWLVGGDFNY--------------KKPMDAFGECSFRCKLADAGFKGDKFTWRKSRSID
         LL+ L    ++ W + GDFN               +  M+ F E      L D G+KGDKFTW  S   D
Subjt:  LLLDRLSRLYDLPWLVGGDFNY--------------KKPMDAFGECSFRCKLADAGFKGDKFTWRKSRSID

XP_035544642.1 uncharacterized protein LOC109020982 [Juglans regia]5.1e-4626.94Show/hide
Query:  ETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFD
        E  +++  +  L E+E   +V +  D +E    + +  +   I   K IN E F+N + K+W  EG ++    G N FL  F K +++ R++ G PWSFD
Subjt:  ETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFD

Query:  RGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQIS
        R L+     +G + +  + F    FW   H++P    T +  + +  ++G+   V  D  G   G+ +R+RV++ +T+ L R   L     G + WVQ  
Subjt:  RGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQIS

Query:  YEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQI-------RKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDP
        YE+LP FC  CG + H  K     +   +A+   Q          KE  I+Q+            A    R   K     + G  +     +D +  SDP
Subjt:  YEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQI-------RKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDP

Query:  CPNG-----GTNRGGIILDPSGLTTDTRQGPQLAEVEVT--EVNFGPIINTNKGKGINRKVIKEGSTVH---TDRSLSCEQAENELHLGDRKLEGSDTQT
                  T      L  S + TDT    QL +V VT  ++     + T K      + +++ S      + R  S +  E    L  R +  +++  
Subjt:  CPNG-----GTNRGGIILDPSGLTTDTRQGPQLAEVEVT--EVNFGPIINTNKGKGINRKVIKEGSTVH---TDRSLSCEQAENELHLGDRKLEGSDTQT

Query:  NNRRWKRIARDKSNCSEGGSQEAK---FLGSKHDLDIRI-----------VEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIK
           R+ ++ + +    E   + A    FL  K  L   +           VE  K  + FDN F +   G SGGL  LWK+ +   + SYS+ HI  ++K
Subjt:  NNRRWKRIARDKSNCSEGGSQEAK---FLGSKHDLDIRI-----------VEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIK

Query:  DGK--REWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTWRKSR
          K   +W LTGFYG P   +R  SW LL  ++ ++ L WL  GDFN                  +++F +    C L+D GF G+KFTW   R
Subjt:  DGK--REWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTWRKSR

TrEMBL top hitse value%identityAlignment
A0A2I4F3F9 uncharacterized protein LOC1089951411.8e-4926.97Show/hide
Query:  ETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFD
        ++  ++   L L E+E    +E+D+D   E   + Q  V  KI +++ I+ EV  + + K+W +    K      N F   F    +K R+  G PW FD
Subjt:  ETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFD

Query:  RGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQIS
          ++V ++ +G   L ++ F    FW   H+LP  C T+ K E +  ++G+ E V++   G   G  LRV+V +D+T+P+ R   L V   G   W   S
Subjt:  RGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQIS

Query:  YEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDK--IDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGG
        YEKLP  C  CG + H V      +   E  +++  R+ E K  ++ R G    K+ +  + E   + RK   G  + +   D+ +++     D    G 
Subjt:  YEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDK--IDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGG

Query:  TNRGGIILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARD
          RG    D       T +  +    +  E  +G ++  +     G G+     K+G      R+   +Q  ++  +    ++            R  +D
Subjt:  TNRGGIILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARD

Query:  KSNCSEGGSQEAKFLGSKHDLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDS-IIKDGKREWRLTGFYGDPSVEKRSDSW
          N  E       F+     L  +  +  +  L  +  F V  VG+ GGL+LLW   +   IV+YS++HI+  I ++G+++W LT FYG P   KR +SW
Subjt:  KSNCSEGGSQEAKFLGSKHDLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDS-IIKDGKREWRLTGFYGDPSVEKRSDSW

Query:  LLLDRLSRLYDLPWLVGGDFNY--------------KKPMDAFGECSFRCKLADAGFKGDKFTWRKSRSID
         LL+ L    ++ W + GDFN               +  M+ F E      L D G+KGDKFTW  S   D
Subjt:  LLLDRLSRLYDLPWLVGGDFNY--------------KKPMDAFGECSFRCKLADAGFKGDKFTWRKSRSID

A0A2N9E9A1 Reverse transcriptase domain-containing protein3.8e-4726.27Show/hide
Query:  LSSLNLQEEELG-GVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMV
        + SL  Q E+      E D  +L     + +  +A K  T +++NAE        +W  +    I     N  +  F  + +++R++ G PW++D+ L+V
Subjt:  LSSLNLQEEELG-GVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMV

Query:  FEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLP
        F+ I+    +  + F+    W   H LP      + A  L +S+GR E +  + A    G  +R+R+++DVT+PL R  K ++   G E W+   YE+LP
Subjt:  FEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLP

Query:  DFCCGCGKLEHLVKD---------YVYAED-----GAEAAQNEQIRKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQ--------GDEMMDVT
        +FC  CG L H  KD          + AED        AA +   RK E K+D       +K   P + + A     T+T ++                T
Subjt:  DFCCGCGKLEHLVKD---------YVYAED-----GAEAAQNEQIRKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQ--------GDEMMDVT

Query:  DQDKVGASDP-----------CPNGGTNRGGII-----LDPS---GLTTDTRQGPQLAEVEVTEVN----FGPIINTNKGKGINRKVIKEGSTVHTDRSL
          +K   S P            PN      G+I      +P     +  +   G  + E E+ E++    F P    NK       ++    T       
Subjt:  DQDKVGASDP-----------CPNGGTNRGGII-----LDPS---GLTTDTRQGPQLAEVEVTEVN----FGPIINTNKGKGINRKVIKEGSTVHTDRSL

Query:  SCEQAENEL-HLGDRKLEGSDTQTNNRRWKRIAR---DKSNCSEGGSQEAKFLGSKHDLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPI
           +   ++ +      + +      + WK++AR   D++    G  Q  +   S    D   +E  +  L FDN F        GGL L WK  + L +
Subjt:  SCEQAENEL-HLGDRKLEGSDTQTNNRRWKRIAR---DKSNCSEGGSQEAKFLGSKHDLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPI

Query:  VSYSKAHIDSIIKDGKRE-WRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKF
         S+S +HID+I+ + + + WRLTGFYG P    R +SW LL RLS L+ LPW   GDFN               +  M  F +    C   D G+ G  F
Subjt:  VSYSKAHIDSIIKDGKRE-WRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKF

Query:  TWRKSRSID
        TW  +R  D
Subjt:  TWRKSRSID

A0A2N9G3I8 Reverse transcriptase domain-containing protein3.1e-4925.53Show/hide
Query:  EVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMVFEDIKGALNLKAMDFR
        E D  +L    ++++  +A K  T ++IN E        +W  + N  ++  G N+ L  F  + + +R++ G PWS+D+ L+ F+ +     ++ + F 
Subjt:  EVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMVFEDIKGALNLKAMDFR

Query:  YAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLPDFCCGCGKLEHLVKDY
           FW   H+LP +C  +  AE L  SIG     ++      +G+ +RVRVK+D+T+PL R  K+ + + GE  WV   YE+LP+FC  CG   H  +D 
Subjt:  YAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLPDFCCGCGKLEHLVKDY

Query:  VYAEDGAEAAQNEQ---------IRKKEDKIDQRIGLGGE----KDSVPVAGEGA-----REQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGGTNRGG
           E+  +    E+         +R  +D++ +R+ +  E      S P+ GE A     +   K +   S   E +D TD +          GG N+  
Subjt:  VYAEDGAEAAQNEQ---------IRKKEDKIDQRIGLGGE----KDSVPVAGEGA-----REQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGGTNRGG

Query:  IILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNKGKGINRKV------IKEGS--TVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIAR--
             +  T +     QL  +++  +N+  +I    G   +++       I  GS   +H   S S +   + L     K+  S  + N   WK++AR  
Subjt:  IILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNKGKGINRKV------IKEGS--TVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIAR--

Query:  -------------DKSNCSEGGSQEAKFLGSKHDLDI-------------------------------------RIVEEAKNYLSFDNGFEVPRVGRSGG
                     +K  C E    E + L S+    +                                       +E+ +  L F++   V    + GG
Subjt:  -------------DKSNCSEGGSQEAKFLGSKHDLDI-------------------------------------RIVEEAKNYLSFDNGFEVPRVGRSGG

Query:  LMLLWKDTLLLPIVSYSKAHIDSIIKDGKRE-WRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN-----------YKKP---MDAFGECSFR
        L L WKD   + I SYS +HID+IIK+G  + WRLTG YG P  ++R ++W LL  L   + LPW   GDFN           + +P   M  F      
Subjt:  LMLLWKDTLLLPIVSYSKAHIDSIIKDGKRE-WRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN-----------YKKP---MDAFGECSFR

Query:  CKLADAGFKGDKFTWRKSR----SIDEKLTSCIQNLKSWDRRRLKGSLKEAISRKEERILLLSNLP
        C L D  ++G  FTW  +R    +   +L   + N++   R  +       +S+ + + L LS LP
Subjt:  CKLADAGFKGDKFTWRKSR----SIDEKLTSCIQNLKSWDRRRLKGSLKEAISRKEERILLLSNLP

A0A2N9GGJ7 Uncharacterized protein1.6e-5026.95Show/hide
Query:  EVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMVFEDIKGALNLKAMDFR
        E D  +L    ++++  +A K  T ++IN E        +W  + N  ++  G N+ L  F  + + +R++ G PWS+D+ L+ F+ +   + ++ + F 
Subjt:  EVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMVFEDIKGALNLKAMDFR

Query:  YAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLPDFCCGCGKLEHLVKDY
           FW   H+LP +C  +  AE L  SIG     ++      + + +RVRVK+D+T+PL R  K+ + + GE  W    YE+L +FC  CG   H  +D 
Subjt:  YAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLPDFCCGCGKLEHLVKDY

Query:  VYAEDGAEAAQNEQ---------IRKKEDKIDQRIGLGGE----KDSVPVAGEGA-----REQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGGTNRGG
           ED  +  + E+         +R  ++++ +R+ +  E      S P+ G+ A     +  +K +   S   E +D TD +     D     G N+  
Subjt:  VYAEDGAEAAQNEQ---------IRKKEDKIDQRIGLGGE----KDSVPVAGEGA-----REQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGGTNRGG

Query:  IILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNKGKGINRKV------IKEGSTVHTDRSLSCEQAENELHLGD--RKLEGSDTQTNNRRWKRIARDK
         I   +  T +     QL  +++  +N+  +I  N G   ++K       I  G+ + + +  S         LG+   K   S  + N   WK++AR K
Subjt:  IILDPSGLTTDTRQGPQLAEVEVTEVNFGPIINTNKGKGINRKV------IKEGSTVHTDRSLSCEQAENELHLGD--RKLEGSDTQTNNRRWKRIARDK

Query:  SNCSEGGSQEAKF-LGSKHDLDIRIVEEAKNYLSFDNGFEVPRV-GRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDGKRE-WRLTGFYGDPSVEKRSDS
            +G     KF +  K   D  +  E ++  S    + V +   + GGL L WKD   + I SYS +HID+IIK+G  + WRLTG YG P  +KR ++
Subjt:  SNCSEGGSQEAKF-LGSKHDLDIRIVEEAKNYLSFDNGFEVPRV-GRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDGKRE-WRLTGFYGDPSVEKRSDS

Query:  WLLLDRLSRLYDLPWLVGGDFN-----------YKKP---MDAFGECSFRCKLADAGFKGDKFTWRKSR----SIDEKLTSCIQNLKSWDRRRLKGSLKE
        W LL  L   + LPW   GDFN           + +P   M  F      C L D  ++G  FTW  +R    +   +L   + N++      +      
Subjt:  WLLLDRLSRLYDLPWLVGGDFN-----------YKKP---MDAFGECSFRCKLADAGFKGDKFTWRKSR----SIDEKLTSCIQNLKSWDRRRLKGSLKE

Query:  AISRKEERILLLSNLP
         +S+ + + L LS LP
Subjt:  AISRKEERILLLSNLP

A0A5C7H9Y2 CCHC-type domain-containing protein3.4e-6430.2Show/hide
Query:  MEEETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPW
        ME +  S++   L+L ++  G +  +     E  E+     +  K  T K IN E FK+ +  IW  +  + +E  G N+F   F+   ++ RI+ GGPW
Subjt:  MEEETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPW

Query:  SFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWV
         FD+ L+V  +  G+  +  + FRY  FW   H+LP  C  R+    L   +G+ + ++   +G+C GQ +R+RV IDV  PL+R  ++ +G   +   V
Subjt:  SFDRGLMVFEDIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWV

Query:  QISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDKIDQRIGLGGEKDSVP-VAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPN
         I YE+LP+FC  CGK+ HLV+D               +  KE         G    +V     +G  E++ +  G  +G            G+SD   N
Subjt:  QISYEKLPDFCCGCGKLEHLVKDYVYAEDGAEAAQNEQIRKKEDKIDQRIGLGGEKDSVP-VAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPN

Query:  ---GGTNRGGIILDPSGLTTDTRQGPQLAEVE-------VTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQ
            G+ +  +  D S L  D  Q   + E++        T V+   +++  +     +  +++ I E ST ++ R  +     N + +G+       T 
Subjt:  ---GGTNRGGIILDPSGLTTDTRQGPQLAEVE-------VTEVNFGPIINTNK----GKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQ

Query:  ---------TNNRRWKRIARDKSNCSEGGSQEAKFLGSKH-DLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDG-
                 TN +RWKR+AR+K      G+Q    LG K  D+DI    + K      + F V R+G+ GGL LLWK+ + + I S++K HID++IKD  
Subjt:  ---------TNNRRWKRIARDKSNCSEGGSQEAKFLGSKH-DLDIRIVEEAKNYLSFDNGFEVPRVGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDG-

Query:  KREWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTW
           WR TGFYG+P    R  SW LL RL R+ +LPW+V GDFN                  M +F E    C L D G+ G+K+TW
Subjt:  KREWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFN--------------YKKPMDAFGECSFRCKLADAGFKGDKFTW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0727.08Show/hide
Query:  EWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICVGGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSE
        +W  P   + K N DA+W+      GIGW++R+ +  ++ +G + + R  ++   E +A+   +   +R    + K +I +SD+  ++  LN DD     
Subjt:  EWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICVGGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSE

Query:  LSNFVGAINSLASCFPVVKFVNCPRSKN-LAHNIVRNVCKFGDF
        L   +  I  L   F  VKF   PR  N +A  I R    F ++
Subjt:  LSNFVGAINSLASCFPVVKFVNCPRSKN-LAHNIVRNVCKFGDF

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)3.9e-0425Show/hide
Query:  IWKVWSVK---IFISINNCPVSDADKNKVIRDISCAKEELFLTEKLNTPSARSENCESHG---EWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRS
        +W++W  +   +F  I+  P   A K +     +    E  + +  N+ S    N    G   EW+PP   Y K N D+ +    +     W+IRD N  
Subjt:  IWKVWSVK---IFISINNCPVSDADKNKVIRDISCAKEELFLTEKLNTPSARSENCESHG---EWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRS

Query:  LICVGGKQIKRRWSIKVLEAKAILEGLE
        +I  G  ++++ +S    EA   L  L+
Subjt:  LICVGGKQIKRRWSIKVLEAKAILEGLE

AT4G29090.1 Ribonuclease H-like superfamily protein7.4e-1127Show/hide
Query:  IIWKVWSVKIFISINNCPVSDADKNKVIRDISCAKEELFL-TEKLNTPSARSENCESHGEWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICV
        ++W++W  +  +        + +  +V+R      EE  + TE  +  +    N  S G W PP   + K N DA+W       GIGWV+R+    +  +
Subjt:  IIWKVWSVKIFISINNCPVSDADKNKVIRDISCAKEELFL-TEKLNTPSARSENCESHGEWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICV

Query:  GGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSELSNFVGAINSLASCFPVVKFVNCPRSKN-LAHNIVRNVCKF
        G + + +  S+   E +A+   + + +R +     ++I +SDS  +I+ LN +DE    L   +  +  L S F  VKFV  PR  N LA  + R    F
Subjt:  GGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSELSNFVGAINSLASCFPVVKFVNCPRSKN-LAHNIVRNVCKF

AT5G19270.1 unknown protein7.2e-0625.69Show/hide
Query:  WTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICVGGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSEL
        WTPPD    K N +A W N   + G+ W+ RD N   +        R  +  + E + IL  +++     + +   +I+ SD    I  L  +  D    
Subjt:  WTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICVGGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSEL

Query:  SNFVGAINSLASCFPVVKF--VNCPRSKNLAHNIVRNVCKFGDF
        + ++  I      FP + F  V+CP + ++A +I  +V + G F
Subjt:  SNFVGAINSLASCFPVVKF--VNCPRSKNLAHNIVRNVCKFGDF

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.8e-0419.79Show/hide
Query:  IIWKVWSVKIFISINNCPVS-DADKNKVIRDISCAKEELFLTEKLNTPSARSENCESHGEWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICV
        ++W++W     +  N+            + D     +     E+ N    R+ +   + +W+PP  +  K N DAS      + G+GW++R+   ++I  
Subjt:  IIWKVWSVKIFISINNCPVS-DADKNKVIRDISCAKEELFLTEKLNTPSARSENCESHGEWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICV

Query:  GGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSELSNFVGAINSLASCFPVVKFVNCPRSKN
        G  + + R + +  E   ++  ++A       H+K +I + D+  + + +N    +   L +F+  I S    F  ++F    R +N
Subjt:  GGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDSDSSEVIKCLNGDDEDLSELSNFVGAINSLASCFPVVKFVNCPRSKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGAAACGTTCAGCAAACGACTATCGAGTCTAAATCTACAGGAGGAGGAACTGGGAGGAGTGGTCGAGGTCGATGATGATGAGCTCGAGGAATTCGAAAAAAG
GAATCAAGACGATGTAGCCTGTAAAATTTGTACAACAAAATCCATAAATGCTGAAGTTTTTAAAAACATAGTCCCGAAAATATGGAACCTAGAGGGAAACATCAAGATTG
AGGCGGCAGGTAGAAACCTTTTCCTATGTTCCTTTAGAAAAAAGAAAGAAAAAGACAGAATCGTCTATGGAGGACCTTGGAGCTTTGATAGGGGCTTGATGGTTTTCGAA
GACATCAAAGGAGCGTTGAATCTAAAAGCTATGGACTTCAGGTACGCATATTTCTGGACTAACTTTCATGATCTCCCTAGAGTGTGTTTCACCAGGAAAAAAGCTGAAGC
ATTGTGGAACTCTATCGGAAGATTTGAAGGGGTGGAATTGGACAGGGCAGGAAAATGCAGCGGGCAAACTCTTAGAGTGAGAGTTAAGATAGACGTAACCCGACCCCTGA
GAAGAGCGACCAAGCTGAAGGTTGGATCAATGGGTGAAGAGATTTGGGTTCAAATAAGCTATGAGAAGCTTCCAGACTTCTGCTGTGGTTGTGGTAAATTAGAGCATTTA
GTTAAGGACTATGTTTACGCAGAGGATGGCGCGGAGGCAGCCCAGAACGAACAAATTAGGAAAAAGGAGGATAAGATAGATCAAAGAATAGGGTTAGGAGGTGAGAAGGA
TTCGGTTCCCGTGGCCGGAGAAGGTGCCAGAGAACAAAGGAAAACCTCAACGGGCGATTCTCAGGGCGATGAGATGATGGATGTGACGGATCAGGACAAGGTCGGCGCAT
CAGATCCATGCCCTAATGGTGGTACCAACCGGGGGGGTATCATATTGGACCCATCTGGATTAACTACTGATACAAGGCAAGGCCCACAGCTGGCAGAAGTGGAGGTAACT
GAGGTAAATTTTGGACCTATCATTAATACTAATAAGGGCAAGGGGATTAATAGAAAAGTTATCAAAGAAGGAAGTACTGTCCATACCGACAGATCCTTGAGCTGTGAGCA
AGCCGAAAACGAGTTACATCTGGGCGATAGGAAGTTGGAGGGTTCCGACACTCAGACTAATAACAGAAGATGGAAGAGAATCGCAAGAGATAAATCCAATTGCTCGGAAG
GGGGATCTCAGGAAGCCAAGTTTTTGGGAAGCAAGCACGATCTTGATATTAGGATAGTGGAGGAAGCCAAAAATTATCTGAGTTTTGACAATGGTTTCGAGGTTCCAAGG
GTGGGTAGAAGTGGAGGCCTTATGTTGTTATGGAAGGATACCCTTCTACTCCCGATTGTCTCCTACTCAAAGGCGCACATCGATTCTATTATTAAAGATGGTAAAAGAGA
GTGGAGATTAACAGGCTTCTATGGTGATCCTTCTGTGGAAAAAAGATCAGACTCCTGGCTCCTTCTTGATCGGTTGAGTAGGCTTTATGACCTCCCATGGCTGGTGGGAG
GAGACTTTAATTACAAGAAACCTATGGATGCTTTCGGCGAGTGCAGTTTTAGATGTAAGTTGGCTGATGCAGGGTTCAAGGGAGATAAATTCACGTGGAGGAAGAGCAGA
AGTATTGACGAAAAACTTACCTCCTGCATTCAAAATTTGAAATCCTGGGATAGGCGTAGACTTAAAGGCTCCCTAAAAGAGGCTATCAGTAGGAAAGAGGAGCGGATCCT
TCTCCTTTCAAATCTCCCTAACCCAAATGGCCTAGCTAACAGAAGACTTTGTGACTTGATCGGAGACGATGGGATTTGGAAGGAGGATGAGGTTCGAGATGGGTTCATCC
CGCAAGACATTAAGGATATCTTGAACACTCTGATAGGCCCTAGAGGGTCTAAGGACGAAATTATCTTGGGAGAGGACCCCAAAGGGCTTTTTTCGGTTAAAAGTGCATAT
ACTTTGGCTAAAAGTTATAGCATGAGTCCTTCCACCTCCTCGGTGGATTCTAGGGGAGCTAATATGCAGCTTATGGAAAGCAAAGACTCTCCAAGAGCAAAGTTATGTGT
CTGGAAAGTTATCAAAGATGTCATTCCCACTAAAGAAAACATTATTAGAAGAGGTATCGATTCTAACCCTACTTATTGGGATTGGATGACTAAAAATTTTTCTGTGGATG
AGCTAGATTTGGCTATCATTATTATCTGGAAGGTGTGGAGCGTCAAAATTTTTATATCTATTAACAATTGTCCTGTTTCGGATGCAGATAAAAATAAAGTCATTAGAGAT
ATCTCGTGCGCTAAGGAGGAGCTCTTCTTGACGGAGAAGCTTAACACCCCTTCGGCAAGATCGGAGAACTGCGAGAGTCATGGAGAGTGGACACCACCGGATCCGAACTA
TTGGAAGCTAAATTGTGACGCTTCCTGGAAGAATATTGCAAATATTGGAGGTATTGGTTGGGTTATCCGTGACTTTAATAGATCTCTAATTTGTGTAGGAGGGAAACAAA
TTAAAAGAAGATGGTCTATTAAAGTGTTGGAAGCAAAAGCAATTCTTGAGGGATTGGAAGCGTTTAATAGGTGTGAAGTCGTTCACCGCAAGTGGCTGATAGTGGACTCA
GATTCGAGCGAGGTGATAAAGTGCCTGAATGGGGATGATGAAGACCTTTCTGAGCTAAGCAATTTTGTTGGAGCTATTAATAGTCTAGCTAGTTGCTTTCCAGTTGTAAA
ATTTGTTAACTGCCCTAGGTCTAAAAACTTAGCTCATAATATTGTTAGAAACGTTTGTAAGTTTGGTGATTTTGGAGGAGCTTGCTGCTCCCTTCTTCTTCTGACACTAT
TAGGGTCGGTTTTGGCGAGTACCCACCTTGGATCTCCGATTTGTTGCCTATGGGCTGCCTCCCTGGTGGGTCCCTTCTGGGCTAAATGTTTCTCTGTTGCGTTGATCTTT
ACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAGAAACGTTCAGCAAACGACTATCGAGTCTAAATCTACAGGAGGAGGAACTGGGAGGAGTGGTCGAGGTCGATGATGATGAGCTCGAGGAATTCGAAAAAAG
GAATCAAGACGATGTAGCCTGTAAAATTTGTACAACAAAATCCATAAATGCTGAAGTTTTTAAAAACATAGTCCCGAAAATATGGAACCTAGAGGGAAACATCAAGATTG
AGGCGGCAGGTAGAAACCTTTTCCTATGTTCCTTTAGAAAAAAGAAAGAAAAAGACAGAATCGTCTATGGAGGACCTTGGAGCTTTGATAGGGGCTTGATGGTTTTCGAA
GACATCAAAGGAGCGTTGAATCTAAAAGCTATGGACTTCAGGTACGCATATTTCTGGACTAACTTTCATGATCTCCCTAGAGTGTGTTTCACCAGGAAAAAAGCTGAAGC
ATTGTGGAACTCTATCGGAAGATTTGAAGGGGTGGAATTGGACAGGGCAGGAAAATGCAGCGGGCAAACTCTTAGAGTGAGAGTTAAGATAGACGTAACCCGACCCCTGA
GAAGAGCGACCAAGCTGAAGGTTGGATCAATGGGTGAAGAGATTTGGGTTCAAATAAGCTATGAGAAGCTTCCAGACTTCTGCTGTGGTTGTGGTAAATTAGAGCATTTA
GTTAAGGACTATGTTTACGCAGAGGATGGCGCGGAGGCAGCCCAGAACGAACAAATTAGGAAAAAGGAGGATAAGATAGATCAAAGAATAGGGTTAGGAGGTGAGAAGGA
TTCGGTTCCCGTGGCCGGAGAAGGTGCCAGAGAACAAAGGAAAACCTCAACGGGCGATTCTCAGGGCGATGAGATGATGGATGTGACGGATCAGGACAAGGTCGGCGCAT
CAGATCCATGCCCTAATGGTGGTACCAACCGGGGGGGTATCATATTGGACCCATCTGGATTAACTACTGATACAAGGCAAGGCCCACAGCTGGCAGAAGTGGAGGTAACT
GAGGTAAATTTTGGACCTATCATTAATACTAATAAGGGCAAGGGGATTAATAGAAAAGTTATCAAAGAAGGAAGTACTGTCCATACCGACAGATCCTTGAGCTGTGAGCA
AGCCGAAAACGAGTTACATCTGGGCGATAGGAAGTTGGAGGGTTCCGACACTCAGACTAATAACAGAAGATGGAAGAGAATCGCAAGAGATAAATCCAATTGCTCGGAAG
GGGGATCTCAGGAAGCCAAGTTTTTGGGAAGCAAGCACGATCTTGATATTAGGATAGTGGAGGAAGCCAAAAATTATCTGAGTTTTGACAATGGTTTCGAGGTTCCAAGG
GTGGGTAGAAGTGGAGGCCTTATGTTGTTATGGAAGGATACCCTTCTACTCCCGATTGTCTCCTACTCAAAGGCGCACATCGATTCTATTATTAAAGATGGTAAAAGAGA
GTGGAGATTAACAGGCTTCTATGGTGATCCTTCTGTGGAAAAAAGATCAGACTCCTGGCTCCTTCTTGATCGGTTGAGTAGGCTTTATGACCTCCCATGGCTGGTGGGAG
GAGACTTTAATTACAAGAAACCTATGGATGCTTTCGGCGAGTGCAGTTTTAGATGTAAGTTGGCTGATGCAGGGTTCAAGGGAGATAAATTCACGTGGAGGAAGAGCAGA
AGTATTGACGAAAAACTTACCTCCTGCATTCAAAATTTGAAATCCTGGGATAGGCGTAGACTTAAAGGCTCCCTAAAAGAGGCTATCAGTAGGAAAGAGGAGCGGATCCT
TCTCCTTTCAAATCTCCCTAACCCAAATGGCCTAGCTAACAGAAGACTTTGTGACTTGATCGGAGACGATGGGATTTGGAAGGAGGATGAGGTTCGAGATGGGTTCATCC
CGCAAGACATTAAGGATATCTTGAACACTCTGATAGGCCCTAGAGGGTCTAAGGACGAAATTATCTTGGGAGAGGACCCCAAAGGGCTTTTTTCGGTTAAAAGTGCATAT
ACTTTGGCTAAAAGTTATAGCATGAGTCCTTCCACCTCCTCGGTGGATTCTAGGGGAGCTAATATGCAGCTTATGGAAAGCAAAGACTCTCCAAGAGCAAAGTTATGTGT
CTGGAAAGTTATCAAAGATGTCATTCCCACTAAAGAAAACATTATTAGAAGAGGTATCGATTCTAACCCTACTTATTGGGATTGGATGACTAAAAATTTTTCTGTGGATG
AGCTAGATTTGGCTATCATTATTATCTGGAAGGTGTGGAGCGTCAAAATTTTTATATCTATTAACAATTGTCCTGTTTCGGATGCAGATAAAAATAAAGTCATTAGAGAT
ATCTCGTGCGCTAAGGAGGAGCTCTTCTTGACGGAGAAGCTTAACACCCCTTCGGCAAGATCGGAGAACTGCGAGAGTCATGGAGAGTGGACACCACCGGATCCGAACTA
TTGGAAGCTAAATTGTGACGCTTCCTGGAAGAATATTGCAAATATTGGAGGTATTGGTTGGGTTATCCGTGACTTTAATAGATCTCTAATTTGTGTAGGAGGGAAACAAA
TTAAAAGAAGATGGTCTATTAAAGTGTTGGAAGCAAAAGCAATTCTTGAGGGATTGGAAGCGTTTAATAGGTGTGAAGTCGTTCACCGCAAGTGGCTGATAGTGGACTCA
GATTCGAGCGAGGTGATAAAGTGCCTGAATGGGGATGATGAAGACCTTTCTGAGCTAAGCAATTTTGTTGGAGCTATTAATAGTCTAGCTAGTTGCTTTCCAGTTGTAAA
ATTTGTTAACTGCCCTAGGTCTAAAAACTTAGCTCATAATATTGTTAGAAACGTTTGTAAGTTTGGTGATTTTGGAGGAGCTTGCTGCTCCCTTCTTCTTCTGACACTAT
TAGGGTCGGTTTTGGCGAGTACCCACCTTGGATCTCCGATTTGTTGCCTATGGGCTGCCTCCCTGGTGGGTCCCTTCTGGGCTAAATGTTTCTCTGTTGCGTTGATCTTT
ACTTAA
Protein sequenceShow/hide protein sequence
MEEETFSKRLSSLNLQEEELGGVVEVDDDELEEFEKRNQDDVACKICTTKSINAEVFKNIVPKIWNLEGNIKIEAAGRNLFLCSFRKKKEKDRIVYGGPWSFDRGLMVFE
DIKGALNLKAMDFRYAYFWTNFHDLPRVCFTRKKAEALWNSIGRFEGVELDRAGKCSGQTLRVRVKIDVTRPLRRATKLKVGSMGEEIWVQISYEKLPDFCCGCGKLEHL
VKDYVYAEDGAEAAQNEQIRKKEDKIDQRIGLGGEKDSVPVAGEGAREQRKTSTGDSQGDEMMDVTDQDKVGASDPCPNGGTNRGGIILDPSGLTTDTRQGPQLAEVEVT
EVNFGPIINTNKGKGINRKVIKEGSTVHTDRSLSCEQAENELHLGDRKLEGSDTQTNNRRWKRIARDKSNCSEGGSQEAKFLGSKHDLDIRIVEEAKNYLSFDNGFEVPR
VGRSGGLMLLWKDTLLLPIVSYSKAHIDSIIKDGKREWRLTGFYGDPSVEKRSDSWLLLDRLSRLYDLPWLVGGDFNYKKPMDAFGECSFRCKLADAGFKGDKFTWRKSR
SIDEKLTSCIQNLKSWDRRRLKGSLKEAISRKEERILLLSNLPNPNGLANRRLCDLIGDDGIWKEDEVRDGFIPQDIKDILNTLIGPRGSKDEIILGEDPKGLFSVKSAY
TLAKSYSMSPSTSSVDSRGANMQLMESKDSPRAKLCVWKVIKDVIPTKENIIRRGIDSNPTYWDWMTKNFSVDELDLAIIIIWKVWSVKIFISINNCPVSDADKNKVIRD
ISCAKEELFLTEKLNTPSARSENCESHGEWTPPDPNYWKLNCDASWKNIANIGGIGWVIRDFNRSLICVGGKQIKRRWSIKVLEAKAILEGLEAFNRCEVVHRKWLIVDS
DSSEVIKCLNGDDEDLSELSNFVGAINSLASCFPVVKFVNCPRSKNLAHNIVRNVCKFGDFGGACCSLLLLTLLGSVLASTHLGSPICCLWAASLVGPFWAKCFSVALIF
T