; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019863 (gene) of Snake gourd v1 genome

Gene IDTan0019863
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG06:32938000..32943078
RNA-Seq ExpressionTan0019863
SyntenyTan0019863
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU38731.1 hypothetical protein TSUD_208420 [Trifolium subterraneum]6.2e-5127.81Show/hide
Query:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQ-GDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN
        ++ GR GG+ ++W++  + SI ++S  +ID+ + D+Q G WR TGFYG P    R +SW  L + S+   LPW + GDFN++LS +EK G + + + L+N
Subjt:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQ-GDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN

Query:  NFADCIFRCNLVDTGCRGNKFTWRKS-RHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEA
         F + +    LVD   +G  FTW KS     A +E+LDR   N          ++  LT  ASDH P+L  L+ D     HR  +   +FE  W    E 
Subjt:  NFADCIFRCNLVDTGCRGNKFTWRKS-RHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEA

Query:  RVLIRGHW----MESISRSPADLKEKITSCI---------LKLKTWDRQRLKG--SLKK---------------AIQRKEETWNEEAARNGVSPQDYID-
           ++ HW      +I+R   D    +TS             +  WD+  L    S+KK                +    + W+    R G+   D  D 
Subjt:  RVLIRGHW----MESISRSPADLKEKITSCI---------LKLKTWDRQRLKG--SLKK---------------AIQRKEETWNEEAARNGVSPQDYID-

Query:  IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCK
        I++TPL      D I W  +  G+++VKSAY        G    +V EG     W+ +W+ +   + K  +W++  + +PT+  +  RGV     C  C 
Subjt:  IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCK

Query:  VHREDTIHVMWGCKIAKRIWINF-----IPEMETLLYTCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQA
           ED+ H+ + C+ +   W        I +   L  + K      + ++ M+  LN +       +MW+IWK RN +     ++ N+   ++ R +   
Subjt:  VHREDTIHVMWGCKIAKRIWINF-----IPEMETLLYTCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQA

Query:  REELCSLDRRNTNFQKARLESCESHG----EWTPPEANTWKLNCDASWNDNLEVGGIG-----------WVKEKWPIKLL-----KARAILDGLKAVTSC
        R        RN    + R  + + H     EWT P+A TWK N DAS++ +    GIG             K +W   +L     +A  +L  LK V   
Subjt:  REELCSLDRRNTNFQKARLESCESHG----EWTPPEANTWKLNCDASWNDNLEVGGIG-----------WVKEKWPIKLL-----KARAILDGLKAVTSC

Query:  DVNHRKMMTLETDSSEVVKNIN
           H   +  E DS  VV   N
Subjt:  DVNHRKMMTLETDSSEVVKNIN

KAF7824053.1 hypothetical protein G2W53_022197 [Senna tora]5.1e-5327.1Show/hide
Query:  RSGGLMLLWKEPTHLSIISFSKANIDVIIKD--IQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFA
        R+GGL L W     L++ SFS  +IDVI+ D  +   WR TG +G P E+++  +W LL   +S  DLPWL  GDFNE++   EK GG  KS + M  F 
Subjt:  RSGGLMLLWKEPTHLSIISFSKANIDVIIKD--IQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFA

Query:  DCIFRCNLVDTGCRGNKFTWRKSR--HHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTS-HRRRRPNARFEENWVGCEEAR
        D    C   D G +G  FTW   R   HN  +ERLDR F  +  L++     ++H++  +SDH  +   + FD++  S H RR+   RFEE W   E  +
Subjt:  DCIFRCNLVDTGCRGNKFTWRKSR--HHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTS-HRRRRPNARFEENWVGCEEAR

Query:  VLIRGHWMESISRSPADLKEKITS---------------------CILK------------------LKTWDRQRLKG-----------------SLKKA
         +I   W  + + S     EK+TS                     CI+K                  +  W+   +                    +   
Subjt:  VLIRGHWMESISRSPADLKEKITS---------------------CILK------------------LKTWDRQRLKG-----------------SLKKA

Query:  IQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLG
        I  +  TW  +   N   P +   I++ PL  R  +D  IW  +    +SVKSAYH+  +     +S   S  S+K  W  +WK     + ++  W++  
Subjt:  IQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLG

Query:  DIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW----INFIPEMETLLYTCKREWTPTDCWDWMISNLNVEEIES---TIIIMWNIWKA
        + +PT +N+ KRGV I   C  C++  EDT+H   GC  A+ +W    + F P +           +     DW+ S L  E IE       + W IW  
Subjt:  DIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW----INFIPEMETLLYTCKREWTPTDCWDWMISNLNVEEIES---TIIIMWNIWKA

Query:  R-NFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCESH--GEWTPPEANTWKLNCDASWNDNLE----------VGGIGWVKEKW-
        R NF+   K++ V +     + ++     E    D+     Q    + C S    +W  P++   KLN DA+   + E           G + +V     
Subjt:  R-NFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCESH--GEWTPPEANTWKLNCDASWNDNLE----------VGGIGWVKEKW-

Query:  ----PIKLLKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIAR
               + +A AI  GL  V   DV +   + +ETD     K+ +  A+  S +   ++   ++ +          PR  N VAH +A+
Subjt:  ----PIKLLKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIAR

KAF8408042.1 hypothetical protein HHK36_007182 [Tetracentron sinense]3.2e-5527.57Show/hide
Query:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDI--QGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLM
        E VGRSGGL LLW++   +S+ S+SK +IDV++  +  +  WR TG YG P    +  +W L+   S  + +PW+  GDFNE+   EEK G   K+   M
Subjt:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDI--QGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLM

Query:  NNFADCIFRCNLVDTGCRGNKFTW-RKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEE
          F + I  C+L+  G  GN FTW  K       +ERLDR              ++ HL+ H SDH P+L  L FD       R++ + RFE  W+   E
Subjt:  NNFADCIFRCNLVDTGCRGNKFTW-RKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEE

Query:  ARVLIRGHW----MESISRSPADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFS
           +I   W      S    P D K  +                      I + ++TWN         P +   I + PL  R   D  +W    KG FS
Subjt:  ARVLIRGHW----MESISRSPADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFS

Query:  VKSAYHLAKRYRIGSSSSKVSEGS-------SKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRI
        V+SAYHL    R   S++  S  S       S   W+ +W+     + K+ +WKV  +I+P + N+ KR + +   C  C    E  +HV+  C  A+++
Subjt:  VKSAYHLAKRYRIGSSSSKVSEGS-------SKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRI

Query:  WINFIPEMETLLYTCKREWTPTDCWDWMISNLNVEE-IESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCE
        W+      +  L +          W   I   + EE + +  +I W+IWK RN     +Y+F     +K+       R      D  N N  +A  ES  
Subjt:  WINFIPEMETLLYTCKREWTPTDCWDWMISNLNVEE-IESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCE

Query:  SHGEWTPPEANTWKLNCDASWNDNLEVGGIGWV----------------KEKWPIKLLKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEG
        +   W  P  + +K+N D + +      G+G V                       +++A A  +GLK      +     + LE+DS   ++ +  + E 
Subjt:  SHGEWTPPEANTWKLNCDASWNDNLEVGGIGWV----------------KEKWPIKLLKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEG

Query:  MS
         S
Subjt:  MS

MBA0733287.1 hypothetical protein [Gossypium gossypioides]4.1e-4727.62Show/hide
Query:  LSIISFSKANIDVIIKDIQG--DWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCR
        + + SFSK +IDV+I D +    WRFTGFYG P  ++R  SW  L R +S  ++PWLV  DFNE++   EK GG P+ ++ M  F   +  C L D G  
Subjt:  LSIISFSKANIDVIIKDIQG--DWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMNNFADCIFRCNLVDTGCR

Query:  GNKFTW-RKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEARVLIRGHWMESISRSPA
        G+ FTW R +      +ERLDR  +N   +    +V + HL+   SDH P+L     DT     +    + +FE  W+  +    +++G W      S  
Subjt:  GNKFTW-RKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEARVLIRGHWMESISRSPA

Query:  DLKEKITSCILKLKTWDRQRL-KGSLKKAIQRKE------------ETWNEEAARNGVSPQDYI-DIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLA
        DL +K+    + LK W  + + K  +  +  R+E              WN    +N   P+D +  I+  PL      D   W  ++ G FSV+S Y L 
Subjt:  DLKEKITSCILKLKTWDRQRL-KGSLKKAIQRKE------------ETWNEEAARNGVSPQDYI-DIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLA

Query:  KRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW--INFIPEMETLLY
        +   +   S  + +  +K+ +  LW  +  S+    VW++  D IP  +N+  R V  N  C  C    E ++HV   C     +W  +NF   M    +
Subjt:  KRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW--INFIPEMETLLY

Query:  TCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINTVKYLFVNDDL-LKIIRDISQAREELCSLDRRNTNFQKARLESCESH--GEWTPPEAN
        T   EW       W+    N ++  S    +W IW +RN +   + +     L L I R +S         +++  N  K +  +C S+   E TP    
Subjt:  TCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINTVKYLFVNDDL-LKIIRDISQAREELCSLDRRNTNFQKARLESCESH--GEWTPPEAN

Query:  TWKLNCDASWNDNLE---VGGIGWVKEKWPIKL-------------LKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVEAI
        T +++ DA+++ N      G +GW      + L              +A A L+G+K   S  ++  K+M    DS  V+K     +   S +   +  I
Subjt:  TWKLNCDASWNDNLE---VGGIGWVKEKWPIKL-------------LKARAILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVEAI

Query:  ANVES--RSLIVKFVKCPRSSNSVAHNIAR
           ++  + LI +++   RS N  AH IA+
Subjt:  ANVES--RSLIVKFVKCPRSSNSVAHNIAR

RYR18269.1 hypothetical protein Ahy_B03g062876 [Arachis hypogaea]1.1e-4224.46Show/hide
Query:  EKVGRSGGLMLLWKEPTHLSII----SFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKK
        E  G SGGL LLWK  T++++     ++ KANI+ I  D+  +W+    YG+P  + R   W  L   +   ++P    GDFN++L++ EK G  P+ + 
Subjt:  EKVGRSGGLMLLWKEPTHLSII----SFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKK

Query:  LMNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNA-TKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHR-RRRPNARFEENWVG
         +  F   +   +L+D   +GNK+TW  +  +N  T++RLDR  +N   L     V++       SDH  ++       ++T  R R + + +FE  WV 
Subjt:  LMNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNA-TKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHR-RRRPNARFEENWVG

Query:  CEEARVLIRGHW-MESISRSP-ADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWN-EEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGI
         EE + +I+  W  E  SR+      +K   CI +L  W  ++ K + KK  ++K E    +EAA           I  TP+     KD  +W     G 
Subjt:  CEEARVLIRGHW-MESISRSP-ADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWN-EEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGI

Query:  FSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKA----KTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW
        ++V++ YH+AK  +      ++ + S+ + W  +W+A        + ++ +WK +  I+P   N+ +R + + P C  C+   E   H +  C   + +W
Subjt:  FSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKA----KTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW

Query:  INFIPEMETLLYTCK--REWTPTDCWDWMISNLNVEEIESTI----IIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARL
             ++    Y  +  REW         I   +  E E T+     + W IWKARN      ++F   +++   + I+Q+        +      KA +
Subjt:  INFIPEMETLLYTCK--REWTPTDCWDWMISNLNVEEIESTI----IIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARL

Query:  ESCESHG-----EWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLL---------------KARAILDGLKAVTSCDVNHRKMMTLETDSSEVVK
              G      W PP  N  K+N DA++  +     +      W  K++               +A+A  + L  + +  + +     +ETDS  +V+
Subjt:  ESCESHG-----EWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLL---------------KARAILDGLKAVTSCDVNHRKMMTLETDSSEVVK

Query:  NINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIA
         I      +++    +  I  +   +  V     PR  N +AH +A
Subjt:  NINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIA

TrEMBL top hitse value%identityAlignment
A0A2Z6N4T0 Uncharacterized protein3.0e-5127.81Show/hide
Query:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQ-GDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN
        ++ GR GG+ ++W++  + SI ++S  +ID+ + D+Q G WR TGFYG P    R +SW  L + S+   LPW + GDFN++LS +EK G + + + L+N
Subjt:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQ-GDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN

Query:  NFADCIFRCNLVDTGCRGNKFTWRKS-RHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEA
         F + +    LVD   +G  FTW KS     A +E+LDR   N          ++  LT  ASDH P+L  L+ D     HR  +   +FE  W    E 
Subjt:  NFADCIFRCNLVDTGCRGNKFTWRKS-RHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEA

Query:  RVLIRGHW----MESISRSPADLKEKITSCI---------LKLKTWDRQRLKG--SLKK---------------AIQRKEETWNEEAARNGVSPQDYID-
           ++ HW      +I+R   D    +TS             +  WD+  L    S+KK                +    + W+    R G+   D  D 
Subjt:  RVLIRGHW----MESISRSPADLKEKITSCI---------LKLKTWDRQRLKG--SLKK---------------AIQRKEETWNEEAARNGVSPQDYID-

Query:  IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCK
        I++TPL      D I W  +  G+++VKSAY        G    +V EG     W+ +W+ +   + K  +W++  + +PT+  +  RGV     C  C 
Subjt:  IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCK

Query:  VHREDTIHVMWGCKIAKRIWINF-----IPEMETLLYTCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQA
           ED+ H+ + C+ +   W        I +   L  + K      + ++ M+  LN +       +MW+IWK RN +     ++ N+   ++ R +   
Subjt:  VHREDTIHVMWGCKIAKRIWINF-----IPEMETLLYTCKREWTPTDCWDWMISNLNVEEIESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQA

Query:  REELCSLDRRNTNFQKARLESCESHG----EWTPPEANTWKLNCDASWNDNLEVGGIG-----------WVKEKWPIKLL-----KARAILDGLKAVTSC
        R        RN    + R  + + H     EWT P+A TWK N DAS++ +    GIG             K +W   +L     +A  +L  LK V   
Subjt:  REELCSLDRRNTNFQKARLESCESHG----EWTPPEANTWKLNCDASWNDNLEVGGIG-----------WVKEKWPIKLL-----KARAILDGLKAVTSC

Query:  DVNHRKMMTLETDSSEVVKNIN
           H   +  E DS  VV   N
Subjt:  DVNHRKMMTLETDSSEVVKNIN

A0A444ZVS3 Uncharacterized protein5.1e-4324.46Show/hide
Query:  EKVGRSGGLMLLWKEPTHLSII----SFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKK
        E  G SGGL LLWK  T++++     ++ KANI+ I  D+  +W+    YG+P  + R   W  L   +   ++P    GDFN++L++ EK G  P+ + 
Subjt:  EKVGRSGGLMLLWKEPTHLSII----SFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKK

Query:  LMNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNA-TKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHR-RRRPNARFEENWVG
         +  F   +   +L+D   +GNK+TW  +  +N  T++RLDR  +N   L     V++       SDH  ++       ++T  R R + + +FE  WV 
Subjt:  LMNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNA-TKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHR-RRRPNARFEENWVG

Query:  CEEARVLIRGHW-MESISRSP-ADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWN-EEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGI
         EE + +I+  W  E  SR+      +K   CI +L  W  ++ K + KK  ++K E    +EAA           I  TP+     KD  +W     G 
Subjt:  CEEARVLIRGHW-MESISRSP-ADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWN-EEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGI

Query:  FSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKA----KTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW
        ++V++ YH+AK  +      ++ + S+ + W  +W+A        + ++ +WK +  I+P   N+ +R + + P C  C+   E   H +  C   + +W
Subjt:  FSVKSAYHLAKRYRIGSSSSKVSEGSSKEPWNNLWKA----KTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIW

Query:  INFIPEMETLLYTCK--REWTPTDCWDWMISNLNVEEIESTI----IIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARL
             ++    Y  +  REW         I   +  E E T+     + W IWKARN      ++F   +++   + I+Q+        +      KA +
Subjt:  INFIPEMETLLYTCK--REWTPTDCWDWMISNLNVEEIESTI----IIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARL

Query:  ESCESHG-----EWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLL---------------KARAILDGLKAVTSCDVNHRKMMTLETDSSEVVK
              G      W PP  N  K+N DA++  +     +      W  K++               +A+A  + L  + +  + +     +ETDS  +V+
Subjt:  ESCESHG-----EWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLL---------------KARAILDGLKAVTSCDVNHRKMMTLETDSSEVVK

Query:  NINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIA
         I      +++    +  I  +   +  V     PR  N +AH +A
Subjt:  NINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIA

A0A445CZL3 Uncharacterized protein1.4e-4024.91Show/hide
Query:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGD-WRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN
        E  G SGGL LLW E  ++ I  + + +I   I D +G  W     YG+P    R   W  + R +S    P +  GDFN++LS+EEK G  PK +  + 
Subjt:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGD-WRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN

Query:  NFADCIFRCNLVDTGCRGNKFT-WRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEA
         F   +    L+D   +G +FT +   R+   T+E++DR  +N        + S+  L   +SDH+P++ ++         +R+  N +FE  W   EE 
Subjt:  NFADCIFRCNLVDTGCRGNKFT-WRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEA

Query:  RVLIRGHWMESISRSPADLKEKITSCI---LKLKTWDRQRLKGSLK------------KAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVII
          ++R  W +   +    LKE     I    ++  W    + G  K            K +  + E W+     +    +   +I++TP+     +D++ 
Subjt:  RVLIRGHWMESISRSPADLKEKITSCI---LKLKTWDRQRLKGSLK------------KAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVII

Query:  WGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSK-EPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKI
        W     G +S+K+ Y+ A+R     +    S    K E W  +W+ +   + ++ +WK   DI+P   N+ KR +  +P C  C    E   H +  C  
Subjt:  WGEDMKGIFSVKSAYHLAKRYRIGSSSSKVSEGSSK-EPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKI

Query:  AKRIWINFIPEMETLLYTCKREWTP-----TDCWDWMISNL---------NVEE-IESTIIIMWNIWKARNFINTVKYLFVNDDL--------LKIIRDI
        A+  W           +  + +WTP     T   +W++  +         N E  I     +MW IWK RN       +F   ++         KI+  I
Subjt:  AKRIWINFIPEMETLLYTCKREWTP-----TDCWDWMISNL---------NVEE-IESTIIIMWNIWKARNFINTVKYLFVNDDL--------LKIIRDI

Query:  SQAREELCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLLKARAIL
             +    +++  N  K  L       +W PP +N  K N DA++      G I  V     I+  K R IL
Subjt:  SQAREELCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLLKARAIL

A0A5C7IW34 Uncharacterized protein1.4e-4033.23Show/hide
Query:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGD-WRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN
        ++VG SGGL LLWKE  ++ I SFS ++ID I+ D +G+ WRF GFY       R +SW LL   S LF+LPWL   DFNE+    EK  G  K   L++
Subjt:  EKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGD-WRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKLMN

Query:  NFADCIFRCNLVDTGCRGNKFTWRKSRHH-NATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNA----RFEENWVG
         F D +  C   D G  G+ FTW   R   NA  E LD      S        S+ HL++  SDH+P+L  ++F +++ +     P+       EE W+ 
Subjt:  NFADCIFRCNLVDTGCRGNKFTWRKSRHH-NATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNA----RFEENWVG

Query:  CEEARVLIRGHWMESISRSP-ADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKE--------------------------ETWNEEAARNGVSPQDYID
         ++   +I   W ES   S   D++EKI +C L L  W  Q+  G++K  +  K                             WN    RN   P D   
Subjt:  CEEARVLIRGHWMESISRSP-ADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKE--------------------------ETWNEEAARNGVSPQDYID

Query:  IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLA
        I+N P       D + W  D +G +SV+  Y LA
Subjt:  IMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLA

A0A6J1DUG8 uncharacterized protein LOC1110241351.5e-3938.98Show/hide
Query:  VAREKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKL
        V+    G+SGGLMLLW   +++ I S S  +ID II D  G WRFTGFYG+P    R  SW LL+R + + DLPW++GGDFNE++S  EK GG  +++  
Subjt:  VAREKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGGAPKSKKL

Query:  MNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTIN--TSHRRRRPNARFEENWVGC
        M               GC      W          ERLDR+ +N+SML +   + ++HL   +SDH+PILA   F+     T H++R+   RFEE+W+  
Subjt:  MNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTIN--TSHRRRRPNARFEENWVGC

Query:  EEARVLIRGHWMESISRSPADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEE
        +  R +I G W           + KI SC+ +L  W++ RL  SLK AI  KE+
Subjt:  EEARVLIRGHWMESISRSPADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.5e-0733.33Show/hide
Query:  EENRMNSWMLLDRFSS---LFDLPWLVGGDFNELLSEEEKWGGAPKSKKL--MNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNATKERLDRYFLN
        E  R + W  + R S+   L + PWLV GDFN++ S  E +   P +  L  + +   C+   +LVD  CRG  +TW   +  N    +LDR  +N
Subjt:  EENRMNSWMLLDRFSS---LFDLPWLVGGDFNELLSEEEKWGGAPKSKKL--MNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNATKERLDRYFLN

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.5e-0630.65Show/hide
Query:  NLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKR
        ++W  K   + KL +WK L + +P    ++ R + I PFC  C+   E   H+++ C  A+R
Subjt:  NLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKR

AT4G29090.1 Ribonuclease H-like superfamily protein1.4e-2127.37Show/hide
Query:  GPRGAKDVIIWGEDMKGIFSVKSAYH-LAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHRED
        G R   D   W     G ++VKS Y  L +     SS  +VSE S    +  +WK++T  + +  +WK L + +P    +  R +     C  C   +E 
Subjt:  GPRGAKDVIIWGEDMKGIFSVKSAYH-LAKRYRIGSSSSKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHRED

Query:  TIHVMWGCKIAKRIW-INFIPEMETLLYTCKREWTPTD----CWDWMISNLNVE-EIESTII--IMWNIWKARNFINTVKYLFVNDDLLKIIRDISQARE
          H+++ C  A+  W I+ IP           EW  +      W + + N N + E  S ++  ++W +WK RN +      F   ++L+   D  +   
Subjt:  TIHVMWGCKIAKRIW-INFIPEMETLLYTCKREWTPTD----CWDWMISNLNVE-EIESTII--IMWNIWKARNFINTVKYLFVNDDLLKIIRDISQARE

Query:  ELCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDASWNDNLEVGGIGWV--KEKWPIKLLKARAILDGLKAVTSCDVNHRKMMTL----------
             +   T  Q  R  SC   G W PP     K N DA+WN + E  GIGWV   EK  +K + ARA L  LK+V   ++   +   L          
Subjt:  ELCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDASWNDNLEVGGIGWV--KEKWPIKLLKARAILDGLKAVTSCDVNHRKMMTL----------

Query:  --ETDSSEVVKNINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIAR
          E+DS  +++ +N + E    L   ++ +  + S+   VKFV  PR  N++A  +AR
Subjt:  --ETDSSEVVKNINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCCGAGGGAGGGAACGAAAAACAACGATGGAAGCTGTACTCTAAATTTGAGGCCTAAAGATCAGAAGTCGTCGGAAAGGGAGGTCGGAGAGGCGGCCGGCGACCA
AGGAAAGAACCCAACGGGCAAAACTCGAAATTTGGAACCGGTGGATGGGACGTCACAGACCAAGGTCAGGGAGACAGGACAGGGCATGTATGTTGGAGAAGCCACGTGGA
GAGGGGGGACCCACTCTGATGAGAATGGAGGGTTGGCTGGGGACCTCGATAATAACATAGGAGAAGTTTTCTTGGATTTGATTAGTGGGCCGAAAACTTGTGGGCTGGTC
AATGGGCCAGAAAAGAAGTTGGGGCTGAATAGGGGTCTAGAAAAGGAAATGGTTGATAAGGGCTCATTTACCAATGTCAAACCCAATAAGCAAATCAAATGGATTGACGG
GGAAGAAGGAAGAGTTTCGACCCATTCAATTCTAAACCCAGTAACGGAGGAAGGAGCAGACAACAGGAAGCCGATTACAGGTGGTCCATATCACGAACCAAGGAACCAAA
AGGATAAAGAGAGAGACTTCGAGAGCTTGACCTATAAAGGAGAGGGGTCCAACCAGGTTCTCAGGGTTGAAATTGAGAGTCTTGGATCAGATTACTATCCAGCAGATACT
TCACGAAAAGCAGAGAGAGTAGAAAATAAAGCCTCTAGGAATTGGAAAAGAGTTGCTAGAGAGAAAGTGGGCAGAAGTGGCGGGTTAATGTTGTTGTGGAAGGAGCCGAC
TCATCTTTCGATTATCTCTTTCTCTAAAGCGAACATTGATGTCATTATCAAAGACATTCAAGGGGATTGGAGATTCACTGGTTTCTATGGAGATCCTGCGGAGGAGAATA
GAATGAATTCCTGGATGCTTTTGGATCGGTTTAGTAGTTTGTTCGATCTCCCTTGGCTGGTGGGGGGTGACTTCAACGAGCTGCTGTCAGAGGAAGAGAAATGGGGAGGG
GCACCCAAAAGTAAGAAACTCATGAATAACTTTGCTGACTGTATTTTTAGATGTAACTTGGTTGATACAGGTTGTAGGGGCAATAAATTCACATGGAGAAAAAGCAGACA
TCACAATGCAACCAAGGAACGTCTTGACAGGTATTTCTTAAATCAAAGCATGTTGATTCGCACCACGAAGGTCAGTATCTCTCATCTTACTTTTCATGCTTCTGATCATA
AGCCTATTCTTGCTCACCTCAAGTTTGATACAATTAATACTAGCCATAGGAGACGGAGACCTAATGCTCGTTTTGAGGAAAATTGGGTTGGTTGTGAGGAGGCTAGGGTG
CTGATAAGAGGTCACTGGATGGAAAGTATAAGTAGGAGTCCTGCCGATCTGAAAGAGAAAATAACTTCCTGTATTCTTAAGTTGAAAACCTGGGATAGACAAAGATTAAA
AGGGTCCTTAAAAAAGGCCATTCAAAGGAAAGAGGAGACTTGGAATGAAGAAGCGGCCAGAAATGGAGTTTCTCCTCAGGACTATATTGATATTATGAATACTCCTCTTG
GCCCAAGAGGGGCAAAGGATGTCATTATTTGGGGAGAGGATATGAAAGGGATTTTTTCGGTCAAAAGTGCATATCACTTGGCCAAAAGATACAGAATTGGCTCTTCCAGT
TCCAAAGTGAGTGAAGGGAGTTCCAAGGAGCCCTGGAACAATCTCTGGAAAGCCAAGACCTTATCTAGAGCAAAACTCTGTGTGTGGAAAGTGTTAGGAGATATCATCCC
TACCAAAATTAACATTATTAAAAGAGGTGTTGACATTAATCCCTTTTGCTGTTTTTGCAAGGTACACCGTGAGGACACAATCCATGTCATGTGGGGGTGCAAAATCGCCA
AAAGAATCTGGATCAACTTTATCCCCGAGATGGAGACTTTGCTTTATACCTGTAAAAGAGAATGGACGCCTACGGATTGTTGGGACTGGATGATTTCAAACCTCAATGTG
GAGGAGATAGAATCGACCATCATCATCATGTGGAACATTTGGAAAGCCAGGAACTTTATAAACACTGTTAAATATTTGTTTGTAAATGATGATCTGCTGAAAATTATCAG
GGACATTTCACAGGCTAGGGAGGAGCTATGTTCGTTGGACCGACGAAACACAAATTTCCAAAAAGCAAGATTGGAGAGCTGTGAGAGTCATGGAGAATGGACCCCCCCTG
AGGCAAACACGTGGAAACTCAATTGCGACGCCTCTTGGAATGATAATCTAGAGGTTGGTGGGATTGGATGGGTCAAAGAGAAGTGGCCAATCAAACTGCTTAAAGCCAGA
GCTATTCTGGATGGTCTTAAAGCGGTGACAAGTTGCGATGTCAATCACCGAAAGATGATGACATTGGAAACTGACTCAAGCGAAGTAGTAAAGAACATCAATGGGGAAGC
GGAAGGCATGTCGGAATTGTATAACTTTGTTGAGGCGATTGCGAATGTGGAGAGTCGTTCTCTTATTGTTAAGTTTGTAAAGTGCCCTAGATCTAGCAATAGTGTAGCTC
ATAACATTGCTAGGGGGTGTGTATTCATGGTGATTTTCAGGGTCCTTTTAGCTCCCCTCTTGTTGAAGGAGCTTTTGAAGTTGTTTTTGGTGAGTACCCGTCTTGGGTCT
CCAAGTTGTTGCCTGCGGGCTGTCTCCCCAACTCTTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCCGAGGGAGGGAACGAAAAACAACGATGGAAGCTGTACTCTAAATTTGAGGCCTAAAGATCAGAAGTCGTCGGAAAGGGAGGTCGGAGAGGCGGCCGGCGACCA
AGGAAAGAACCCAACGGGCAAAACTCGAAATTTGGAACCGGTGGATGGGACGTCACAGACCAAGGTCAGGGAGACAGGACAGGGCATGTATGTTGGAGAAGCCACGTGGA
GAGGGGGGACCCACTCTGATGAGAATGGAGGGTTGGCTGGGGACCTCGATAATAACATAGGAGAAGTTTTCTTGGATTTGATTAGTGGGCCGAAAACTTGTGGGCTGGTC
AATGGGCCAGAAAAGAAGTTGGGGCTGAATAGGGGTCTAGAAAAGGAAATGGTTGATAAGGGCTCATTTACCAATGTCAAACCCAATAAGCAAATCAAATGGATTGACGG
GGAAGAAGGAAGAGTTTCGACCCATTCAATTCTAAACCCAGTAACGGAGGAAGGAGCAGACAACAGGAAGCCGATTACAGGTGGTCCATATCACGAACCAAGGAACCAAA
AGGATAAAGAGAGAGACTTCGAGAGCTTGACCTATAAAGGAGAGGGGTCCAACCAGGTTCTCAGGGTTGAAATTGAGAGTCTTGGATCAGATTACTATCCAGCAGATACT
TCACGAAAAGCAGAGAGAGTAGAAAATAAAGCCTCTAGGAATTGGAAAAGAGTTGCTAGAGAGAAAGTGGGCAGAAGTGGCGGGTTAATGTTGTTGTGGAAGGAGCCGAC
TCATCTTTCGATTATCTCTTTCTCTAAAGCGAACATTGATGTCATTATCAAAGACATTCAAGGGGATTGGAGATTCACTGGTTTCTATGGAGATCCTGCGGAGGAGAATA
GAATGAATTCCTGGATGCTTTTGGATCGGTTTAGTAGTTTGTTCGATCTCCCTTGGCTGGTGGGGGGTGACTTCAACGAGCTGCTGTCAGAGGAAGAGAAATGGGGAGGG
GCACCCAAAAGTAAGAAACTCATGAATAACTTTGCTGACTGTATTTTTAGATGTAACTTGGTTGATACAGGTTGTAGGGGCAATAAATTCACATGGAGAAAAAGCAGACA
TCACAATGCAACCAAGGAACGTCTTGACAGGTATTTCTTAAATCAAAGCATGTTGATTCGCACCACGAAGGTCAGTATCTCTCATCTTACTTTTCATGCTTCTGATCATA
AGCCTATTCTTGCTCACCTCAAGTTTGATACAATTAATACTAGCCATAGGAGACGGAGACCTAATGCTCGTTTTGAGGAAAATTGGGTTGGTTGTGAGGAGGCTAGGGTG
CTGATAAGAGGTCACTGGATGGAAAGTATAAGTAGGAGTCCTGCCGATCTGAAAGAGAAAATAACTTCCTGTATTCTTAAGTTGAAAACCTGGGATAGACAAAGATTAAA
AGGGTCCTTAAAAAAGGCCATTCAAAGGAAAGAGGAGACTTGGAATGAAGAAGCGGCCAGAAATGGAGTTTCTCCTCAGGACTATATTGATATTATGAATACTCCTCTTG
GCCCAAGAGGGGCAAAGGATGTCATTATTTGGGGAGAGGATATGAAAGGGATTTTTTCGGTCAAAAGTGCATATCACTTGGCCAAAAGATACAGAATTGGCTCTTCCAGT
TCCAAAGTGAGTGAAGGGAGTTCCAAGGAGCCCTGGAACAATCTCTGGAAAGCCAAGACCTTATCTAGAGCAAAACTCTGTGTGTGGAAAGTGTTAGGAGATATCATCCC
TACCAAAATTAACATTATTAAAAGAGGTGTTGACATTAATCCCTTTTGCTGTTTTTGCAAGGTACACCGTGAGGACACAATCCATGTCATGTGGGGGTGCAAAATCGCCA
AAAGAATCTGGATCAACTTTATCCCCGAGATGGAGACTTTGCTTTATACCTGTAAAAGAGAATGGACGCCTACGGATTGTTGGGACTGGATGATTTCAAACCTCAATGTG
GAGGAGATAGAATCGACCATCATCATCATGTGGAACATTTGGAAAGCCAGGAACTTTATAAACACTGTTAAATATTTGTTTGTAAATGATGATCTGCTGAAAATTATCAG
GGACATTTCACAGGCTAGGGAGGAGCTATGTTCGTTGGACCGACGAAACACAAATTTCCAAAAAGCAAGATTGGAGAGCTGTGAGAGTCATGGAGAATGGACCCCCCCTG
AGGCAAACACGTGGAAACTCAATTGCGACGCCTCTTGGAATGATAATCTAGAGGTTGGTGGGATTGGATGGGTCAAAGAGAAGTGGCCAATCAAACTGCTTAAAGCCAGA
GCTATTCTGGATGGTCTTAAAGCGGTGACAAGTTGCGATGTCAATCACCGAAAGATGATGACATTGGAAACTGACTCAAGCGAAGTAGTAAAGAACATCAATGGGGAAGC
GGAAGGCATGTCGGAATTGTATAACTTTGTTGAGGCGATTGCGAATGTGGAGAGTCGTTCTCTTATTGTTAAGTTTGTAAAGTGCCCTAGATCTAGCAATAGTGTAGCTC
ATAACATTGCTAGGGGGTGTGTATTCATGGTGATTTTCAGGGTCCTTTTAGCTCCCCTCTTGTTGAAGGAGCTTTTGAAGTTGTTTTTGGTGAGTACCCGTCTTGGGTCT
CCAAGTTGTTGCCTGCGGGCTGTCTCCCCAACTCTTCTCTAG
Protein sequenceShow/hide protein sequence
MLPREGTKNNDGSCTLNLRPKDQKSSEREVGEAAGDQGKNPTGKTRNLEPVDGTSQTKVRETGQGMYVGEATWRGGTHSDENGGLAGDLDNNIGEVFLDLISGPKTCGLV
NGPEKKLGLNRGLEKEMVDKGSFTNVKPNKQIKWIDGEEGRVSTHSILNPVTEEGADNRKPITGGPYHEPRNQKDKERDFESLTYKGEGSNQVLRVEIESLGSDYYPADT
SRKAERVENKASRNWKRVAREKVGRSGGLMLLWKEPTHLSIISFSKANIDVIIKDIQGDWRFTGFYGDPAEENRMNSWMLLDRFSSLFDLPWLVGGDFNELLSEEEKWGG
APKSKKLMNNFADCIFRCNLVDTGCRGNKFTWRKSRHHNATKERLDRYFLNQSMLIRTTKVSISHLTFHASDHKPILAHLKFDTINTSHRRRRPNARFEENWVGCEEARV
LIRGHWMESISRSPADLKEKITSCILKLKTWDRQRLKGSLKKAIQRKEETWNEEAARNGVSPQDYIDIMNTPLGPRGAKDVIIWGEDMKGIFSVKSAYHLAKRYRIGSSS
SKVSEGSSKEPWNNLWKAKTLSRAKLCVWKVLGDIIPTKINIIKRGVDINPFCCFCKVHREDTIHVMWGCKIAKRIWINFIPEMETLLYTCKREWTPTDCWDWMISNLNV
EEIESTIIIMWNIWKARNFINTVKYLFVNDDLLKIIRDISQAREELCSLDRRNTNFQKARLESCESHGEWTPPEANTWKLNCDASWNDNLEVGGIGWVKEKWPIKLLKAR
AILDGLKAVTSCDVNHRKMMTLETDSSEVVKNINGEAEGMSELYNFVEAIANVESRSLIVKFVKCPRSSNSVAHNIARGCVFMVIFRVLLAPLLLKELLKLFLVSTRLGS
PSCCLRAVSPTLL