; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019975 (gene) of Snake gourd v1 genome

Gene IDTan0019975
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG05:5492147..5494352
RNA-Seq ExpressionTan0019975
SyntenyTan0019975
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]4.0e-5226.94Show/hide
Query:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL
        +RL ++   ++  G V+ ++   +EE ++     +  K++  K  N +  +N +  +W L     + E + +N+F   F + +D++RV  GGPW+FD+ L
Subjt:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL

Query:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK
        I F +  G   I  ++F +  FW++ +++P  C T   A   G  +G  E +      +    T++VR RM+IT PL+R +++ V   G EV +   YE 
Subjt:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK

Query:  LPDFCFGCGKLGHLAKDCD-SDEGGLKANL---QFEDWLKTSSRVGGEESPRGGDLRNKVQGR---GRGRGPR----AYNGRGPRGDDPPEEIGEMDKLE
        LP+FCF CG +GH A DC   D GG        ++  W+   S    +         N    R     G   R       GR        EE+     LE
Subjt:  LPDFCFGCGKLGHLAKDCD-SDEGGLKANL---QFEDWLKTSSRVGGEESPRGGDLRNKVQGR---GRGRGPR----AYNGRGPRGDDPPEEIGEMDKLE

Query:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGA---GTSNNSGTQ-------DSDVVASKLISGTKETNKPLG----------FDIDGPE
        A  +E   +A      H+G+     + +VH     VQ++  G    G S N+          D  V+   L   T    K  G              G  
Subjt:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGA---GTSNNSGTQ-------DSDVVASKLISGTKETNKPLG----------FDIDGPE

Query:  KNLTEG-----LVSDEDLTHKENVSKGLPQGATT---AEVDERMGLANVGLGIEVG-----------------------PPTRTSLGHQLSPGSIKIEKK
        K + EG     +VSD  +        G+ +         VD+  G+ ++   +EV                          +    G +  P SI I  K
Subjt:  KNLTEG-----LVSDEDLTHKENVSKGLPQGATT---AEVDERMGLANVGLGIEVG-----------------------PPTRTSLGHQLSPGSIKIEKK

Query:  DLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGS-----HQSSGSEPRISRFRGGKHGYN-----------FG--SEDMEAILKFENAFE
         L  V+ +        K+        D ++  +N K  R+   S     +   G+   ++  R     Y+           +G  +E +   + F N+F 
Subjt:  DLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGS-----HQSSGSEPRISRFRGGKHGYN-----------FG--SEDMEAILKFENAFE

Query:  VPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLV
        V   G+SGGL++LW     + + S+S GHID ++       WRFTGFYG P    RIDSW LL RL  LFDLPW+ GGDFNE+LS  EK GG  ++ S +
Subjt:  VPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLV

Query:  DSFANCIFNCKLADAEIEIQ
          F   +  C L D   E Q
Subjt:  DSFANCIFNCKLADAEIEIQ

KAF4372682.1 hypothetical protein F8388_000849, partial [Cannabis sativa]6.7e-5526.12Show/hide
Query:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL
        +RL  +   ++  G V+ ++   +EE ++     +  K++  K  N +  +N +  +W L     + E + +N+F   F + +D++RV  GGPW+FD+ L
Subjt:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL

Query:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK
        I F +  G   I  ++F +  FW++ +++P  C T   A   G  +G  E +      +    T++VR RM+IT PL+R +++ V   G EV +   YE 
Subjt:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK

Query:  LPDFCFGCGKLGHLAKDCD-SDEGG---LKANLQFEDWLKTSS-----RVGGEESPRGGDLRNKVQGRGRGRGPRA--YNGRGPRGDDPPEEIGEMDKLE
        LPDFCF CG +GH A DC   D GG        ++  W+   S     R   ++          +   G      A    GR        EE+     LE
Subjt:  LPDFCFGCGKLGHLAKDCD-SDEGG---LKANLQFEDWLKTSS-----RVGGEESPRGGDLRNKVQGRGRGRGPRA--YNGRGPRGDDPPEEIGEMDKLE

Query:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVS
        A  +E   +A      H+G                   +   +G +   G Q S +V ++ + G  + N  +G  +   E  +  G ++D++    +N  
Subjt:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVS

Query:  KG--------LPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKK---WKRI
        +G        L        V E     +V   + VG  T + +G   +    ++   D G   K   +  +++++  T+ +    E  I  +K   WKR+
Subjt:  KG--------LPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKK---WKRI

Query:  -------ARGSHQSSGSEPRISRF--------------RGGKHG---------------------------------YNFGSEDMEAILKFENAFEVPRA
               ++G   S    P++S                 GGK                                   Y   +E +   + F N+F V   
Subjt:  -------ARGSHQSSGSEPRISRF--------------RGGKHG---------------------------------YNFGSEDMEAILKFENAFEVPRA

Query:  GRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFA
        G+SGGL++LW     + + S+S GHID ++       WRFTGFYG P    RI+SW LL RL  LFDLPW+ GGDFNE+LS  EK GG  ++ S +  F 
Subjt:  GRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFA

Query:  NCIFNCKLADAEIEIQ
          +  C L D   E Q
Subjt:  NCIFNCKLADAEIEIQ

KAF4381998.1 hypothetical protein G4B88_006630 [Cannabis sativa]1.5e-5127.99Show/hide
Query:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL
        NRLA + + ++  G V+ ++   +EE +K+    +  K++  +  N +  +  +  +W +     + E + +NIF   F   +D++RV  GGPW+ D+ L
Subjt:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL

Query:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK
        I F +  G   I  ++F +  FW++ +++P  C T   A   G  +G  E +      +    T++VR RM+I  PL+R +++ +   G EV +   Y+ 
Subjt:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK

Query:  LPDFCFGCGKLGHLAKDCD-SDEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKLEAAGKEVRRVA
        LPDFC+ CG +GH A DC   D  G   N QF+     S        PR     NK       R P +   R      P   +GE  K+ +A +  R   
Subjt:  LPDFCFGCGKLGHLAKDCD-SDEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKLEAAGKEVRRVA

Query:  KEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHK--------ENVSKG
          +T   +         +V +G  + +    G G    SG +      S++ S             DG +   +EG   + +   K        E+V+ G
Subjt:  KEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHK--------ENVSKG

Query:  LPQGATTAEVD--ERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQID-NKKWKRIARGSHQSSGS
        L +   + EV   E    ++VG+ +E         G   S    K   K  G ++ S QE   E+        T +  ++   +K WKR    + +SS  
Subjt:  LPQGATTAEVD--ERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQID-NKKWKRIARGSHQSSGS

Query:  EPRISRFRGGKHGYNFGSEDMEAILKFENAFE--VPRAG--------------RSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPS
        +P+ S F          S  +  ++   NA     PR+G               SGGL++LW     + + S++ GHID ++    +  WRFTGFYG P 
Subjt:  EPRISRFRGGKHGYNFGSEDMEAILKFENAFE--VPRAG--------------RSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPS

Query:  GEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLADAEIEIQ
           R +SW LL RL  LFDLPW+ GGDFNE+LS  EK GG+ ++ S +  F N +  C LAD   E Q
Subjt:  GEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLADAEIEIQ

OMO61345.1 reverse transcriptase [Corchorus capsularis]4.3e-5428.37Show/hide
Query:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW
        M EG+ ++   N NL EEE   V  VD   ++E   +  + +  K+L+ + +NVEV +N++  +W L G + +     N+F+  F +  +K+RV    PW
Subjt:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW

Query:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV
        +F++ L+V +       ++ +      FW   HDLP           +G S G  E +++ G     G+ LR R R+++T+PLRR + L   + G ++ +
Subjt:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV

Query:  QISYEKLPDFCFGCGKLGHLAKDCDS------DEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKL
           YEKLPDFC+ CG L H+  +C+       D+G  K   ++  WL+        E PR   ++    G    R  R                 E +K 
Subjt:  QISYEKLPDFCFGCGKLGHLAKDCDS------DEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKL

Query:  EAAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENV
        ++  K  RR  +   G    + L+ +    ++  +  Q +  G   +N            + ++G            D  +K L +   SD  +  K+ V
Subjt:  EAAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENV

Query:  -SKGLPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIA-RGSHQSS
         SKG    + + +V    GLA  G                           D   V K+  E              G S+I+   KKWKR A   S    
Subjt:  -SKGLPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIA-RGSHQSS

Query:  GSEPRISRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIINDDKG--PWRFTGFYGEPSGEKRIDSWALLNR
        GS  ++ +  G K   + G + MEA  K +   EV    RSGGL +LWK    + I SYS  H D I+ D KG  PWRFTGFYG P   +R +SW L+  
Subjt:  GSEPRISRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIINDDKG--PWRFTGFYGEPSGEKRIDSWALLNR

Query:  LSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLADAEI
        L     LPW++GGDFNE++   EK GG+ +  S V  F N I +C+L   ++
Subjt:  LSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLADAEI

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]3.4e-6728.82Show/hide
Query:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW
        M+    S +   L+L +++ G +  +     E  E+     +  K +T K IN E FK+ +  IW  +  VT+E    NIF   F+N  D+KR++ GGPW
Subjt:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW

Query:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV
         FD+ L+V  E  G+  +  L FRY  FW+  H+LP  C  R+    LG  +G  + +++   G C GQ +R+R  +D+  PL+R +++ +G   +   V
Subjt:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV

Query:  QISYEKLPDFCFGCGKLGHLAKDC--DSDEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKLEAAG
         I YE+LP+FC+ CGK+GHL +DC  ++ E    ++ +F  W++  SR                  R +G G +  +  G R      E G  D LE   
Subjt:  QISYEKLPDFCFGCGKLGHLAKDC--DSDEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKLEAAG

Query:  KEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVSKGL
                          ++  TK     D+ V   DG          +  D++       T ET   +   +   ++ L +   S      KE +++  
Subjt:  KEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVSKGL

Query:  PQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGSHQSSGSEPRI
         Q +   E    +              T   +G  +S     +  +D G  I                          + K+WKR+AR   +  GS    
Subjt:  PQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGSHQSSGSEPRI

Query:  SRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIND-DKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDL
        ++  G K G +   E+     K  + F V R G+ GGL +LWK+ + + I S++KGHID +I D D   WRFTGFYGEP    R+ SW+LL RL  + +L
Subjt:  SRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIND-DKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDL

Query:  PWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLAD
        PW+V GDFNE+L  +EK GG  ++ + + SF   + +C L D
Subjt:  PWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLAD

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase2.1e-5428.37Show/hide
Query:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW
        M EG+ ++   N NL EEE   V  VD   ++E   +  + +  K+L+ + +NVEV +N++  +W L G + +     N+F+  F +  +K+RV    PW
Subjt:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW

Query:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV
        +F++ L+V +       ++ +      FW   HDLP           +G S G  E +++ G     G+ LR R R+++T+PLRR + L   + G ++ +
Subjt:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV

Query:  QISYEKLPDFCFGCGKLGHLAKDCDS------DEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKL
           YEKLPDFC+ CG L H+  +C+       D+G  K   ++  WL+        E PR   ++    G    R  R                 E +K 
Subjt:  QISYEKLPDFCFGCGKLGHLAKDCDS------DEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKL

Query:  EAAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENV
        ++  K  RR  +   G    + L+ +    ++  +  Q +  G   +N            + ++G            D  +K L +   SD  +  K+ V
Subjt:  EAAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENV

Query:  -SKGLPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIA-RGSHQSS
         SKG    + + +V    GLA  G                           D   V K+  E              G S+I+   KKWKR A   S    
Subjt:  -SKGLPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIA-RGSHQSS

Query:  GSEPRISRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIINDDKG--PWRFTGFYGEPSGEKRIDSWALLNR
        GS  ++ +  G K   + G + MEA  K +   EV    RSGGL +LWK    + I SYS  H D I+ D KG  PWRFTGFYG P   +R +SW L+  
Subjt:  GSEPRISRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIINDDKG--PWRFTGFYGEPSGEKRIDSWALLNR

Query:  LSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLADAEI
        L     LPW++GGDFNE++   EK GG+ +  S V  F N I +C+L   ++
Subjt:  LSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLADAEI

A0A2N9IXX1 Reverse transcriptase domain-containing protein2.3e-5327.35Show/hide
Query:  EYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGLIVFEELKGALNIKALDFRYARFWVNF
        E E+ +   +A + +T + +N+E        +W  E   T      NI V  F N  D +RV+   PWS+D+ L+ F+ ++   +I  ++ R+  FWV  
Subjt:  EYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGLIVFEELKGALNIKALDFRYARFWVNF

Query:  HDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEKLPDFCFGCGKLGHLAKDCDSDEGGLK
        H+LP    + + A ALG ++G+ E V  D   R     +R+R ++DI++PL R  K  + + G+E+W+   YE+LP+FC+ CG L H  +DC        
Subjt:  HDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEKLPDFCFGCGKLGHLAKDCDSDEGGLK

Query:  ANLQFEDWLKTSSRVGGEESPRGGDLRNKVQ-------------------GRGRGRGPRAYNG-RGPRGDDPPEEIGEMDKLEAAGKEVRRVAKEITGPH
             E WL++   +  EE   G  LR  V                    GRG  R P  ++    P G     E+GE        KE      E   P 
Subjt:  ANLQFEDWLKTSSRVGGEESPRGGDLRNKVQ-------------------GRGRGRGPRAYNG-RGPRGDDPPEEIGEMDKLEAAGKEVRRVAKEITGPH

Query:  VGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVSKGLPQ------------
                 +TVH+ D+ +QS               + V +  +I  T   N P+     G  K  TE  VS E+     ++    P             
Subjt:  VGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVSKGLPQ------------

Query:  ----GATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKK---WKRIAR-------
            G    +V  R  L ++G         +T L H         ++K +G     P  K+    I+  E + G  E+++  +    WKRI+R       
Subjt:  ----GATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKK---WKRIAR-------

Query:  -GSHQSSGSEPRISRFRGGKHGYNFGS-----ED----------------------------MEAI---LKFENAFEVPRAGRSGGLMMLWKSSVHLFIS
          +H S     R   +   +H    GS     ED                            ME I   L+F+  F VP   RSGGL M+W   V L I 
Subjt:  -GSHQSSGSEPRISRFRGGKHGYNFGS-----ED----------------------------MEAI---LKFENAFEVPRAGRSGGLMMLWKSSVHLFIS

Query:  SYSKGHIDTIINDDKG-PWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLAD
        ++S+ HID  +  + G  WRFTGFYG P G ++ +SWALL+ L+ +  LPWL  GD+NE+LS +E++G  + +   +  F + I  C   D
Subjt:  SYSKGHIDTIINDDKG-PWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLAD

A0A5C7H9Y2 CCHC-type domain-containing protein1.6e-6728.82Show/hide
Query:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW
        M+    S +   L+L +++ G +  +     E  E+     +  K +T K IN E FK+ +  IW  +  VT+E    NIF   F+N  D+KR++ GGPW
Subjt:  MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPW

Query:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV
         FD+ L+V  E  G+  +  L FRY  FW+  H+LP  C  R+    LG  +G  + +++   G C GQ +R+R  +D+  PL+R +++ +G   +   V
Subjt:  SFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWV

Query:  QISYEKLPDFCFGCGKLGHLAKDC--DSDEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKLEAAG
         I YE+LP+FC+ CGK+GHL +DC  ++ E    ++ +F  W++  SR                  R +G G +  +  G R      E G  D LE   
Subjt:  QISYEKLPDFCFGCGKLGHLAKDC--DSDEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKLEAAG

Query:  KEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVSKGL
                          ++  TK     D+ V   DG          +  D++       T ET   +   +   ++ L +   S      KE +++  
Subjt:  KEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVSKGL

Query:  PQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGSHQSSGSEPRI
         Q +   E    +              T   +G  +S     +  +D G  I                          + K+WKR+AR   +  GS    
Subjt:  PQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGSHQSSGSEPRI

Query:  SRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIND-DKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDL
        ++  G K G +   E+     K  + F V R G+ GGL +LWK+ + + I S++KGHID +I D D   WRFTGFYGEP    R+ SW+LL RL  + +L
Subjt:  SRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIND-DKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDL

Query:  PWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLAD
        PW+V GDFNE+L  +EK GG  ++ + + SF   + +C L D
Subjt:  PWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLAD

A0A7J6DZ24 CCHC-type domain-containing protein2.0e-5226.94Show/hide
Query:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL
        +RL ++   ++  G V+ ++   +EE ++     +  K++  K  N +  +N +  +W L     + E + +N+F   F + +D++RV  GGPW+FD+ L
Subjt:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL

Query:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK
        I F +  G   I  ++F +  FW++ +++P  C T   A   G  +G  E +      +    T++VR RM+IT PL+R +++ V   G EV +   YE 
Subjt:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK

Query:  LPDFCFGCGKLGHLAKDCD-SDEGGLKANL---QFEDWLKTSSRVGGEESPRGGDLRNKVQGR---GRGRGPR----AYNGRGPRGDDPPEEIGEMDKLE
        LP+FCF CG +GH A DC   D GG        ++  W+   S    +         N    R     G   R       GR        EE+     LE
Subjt:  LPDFCFGCGKLGHLAKDCD-SDEGGLKANL---QFEDWLKTSSRVGGEESPRGGDLRNKVQGR---GRGRGPR----AYNGRGPRGDDPPEEIGEMDKLE

Query:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGA---GTSNNSGTQ-------DSDVVASKLISGTKETNKPLG----------FDIDGPE
        A  +E   +A      H+G+     + +VH     VQ++  G    G S N+          D  V+   L   T    K  G              G  
Subjt:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGA---GTSNNSGTQ-------DSDVVASKLISGTKETNKPLG----------FDIDGPE

Query:  KNLTEG-----LVSDEDLTHKENVSKGLPQGATT---AEVDERMGLANVGLGIEVG-----------------------PPTRTSLGHQLSPGSIKIEKK
        K + EG     +VSD  +        G+ +         VD+  G+ ++   +EV                          +    G +  P SI I  K
Subjt:  KNLTEG-----LVSDEDLTHKENVSKGLPQGATT---AEVDERMGLANVGLGIEVG-----------------------PPTRTSLGHQLSPGSIKIEKK

Query:  DLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGS-----HQSSGSEPRISRFRGGKHGYN-----------FG--SEDMEAILKFENAFE
         L  V+ +        K+        D ++  +N K  R+   S     +   G+   ++  R     Y+           +G  +E +   + F N+F 
Subjt:  DLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGS-----HQSSGSEPRISRFRGGKHGYN-----------FG--SEDMEAILKFENAFE

Query:  VPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLV
        V   G+SGGL++LW     + + S+S GHID ++       WRFTGFYG P    RIDSW LL RL  LFDLPW+ GGDFNE+LS  EK GG  ++ S +
Subjt:  VPRAGRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLV

Query:  DSFANCIFNCKLADAEIEIQ
          F   +  C L D   E Q
Subjt:  DSFANCIFNCKLADAEIEIQ

A0A7J6FPV7 CCHC-type domain-containing protein3.2e-5526.12Show/hide
Query:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL
        +RL  +   ++  G V+ ++   +EE ++     +  K++  K  N +  +N +  +W L     + E + +N+F   F + +D++RV  GGPW+FD+ L
Subjt:  NRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTI-EAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGL

Query:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK
        I F +  G   I  ++F +  FW++ +++P  C T   A   G  +G  E +      +    T++VR RM+IT PL+R +++ V   G EV +   YE 
Subjt:  IVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEK

Query:  LPDFCFGCGKLGHLAKDCD-SDEGG---LKANLQFEDWLKTSS-----RVGGEESPRGGDLRNKVQGRGRGRGPRA--YNGRGPRGDDPPEEIGEMDKLE
        LPDFCF CG +GH A DC   D GG        ++  W+   S     R   ++          +   G      A    GR        EE+     LE
Subjt:  LPDFCFGCGKLGHLAKDCD-SDEGG---LKANLQFEDWLKTSS-----RVGGEESPRGGDLRNKVQGRGRGRGPRA--YNGRGPRGDDPPEEIGEMDKLE

Query:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVS
        A  +E   +A      H+G                   +   +G +   G Q S +V ++ + G  + N  +G  +   E  +  G ++D++    +N  
Subjt:  AAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNKVQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVS

Query:  KG--------LPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKK---WKRI
        +G        L        V E     +V   + VG  T + +G   +    ++   D G   K   +  +++++  T+ +    E  I  +K   WKR+
Subjt:  KG--------LPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIKIEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKK---WKRI

Query:  -------ARGSHQSSGSEPRISRF--------------RGGKHG---------------------------------YNFGSEDMEAILKFENAFEVPRA
               ++G   S    P++S                 GGK                                   Y   +E +   + F N+F V   
Subjt:  -------ARGSHQSSGSEPRISRF--------------RGGKHG---------------------------------YNFGSEDMEAILKFENAFEVPRA

Query:  GRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFA
        G+SGGL++LW     + + S+S GHID ++       WRFTGFYG P    RI+SW LL RL  LFDLPW+ GGDFNE+LS  EK GG  ++ S +  F 
Subjt:  GRSGGLMMLWKSSVHLFISSYSKGHIDTIIN-DDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFA

Query:  NCIFNCKLADAEIEIQ
          +  C L D   E Q
Subjt:  NCIFNCKLADAEIEIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding1.8e-0523.4Show/hide
Query:  FRNTKDKKRVINGGPWSFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLR
        F++ +    ++  GPWSF+  + V +     L+  A +F+   FW+    +P    T +   ++G  +G+F  +E+                        
Subjt:  FRNTKDKKRVINGGPWSFDRGLIVFEELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLR

Query:  RAIKLKVGSMGEEVWV-QISYEKLPDFCFGCGKLGHLAKDC
                ++G +V V +  YEKL +FC  CG L H A +C
Subjt:  RAIKLKVGSMGEEVWV-QISYEKLPDFCFGCGKLGHLAKDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAAGGAATATTTAGTAATAGATTAGCAAACTTGAATCTTCAGGAAGAGGAACGAGGAGGGGTTGTGGAAGTGGACGATGATGAGCTTGAGGAGTATGAGAAGAA
AAATCAAAGTGAAGTAGCCTGTAAAATTCTCACAACGAAGACTATAAATGTCGAAGTCTTCAAGAACATTGTACCAAAAATCTGGAATTTAGAGGGCAATGTCACGATCG
AAGCAGCATGGCGGAACATATTTGTCTGCTCTTTCAGAAACACAAAGGATAAAAAGCGCGTTATCAATGGAGGCCCCTGGAGCTTCGATAGAGGACTAATAGTGTTTGAA
GAACTCAAAGGAGCTCTAAACATCAAAGCGCTTGATTTCAGGTACGCACGTTTTTGGGTTAATTTCCATGATTTACCAAGAGTTTGTTTCACCAGGAAGAAAGCAGAGGC
ATTAGGCAACTCGCTCGGCATCTTTGAGGGTGTTGAATCTGATGGGCTAGGAAGATGCAGTGGACAAACTCTGAGGGTGCGTTTTAGGATGGACATCACTCGGCCTTTGC
GAAGGGCAATAAAGTTGAAGGTGGGCTCAATGGGAGAAGAGGTATGGGTTCAAATCAGTTATGAGAAACTCCCAGATTTTTGCTTTGGTTGTGGTAAGTTGGGACATCTA
GCTAAAGACTGTGATTCTGATGAGGGCGGGCTAAAGGCCAATCTCCAATTTGAAGATTGGCTAAAAACGAGTTCCCGAGTGGGTGGAGAAGAAAGCCCAAGAGGCGGAGA
CCTGAGAAACAAAGTTCAGGGCAGAGGAAGAGGAAGGGGTCCGCGAGCATACAATGGAAGGGGTCCGAGGGGAGACGACCCGCCAGAAGAGATCGGGGAGATGGACAAAC
TGGAAGCTGCTGGGAAAGAGGTTCGACGAGTGGCGAAAGAGATAACCGGACCCCATGTCGGCGCCTGTCTGCGAGCCCAAACGAAAACTGTGCATGATGGTGACAACAAG
GTGCAAAGCAATGATGGTGGGGCAGGCACAAGTAACAATAGTGGGACCCAGGACAGTGACGTGGTCGCTTCAAAGTTGATAAGTGGAACGAAAGAGACTAACAAACCTCT
TGGGTTCGATATTGATGGGCCTGAGAAGAACTTGACAGAGGGGCTGGTGTCGGATGAAGACCTTACTCATAAGGAAAATGTCTCTAAAGGGCTTCCACAAGGGGCCACTA
CGGCCGAAGTTGATGAAAGGATGGGCCTGGCAAACGTGGGCCTCGGTATTGAAGTTGGTCCCCCTACAAGAACAAGTCTCGGGCATCAGCTTTCTCCCGGGTCTATTAAA
ATAGAAAAGAAGGACTTGGGCTCTGTCATTAAGAGCCCACAAGAAAAAGATACCGAGATTAAAATCCAATTAACTGAAATCGTAACAGGGGACTCAGAAATTCAGATCGA
CAACAAAAAATGGAAGAGGATAGCTAGAGGAAGTCACCAAAGTTCGGGGAGTGAGCCTCGAATTAGCAGATTCAGAGGGGGCAAACATGGATACAATTTCGGAAGTGAAG
ATATGGAAGCGATACTGAAGTTTGAGAATGCTTTTGAAGTTCCAAGGGCTGGGAGGAGTGGTGGCCTTATGATGTTGTGGAAAAGTAGTGTCCATCTCTTTATCTCTTCT
TACTCTAAAGGGCATATCGACACCATTATAAATGATGATAAGGGGCCCTGGCGCTTCACAGGTTTCTATGGGGAGCCTTCTGGGGAAAAAAGAATCGACTCTTGGGCTCT
CCTCAATCGATTGAGCATGCTTTTTGACCTCCCGTGGTTGGTGGGAGGTGATTTTAACGAGCTTCTGTCTGATGAGGAAAAAGCGGGAGGGGCTGCTAAAAATAAAAGTC
TTGTGGACAGTTTTGCAAATTGCATATTCAACTGCAAGCTTGCCGATGCTGAGATTGAAATCCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGAAGGAATATTTAGTAATAGATTAGCAAACTTGAATCTTCAGGAAGAGGAACGAGGAGGGGTTGTGGAAGTGGACGATGATGAGCTTGAGGAGTATGAGAAGAA
AAATCAAAGTGAAGTAGCCTGTAAAATTCTCACAACGAAGACTATAAATGTCGAAGTCTTCAAGAACATTGTACCAAAAATCTGGAATTTAGAGGGCAATGTCACGATCG
AAGCAGCATGGCGGAACATATTTGTCTGCTCTTTCAGAAACACAAAGGATAAAAAGCGCGTTATCAATGGAGGCCCCTGGAGCTTCGATAGAGGACTAATAGTGTTTGAA
GAACTCAAAGGAGCTCTAAACATCAAAGCGCTTGATTTCAGGTACGCACGTTTTTGGGTTAATTTCCATGATTTACCAAGAGTTTGTTTCACCAGGAAGAAAGCAGAGGC
ATTAGGCAACTCGCTCGGCATCTTTGAGGGTGTTGAATCTGATGGGCTAGGAAGATGCAGTGGACAAACTCTGAGGGTGCGTTTTAGGATGGACATCACTCGGCCTTTGC
GAAGGGCAATAAAGTTGAAGGTGGGCTCAATGGGAGAAGAGGTATGGGTTCAAATCAGTTATGAGAAACTCCCAGATTTTTGCTTTGGTTGTGGTAAGTTGGGACATCTA
GCTAAAGACTGTGATTCTGATGAGGGCGGGCTAAAGGCCAATCTCCAATTTGAAGATTGGCTAAAAACGAGTTCCCGAGTGGGTGGAGAAGAAAGCCCAAGAGGCGGAGA
CCTGAGAAACAAAGTTCAGGGCAGAGGAAGAGGAAGGGGTCCGCGAGCATACAATGGAAGGGGTCCGAGGGGAGACGACCCGCCAGAAGAGATCGGGGAGATGGACAAAC
TGGAAGCTGCTGGGAAAGAGGTTCGACGAGTGGCGAAAGAGATAACCGGACCCCATGTCGGCGCCTGTCTGCGAGCCCAAACGAAAACTGTGCATGATGGTGACAACAAG
GTGCAAAGCAATGATGGTGGGGCAGGCACAAGTAACAATAGTGGGACCCAGGACAGTGACGTGGTCGCTTCAAAGTTGATAAGTGGAACGAAAGAGACTAACAAACCTCT
TGGGTTCGATATTGATGGGCCTGAGAAGAACTTGACAGAGGGGCTGGTGTCGGATGAAGACCTTACTCATAAGGAAAATGTCTCTAAAGGGCTTCCACAAGGGGCCACTA
CGGCCGAAGTTGATGAAAGGATGGGCCTGGCAAACGTGGGCCTCGGTATTGAAGTTGGTCCCCCTACAAGAACAAGTCTCGGGCATCAGCTTTCTCCCGGGTCTATTAAA
ATAGAAAAGAAGGACTTGGGCTCTGTCATTAAGAGCCCACAAGAAAAAGATACCGAGATTAAAATCCAATTAACTGAAATCGTAACAGGGGACTCAGAAATTCAGATCGA
CAACAAAAAATGGAAGAGGATAGCTAGAGGAAGTCACCAAAGTTCGGGGAGTGAGCCTCGAATTAGCAGATTCAGAGGGGGCAAACATGGATACAATTTCGGAAGTGAAG
ATATGGAAGCGATACTGAAGTTTGAGAATGCTTTTGAAGTTCCAAGGGCTGGGAGGAGTGGTGGCCTTATGATGTTGTGGAAAAGTAGTGTCCATCTCTTTATCTCTTCT
TACTCTAAAGGGCATATCGACACCATTATAAATGATGATAAGGGGCCCTGGCGCTTCACAGGTTTCTATGGGGAGCCTTCTGGGGAAAAAAGAATCGACTCTTGGGCTCT
CCTCAATCGATTGAGCATGCTTTTTGACCTCCCGTGGTTGGTGGGAGGTGATTTTAACGAGCTTCTGTCTGATGAGGAAAAAGCGGGAGGGGCTGCTAAAAATAAAAGTC
TTGTGGACAGTTTTGCAAATTGCATATTCAACTGCAAGCTTGCCGATGCTGAGATTGAAATCCAATAA
Protein sequenceShow/hide protein sequence
MDEGIFSNRLANLNLQEEERGGVVEVDDDELEEYEKKNQSEVACKILTTKTINVEVFKNIVPKIWNLEGNVTIEAAWRNIFVCSFRNTKDKKRVINGGPWSFDRGLIVFE
ELKGALNIKALDFRYARFWVNFHDLPRVCFTRKKAEALGNSLGIFEGVESDGLGRCSGQTLRVRFRMDITRPLRRAIKLKVGSMGEEVWVQISYEKLPDFCFGCGKLGHL
AKDCDSDEGGLKANLQFEDWLKTSSRVGGEESPRGGDLRNKVQGRGRGRGPRAYNGRGPRGDDPPEEIGEMDKLEAAGKEVRRVAKEITGPHVGACLRAQTKTVHDGDNK
VQSNDGGAGTSNNSGTQDSDVVASKLISGTKETNKPLGFDIDGPEKNLTEGLVSDEDLTHKENVSKGLPQGATTAEVDERMGLANVGLGIEVGPPTRTSLGHQLSPGSIK
IEKKDLGSVIKSPQEKDTEIKIQLTEIVTGDSEIQIDNKKWKRIARGSHQSSGSEPRISRFRGGKHGYNFGSEDMEAILKFENAFEVPRAGRSGGLMMLWKSSVHLFISS
YSKGHIDTIINDDKGPWRFTGFYGEPSGEKRIDSWALLNRLSMLFDLPWLVGGDFNELLSDEEKAGGAAKNKSLVDSFANCIFNCKLADAEIEIQ