; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024926 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024926
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:7069350..7070978
RNA-Seq ExpressionLag0024926
SyntenyLag0024926
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8516701.1 hypothetical protein F0562_016793 [Nyssa sinensis]2.3e-9343.41Show/hide
Query:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
        T    +++   NPS S    L+NICNL++  LDS+NY+ W+FQIS + K+H L  Y+DGT   P     DE+         +Y+ W  +DQAL+TL+NAT
Subjt:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT

Query:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
        LSQTALS+VIG  TS++ W  LE+ FS+STR+N++ LK+ L +IS K  +SID+Y++++K+  + LA+VS +I+ ED +IY +NGLP  YN FKTS+RT+
Subjt:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR

Query:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
        + ++T  E++ ++K EE  ++   K   +       M T+     S +RG   S+  GRG GR   +   GR      F +P+   S+  +P+  P Q +
Subjt:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD

Query:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
         R +N   V  QIC + GH+ALDCY++M++SYQG+ P  +L AMSA+ +  S     + SP   N W +DTG   H+T+DLANL     Y G++NIT+ N
Subjt:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN

Query:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
        GQ+L ISH G   +   + +F L+N+  VP ++TNLLSVHQ C DN+C FIFDS  F IQDK+T ++LF GPS +GLYPL   S      PSP
Subjt:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP

KAA8519786.1 hypothetical protein F0562_014124 [Nyssa sinensis]6.1e-9443.41Show/hide
Query:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
        T    +++   NPS S    L+NICNL++  LDS+NY+ W+FQIS + K+H L  Y+DGT   P     DE+         +Y+ W  +DQAL+TL+NAT
Subjt:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT

Query:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
        LSQTALS+VIG  TS++ W  LE+ FS+STR+N++ LK+ L +IS K  +SID+Y++++K+  + LA+VS +I+ ED +IY +NGLP  YN FKTS+RT+
Subjt:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR

Query:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
        + ++T  E++ ++K EE  ++   K   +       M T+     S +RG   S+  GRG GR   +   GR      F +P+   S+  +P+  P Q +
Subjt:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD

Query:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
         R +N   V  QIC + GH+ALDCY++M++SYQG+ P  +L AMSA+ +  S     + SP   N W +DTG   H+T+DLANL     Y G++NIT+ N
Subjt:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN

Query:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
        GQ+L ISH G   +   + +F L+N+  VP ++TNLLSVHQ C DN+C FIFDS  F IQDK+T ++LF GPS +GLYPL   S      PSP
Subjt:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]1.7e-10741.8Show/hide
Query:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
        T    +++   NPS S    L+NICNL++  LDS+NY+ W+FQIS + K+H L  Y+DGT   P     DE+         +Y+ W  +DQAL+TL+NAT
Subjt:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT

Query:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
        LSQTALS+VIG  TS++ W  LE+ FS+STR+N++ LK+ L +IS K  +SID+Y++++K+  + LA+VS +I+ ED +IY +NGLP  YN FKTS+RT+
Subjt:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR

Query:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
        + ++T  E++ ++K EE  ++   K   +       M T+     S +RG   S+  GRG GR   +   GR      F +P+   S+  +P+  P Q +
Subjt:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD

Query:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
         R +N   V  QIC + GH+ALDCY++M++SYQG+ P  +L AMSA+ +  S     + SP   N W +DTG   H+T+DLANL     Y G++NIT+ N
Subjt:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN

Query:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---
        GQ+L ISH G   +   + +F L+N+  VP ++TNLLSVHQ C DN+C FIFDS  F IQDK+T ++LF GPS +GLYPL    + K  +P+ Q  L   
Subjt:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---

Query:  ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF
                              TA +G + ST +WHDRLGHP  + L S+L+S+ I   R    +C+HCL GK++K PF
Subjt:  ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF

KAA8535282.1 hypothetical protein F0562_030285 [Nyssa sinensis]1.8e-9343.41Show/hide
Query:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
        T    +++   NPS S    L+NICNL++  LDS+NY+ W+FQIS + K+H L  Y+DGT   P     DE+         +Y+ W  +DQAL+TL+NAT
Subjt:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT

Query:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
        LSQTALS+VIG  TS++ W  LE+ FS+STR+N++ LK+ L +IS K  +SID+Y++++K   + LA+VS +I+ ED +IY +NGLP  YN FKTS+RT+
Subjt:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR

Query:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
        + ++T  E++ ++K EE  ++   K   +       M T+     S +RG   S+  GRG GR   +   GR      F +P+   S+  +P+  P Q +
Subjt:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD

Query:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
         R +N   V  QIC + GH+ALDCY++M++SYQG+ P  +L AMSA+ +  S     + SP   N W +DTG   H+T+DLANL     Y G++NIT+ N
Subjt:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN

Query:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP
        GQ+L ISH G   +   + +F L+N+  VP ++TNLLSVHQ C DN+C FIFDS  F IQDK+T ++LF GPS +GLYPL   S      PSP
Subjt:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKS------PSP

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]2.3e-8546.45Show/hide
Query:  MTTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE--------------------KSVDYEAWYERDQALI
        MT+   N+  + +  +  L+NICNLVS+ LDST++ILW+FQ++ + K+HKLF ++DG+  AP +                     +  +E W  +DQAL+
Subjt:  MTTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE--------------------KSVDYEAWYERDQALI

Query:  TLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFK
        TLINATLS  AL+YV+   TS+QVWE LEKH+SS++RTNV+ LK++LQSI KK+ ESIDAYV+R+KEI +K A VS  I+ E  +IY +NGL + YN   
Subjt:  TLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFK

Query:  TSLRTRAHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNW----RSHGRGRGGGRSDGNRNNGRGGRGF--LFPNPSNAPSHSQF
        TS+RTRA S++F ELH+ MK+EESA+++Q K  +     +    +S  SQ+R +     +SH RGRG       +NNGRG   F   F N     S   F
Subjt:  TSLRTRAHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNW----RSHGRGRGGGRSDGNRNNGRGGRGF--LFPNPSNAPSHSQF

Query:  PSPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPA---YNG
         +  Q D    NR   QIC + GH ALDCYN+MN+ +QGRHPP +LAAM A  ++S     L    S    WL+D+ CN H+T+DL+NL I+     YNG
Subjt:  PSPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPA---YNG

Query:  EENITVGNGQSLPISHFGPGQL
        EENI+VG+GQS PI+HFG GQ+
Subjt:  EENITVGNGQSLPISHFGPGQL

TrEMBL top hitse value%identityAlignment
A0A2N9EZ90 Uncharacterized protein2.4e-9638.45Show/hide
Query:  TTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDEKSVD------------YEAWYERDQALITLINATLSQ
        TT  +    N+   +  L+NI   V+V LD +N++ W+FQI+ + +++ L +YV+G +  P +  +             Y  W  RD+AL++LI+ATLS 
Subjt:  TTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDEKSVD------------YEAWYERDQALITLINATLSQ

Query:  TALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHS
        +A S VIG  ++  +W  L K ++S +R+N++ LK +L  + KK+ ++I  Y++R+KE ++KLAAV T++D ED +   + GLPS Y  F +++ T+  S
Subjt:  TALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHS

Query:  LTF-ELHILMKTEESAL-DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFL---------------------FP-NPS
        ++F ELH+LM ++E  L   Q    E S ++    T S  + SRG + + GRG    R  GN   G    GF                      FP +P 
Subjt:  LTF-ELHILMKTEESAL-DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFL---------------------FP-NPS

Query:  N----APSHSQFP-----SPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLT
        N    +P+ SQ P     S P F     NR   QICQ+ GH ALDCYN+MNYSYQGRHPPAKLAAM+++ S S            +N W+SDTG   H T
Subjt:  N----APSHSQFP-----SPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLT

Query:  SDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY
         DLANL  S  YN  + ++VGNGQ LPISH G  QL   +  F L N+ RVP +++NLLSV++ C DN+C F FDS  F IQD+ +GK L+ G S +GLY
Subjt:  SDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY

Query:  PL---------VAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPFSPIILSFLFSFRV
        PL          ++S SP+      Q  +++S+T+WH R GHP   +L  +L++ F P    D   C+HC  GK+++ PFS    S  F  ++
Subjt:  PL---------VAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPFSPIILSFLFSFRV

A0A2N9G7E3 Integrase catalytic domain-containing protein2.3e-9941.94Show/hide
Query:  LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI
        ++SNP+ +    L+NI NLVSV LD TNY+LW+FQI+   K++KL   VDG+   P+            + D+  W  +DQALI++I ATLS +AL+ VI
Subjt:  LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI

Query:  GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH
        G ++++ VW+ LEK F+S +R+NV+ LK +L SI KK+ ESI+ Y++++KE  +KL AV   I+AE+ +   ++GLP+ +  F +++RTR  S++F ELH
Subjt:  GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH

Query:  ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT
        +LM  EE +L++ T+   + +  HLAM       + GN     + S  +   GGR     N   GRGGR F   N +    ++  P P  ++ + S+R T
Subjt:  ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT

Query:  FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG
         QIC + GH ALDCY++M++++QG+HPP KLAAM+ S++ SS           SN W+SDTG   H T DLANL  +  YNG + +TVGNGQ LPI+H G
Subjt:  FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG

Query:  PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ
          QL        L    RVP++ TNLLSV + C DNNCCF FD+S F+IQD  +GKVL+ G +  GLYP+          ++P P            +A 
Subjt:  PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ

Query:  VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS
           K S++ WH RLGHP   IL SV   L +S I  S S+   CKHC  GK+S+ PFS
Subjt:  VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS

A0A2N9HPA0 Uncharacterized protein1.3e-9741.39Show/hide
Query:  LTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERL
        L+NI NLVSV LD +NY+LW++QI+ + K++ +  +VDGT + P E          ++  Y+ W  RDQ L+TLIN+TLS TALS V+G  T+  VW  L
Subjt:  LTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERL

Query:  EKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFE-LHILMKTEESALDQ
        EK ++SS+R+N++ LK EL +I K+S +SI+++++++K+  ++L AV   ID E+ +   + GLP  Y+ F T++RTR  + +FE +H+L+  EE +L  
Subjt:  EKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFE-LHILMKTEESALDQ

Query:  QTKIYETSNISHLAMTTSVDS---QSRGNWRSHGR---GRGGGRSDGNRNNGRGGRGFLFPNPS-NAPSHSQF-----PSPPQFDGRLSNRVTFQICQEY
        Q+ I  +   +H+AM  + +     S+GN R  GR    RG GR+  N N+GRGG      N S NA S   F      SP Q     + R   QIC + 
Subjt:  QTKIYETSNISHLAMTTSVDS---QSRGNWRSHGR---GRGGGRSDGNRNNGRGGRGFLFPNPS-NAPSHSQF-----PSPPQFDGRLSNRVTFQICQEY

Query:  GHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP
        GH ALDCY++M+YSYQG+ PP+KLAAM+A++         N+  SD + W+SDTG   H T DL+ +     Y G +  TVGNGQ++PI+H G  QL   
Subjt:  GHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP

Query:  NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSIL
        +  F L  + RVP +++NLLSV++ C DNNCCF+FD++ F I+D  TGK+L+ GPS N LYP+   S  P   T         S+ VWHDRLGHP   + 
Subjt:  NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSIL

Query:  NSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPF
          + ++S +  S S+     C HC+ GK++  PF
Subjt:  NSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPF

A0A2N9IB37 Uncharacterized protein5.2e-9941.76Show/hide
Query:  LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI
        ++SNP+ +    L+NI NLVSV LD TNY+LW+FQI+   K++KL   VDG+   P+            + D+  W  +DQALI++I ATLS +AL+ VI
Subjt:  LNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDE----------KSVDYEAWYERDQALITLINATLSQTALSYVI

Query:  GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH
        G ++++ VW+ LEK F+S +R+NV+ LK +L SI KK+ ESI+ Y++++KE  +KL A+   I+AE+ +   ++GLP+ +  F +++RTR  S++F ELH
Subjt:  GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELH

Query:  ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT
        +LM  EE +L++ T+   + +  HLAM       + GN     + S  +   GGR     N   GRGGR F   N +    ++  P P  ++ + S+R T
Subjt:  ILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGN-----WRSHGRGRGGGRSD--GNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVT

Query:  FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG
         QIC + GH ALDCY++M++++QG+HPP KLAAM+ S++ SS           SN W+SDTG   H T DLANL  +  YNG + +TVGNGQ LPI+H G
Subjt:  FQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFG

Query:  PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ
          QL        L    RVP++ TNLLSV + C DNNCCF FD+S F+IQD  +GKVL+ G +  GLYP+          ++P P            +A 
Subjt:  PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLV--------AKSPSPA-------QVTLTAQ

Query:  VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS
           K S++ WH RLGHP   IL SV   L +S I  S S+   CKHC  GK+S+ PFS
Subjt:  VGIKASTTVWHDRLGHPCLSILNSV---LNSSFIPVSRSDIGVCKHCLDGKLSKQPFS

A0A5J5A1U7 Integrase catalytic domain-containing protein8.0e-10841.8Show/hide
Query:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT
        T    +++   NPS S    L+NICNL++  LDS+NY+ W+FQIS + K+H L  Y+DGT   P     DE+         +Y+ W  +DQAL+TL+NAT
Subjt:  TTPENNSALNSNPSVS---FLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP-----DEKSV-------DYEAWYERDQALITLINAT

Query:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR
        LSQTALS+VIG  TS++ W  LE+ FS+STR+N++ LK+ L +IS K  +SID+Y++++K+  + LA+VS +I+ ED +IY +NGLP  YN FKTS+RT+
Subjt:  LSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTR

Query:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD
        + ++T  E++ ++K EE  ++   K   +       M T+     S +RG   S+  GRG GR   +   GR      F +P+   S+  +P+  P Q +
Subjt:  AHSLTF-ELHILMKTEESALDQQTKIYETSNISHLAMTTSVD---SQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPS--PPQFD

Query:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN
         R +N   V  QIC + GH+ALDCY++M++SYQG+ P  +L AMSA+ +  S     + SP   N W +DTG   H+T+DLANL     Y G++NIT+ N
Subjt:  GRLSNR--VTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGN

Query:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---
        GQ+L ISH G   +   + +F L+N+  VP ++TNLLSVHQ C DN+C FIFDS  F IQDK+T ++LF GPS +GLYPL    + K  +P+ Q  L   
Subjt:  GQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL----VAKSPSPA-QVTL---

Query:  ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF
                              TA +G + ST +WHDRLGHP  + L S+L+S+ I   R    +C+HCL GK++K PF
Subjt:  ----------------------TAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPF

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-1124.15Show/hide
Query:  WRFQISPLRKSHKLFKYVDGTTKAPDEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGES
        W+ ++  L     L K +D  +K PD  ++  E W + D+   + I   LS   ++ +I   T++ +W RLE  + S T TN + LK +L ++    G +
Subjt:  WRFQISPLRKSHKLFKYVDGTTKAPDEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGES

Query:  IDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNWRSH
          +++     ++ +LA +   I+ ED+ I  +N LPS+Y+   T++      L  +  I +K   SAL    K+ +       A+ T             
Subjt:  IDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHILMKTEESALDQQTKIYETSNISHLAMTTSVDSQSRGNWRSH

Query:  GRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYN--KMNYSYQGRHPPAKLAAMSASTSH-----SS
        GRGR   RS  + N GR G      N S +              R+ N      C + GH   DC N  K      G+      AAM  +  +     + 
Subjt:  GRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYN--KMNYSYQGRHPPAKLAAMSASTSH-----SS

Query:  PGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP---NASFTLSNLFRVPDISTNLLSVHQLCIDNNCC
            ++ S  +S  W+ DT  + H T  + +L           + +GN     I+  G G + +      +  L ++  VPD+  NL+S   L  D    
Subjt:  PGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLP---NASFTLSNLFRVPDISTNLLSVHQLCIDNNCC

Query:  FIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSR-SDIGVCKHCLDGKLSKQP
        + F +  + +   S   V+  G +   LY   A+     Q  L A    + S  +WH R+GH     L  +   S I  ++ + +  C +CL GK  +  
Subjt:  FIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSR-SDIGVCKHCLDGKLSKQP

Query:  F
        F
Subjt:  F

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-4528.86Show/hide
Query:  NNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVI
        N S LN N     ++N+  L      STNY++W  Q+  L   ++L  ++DG+T  P            + DY  W  +D+ + + +   +S +    V 
Subjt:  NNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVI

Query:  GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHI
           T+ Q+WE L K +++ +  +V  L+T+L+  +K + ++ID Y++ +    ++LA +   +D ++Q+   +  LP  Y      +  +    T     
Subjt:  GCQTSQQVWERLEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHI

Query:  LMKTEESALDQQTKIYETSNISHLAMT-TSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEY
        L +  E  L+ ++KI   S+ + + +T  +V  ++     ++  G    R D NRNN    + +   + +  P+++Q  S P            QIC   
Subjt:  LMKTEESALDQQTKIYETSNISHLAMT-TSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEY

Query:  GHNALDCYNKMNY--SYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLS
        GH+A  C    ++  S   + PP      S  T       L   SP  SN WL D+G   H+TSD  NL +   Y G +++ V +G ++PISH G   LS
Subjt:  GHNALDCYNKMNY--SYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLS

Query:  LPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLS
          +    L N+  VP+I  NL+SV++LC  N     F  +SF ++D +TG  L  G + + LY     S  P  V+L A    KA+ + WH RLGHP  S
Subjt:  LPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLS

Query:  ILNSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPFS
        ILNSV+++  + V         C  CL  K +K PFS
Subjt:  ILNSVLNSSFIPVSRSD--IGVCKHCLDGKLSKQPFS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-4127.7Show/hide
Query:  TNICNLVS---VHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWER
        TNI N+       L STNY++W  Q+  L   ++L  ++DG+T  P            + DY  W  +D+ + + I   +S +    V    T+ Q+WE 
Subjt:  TNICNLVS---VHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAP---------DEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWER

Query:  LEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYN--VFKTSLRTRAHSLTFELHILMKTEESAL
        L K +++ +  +V    T+L+ I++                 ++LA +   +D ++Q+   +  LP  Y   + + + +    SLT E+H      E  +
Subjt:  LEKHFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYN--VFKTSLRTRAHSLTFELHILMKTEESAL

Query:  DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYNK
        ++++K+   ++   + +T +V +    N   +   RG  R+  N NN           PS++ S S    P  + GR       QIC   GH+A  C   
Subjt:  DQQTKIYETSNISHLAMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYNK

Query:  MNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLF
            +Q +    +  + S  T       L   SP ++N WL D+G   H+TSD  NL     Y G +++ + +G ++PI+H G   L   + S  L+ + 
Subjt:  MNYSYQGRHPPAKLAAMSASTSHSSPGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLF

Query:  RVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY--PLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSF
         VP+I  NL+SV++LC  N     F  +SF ++D +TG  L  G + + LY  P+     S   V++ A    KA+ + WH RLGHP L+ILNSV+++  
Subjt:  RVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLY--PLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSF

Query:  IPVSRSD--IGVCKHCLDGKLSKQPFS
        +PV      +  C  C   K  K PFS
Subjt:  IPVSRSD--IGVCKHCLDGKLSKQPFS

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.3e-0824.55Show/hide
Query:  VSVHLDSTNYILWRFQISPLRKSHKLFKYVDG-TTKAPDEKSVDYEAWYERDQALITLINATLSQTALSYVI--GCQTSQQVWERLEKHFSSSTRTNVIG
        V++ L+  NY +WR     L  S  +  ++DG +T  P    +  + W ERD  +   I  T++ + L  +I  GC T++ +W  LE  F  +     + 
Subjt:  VSVHLDSTNYILWRFQISPLRKSHKLFKYVDG-TTKAPDEKSVDYEAWYERDQALITLINATLSQTALSYVI--GCQTSQQVWERLEKHFSSSTRTNVIG

Query:  LKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELHILMKTEESALDQQTKIYETSNISHLA
         + EL++ +     S+  Y +++K + + L  V + I     +++ +NGL   Y+     ++ ++   +F E   ++  EES L  ++K    S+ +H +
Subjt:  LKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTF-ELHILMKTEESALDQQTKIYETSNISHLA

Query:  MTTSV----------------DSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSP---PQF
        ++  +                ++ + G  RS  + RGGG SDG  NN    R    P     P  S +  P   PQF
Subjt:  MTTSV----------------DSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSP---PQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACACCAGAGAATAATTCTGCCCTGAATTCAAATCCATCCGTTTCCTTCCTCACCAATATTTGCAATCTCGTATCTGTTCACCTGGATTCTACGAATTACATTCT
TTGGAGGTTTCAGATTTCACCACTGCGTAAGTCCCATAAGTTGTTTAAGTATGTGGATGGAACAACCAAAGCTCCAGATGAAAAAAGTGTTGATTATGAAGCCTGGTATG
AACGTGATCAGGCTTTGATCACTTTGATCAATGCAACACTCTCACAGACGGCCTTGTCATATGTTATTGGCTGTCAAACTTCACAACAAGTCTGGGAACGATTAGAGAAG
CATTTTTCTTCCTCAACAAGAACGAATGTCATCGGCTTGAAGACCGAATTGCAAAGTATCTCTAAGAAATCTGGCGAATCAATTGATGCATATGTTCGACGTGTCAAGGA
GATTGTTAATAAATTAGCCGCTGTATCTACTGTGATTGATGCCGAGGACCAGATTATTTATACTGTCAATGGCTTACCTTCTGCCTACAATGTCTTCAAGACTTCTCTTC
GGACAAGAGCTCACTCGTTAACTTTCGAATTGCATATCTTGATGAAAACTGAAGAATCAGCTCTTGATCAACAAACGAAGATTTATGAGACTTCTAATATATCTCATCTT
GCTATGACAACTAGTGTTGATTCTCAAAGCCGAGGTAATTGGAGATCTCACGGTCGGGGACGTGGAGGTGGTCGTTCTGATGGCAATCGTAATAATGGTCGTGGAGGTCG
TGGCTTTCTGTTCCCTAATCCGTCAAATGCTCCTTCTCACAGTCAATTTCCTTCACCTCCTCAATTTGATGGTCGATTATCAAATCGTGTTACTTTTCAAATTTGCCAAG
AGTATGGACATAATGCCCTAGATTGTTACAACAAAATGAACTACTCCTATCAGGGTCGTCATCCGCCTGCCAAGCTTGCAGCCATGTCTGCTTCTACTTCGCACTCTTCT
CCAGGCACTTTACTTAATACTTCTCCATCTGATTCTAATGTGTGGTTGTCTGATACAGGGTGTAATGCACATCTTACTAGTGACCTTGCAAACTTGGGCATTTCTCCTGC
TTATAATGGGGAAGAGAACATAACAGTTGGTAATGGTCAGTCACTACCCATTTCTCATTTTGGTCCTGGTCAGCTTTCCCTTCCCAATGCCTCTTTTACTTTATCTAATC
TTTTTCGTGTTCCTGATATATCAACAAATCTCCTTTCTGTTCATCAATTATGTATAGACAATAATTGTTGTTTCATCTTTGATTCATCATCTTTTACCATTCAGGACAAA
TCAACGGGCAAAGTTCTCTTCCACGGACCTAGTGTCAACGGTCTTTATCCACTGGTTGCAAAATCTCCTTCTCCAGCACAAGTAACCCTTACGGCCCAAGTTGGTATCAA
GGCTTCCACTACTGTGTGGCATGATCGGTTAGGTCACCCTTGTCTTTCGATTCTAAATTCTGTTTTGAATTCCTCTTTTATTCCAGTTAGTCGGTCTGATATTGGTGTTT
GTAAACATTGTCTTGATGGCAAGCTGTCTAAACAACCTTTTTCCCCTATCATCCTCTCTTTCTTGTTCTCCTTTAGAGTTACTGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGACAACACCAGAGAATAATTCTGCCCTGAATTCAAATCCATCCGTTTCCTTCCTCACCAATATTTGCAATCTCGTATCTGTTCACCTGGATTCTACGAATTACATTCT
TTGGAGGTTTCAGATTTCACCACTGCGTAAGTCCCATAAGTTGTTTAAGTATGTGGATGGAACAACCAAAGCTCCAGATGAAAAAAGTGTTGATTATGAAGCCTGGTATG
AACGTGATCAGGCTTTGATCACTTTGATCAATGCAACACTCTCACAGACGGCCTTGTCATATGTTATTGGCTGTCAAACTTCACAACAAGTCTGGGAACGATTAGAGAAG
CATTTTTCTTCCTCAACAAGAACGAATGTCATCGGCTTGAAGACCGAATTGCAAAGTATCTCTAAGAAATCTGGCGAATCAATTGATGCATATGTTCGACGTGTCAAGGA
GATTGTTAATAAATTAGCCGCTGTATCTACTGTGATTGATGCCGAGGACCAGATTATTTATACTGTCAATGGCTTACCTTCTGCCTACAATGTCTTCAAGACTTCTCTTC
GGACAAGAGCTCACTCGTTAACTTTCGAATTGCATATCTTGATGAAAACTGAAGAATCAGCTCTTGATCAACAAACGAAGATTTATGAGACTTCTAATATATCTCATCTT
GCTATGACAACTAGTGTTGATTCTCAAAGCCGAGGTAATTGGAGATCTCACGGTCGGGGACGTGGAGGTGGTCGTTCTGATGGCAATCGTAATAATGGTCGTGGAGGTCG
TGGCTTTCTGTTCCCTAATCCGTCAAATGCTCCTTCTCACAGTCAATTTCCTTCACCTCCTCAATTTGATGGTCGATTATCAAATCGTGTTACTTTTCAAATTTGCCAAG
AGTATGGACATAATGCCCTAGATTGTTACAACAAAATGAACTACTCCTATCAGGGTCGTCATCCGCCTGCCAAGCTTGCAGCCATGTCTGCTTCTACTTCGCACTCTTCT
CCAGGCACTTTACTTAATACTTCTCCATCTGATTCTAATGTGTGGTTGTCTGATACAGGGTGTAATGCACATCTTACTAGTGACCTTGCAAACTTGGGCATTTCTCCTGC
TTATAATGGGGAAGAGAACATAACAGTTGGTAATGGTCAGTCACTACCCATTTCTCATTTTGGTCCTGGTCAGCTTTCCCTTCCCAATGCCTCTTTTACTTTATCTAATC
TTTTTCGTGTTCCTGATATATCAACAAATCTCCTTTCTGTTCATCAATTATGTATAGACAATAATTGTTGTTTCATCTTTGATTCATCATCTTTTACCATTCAGGACAAA
TCAACGGGCAAAGTTCTCTTCCACGGACCTAGTGTCAACGGTCTTTATCCACTGGTTGCAAAATCTCCTTCTCCAGCACAAGTAACCCTTACGGCCCAAGTTGGTATCAA
GGCTTCCACTACTGTGTGGCATGATCGGTTAGGTCACCCTTGTCTTTCGATTCTAAATTCTGTTTTGAATTCCTCTTTTATTCCAGTTAGTCGGTCTGATATTGGTGTTT
GTAAACATTGTCTTGATGGCAAGCTGTCTAAACAACCTTTTTCCCCTATCATCCTCTCTTTCTTGTTCTCCTTTAGAGTTACTGCATAG
Protein sequenceShow/hide protein sequence
MTTPENNSALNSNPSVSFLTNICNLVSVHLDSTNYILWRFQISPLRKSHKLFKYVDGTTKAPDEKSVDYEAWYERDQALITLINATLSQTALSYVIGCQTSQQVWERLEK
HFSSSTRTNVIGLKTELQSISKKSGESIDAYVRRVKEIVNKLAAVSTVIDAEDQIIYTVNGLPSAYNVFKTSLRTRAHSLTFELHILMKTEESALDQQTKIYETSNISHL
AMTTSVDSQSRGNWRSHGRGRGGGRSDGNRNNGRGGRGFLFPNPSNAPSHSQFPSPPQFDGRLSNRVTFQICQEYGHNALDCYNKMNYSYQGRHPPAKLAAMSASTSHSS
PGTLLNTSPSDSNVWLSDTGCNAHLTSDLANLGISPAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDK
STGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTTVWHDRLGHPCLSILNSVLNSSFIPVSRSDIGVCKHCLDGKLSKQPFSPIILSFLFSFRVTA