; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi02G001065 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi02G001065
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr2:31100371..31119700
RNA-Seq ExpressionBhi02G001065
SyntenyBhi02G001065
Gene Ontology termsGO:0000178 - exosome (RNase complex) (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR012340 - Nucleic acid-binding, OB-fold
IPR019495 - Exosome complex component CSL4, C-terminal
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035107.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-9941.26Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP
        V + GD  L+K+++SLK++MK  G +D G LVE   L+ G  +D +      ++   E +  +L+QF +VF  P TLPP+R  +H I L+ GT P+NVRP
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP

Query:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------
        YRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI VI+EL DEL GA+VFSK+DLK+            
Subjt:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------

Query:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA
                                       MN + +P LR+FVLVFF DIL+YS+ M++H                  N++KC FA P + YLGH I  
Subjt:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA

Query:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG
         G+ AD  KI+A+ +W  P N++E+R FLGLTGYY +FV NYG+                      AFN LK AM++LP L++P+FN PFEIESDAS VG
Subjt:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG

Query:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD
        +GAVL      VA+                                L+G+           +  +  +V+  +YQ+W+AKLLGY   + Y+ G EN AAD
Subjt:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD

Query:  ALSRLPPAL
        ALSR+PPA+
Subjt:  ALSRLPPAL

KAA0055700.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]9.6e-10041.26Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP
        V + GD  L+K+++SLK++MK  G +D G LVE   L+ G  +D +       +   E +  +L+QF +VF  P TLPP+R  +H I L+ GT P+NVRP
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP

Query:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------
        YRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI VI+EL DEL GA+VFSK+DLK+            
Subjt:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------

Query:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA
                                       MN + +P LR+FVLVFF DIL+YS+ M++H                  N++KC FA P + YLGH I  
Subjt:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA

Query:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG
         G+ AD  KI+A+ +W  P N++E+R FLGLTGYY +FV NYG+                     +AFN LK AM++LP L++P+FN PFEIESDAS VG
Subjt:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG

Query:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD
        +GAVL      VA+                                L+G+           +  +  +V+  +YQ+W+AKLLGY   + Y+ G EN AAD
Subjt:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD

Query:  ALSRLPPAL
        ALSR PPA+
Subjt:  ALSRLPPAL

KAA0059567.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.6e-10243.42Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI
        V + GD  L K+++SLK++MK  G +D G LVE   ++ GL ++     H  D    ++  EA+  +L+QF  VF  PT LPP+R  DH I L+ G  P+
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI

Query:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------
        NVRPYRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI +I+EL DEL GA+VFSKIDLK+        
Subjt:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------

Query:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH
                                           MN + +P LR+FVLVFF DILVYS+ +++H                 ANL+KC FA P + YLGH
Subjt:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH

Query:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA
        +I   G+ AD  KI+A+ +W  P N++E+R FL LTGYY +FV +YG+                      AF+ LK AM++LP L++P+FN PFEIESDA
Subjt:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA

Query:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE
        S  G+GA +       T++A VL  + + G +  + +EV +G    A +YQRW+AKLLGY   + Y+ G EN AADALSR+ P ++
Subjt:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE

KAA0064684.1 tetratricopeptide repeat protein SKI3 [Cucumis melo var. makuwa]1.3e-9943.92Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP
        + + GD  L K+++SLK+M+K  G ED G LVE   ++    D     +   + ++   +  VL+QFE VF  P  LPP+R  +H I L+ G  PINVRP
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP

Query:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLK-------------
        YRY   QK+E+ K+V EML +G+I+P+ SPFSS VL   KKDGSW FCVDYRA+N ATIPDKFP  V++EL DEL GA VFSKIDLK             
Subjt:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLK-------------

Query:  ----------------------------SIAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA
                                       MN I +P LRKFVLVFF DIL+YS++  DH             +   AN KKC FA   VEYLGH++  
Subjt:  ----------------------------SIAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA

Query:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG
         GV  D  KIK++ KW KP NIKE+R FLGLTGYY +FV NYG+                     +AF  LK AM SLP L+LP+FN+PFEI++DAS  G
Subjt:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG

Query:  IGAVLIHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALEFGLMGVVL
        +GAVLI  +        TL   + +        ++V   +YQ WIAKLLGY   + YK G EN AADALS++P  +E   + V++
Subjt:  IGAVLIHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALEFGLMGVVL

TYK14806.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.6e-10243.42Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI
        V + GD  L K+++SLK++MK  G +D G LVE   ++ GL ++     H  D    ++  EA+  +L+QF  VF  PT LPP+R  DH I L+ G  P+
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI

Query:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------
        NVRPYRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI +I+EL DEL GA+VFSKIDLK+        
Subjt:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------

Query:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH
                                           MN + +P LR+FVLVFF DILVYS+ +++H                 ANL+KC FA P + YLGH
Subjt:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH

Query:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA
        +I   G+ AD  KI+A+ +W  P N++E+R FL LTGYY +FV +YG+                      AF+ LK AM++LP L++P+FN PFEIESDA
Subjt:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA

Query:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE
        S  G+GA +       T++A VL  + + G +  + +EV +G    A +YQRW+AKLLGY   + Y+ G EN AADALSR+ P ++
Subjt:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE

TrEMBL top hitse value%identityAlignment
A0A5A7UKN8 Ty3/gypsy retrotransposon protein4.7e-10041.26Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP
        V + GD  L+K+++SLK++MK  G +D G LVE   L+ G  +D +       +   E +  +L+QF +VF  P TLPP+R  +H I L+ GT P+NVRP
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP

Query:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------
        YRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI VI+EL DEL GA+VFSK+DLK+            
Subjt:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------

Query:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA
                                       MN + +P LR+FVLVFF DIL+YS+ M++H                  N++KC FA P + YLGH I  
Subjt:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA

Query:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG
         G+ AD  KI+A+ +W  P N++E+R FLGLTGYY +FV NYG+                     +AFN LK AM++LP L++P+FN PFEIESDAS VG
Subjt:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG

Query:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD
        +GAVL      VA+                                L+G+           +  +  +V+  +YQ+W+AKLLGY   + Y+ G EN AAD
Subjt:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD

Query:  ALSRLPPAL
        ALSR PPA+
Subjt:  ALSRLPPAL

A0A5A7UY55 Ty3/gypsy retrotransposon protein2.2e-10243.42Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI
        V + GD  L K+++SLK++MK  G +D G LVE   ++ GL ++     H  D    ++  EA+  +L+QF  VF  PT LPP+R  DH I L+ G  P+
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI

Query:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------
        NVRPYRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI +I+EL DEL GA+VFSKIDLK+        
Subjt:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------

Query:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH
                                           MN + +P LR+FVLVFF DILVYS+ +++H                 ANL+KC FA P + YLGH
Subjt:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH

Query:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA
        +I   G+ AD  KI+A+ +W  P N++E+R FL LTGYY +FV +YG+                      AF+ LK AM++LP L++P+FN PFEIESDA
Subjt:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA

Query:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE
        S  G+GA +       T++A VL  + + G +  + +EV +G    A +YQRW+AKLLGY   + Y+ G EN AADALSR+ P ++
Subjt:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE

A0A5A7VBR1 Tetratricopeptide repeat protein SKI36.1e-10043.92Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP
        + + GD  L K+++SLK+M+K  G ED G LVE   ++    D     +   + ++   +  VL+QFE VF  P  LPP+R  +H I L+ G  PINVRP
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP

Query:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLK-------------
        YRY   QK+E+ K+V EML +G+I+P+ SPFSS VL   KKDGSW FCVDYRA+N ATIPDKFP  V++EL DEL GA VFSKIDLK             
Subjt:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLK-------------

Query:  ----------------------------SIAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA
                                       MN I +P LRKFVLVFF DIL+YS++  DH             +   AN KKC FA   VEYLGH++  
Subjt:  ----------------------------SIAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA

Query:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG
         GV  D  KIK++ KW KP NIKE+R FLGLTGYY +FV NYG+                     +AF  LK AM SLP L+LP+FN+PFEI++DAS  G
Subjt:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG

Query:  IGAVLIHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALEFGLMGVVL
        +GAVLI  +        TL   + +        ++V   +YQ WIAKLLGY   + YK G EN AADALS++P  +E   + V++
Subjt:  IGAVLIHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALEFGLMGVVL

A0A5A7VBU7 Ty3/gypsy retrotransposon protein6.1e-10041.26Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP
        V + GD  L+K+++SLK++MK  G +D G LVE   L+ G  +D +       +   E +  +L+QF +VF  P TLPP+R  +H I L+ GT P+NVRP
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPINVRP

Query:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------
        YRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI VI+EL DEL GA+VFSK+DLK+            
Subjt:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS------------

Query:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA
                                       MN + +P LR+FVLVFF DIL+YS+ M++H                  N++KC FA P + YLGH I  
Subjt:  -----------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGHLILA

Query:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG
         G+ AD  KI+A+ +W  P N++E+R FLGLTGYY +FV NYGS                      AFN LK AM++LP L++P+FN PFEIESDAS VG
Subjt:  GGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVG

Query:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD
        +GAVL      VA+                                L+G+           +  +  +++  +YQ+W+AKLLGY   + Y+ G EN AAD
Subjt:  IGAVLIHTE--VAF-------------------------------VLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAAD

Query:  ALSRLPPAL
        ALSR+PPA+
Subjt:  ALSRLPPAL

A0A5D3CWK0 Ty3/gypsy retrotransposon protein2.2e-10243.42Show/hide
Query:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI
        V + GD  L K+++SLK++MK  G +D G LVE   ++ GL ++     H  D    ++  EA+  +L+QF  VF  PT LPP+R  DH I L+ G  P+
Subjt:  VHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCD----QQIPEAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTGPI

Query:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------
        NVRPYRY   QK+E+ ++V+EML++GII+PS+SP+SS VL   KKDGSW+FCVDYRALN  TIPDKFPI +I+EL DEL GA+VFSKIDLK+        
Subjt:  NVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS--------

Query:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH
                                           MN + +P LR+FVLVFF DILVYS+ +++H                 ANL+KC FA P + YLGH
Subjt:  ---------------------------------IAMNDILRPCLRKFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQFAVPCVEYLGH

Query:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA
        +I   G+ AD  KI+A+ +W  P N++E+R FL LTGYY +FV +YG+                      AF+ LK AM++LP L++P+FN PFEIESDA
Subjt:  LILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS---------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDA

Query:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE
        S  G+GA +       T++A VL  + + G +  + +EV +G    A +YQRW+AKLLGY   + Y+ G EN AADALSR+ P ++
Subjt:  SRVGIGAVL-----IHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.9e-4029.61Show/hide
Query:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSV-LFGNKKDGS----WQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDL---------
        Y YPQ  + E+   + +ML  GII+ S SP++S + +   K+D S    ++  +DYR LN  T+ D+ PI  +DE+L +L     F+ IDL         
Subjt:  YRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSV-LFGNKKDGS----WQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDL---------

Query:  --KSIA------------------------------MNDILRPCLRKFVLVFFYDILVYSRSMEDHGNNF--------VANLK----KCQFAVPCVEYLG
          +S++                              MNDILRP L K  LV+  DI+V+S S+++H  +          ANLK    KC+F      +LG
Subjt:  --KSIA------------------------------MNDILRPCLRKFVLVFFYDILVYSRSMEDHGNNF--------VANLK----KCQFAVPCVEYLG

Query:  HLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG-----------------------SEAFNHLKTAMISLPFLSLPNFNEPFEIE
        H++   G+  +  KI+A+QK+  P   KE+++FLGLTGYY KF+ N+                          AF  LK  +   P L +P+F + F + 
Subjt:  HLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG-----------------------SEAFNHLKTAMISLPFLSLPNFNEPFEIE

Query:  SDASRVGIGAVLIHTEVAFVLVGQTLR------GTNGSEKSEVFIGIKV-----------IAGEYQ----------------RWIAKLLGYDLSIEYKRG
        +DAS V +GAVL         + +TL        T   E   +    K            I+ ++Q                RW  KL  +D  I+Y +G
Subjt:  SDASRVGIGAVLIHTEVAFVLVGQTLR------GTNGSEKSEVFIGIKV-----------IAGEYQ----------------RWIAKLLGYDLSIEYKRG

Query:  SENSAADALSRL
         EN  ADALSR+
Subjt:  SENSAADALSRL

P20825 Retrovirus-related Pol polyprotein from transposon 2979.5e-4229.9Show/hide
Query:  PINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKD-----GSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDL---
        PI  + Y   Q  + E+   V EML  G+I+ S SP++S      KK        ++  +DYR LN  TIPD++PI  +DE+L +L     F+ IDL   
Subjt:  PINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKD-----GSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDL---

Query:  --------KSIA------------------------------MNDILRPCLRKFVLVFFYDILVYSRSMEDHGNNFV--------ANLK----KCQFAVP
                +SI+                              MN+ILRP L K  LV+  DI+++S S+ +H N+          ANLK    KC+F   
Subjt:  --------KSIA------------------------------MNDILRPCLRKFVLVFFYDILVYSRSMEDHGNNFV--------ANLK----KCQFAVP

Query:  CVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS-----------------------EAFNHLKTAMISLPFLSLPNFN
           +LGH++   G+  +  K+KA+  +  P   KE+R+FLGLTGYY KF+ NY                         EAF  LK  +I  P L LP+F 
Subjt:  CVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS-----------------------EAFNHLKTAMISLPFLSLPNFN

Query:  EPFEIESDASRVGIGAVLIHTEVAFVLVGQTLR----GTNGSEKSEVFI-----------------------------GIKVIAGEYQRWIAKLLGYDLS
        + F + +DAS + +GAVL         + +TL       +  EK  + I                              +K    + +RW  +L  Y   
Subjt:  EPFEIESDASRVGIGAVLIHTEVAFVLVGQTLR----GTNGSEKSEVFI-----------------------------GIKVIAGEYQRWIAKLLGYDLS

Query:  IEYKRGSENSAADALSRL
        I+Y +G ENS ADALSR+
Subjt:  IEYKRGSENSAADALSRL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.9e-3728.13Show/hide
Query:  HVIELEPGTGPINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKI
        H IE++PG     ++PY   +  + EI K+V ++L    I PS+SP SS V+   KKDG+++ CVDYR LN+ATI D FP+  ID LL  +  A +F+ +
Subjt:  HVIELEPGTGPINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKI

Query:  DLKSIAMNDILRPCLR---------------------------------------KFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQF
        DL S      + P  R                                       +FV V+  DIL++S S E+H              N +   KKC+F
Subjt:  DLKSIAMNDILRPCLR---------------------------------------KFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQF

Query:  AVPCVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG--------------------SEAFNHLKTAMISLPFLSLPNFN
        A    E+LG+ I    +A    K  A++ +  PK +K+ + FLG+  YY +F+ N                       +A   LK A+ + P L   N  
Subjt:  AVPCVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG--------------------SEAFNHLKTAMISLPFLSLPNFN

Query:  EPFEIESDASRVGIGAVLIHTEVAFVLVG---------QTLRGTNGSEKSEVFIGIKVI------------------------------AGEYQRWIAKL
          + + +DAS+ GIGAVL   +    LVG         ++ +    + + E+   IK +                              A   QRW+  L
Subjt:  EPFEIESDASRVGIGAVLIHTEVAFVLVG---------QTLRGTNGSEKSEVFIGIKVI------------------------------AGEYQRWIAKL

Query:  LGYDLSIEYKRGSENSAADALSR
          YD ++EY  G +N  ADA+SR
Subjt:  LGYDLSIEYKRGSENSAADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.9e-3825.58Show/hide
Query:  EAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTG---PINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKK-----DGSWQFCV
        E +  +L +F ++F  P +       +  ++ E  T    PI  + Y YP   + E+ + ++E+L  GII+PS SP++S +    KK     +  ++  V
Subjt:  EAVRLVLQQFEKVFMTPTTLPPKRHHDHVIELEPGTG---PINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKK-----DGSWQFCV

Query:  DYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS-----------------------------------------IAMNDILRPCLRKFVLVFFY
        D++ LN  TIPD +PI  I+  L  L  A  F+ +DL S                                           ++DILR  + K   V+  
Subjt:  DYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKS-----------------------------------------IAMNDILRPCLRKFVLVFFY

Query:  DILVYSRSMEDHGN------------NFVANLKKCQFAVPCVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS-----
        DI+V+S   + H              N   NL+K  F    VE+LG+++ A G+ AD  K++A+ +   P ++KEL+ FLG+T YY KF+ +Y       
Subjt:  DILVYSRSMEDHGN------------NFVANLKKCQFAVPCVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGS-----

Query:  ----------------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVGIGAVL---------------------------IHTEV
                                    ++FN LK+ + S   L+ P F +PF + +DAS   IGAVL                           I  E+
Subjt:  ----------------------------EAFNHLKTAMISLPFLSLPNFNEPFEIESDASRVGIGAVL---------------------------IHTEV

Query:  -----------AFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPAL
                   A++    T++     +     +G +    + +RW A++  Y+  + YK G  N  ADALSR+PP L
Subjt:  -----------AFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPAL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.9e-3728.13Show/hide
Query:  HVIELEPGTGPINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKI
        H IE++PG     ++PY   +  + EI K+V ++L    I PS+SP SS V+   KKDG+++ CVDYR LN+ATI D FP+  ID LL  +  A +F+ +
Subjt:  HVIELEPGTGPINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKI

Query:  DLKSIAMNDILRPCLR---------------------------------------KFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQF
        DL S      + P  R                                       +FV V+  DIL++S S E+H              N +   KKC+F
Subjt:  DLKSIAMNDILRPCLR---------------------------------------KFVLVFFYDILVYSRSMEDH------------GNNFVANLKKCQF

Query:  AVPCVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG--------------------SEAFNHLKTAMISLPFLSLPNFN
        A    E+LG+ I    +A    K  A++ +  PK +K+ + FLG+  YY +F+ N                       +A + LK A+ + P L   N  
Subjt:  AVPCVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG--------------------SEAFNHLKTAMISLPFLSLPNFN

Query:  EPFEIESDASRVGIGAVLIHTEVAFVLVG---------QTLRGTNGSEKSEVFIGIKVI------------------------------AGEYQRWIAKL
          + + +DAS+ GIGAVL   +    LVG         ++ +    + + E+   IK +                              A   QRW+  L
Subjt:  EPFEIESDASRVGIGAVLIHTEVAFVLVG---------QTLRGTNGSEKSEVFIGIKVI------------------------------AGEYQRWIAKL

Query:  LGYDLSIEYKRGSENSAADALSR
          YD ++EY  G +N  ADA+SR
Subjt:  LGYDLSIEYKRGSENSAADALSR

Arabidopsis top hitse value%identityAlignment
AT5G38890.1 Nucleic acid-binding, OB-fold-like protein3.7e-5777.14Show/hide
Query:  KRSTVEVTGHKAHGAVPAPGSIVIARVTKVMSKMASADIMCVGPKSVKEKFTGIIRQQDVRAMEIDKVDMHLSFRPGDVVKALVLSLGDARAYHLSTAKN
        +R+ VEVTGHKAHG +P  GS+VIARVTKVM+KMA+ DI+CVG K+V+E F G+IRQQDVRA EIDKVDMH SF  GD+V+A+VLSLGDARAY+LSTAKN
Subjt:  KRSTVEVTGHKAHGAVPAPGSIVIARVTKVMSKMASADIMCVGPKSVKEKFTGIIRQQDVRAMEIDKVDMHLSFRPGDVVKALVLSLGDARAYHLSTAKN

Query:  ELGVVSAESTAGAVMVPISWTEMQCPLTGQIQQRKVAKVG
        ELGVVSAES AG  MVPISWTEMQCPL+GQ +QRKVAKVG
Subjt:  ELGVVSAESTAGAVMVPISWTEMQCPLTGQIQQRKVAKVG

ATMG00850.1 DNA/RNA polymerases superfamily protein3.6e-0451.28Show/hide
Query:  QKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSW
        ++  ++  + EML A IIQPS SP+SS VL   KKDG W
Subjt:  QKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein5.2e-1944.07Show/hide
Query:  NNFVANLKKCQFAVPCVEYLG--HLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG---------------------SEAFNHLK
        + F AN KKC F  P + YLG  H+I   GV+AD +K++AM  W +PKN  ELR FLGLTGYY +FV NYG                     + AF  LK
Subjt:  NNFVANLKKCQFAVPCVEYLG--HLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYG---------------------SEAFNHLK

Query:  TAMISLPFLSLPNFNEPF
         A+ +LP L+LP+   PF
Subjt:  TAMISLPFLSLPNFNEPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACAACTTTCTTCAATGGTCACAAAGTGTTCAACTGTATGATTGGTTATCTTACTGGAGATAAGAAGGCACCTGCCAAGGATGGTTCTCTATTTTCTATGTGGGA
TGTTGAAAATTCCATGGTTATGACTTGGTTGGTGAATTCCATGACTGAGGAAATTTGTTCCAACCATATGTATTTCGATTGGGGCAATCAATCAAATGTGACTATTGGAG
TTGAATCTTCAACTCAGCAATATTCGACAAGGAGGGGATACCATTACACAATATTTTCACAAATTGACAAGAATTTGGCAAGAGTTGGATTATTTGATAATTATGAATGG
CATTCAACGAAAGATCAAATGCACTACATTAGGAAGTGCAGGTGTGATCTTAGGAGTAACGTGGTGGGAGAATGGATGGTTCACCTCCATGGAGATCATCGTTTGGTGAA
ATCCCAAATTTCCTTGAAGTCGATGATGAAGATGTTGGGAAGGGAAGATCTTGGGATGTTGGTGGAACTTAGTGTTCTAGATACGGGGTTGAAGGATGATAGCAAGGTTG
TACTCCACACTTGTGACCAGCAGATCCCCGAAGCTGTCCGATTAGTTCTTCAGCAATTTGAGAAGGTGTTTATGACCCCAACTACTTTACCACCCAAAAGACATCATGAT
CATGTGATCGAATTGGAGCCAGGGACCGGGCCTATCAATGTTCGACCATACCGATATCCACAATTTCAGAAAGACGAGATCAGAAAAATGGTCAATGAGATGTTGACGGC
TGGAATTATACAGCCAAGTAGAAGCCCTTTCTCTAGCTCGGTTCTCTTCGGCAATAAAAAAGATGGCAGTTGGCAGTTTTGTGTGGACTATCGAGCCCTCAATCGAGCGA
CCATACCAGACAAGTTTCCCATTTCAGTGATTGATGAGTTGTTGGACGAGTTGTTTGGTGCTACTGTTTTTTCAAAGATAGATTTGAAATCTATAGCCATGAATGATATT
CTGAGACCTTGTTTGAGAAAATTCGTGTTGGTTTTTTTTTACGACATCCTCGTTTATAGCCGATCAATGGAGGACCATGGCAATAACTTCGTAGCCAATTTGAAGAAATG
TCAGTTTGCCGTTCCTTGTGTTGAATACCTCGGTCATTTGATTTTGGCTGGTGGGGTAGCAGCTGACTCGTCCAAGATCAAGGCAATGCAGAAGTGGTTAAAACCCAAAA
ATATTAAGGAACTAAGGAGCTTCCTTGGTCTGACAGGTTATTACCCGAAGTTTGTGGCTAATTATGGGTCGGAAGCTTTTAACCATTTGAAAACAGCTATGATATCCCTA
CCATTCCTATCACTACCAAACTTCAATGAACCGTTTGAGATTGAATCAGATGCCTCTAGGGTTGGTATTGGGGCTGTCTTGATCCATACAGAGGTGGCATTTGTACTTGT
TGGGCAGACACTTCGTGGTACGAACGGATCAGAGAAGTCTGAAGTTTTTATTGGAATAAAAGTGATTGCAGGGGAGTATCAAAGATGGATTGCGAAGCTTTTGGGATACG
ACTTAAGCATTGAATATAAGCGAGGTTCTGAGAATTCAGCAGCAGATGCATTATCGAGATTACCTCCAGCCTTGGAGTTTGGTTTAATGGGTGTGGTACTGGATAAACCA
GAGGAGGTGTATGTACTAAATTCTCTAAAGAGATCAACTGTGGAAGTGACTGGTCATAAGGCCCATGGTGCTGTTCCAGCTCCTGGATCTATTGTCATAGCTCGAGTCAC
AAAAGTAATGTCTAAAATGGCATCAGCTGATATCATGTGTGTTGGTCCAAAGTCTGTGAAAGAGAAGTTTACTGGAATTATAAGGCAACAAGATGTTCGAGCAATGGAGA
TCGATAAAGTAGATATGCATTTGTCATTTCGTCCTGGTGACGTTGTGAAAGCTCTTGTTCTTTCTCTTGGAGATGCAAGGGCCTATCATCTATCAACTGCAAAAAATGAA
CTCGGCGTGGTCTCTGCAGAGAGCACAGCAGGTGCAGTGATGGTTCCCATAAGTTGGACAGAAATGCAGTGTCCATTAACAGGCCAAATTCAGCAAAGAAAAGTAGCCAA
AGTTGGAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACAACTTTCTTCAATGGTCACAAAGTGTTCAACTGTATGATTGGTTATCTTACTGGAGATAAGAAGGCACCTGCCAAGGATGGTTCTCTATTTTCTATGTGGGA
TGTTGAAAATTCCATGGTTATGACTTGGTTGGTGAATTCCATGACTGAGGAAATTTGTTCCAACCATATGTATTTCGATTGGGGCAATCAATCAAATGTGACTATTGGAG
TTGAATCTTCAACTCAGCAATATTCGACAAGGAGGGGATACCATTACACAATATTTTCACAAATTGACAAGAATTTGGCAAGAGTTGGATTATTTGATAATTATGAATGG
CATTCAACGAAAGATCAAATGCACTACATTAGGAAGTGCAGGTGTGATCTTAGGAGTAACGTGGTGGGAGAATGGATGGTTCACCTCCATGGAGATCATCGTTTGGTGAA
ATCCCAAATTTCCTTGAAGTCGATGATGAAGATGTTGGGAAGGGAAGATCTTGGGATGTTGGTGGAACTTAGTGTTCTAGATACGGGGTTGAAGGATGATAGCAAGGTTG
TACTCCACACTTGTGACCAGCAGATCCCCGAAGCTGTCCGATTAGTTCTTCAGCAATTTGAGAAGGTGTTTATGACCCCAACTACTTTACCACCCAAAAGACATCATGAT
CATGTGATCGAATTGGAGCCAGGGACCGGGCCTATCAATGTTCGACCATACCGATATCCACAATTTCAGAAAGACGAGATCAGAAAAATGGTCAATGAGATGTTGACGGC
TGGAATTATACAGCCAAGTAGAAGCCCTTTCTCTAGCTCGGTTCTCTTCGGCAATAAAAAAGATGGCAGTTGGCAGTTTTGTGTGGACTATCGAGCCCTCAATCGAGCGA
CCATACCAGACAAGTTTCCCATTTCAGTGATTGATGAGTTGTTGGACGAGTTGTTTGGTGCTACTGTTTTTTCAAAGATAGATTTGAAATCTATAGCCATGAATGATATT
CTGAGACCTTGTTTGAGAAAATTCGTGTTGGTTTTTTTTTACGACATCCTCGTTTATAGCCGATCAATGGAGGACCATGGCAATAACTTCGTAGCCAATTTGAAGAAATG
TCAGTTTGCCGTTCCTTGTGTTGAATACCTCGGTCATTTGATTTTGGCTGGTGGGGTAGCAGCTGACTCGTCCAAGATCAAGGCAATGCAGAAGTGGTTAAAACCCAAAA
ATATTAAGGAACTAAGGAGCTTCCTTGGTCTGACAGGTTATTACCCGAAGTTTGTGGCTAATTATGGGTCGGAAGCTTTTAACCATTTGAAAACAGCTATGATATCCCTA
CCATTCCTATCACTACCAAACTTCAATGAACCGTTTGAGATTGAATCAGATGCCTCTAGGGTTGGTATTGGGGCTGTCTTGATCCATACAGAGGTGGCATTTGTACTTGT
TGGGCAGACACTTCGTGGTACGAACGGATCAGAGAAGTCTGAAGTTTTTATTGGAATAAAAGTGATTGCAGGGGAGTATCAAAGATGGATTGCGAAGCTTTTGGGATACG
ACTTAAGCATTGAATATAAGCGAGGTTCTGAGAATTCAGCAGCAGATGCATTATCGAGATTACCTCCAGCCTTGGAGTTTGGTTTAATGGGTGTGGTACTGGATAAACCA
GAGGAGGTGTATGTACTAAATTCTCTAAAGAGATCAACTGTGGAAGTGACTGGTCATAAGGCCCATGGTGCTGTTCCAGCTCCTGGATCTATTGTCATAGCTCGAGTCAC
AAAAGTAATGTCTAAAATGGCATCAGCTGATATCATGTGTGTTGGTCCAAAGTCTGTGAAAGAGAAGTTTACTGGAATTATAAGGCAACAAGATGTTCGAGCAATGGAGA
TCGATAAAGTAGATATGCATTTGTCATTTCGTCCTGGTGACGTTGTGAAAGCTCTTGTTCTTTCTCTTGGAGATGCAAGGGCCTATCATCTATCAACTGCAAAAAATGAA
CTCGGCGTGGTCTCTGCAGAGAGCACAGCAGGTGCAGTGATGGTTCCCATAAGTTGGACAGAAATGCAGTGTCCATTAACAGGCCAAATTCAGCAAAGAAAAGTAGCCAA
AGTTGGAGGATGA
Protein sequenceShow/hide protein sequence
METTFFNGHKVFNCMIGYLTGDKKAPAKDGSLFSMWDVENSMVMTWLVNSMTEEICSNHMYFDWGNQSNVTIGVESSTQQYSTRRGYHYTIFSQIDKNLARVGLFDNYEW
HSTKDQMHYIRKCRCDLRSNVVGEWMVHLHGDHRLVKSQISLKSMMKMLGREDLGMLVELSVLDTGLKDDSKVVLHTCDQQIPEAVRLVLQQFEKVFMTPTTLPPKRHHD
HVIELEPGTGPINVRPYRYPQFQKDEIRKMVNEMLTAGIIQPSRSPFSSSVLFGNKKDGSWQFCVDYRALNRATIPDKFPISVIDELLDELFGATVFSKIDLKSIAMNDI
LRPCLRKFVLVFFYDILVYSRSMEDHGNNFVANLKKCQFAVPCVEYLGHLILAGGVAADSSKIKAMQKWLKPKNIKELRSFLGLTGYYPKFVANYGSEAFNHLKTAMISL
PFLSLPNFNEPFEIESDASRVGIGAVLIHTEVAFVLVGQTLRGTNGSEKSEVFIGIKVIAGEYQRWIAKLLGYDLSIEYKRGSENSAADALSRLPPALEFGLMGVVLDKP
EEVYVLNSLKRSTVEVTGHKAHGAVPAPGSIVIARVTKVMSKMASADIMCVGPKSVKEKFTGIIRQQDVRAMEIDKVDMHLSFRPGDVVKALVLSLGDARAYHLSTAKNE
LGVVSAESTAGAVMVPISWTEMQCPLTGQIQQRKVAKVGG