; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001650 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001650
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:34011150..34019736
RNA-Seq ExpressionLag0001650
SyntenyLag0001650
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU28846.1 hypothetical protein TSUD_21830 [Trifolium subterraneum]4.6e-17049.85Show/hide
Query:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF
        ++L ++H+L        TCP+T  QNG+VERKHRH+V+ GLTLLSQ+ LPLKYWD AF TA +LINRLPTP L N +P   L H  PDY  L+VFGC CF
Subjt:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF

Query:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF-------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS
        P LR Y++ KLA+RSK C+F+GYS   KGYKCLSPDG ++VS++V FN  KFP+       S++++ I +  P+  +P    LP    +P    T P++ 
Subjt:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF-------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS

Query:  -EPQLGMATAPSPTPHASSST----VPSNE---DCSSP-------------SPSGSSTPIPP--PQQNVSNDHPMMTHAKHK-----------------A
                ++P P+ HAS  T     P+N    D +SP             SPS + + IPP  P    S  H M+T +K K                 A
Subjt:  -EPQLGMATAPSPTPHASSST----VPSNE---DCSSP-------------SPSGSSTPIPP--PQQNVSNDHPMMTHAKHK-----------------A

Query:  LKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQ
         +SPHW KAM++EY+AL+KNNTW LV  P   + +GCKW+FR+K NSDG+I++YKARLVAKGFHQ    D+TETFS VVKPVT+R +LT+ +   W ++Q
Subjt:  LKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQ

Query:  IDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPAD
        ID NNAFL+G+L E V+M QP GF+ + ++GLVCKL KALY LKQAPRAW++RL   L   GF  SK DPSL + +T+     +L+YV DII+ GSS + 
Subjt:  IDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPAD

Query:  VSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEI
        +  LI+ LN +F+LK L  L++FLGI+V++  NG + LSQ+ YI DLLS+  M  A  + TPMVS   +S    +   D  LYRSIVGALQY TLTRPEI
Subjt:  VSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEI

Query:  SYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG
        S++VNK CQF+ +P   HW+ VK+I RYL G ++  LLLQ  P+N  L L GF DADWASDPDDR+ST+G
Subjt:  SYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG

KYP61341.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]5.6e-16849.77Show/hide
Query:  TLHLNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVF
        T +LN  + + H+L+        CP+T  QNG+ ERKHRHIV++GLTL++Q+ LP+++WD +F TAVYLINRLP+ ++ N  P  +LFH  PDY  LR+F
Subjt:  TLHLNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVF

Query:  GCECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRAS---IHNSNPVVLLPHLASLPQNSNSPISPNT
        GC CFP LR YN HKL FRS+ CVF+GYS S KGYKCL+ DG++++S++V FN +KFP    FSS++AS   +  S P+ + P     P ++    SPN 
Subjt:  GCECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRAS---IHNSNPVVLLPHLASLPQNSNSPISPNT

Query:  SPSNSEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK----------------------HKALKSPHWSKAMKDE
         P+ S   +  + +P+   H S ST+ S    S+  P  SS+PI  P     N HPM T AK                       +AL +P WS AM+ E
Subjt:  SPSNSEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK----------------------HKALKSPHWSKAMKDE

Query:  YDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLT
        Y+AL+ NNTW LVPLP     +GCKW+FR+K N +GS+ +YKARLVAKGF+Q    DY ETFS V+KPVT+R++LTL L + W ++Q+D NNAFL+G L 
Subjt:  YDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLT

Query:  ESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFS
        E V+M QP GF+ A ++ LVCKL KA+Y LKQAPRAW+++L   L  L F  SK DPSL I        YIL+YV DII+ G++ + +  L+S L+S FS
Subjt:  ESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFS

Query:  LKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHH
        LKDLG L+FFLGI+V    +G L L+QS YI DLL+R +M  +K I++PM+SGS +S    + F D  LYRS+VGALQY T+TRPEIS+SVNK CQFM H
Subjt:  LKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHH

Query:  PKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG
        P   HW  VK+I RYLKG  +  L LQ     ++L ++ + DADWASDPDDR+ST+G
Subjt:  PKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG

PNX92571.1 histone deacetylase [Trifolium pratense]3.0e-16949.7Show/hide
Query:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF
        + L ++H+L+        CP+T  QNG+VERKHRHIVD+GLTLLSQ+ LP+ +WD AF TAVYLINRLP+ ++N +TP   LF   PDY FL+VFGC CF
Subjt:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF

Query:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF--------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSN
        P LR Y+NHKL FRS+ C+F+GYSPS KGY+CLSP G+++VS++V FN S+FP+         S+ +    S  +  LP   S+  +  SP+ P T+P  
Subjt:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF--------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSN

Query:  SEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSST-----PIPPPQQNVS----NDHPMMTHAK------------------HKALKSPHWSKAMKD
        S P   +     P    S++    +   S+PSPS +S+      IPP    V     N H M T AK                   +AL  P W  AM+ 
Subjt:  SEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSST-----PIPPPQQNVS----NDHPMMTHAK------------------HKALKSPHWSKAMKD

Query:  EYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHL
        EYDAL+ NNTW LVPLP D + +GCKW+FRIK N DG++++YKARLVAKGFHQ    D+ ETFS VVKPVTIRV+LT+ +  GW+++Q+D NNAFL+G L
Subjt:  EYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHL

Query:  TESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQF
         E V+M QP GF+ + +  LVCKL KALY LKQAPR W+ERL   L  LGFK+SK DPSL +  +A    Y+L+YV DII+  ++   + + IS LN  F
Subjt:  TESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQF

Query:  SLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMH
        SLK LG L++FLGI+V++ ++G L L+QS Y+ DLL+R +M  +  ++TPM S   ++        D ++YRS+VGALQY T+TRPEIS+SVNKACQFM 
Subjt:  SLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMH

Query:  HPKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG
        HP   HW  VK+I RYLKG ++  LLL     P    L  F DADWASDPDDR+ST+G
Subjt:  HPKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG

PNX92906.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense]2.1e-17051.22Show/hide
Query:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC
        L ++L + H+L        TCP+TS QNG VERKHR IV++GLTLLSQ++LPLKYWD +F  AVYLIN+LPT  L + K+P   LF+ +PDYS L++FGC
Subjt:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC

Query:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPS
         CFP LR YNNHKL FRS PCV++G SP  KG+KCL  +G+I+VS++V F+ ++FP+      SST  S   + P        +LP NS  P  P T P 
Subjt:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPS

Query:  NSEP----QLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDA
         + P    Q+    +   TP    S   S    SSP     + P+  P   +SN+HPM+T  K                    AL  P W KAM+ EY A
Subjt:  NSEP----QLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDA

Query:  LIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESV
        L+ NNTW LV LP   K +GCKWIFR+K N DG++++YKARLVAKGF QT   D+ ETFS V+KP TIR++LTL + Y W ++QID NNAFL+G L E V
Subjt:  LIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESV

Query:  FMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKD
        +M QPSGF+ A ++ LVCKL K+LY LKQAPRAWYERL+  L  +GF  SK DPSL+I     AC Y+LIYV DI++ GS+P  + +LI  LN QFSLK 
Subjt:  FMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKD

Query:  LGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKL
        LG++++FLGI+V++  +GGL L+QS YI DLLSR  M   KAI +PMVS   +S    D  +D  LYRS VGALQY TLTRP+ISYSVNK CQFM +P  
Subjt:  LGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKL

Query:  IHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG
         HW+ VK+I RYLKG  N  LL+   P++    L  ++DADWA+D DDR+ST+G
Subjt:  IHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG

PNX93770.1 histone deacetylase [Trifolium pratense]1.9e-16850.46Show/hide
Query:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC
        L ++L + H+L        TCP+TS QNG VERKHR IV++GLTLLSQ++LPL+YWD +F  AVYLIN+LPT  L + K+P   LF+ +PDYS +++FGC
Subjt:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC

Query:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS
         CFP LR YN +KL FRS PC+++G SP  KG+KCL  +G+I+VS++V F+ ++FP    F ++  S  NS   +   + +  P  SNS   P T+P  +
Subjt:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS

Query:  EPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDALIKNNT
         P L +    S      +S+  + +  SS S        PP   +  N+HPM+T  K                    AL  P W KAM+ EY AL+ NNT
Subjt:  EPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDALIKNNT

Query:  WELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPS
        W LV LP   K +GCKWIFR+K N DG+I++YKARLVAKGF QT   D+ ETFS V+KP TIRV+LTL + Y W+++QID NNAFL+G L E V+M QP 
Subjt:  WELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPS

Query:  GFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNF
        GF+ A ++ LVCKL K+LY LKQAPRAWYERL+  L  +GF TSK DPSL++     AC Y+LIYV DI++ GS+P  + +LI  LN QFSLK LG++++
Subjt:  GFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNF

Query:  FLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLV
        FLGI+V++  +GGL L+QS YI DLLSR  M   KAI +PMVS   +S    D  +D  LYRS VGALQY TLTRP+ISYSVNK CQFM +P   HW+ V
Subjt:  FLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLV

Query:  KQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG
        K+I RYLKG  N  LLL   P++    L  ++DADWA+D DDR+ST+G
Subjt:  KQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG

TrEMBL top hitse value%identityAlignment
A0A151RUP0 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-16849.77Show/hide
Query:  TLHLNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVF
        T +LN  + + H+L+        CP+T  QNG+ ERKHRHIV++GLTL++Q+ LP+++WD +F TAVYLINRLP+ ++ N  P  +LFH  PDY  LR+F
Subjt:  TLHLNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVF

Query:  GCECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRAS---IHNSNPVVLLPHLASLPQNSNSPISPNT
        GC CFP LR YN HKL FRS+ CVF+GYS S KGYKCL+ DG++++S++V FN +KFP    FSS++AS   +  S P+ + P     P ++    SPN 
Subjt:  GCECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRAS---IHNSNPVVLLPHLASLPQNSNSPISPNT

Query:  SPSNSEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK----------------------HKALKSPHWSKAMKDE
         P+ S   +  + +P+   H S ST+ S    S+  P  SS+PI  P     N HPM T AK                       +AL +P WS AM+ E
Subjt:  SPSNSEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK----------------------HKALKSPHWSKAMKDE

Query:  YDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLT
        Y+AL+ NNTW LVPLP     +GCKW+FR+K N +GS+ +YKARLVAKGF+Q    DY ETFS V+KPVT+R++LTL L + W ++Q+D NNAFL+G L 
Subjt:  YDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLT

Query:  ESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFS
        E V+M QP GF+ A ++ LVCKL KA+Y LKQAPRAW+++L   L  L F  SK DPSL I        YIL+YV DII+ G++ + +  L+S L+S FS
Subjt:  ESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFS

Query:  LKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHH
        LKDLG L+FFLGI+V    +G L L+QS YI DLL+R +M  +K I++PM+SGS +S    + F D  LYRS+VGALQY T+TRPEIS+SVNK CQFM H
Subjt:  LKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHH

Query:  PKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG
        P   HW  VK+I RYLKG  +  L LQ     ++L ++ + DADWASDPDDR+ST+G
Subjt:  PKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG

A0A2K3MP35 Histone deacetylase1.4e-16949.7Show/hide
Query:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF
        + L ++H+L+        CP+T  QNG+VERKHRHIVD+GLTLLSQ+ LP+ +WD AF TAVYLINRLP+ ++N +TP   LF   PDY FL+VFGC CF
Subjt:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF

Query:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF--------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSN
        P LR Y+NHKL FRS+ C+F+GYSPS KGY+CLSP G+++VS++V FN S+FP+         S+ +    S  +  LP   S+  +  SP+ P T+P  
Subjt:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF--------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSN

Query:  SEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSST-----PIPPPQQNVS----NDHPMMTHAK------------------HKALKSPHWSKAMKD
        S P   +     P    S++    +   S+PSPS +S+      IPP    V     N H M T AK                   +AL  P W  AM+ 
Subjt:  SEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSST-----PIPPPQQNVS----NDHPMMTHAK------------------HKALKSPHWSKAMKD

Query:  EYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHL
        EYDAL+ NNTW LVPLP D + +GCKW+FRIK N DG++++YKARLVAKGFHQ    D+ ETFS VVKPVTIRV+LT+ +  GW+++Q+D NNAFL+G L
Subjt:  EYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHL

Query:  TESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQF
         E V+M QP GF+ + +  LVCKL KALY LKQAPR W+ERL   L  LGFK+SK DPSL +  +A    Y+L+YV DII+  ++   + + IS LN  F
Subjt:  TESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQF

Query:  SLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMH
        SLK LG L++FLGI+V++ ++G L L+QS Y+ DLL+R +M  +  ++TPM S   ++        D ++YRS+VGALQY T+TRPEIS+SVNKACQFM 
Subjt:  SLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMH

Query:  HPKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG
        HP   HW  VK+I RYLKG ++  LLL     P    L  F DADWASDPDDR+ST+G
Subjt:  HPKLIHWQLVKQIFRYLKGIINTSLLLQ---KPNNLCLYGFADADWASDPDDRKSTTG

A0A2K3MQ67 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-17051.22Show/hide
Query:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC
        L ++L + H+L        TCP+TS QNG VERKHR IV++GLTLLSQ++LPLKYWD +F  AVYLIN+LPT  L + K+P   LF+ +PDYS L++FGC
Subjt:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC

Query:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPS
         CFP LR YNNHKL FRS PCV++G SP  KG+KCL  +G+I+VS++V F+ ++FP+      SST  S   + P        +LP NS  P  P T P 
Subjt:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPS

Query:  NSEP----QLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDA
         + P    Q+    +   TP    S   S    SSP     + P+  P   +SN+HPM+T  K                    AL  P W KAM+ EY A
Subjt:  NSEP----QLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDA

Query:  LIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESV
        L+ NNTW LV LP   K +GCKWIFR+K N DG++++YKARLVAKGF QT   D+ ETFS V+KP TIR++LTL + Y W ++QID NNAFL+G L E V
Subjt:  LIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESV

Query:  FMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKD
        +M QPSGF+ A ++ LVCKL K+LY LKQAPRAWYERL+  L  +GF  SK DPSL+I     AC Y+LIYV DI++ GS+P  + +LI  LN QFSLK 
Subjt:  FMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKD

Query:  LGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKL
        LG++++FLGI+V++  +GGL L+QS YI DLLSR  M   KAI +PMVS   +S    D  +D  LYRS VGALQY TLTRP+ISYSVNK CQFM +P  
Subjt:  LGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKL

Query:  IHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG
         HW+ VK+I RYLKG  N  LL+   P++    L  ++DADWA+D DDR+ST+G
Subjt:  IHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG

A0A2K3MSH2 Histone deacetylase9.3e-16950.46Show/hide
Query:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC
        L ++L + H+L        TCP+TS QNG VERKHR IV++GLTLLSQ++LPL+YWD +F  AVYLIN+LPT  L + K+P   LF+ +PDYS +++FGC
Subjt:  LNSSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNN-KTPSEQLFHVKPDYSFLRVFGC

Query:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS
         CFP LR YN +KL FRS PC+++G SP  KG+KCL  +G+I+VS++V F+ ++FP    F ++  S  NS   +   + +  P  SNS   P T+P  +
Subjt:  ECFPCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFP----FSSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS

Query:  EPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDALIKNNT
         P L +    S      +S+  + +  SS S        PP   +  N+HPM+T  K                    AL  P W KAM+ EY AL+ NNT
Subjt:  EPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTPIPPPQQNVSNDHPMMTHAK------------------HKALKSPHWSKAMKDEYDALIKNNT

Query:  WELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPS
        W LV LP   K +GCKWIFR+K N DG+I++YKARLVAKGF QT   D+ ETFS V+KP TIRV+LTL + Y W+++QID NNAFL+G L E V+M QP 
Subjt:  WELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPS

Query:  GFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNF
        GF+ A ++ LVCKL K+LY LKQAPRAWYERL+  L  +GF TSK DPSL++     AC Y+LIYV DI++ GS+P  + +LI  LN QFSLK LG++++
Subjt:  GFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNF

Query:  FLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLV
        FLGI+V++  +GGL L+QS YI DLLSR  M   KAI +PMVS   +S    D  +D  LYRS VGALQY TLTRP+ISYSVNK CQFM +P   HW+ V
Subjt:  FLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLV

Query:  KQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG
        K+I RYLKG  N  LLL   P++    L  ++DADWA+D DDR+ST+G
Subjt:  KQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG

A0A2Z6M8W8 Integrase catalytic domain-containing protein2.2e-17049.85Show/hide
Query:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF
        ++L ++H+L        TCP+T  QNG+VERKHRH+V+ GLTLLSQ+ LPLKYWD AF TA +LINRLPTP L N +P   L H  PDY  L+VFGC CF
Subjt:  SSLSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF

Query:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF-------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS
        P LR Y++ KLA+RSK C+F+GYS   KGYKCLSPDG ++VS++V FN  KFP+       S++++ I +  P+  +P    LP    +P    T P++ 
Subjt:  PCLRSYNNHKLAFRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPF-------SSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNS

Query:  -EPQLGMATAPSPTPHASSST----VPSNE---DCSSP-------------SPSGSSTPIPP--PQQNVSNDHPMMTHAKHK-----------------A
                ++P P+ HAS  T     P+N    D +SP             SPS + + IPP  P    S  H M+T +K K                 A
Subjt:  -EPQLGMATAPSPTPHASSST----VPSNE---DCSSP-------------SPSGSSTPIPP--PQQNVSNDHPMMTHAKHK-----------------A

Query:  LKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQ
         +SPHW KAM++EY+AL+KNNTW LV  P   + +GCKW+FR+K NSDG+I++YKARLVAKGFHQ    D+TETFS VVKPVT+R +LT+ +   W ++Q
Subjt:  LKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQ

Query:  IDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPAD
        ID NNAFL+G+L E V+M QP GF+ + ++GLVCKL KALY LKQAPRAW++RL   L   GF  SK DPSL + +T+     +L+YV DII+ GSS + 
Subjt:  IDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPAD

Query:  VSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEI
        +  LI+ LN +F+LK L  L++FLGI+V++  NG + LSQ+ YI DLLS+  M  A  + TPMVS   +S    +   D  LYRSIVGALQY TLTRPEI
Subjt:  VSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEI

Query:  SYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG
        S++VNK CQF+ +P   HW+ VK+I RYL G ++  LLLQ  P+N  L L GF DADWASDPDDR+ST+G
Subjt:  SYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQ-KPNN--LCLYGFADADWASDPDDRKSTTG

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.0e-6429.04Show/hide
Query:  TCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTL--NNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRS
        T P+T Q NG+ ER  R I +   T++S + L   +W EA  TA YLINR+P+  L  ++KTP E   + KP    LRVFG   +  +++    K   +S
Subjt:  TCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTL--NNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRS

Query:  KPCVFIGYSPSQKGYKCL-SPDGKIFVSRNVAFNVSKFPFS---------------STRASIHNSNPVVLLPHL---------------------ASLPQ
           +F+GY P+  G+K   + + K  V+R+V  + +    S               S   +  N +  ++                          + P 
Subjt:  KPCVFIGYSPSQKGYKCL-SPDGKIFVSRNVAFNVSKFPFS---------------STRASIHNSNPVVLLPHL---------------------ASLPQ

Query:  NSNSPIS---PNTSPSNSEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTP----------------IPPPQQN---------------------
        +S   I    PN S      Q    +  S     + S     +D  + S  GS  P                I  P +N                     
Subjt:  NSNSPIS---PNTSPSNSEPQLGMATAPSPTPHASSSTVPSNEDCSSPSPSGSSTP----------------IPPPQQN---------------------

Query:  ----------------VSNDHPMMTHAKHKALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDI
                        + ND P              W +A+  E +A   NNTW +   P +   V  +W+F +K N  G+  RYKARLVA+GF Q   I
Subjt:  ----------------VSNDHPMMTHAKHKALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDI

Query:  DYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKAD
        DY ETF+ V +  + R +L+LV+ Y   + Q+D   AFL+G L E ++M  P G   + N   VCKL KA+Y LKQA R W+E     LK   F  S  D
Subjt:  DYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKAD

Query:  PSLMI--KQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGS
          + I  K       Y+L+YV D+++       ++N    L  +F + DL ++  F+GI++    +  ++LSQS+Y+  +LS+ NM    A++TP+ S  
Subjt:  PSLMI--KQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGS

Query:  IISAHQGDFFTDVYLYRSIVGALQYVTL-TRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLC----LYGFADADWASDPDDR
               D   +    RS++G L Y+ L TRP+++ +VN   ++        WQ +K++ RYLKG I+  L+ +K  NL     + G+ D+DWA    DR
Subjt:  IISAHQGDFFTDVYLYRSIVGALQYVTL-TRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLC----LYGFADADWASDPDDR

Query:  KSTTGF
        KSTTG+
Subjt:  KSTTGF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-8933.97Show/hide
Query:  TCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRSKP
        T P T Q NG+ ER +R IV+   ++L  + LP  +W EA  TA YLINR P+  L  + P     + +  YS L+VFGC  F  +      KL  +S P
Subjt:  TCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRSKP

Query:  CVFIGYSPSQKGYKCLSP-DGKIFVSRNVAFNVSKFPFSSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNSEPQLGMATAPSPTPHASSSTVPS
        C+FIGY   + GY+   P   K+  SR+V F  S+     T A +       ++P+  ++P  SN+P S  ++      Q      P             
Subjt:  CVFIGYSPSQKGYKCLSP-DGKIFVSRNVAFNVSKFPFSSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNSEPQLGMATAPSPTPHASSSTVPS

Query:  NEDCSSPSPS--------GSSTPIPPPQQNVSNDHPMMTHAKH-----KALKSPHWS---KAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNS
         E+   P+           S  P    ++  S ++ +++  +      + L  P  +   KAM++E ++L KN T++LV LP   + + CKW+F++K++ 
Subjt:  NEDCSSPSPS--------GSSTPIPPPQQNVSNDHPMMTHAKH-----KALKSPHWS---KAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNS

Query:  DGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAP
        D  + RYKARLV KGF Q   ID+ E FS VVK  +IR +L+L       + Q+D   AFLHG L E ++M+QP GF+ AG + +VCKL K+LY LKQAP
Subjt:  DGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAP

Query:  RAWYERLSHFLKTLGFKTSKADPSLMIKQ-TAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKVYYP-TNGGLFLSQSSYIS
        R WY +   F+K+  +  + +DP +  K+ +      +L+YV D++++G     ++ L   L+  F +KDLG     LG+K+    T+  L+LSQ  YI 
Subjt:  RAWYERLSHFLKTLGFKTSKADPSLMIKQ-TAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKVYYP-TNGGLFLSQSSYIS

Query:  DLLSRANMTYAKAIATPMVSGSIISAHQGDFFTD------VYLYRSIVGALQY-VTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLL
         +L R NM  AK ++TP+     +S        +         Y S VG+L Y +  TRP+I+++V    +F+ +P   HW+ VK I RYL+G     L 
Subjt:  DLLSRANMTYAKAIATPMVSGSIISAHQGDFFTD------VYLYRSIVGALQY-VTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLL

Query:  LQKPNNLCLYGFADADWASDPDDRKSTTGF
            + + L G+ DAD A D D+RKS+TG+
Subjt:  LQKPNNLCLYGFADADWASDPDDRKSTTGF

P92519 Uncharacterized mitochondrial protein AtMg008102.0e-3545.26Show/hide
Query:  YILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKV-YYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPM---VSGSIISAHQGDFFT
        Y+L+YV DI++ GSS   ++ LI  L+S FS+KDLG +++FLGI++  +P+  GLFLSQ+ Y   +L+ A M   K ++TP+   ++ S+ +A     + 
Subjt:  YILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKV-YYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPM---VSGSIISAHQGDFFT

Query:  DVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF
        D   +RSIVGALQY+TLTRP+ISY+VN  CQ MH P L  + L+K++ RY+KG I   L + K + L +  F D+DWA     R+STTGF
Subjt:  DVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-13842.26Show/hide
Query:  PYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRSKPCV
        P+T + NG+ ERKHRHIV+ GLTLLS +++P  YW  AFA AVYLINRLPTP L  ++P ++LF   P+Y  LRVFGC C+P LR YN HKL  +S+ CV
Subjt:  PYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRSKPCV

Query:  FIGYSPSQKGYKCLS-PDGKIFVSRNVAFNVSKFPFS--------------------STRASIHNSNPVVLL-----PHLASLPQNS------NSPIS--
        F+GYS +Q  Y CL     ++++SR+V F+ + FPFS                    S   ++    PV+       PH A+ P +S      NS +S  
Subjt:  FIGYSPSQKGYKCLS-PDGKIFVSRNVAFNVSKFPFS--------------------STRASIHNSNPVVLL-----PHLASLPQNS------NSPIS--

Query:  ------PNTSPSNSEPQLGMATAPSPT---------PHASSST------------------VPSNEDCSSPSP----SGSST----------PIPPPQQN
               ++ PS+ EP       P PT          H+S +T                   P+    SSPSP    S SST          P PP  Q 
Subjt:  ------PNTSPSNSEPQLGMATAPSPT---------PHASSST------------------VPSNEDCSSPSP----SGSST----------PIPPPQQN

Query:  VSND-------HPMMTHAKH-------------------------KALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKT-VGCKWIFRIKRNSDGSIS
        V+N+       H M T AK                          +ALK   W  AM  E +A I N+TW+LVP P  + T VGC+WIF  K NSDGS++
Subjt:  VSND-------HPMMTHAKH-------------------------KALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKT-VGCKWIFRIKRNSDGSIS

Query:  RYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYE
        RYKARLVAKG++Q   +DY ETFS V+K  +IR++L + +   W +RQ+D NNAFL G LT+ V+M QP GF        VCKL+KALY LKQAPRAWY 
Subjt:  RYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYE

Query:  RLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKV-YYPTNGGLFLSQSSYISDLLSRA
         L ++L T+GF  S +D SL + Q  K+  Y+L+YV DI++ G+ P  + N +  L+ +FS+KD  +L++FLGI+    PT  GL LSQ  YI DLL+R 
Subjt:  RLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKV-YYPTNGGLFLSQSSYISDLLSRA

Query:  NMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFA
        NM  AK + TPM     +S + G   TD   YR IVG+LQY+  TRP+ISY+VN+  QFMH P   H Q +K+I RYL G  N  + L+K N L L+ ++
Subjt:  NMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFA

Query:  DADWASDPDDRKSTTGF
        DADWA D DD  ST G+
Subjt:  DADWASDPDDRKSTTGF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-13740.28Show/hide
Query:  PYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRSKPCV
        P+T + NG+ ERKHRHIV++GLTLLS +++P  YW  AF+ AVYLINRLPTP L  ++P ++LF   P+Y  L+VFGC C+P LR YN HKL  +SK C 
Subjt:  PYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLAFRSKPCV

Query:  FIGYSPSQKGYKCLS-PDGKIFVSRNVAFNVSKFPFSSTRASIHNSN-------------------------PVVLLPHLASLPQ--------------N
        F+GYS +Q  Y CL  P G+++ SR+V F+   FPFS+T   +  S                          P  L PHL + P+              +
Subjt:  FIGYSPSQKGYKCLS-PDGKIFVSRNVAFNVSKFPFSSTRASIHNSN-------------------------PVVLLPHLASLPQ--------------N

Query:  SNSPISPNTSPSNSEPQLGMATAPSPT--PHA-----SSSTVPSNEDCSSPSPSG------------SSTPIPPPQQNVS--------------------
        SN P S  +SPS+SEP       P PT  PH      S+S + +N + +SPSP+             SS  IP P  ++S                    
Subjt:  SNSPISPNTSPSNSEPQLGMATAPSPT--PHA-----SSSTVPSNEDCSSPSPSG------------SSTPIPPPQQNVS--------------------

Query:  -------------NDHPMMTHAKH-------------------------KALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKT-VGCKWIFRIKRNSD
                     N H M T AK                          +A+K   W +AM  E +A I N+TW+LVP P  + T VGC+WIF  K NSD
Subjt:  -------------NDHPMMTHAKH-------------------------KALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKT-VGCKWIFRIKRNSD

Query:  GSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPR
        GS++RYKARLVAKG++Q   +DY ETFS V+K  +IR++L + +   W +RQ+D NNAFL G LT+ V+M QP GF        VC+L+KA+Y LKQAPR
Subjt:  GSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPR

Query:  AWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLL
        AWY  L  +L T+GF  S +D SL + Q  ++  Y+L+YV DI++ G+    + + + AL+ +FS+K+   L++FLGI+       GL LSQ  Y  DLL
Subjt:  AWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLL

Query:  SRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLY
        +R NM  AK +ATPM +   ++ H G    D   YR IVG+LQY+  TRP++SY+VN+  Q+MH P   HW  +K++ RYL G  +  + L+K N L L+
Subjt:  SRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLY

Query:  GFADADWASDPDDRKSTTGF
         ++DADWA D DD  ST G+
Subjt:  GFADADWASDPDDRKSTTGF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-7541.19Show/hide
Query:  WSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNN
        W  AM DE  A+   +TWE+  LP + K +GCKW+++IK NSDG+I RYKARLVAKG+ Q   ID+ ETFS V K  +++++L +   Y +T+ Q+D +N
Subjt:  WSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTLVLYYGWTMRQIDFNN

Query:  AFLHGHLTESVFMDQPSGFKYAGNQG------LVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPA
        AFL+G L E ++M  P G  YA  QG       VC LKK++Y LKQA R W+ + S  L   GF  S +D +  +K TA     +L+YV DII+  ++ A
Subjt:  AFLHGHLTESVFMDQPSGFKYAGNQG------LVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVYDIIVIGSSPA

Query:  DVSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPE
         V  L S L S F L+DLG L +FLG+++   +  G+ + Q  Y  DLL    +   K  + PM      SAH G  F D   YR ++G L Y+ +TR +
Subjt:  DVSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPE

Query:  ISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF
        IS++VNK  QF   P+L H Q V +I  Y+KG +   L       + L  F+DA + S  D R+ST G+
Subjt:  ISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF

ATMG00240.1 Gag-Pol-related retrotransposon family protein5.5e-1240.26Show/hide
Query:  YVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF
        Y+T+TRP+++++VN+  QF    +    Q V ++  Y+KG +   L     ++L L  FAD+DWAS PD R+S TGF
Subjt:  YVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.6e-0635.29Show/hide
Query:  HRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF
        +R I++   ++L +  LP  +  +A  TAV++IN+ P+  +N   P E  F   P YS+LR FGC  +
Subjt:  HRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECF

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-3645.26Show/hide
Query:  YILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKV-YYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPM---VSGSIISAHQGDFFT
        Y+L+YV DI++ GSS   ++ LI  L+S FS+KDLG +++FLGI++  +P+  GLFLSQ+ Y   +L+ A M   K ++TP+   ++ S+ +A     + 
Subjt:  YILIYVYDIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKV-YYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPM---VSGSIISAHQGDFFT

Query:  DVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF
        D   +RSIVGALQY+TLTRP+ISY+VN  CQ MH P L  + L+K++ RY+KG I   L + K + L +  F D+DWA     R+STTGF
Subjt:  DVYLYRSIVGALQYVTLTRPEISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.0e-2253.85Show/hide
Query:  ALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTL
        ALK P W +AM++E DAL +N TW LVP P +   +GCKW+F+ K +SDG++ R KARLVAKGFHQ   I + ET+S VV+  TIR +L +
Subjt:  ALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVVKPVTIRVLLTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCAGTCATGAAGATTCTCAGGTACTTAAAATCTACTCCTGGTTCAGGGTTGTTTTTCAAGAAGACAACTAATCGAGAGGTTGAAGTATATACTGATGCCGACTG
GGCAGGGTCTATTACGGATCGGAAGTCAACTTCAGGTATTTGTACATTTGTGTGGGGAAACCTTGTCACTTGGCGGTGCAAAAAGCAAACTGTAGTCGCCAGGAGTAGTA
CCGAGGCTGAACTTCGATCTCTTGCTAGTGGAATATGCGAAGGAATGTGGTTACGGAGACTGTTAATGGAGTTACACATGTACAGTAAACATCCAATGAAGTTCAACTGT
GATAATCAATCTACACTTTGCATGGCAAAGGATCCCGTACAACATGATAGGACGAAACATATCGAAATCGATCGACATTTCATCAATGATAGTATCGAGCAGAAACTCAT
CGTCCTAAACTACGTTCCTTCCAAACAACAAGCAGCTGACATTCTCACCAAGCCATTACCACAAACAAGCTTCAAGGAGATGACTAGCAAGCTTAATCTGTTGAACATTT
ACAGAGAACTCTTGTATATATACCCCACGATGAGAATGAAAGATGTACTTCTCTTCTACATTTATTTCTTTAGTGGGATTTCTTGGTCAAAACTGTTCACAATTGGCCCT
ATTTCTGGAATTCGGGCCCCTTTGACTTTTGTGGGATTCGATGAACTACTGATGGAGACCAACGAAGGACAACTAAGTTTGTACAACATGAACACCCAGCACTTCAAACA
CCTTCCAGTCAAAGGCCTTCCGGGTGAGTCCCAAGCTGCAGTTTTCATCAAGACTTTGCTTTCCATCAATGCCCATGGAGGACGTACGGACCACCGGACACACACATATC
TCCACAATGGTATCATATTGTCCACTTTGGGCATAAGCCCTCATGGCTTTGCTTTTGGTTCACCCAAAAGGCCTCATACCAATGGAGATACTTGTCCTCCCTTATATACC
CATGATCATCCCCTTATCCTCGCTTATATACCCATGATCATCCCCTTATCTAGCCGATGTGGGACTTTGGACGCACTCCCAACAATCCTCCCCTCGAACAAAGACGACTC
CTCTTCTCTGGAGTATACCCGCCCACCCAGAGCTCAACCACGGACCTCCACCATGACTATTAAGGCTCACAACTTCTTTGTTCGGAACCTGAGGATTCCACCCAACACGG
CTACTCCCGGGGATCCCTCTCATTCGGATGTGGCCTCGGTTCATTCATGTACCCCTCCTAACTCGGGTGTTACATGCCCACCAGCTTCCACCTTACATCTCAATTCTTCC
CTGAGTCTTCTTCATCAACTTCTCATCAAATGGGATAACACACATACTTGTCCTTACACATCCCAACAAAATGGCATAGTAGAACGAAAACATCGACACATAGTTGATGT
TGGTCTTACCCTCTTATCTCAATCAAACTTGCCTCTTAAGTATTGGGATGAGGCATTTGCCACTGCGGTTTATCTCATCAACAGGTTACCTACTCCAACTCTAAATAATA
AAACCCCTTCCGAACAACTGTTCCATGTTAAACCTGATTACTCCTTCCTTCGTGTTTTTGGGTGTGAATGTTTTCCTTGTCTTCGTTCCTATAACAATCACAAATTAGCC
TTTCGTTCCAAACCGTGTGTATTCATTGGCTATAGTCCATCCCAAAAGGGATACAAATGTCTCTCTCCCGATGGCAAAATATTTGTTTCAAGAAATGTTGCCTTTAATGT
GAGTAAGTTTCCTTTCTCATCCACTCGAGCCTCTATACACAATTCTAATCCTGTTGTCCTTTTACCTCATCTTGCATCCCTTCCCCAAAATTCCAATTCACCTATTTCTC
CAAATACTTCTCCATCCAACTCAGAACCTCAATTAGGCATGGCTACTGCTCCATCTCCCACACCACATGCTTCCTCATCAACAGTTCCTTCTAATGAGGATTGTTCTTCA
CCTAGTCCTAGCGGTTCATCTACACCAATCCCTCCCCCCCAACAGAATGTTTCAAATGATCATCCCATGATGACTCATGCGAAACACAAAGCCTTAAAATCTCCACATTG
GAGTAAGGCAATGAAGGATGAATACGATGCTCTCATAAAGAACAATACTTGGGAATTGGTTCCTTTACCAAATGATAACAAAACTGTGGGATGTAAGTGGATCTTTCGTA
TCAAGCGCAACTCTGATGGATCAATTTCCAGGTACAAAGCCCGGCTTGTAGCCAAGGGCTTTCATCAGACGGTTGACATTGATTATACTGAAACATTCAGTCTCGTAGTT
AAACCAGTTACCATAAGAGTTCTTTTGACCCTTGTTTTATACTATGGCTGGACAATGCGTCAAATTGATTTCAATAACGCCTTTTTGCACGGTCATCTAACTGAATCTGT
TTTTATGGATCAACCATCAGGTTTTAAATATGCAGGCAATCAAGGGCTAGTATGCAAGCTCAAAAAGGCCCTATACGACCTTAAACAAGCCCCAAGGGCATGGTATGAAA
GATTGAGCCATTTCCTCAAAACTCTTGGGTTTAAAACTTCGAAGGCAGATCCTTCTTTGATGATTAAACAAACTGCCAAAGCCTGTTGTTACATACTTATCTATGTTTAT
GATATAATTGTGATAGGGAGTTCTCCAGCTGATGTTTCAAATCTGATATCCGCCTTAAATTCTCAGTTCTCCTTAAAGGACCTCGGTAAGTTGAATTTCTTTCTGGGCAT
TAAGGTGTATTACCCAACTAATGGGGGTCTTTTTCTGTCACAATCCTCATATATTTCAGACCTTCTCTCTCGAGCCAATATGACATATGCAAAGGCCATTGCAACACCAA
TGGTAAGTGGCTCTATTATTTCTGCTCATCAAGGTGATTTTTTTACAGATGTTTATTTATATCGGAGCATTGTTGGCGCGTTGCAATATGTAACTTTGACTAGACCCGAA
ATATCCTATAGCGTTAACAAAGCATGTCAATTTATGCACCATCCCAAACTTATACACTGGCAGCTTGTTAAACAGATATTTAGGTACTTGAAAGGAATTATAAACACTAG
TTTGTTACTTCAGAAACCGAATAATTTATGCCTTTATGGTTTTGCTGATGCCGACTGGGCATCCGACCCAGATGATAGAAAGAGTACAACTGGGTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCAGTCATGAAGATTCTCAGGTACTTAAAATCTACTCCTGGTTCAGGGTTGTTTTTCAAGAAGACAACTAATCGAGAGGTTGAAGTATATACTGATGCCGACTG
GGCAGGGTCTATTACGGATCGGAAGTCAACTTCAGGTATTTGTACATTTGTGTGGGGAAACCTTGTCACTTGGCGGTGCAAAAAGCAAACTGTAGTCGCCAGGAGTAGTA
CCGAGGCTGAACTTCGATCTCTTGCTAGTGGAATATGCGAAGGAATGTGGTTACGGAGACTGTTAATGGAGTTACACATGTACAGTAAACATCCAATGAAGTTCAACTGT
GATAATCAATCTACACTTTGCATGGCAAAGGATCCCGTACAACATGATAGGACGAAACATATCGAAATCGATCGACATTTCATCAATGATAGTATCGAGCAGAAACTCAT
CGTCCTAAACTACGTTCCTTCCAAACAACAAGCAGCTGACATTCTCACCAAGCCATTACCACAAACAAGCTTCAAGGAGATGACTAGCAAGCTTAATCTGTTGAACATTT
ACAGAGAACTCTTGTATATATACCCCACGATGAGAATGAAAGATGTACTTCTCTTCTACATTTATTTCTTTAGTGGGATTTCTTGGTCAAAACTGTTCACAATTGGCCCT
ATTTCTGGAATTCGGGCCCCTTTGACTTTTGTGGGATTCGATGAACTACTGATGGAGACCAACGAAGGACAACTAAGTTTGTACAACATGAACACCCAGCACTTCAAACA
CCTTCCAGTCAAAGGCCTTCCGGGTGAGTCCCAAGCTGCAGTTTTCATCAAGACTTTGCTTTCCATCAATGCCCATGGAGGACGTACGGACCACCGGACACACACATATC
TCCACAATGGTATCATATTGTCCACTTTGGGCATAAGCCCTCATGGCTTTGCTTTTGGTTCACCCAAAAGGCCTCATACCAATGGAGATACTTGTCCTCCCTTATATACC
CATGATCATCCCCTTATCCTCGCTTATATACCCATGATCATCCCCTTATCTAGCCGATGTGGGACTTTGGACGCACTCCCAACAATCCTCCCCTCGAACAAAGACGACTC
CTCTTCTCTGGAGTATACCCGCCCACCCAGAGCTCAACCACGGACCTCCACCATGACTATTAAGGCTCACAACTTCTTTGTTCGGAACCTGAGGATTCCACCCAACACGG
CTACTCCCGGGGATCCCTCTCATTCGGATGTGGCCTCGGTTCATTCATGTACCCCTCCTAACTCGGGTGTTACATGCCCACCAGCTTCCACCTTACATCTCAATTCTTCC
CTGAGTCTTCTTCATCAACTTCTCATCAAATGGGATAACACACATACTTGTCCTTACACATCCCAACAAAATGGCATAGTAGAACGAAAACATCGACACATAGTTGATGT
TGGTCTTACCCTCTTATCTCAATCAAACTTGCCTCTTAAGTATTGGGATGAGGCATTTGCCACTGCGGTTTATCTCATCAACAGGTTACCTACTCCAACTCTAAATAATA
AAACCCCTTCCGAACAACTGTTCCATGTTAAACCTGATTACTCCTTCCTTCGTGTTTTTGGGTGTGAATGTTTTCCTTGTCTTCGTTCCTATAACAATCACAAATTAGCC
TTTCGTTCCAAACCGTGTGTATTCATTGGCTATAGTCCATCCCAAAAGGGATACAAATGTCTCTCTCCCGATGGCAAAATATTTGTTTCAAGAAATGTTGCCTTTAATGT
GAGTAAGTTTCCTTTCTCATCCACTCGAGCCTCTATACACAATTCTAATCCTGTTGTCCTTTTACCTCATCTTGCATCCCTTCCCCAAAATTCCAATTCACCTATTTCTC
CAAATACTTCTCCATCCAACTCAGAACCTCAATTAGGCATGGCTACTGCTCCATCTCCCACACCACATGCTTCCTCATCAACAGTTCCTTCTAATGAGGATTGTTCTTCA
CCTAGTCCTAGCGGTTCATCTACACCAATCCCTCCCCCCCAACAGAATGTTTCAAATGATCATCCCATGATGACTCATGCGAAACACAAAGCCTTAAAATCTCCACATTG
GAGTAAGGCAATGAAGGATGAATACGATGCTCTCATAAAGAACAATACTTGGGAATTGGTTCCTTTACCAAATGATAACAAAACTGTGGGATGTAAGTGGATCTTTCGTA
TCAAGCGCAACTCTGATGGATCAATTTCCAGGTACAAAGCCCGGCTTGTAGCCAAGGGCTTTCATCAGACGGTTGACATTGATTATACTGAAACATTCAGTCTCGTAGTT
AAACCAGTTACCATAAGAGTTCTTTTGACCCTTGTTTTATACTATGGCTGGACAATGCGTCAAATTGATTTCAATAACGCCTTTTTGCACGGTCATCTAACTGAATCTGT
TTTTATGGATCAACCATCAGGTTTTAAATATGCAGGCAATCAAGGGCTAGTATGCAAGCTCAAAAAGGCCCTATACGACCTTAAACAAGCCCCAAGGGCATGGTATGAAA
GATTGAGCCATTTCCTCAAAACTCTTGGGTTTAAAACTTCGAAGGCAGATCCTTCTTTGATGATTAAACAAACTGCCAAAGCCTGTTGTTACATACTTATCTATGTTTAT
GATATAATTGTGATAGGGAGTTCTCCAGCTGATGTTTCAAATCTGATATCCGCCTTAAATTCTCAGTTCTCCTTAAAGGACCTCGGTAAGTTGAATTTCTTTCTGGGCAT
TAAGGTGTATTACCCAACTAATGGGGGTCTTTTTCTGTCACAATCCTCATATATTTCAGACCTTCTCTCTCGAGCCAATATGACATATGCAAAGGCCATTGCAACACCAA
TGGTAAGTGGCTCTATTATTTCTGCTCATCAAGGTGATTTTTTTACAGATGTTTATTTATATCGGAGCATTGTTGGCGCGTTGCAATATGTAACTTTGACTAGACCCGAA
ATATCCTATAGCGTTAACAAAGCATGTCAATTTATGCACCATCCCAAACTTATACACTGGCAGCTTGTTAAACAGATATTTAGGTACTTGAAAGGAATTATAAACACTAG
TTTGTTACTTCAGAAACCGAATAATTTATGCCTTTATGGTTTTGCTGATGCCGACTGGGCATCCGACCCAGATGATAGAAAGAGTACAACTGGGTTCTGA
Protein sequenceShow/hide protein sequence
MDAVMKILRYLKSTPGSGLFFKKTTNREVEVYTDADWAGSITDRKSTSGICTFVWGNLVTWRCKKQTVVARSSTEAELRSLASGICEGMWLRRLLMELHMYSKHPMKFNC
DNQSTLCMAKDPVQHDRTKHIEIDRHFINDSIEQKLIVLNYVPSKQQAADILTKPLPQTSFKEMTSKLNLLNIYRELLYIYPTMRMKDVLLFYIYFFSGISWSKLFTIGP
ISGIRAPLTFVGFDELLMETNEGQLSLYNMNTQHFKHLPVKGLPGESQAAVFIKTLLSINAHGGRTDHRTHTYLHNGIILSTLGISPHGFAFGSPKRPHTNGDTCPPLYT
HDHPLILAYIPMIIPLSSRCGTLDALPTILPSNKDDSSSLEYTRPPRAQPRTSTMTIKAHNFFVRNLRIPPNTATPGDPSHSDVASVHSCTPPNSGVTCPPASTLHLNSS
LSLLHQLLIKWDNTHTCPYTSQQNGIVERKHRHIVDVGLTLLSQSNLPLKYWDEAFATAVYLINRLPTPTLNNKTPSEQLFHVKPDYSFLRVFGCECFPCLRSYNNHKLA
FRSKPCVFIGYSPSQKGYKCLSPDGKIFVSRNVAFNVSKFPFSSTRASIHNSNPVVLLPHLASLPQNSNSPISPNTSPSNSEPQLGMATAPSPTPHASSSTVPSNEDCSS
PSPSGSSTPIPPPQQNVSNDHPMMTHAKHKALKSPHWSKAMKDEYDALIKNNTWELVPLPNDNKTVGCKWIFRIKRNSDGSISRYKARLVAKGFHQTVDIDYTETFSLVV
KPVTIRVLLTLVLYYGWTMRQIDFNNAFLHGHLTESVFMDQPSGFKYAGNQGLVCKLKKALYDLKQAPRAWYERLSHFLKTLGFKTSKADPSLMIKQTAKACCYILIYVY
DIIVIGSSPADVSNLISALNSQFSLKDLGKLNFFLGIKVYYPTNGGLFLSQSSYISDLLSRANMTYAKAIATPMVSGSIISAHQGDFFTDVYLYRSIVGALQYVTLTRPE
ISYSVNKACQFMHHPKLIHWQLVKQIFRYLKGIINTSLLLQKPNNLCLYGFADADWASDPDDRKSTTGF