; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G28680 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G28680
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr3:26311934..26313541
RNA-Seq ExpressionCSPI03G28680
SyntenyCSPI03G28680
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033341.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]6.7e-11558.33Show/hide
Query:  ESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVY
        E KS+ FEKFKHF AKV KQSGMF+KSLR+DRGGEFL N FNHF KERGIHRELITPYT EQN +A+RKNRTVV M RSMLQ+K LS+ FW EAVSTS+Y
Subjt:  ESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVY

Query:  LLNISPTKAIMNKTPFEAWCSKNPNV-------SHLRVFGCISYALVPSQVR--QKLDGKF---EKCIFVEN-----------VSLVGGESANDGAQTVV
        LLNISPTK +MNKTPFEAW  K PN        S   +F  + Y       R    L+GK       +F E+           VSLV GE  NDG QTVV
Subjt:  LLNISPTKAIMNKTPFEAWCSKNPNV-------SHLRVFGCISYALVPSQVR--QKLDGKF---EKCIFVEN-----------VSLVGGESANDGAQTVV

Query:  KNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWR--------------------------------QQPMKEEITTIEKNGTWKMVE-SEG
        +    SSMET TSTPP S PSTPQSYHS SS+DET DELP  R                                QQ MKEE+  IEKNGTWK+V+  EG
Subjt:  KNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWR--------------------------------QQPMKEEITTIEKNGTWKMVE-SEG

Query:  KSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKE
        K+AI LKWV+KTKF ADG LEK+KARLVAKGY QQHG DFE+TFS +A FE V+IVLA A Q+QW VYQFDVK  FL+GELQ EVYV QP+GFV + ++E
Subjt:  KSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKE

Query:  KVYKLTKALYGFETYEKLCGFTDSDWASSLDD
        KVYKLTKALYG +   +        W S +D+
Subjt:  KVYKLTKALYGFETYEKLCGFTDSDWASSLDD

KAA0040613.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.8e-12165.28Show/hide
Query:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT
        MQ KSL  SFYFL+F +DYS MSW+YFLESKS+TFEKFKHFKAKV KQSGMFIKSLR+D+GGEFL N FNHFY+E GIHREL T YT EQN VA++KNRT
Subjt:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT

Query:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDG
        VV M RS+LQ+K LS+ FW+EAVSTS+YLLNISPTKA+MNKTPFEAW  K   +  +      S+           D K  +    E VSLV GE  NDG
Subjt:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDG

Query:  AQTVVKNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARL
         QTVV+    SSMETP STP  S P TPQSYHSPS+               MKEE+TTIEKNGTWKMV+  +GK+AIDLKWV+KTKF ADG LEK+KARL
Subjt:  AQTVVKNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARL

Query:  VAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        VAKG+ QQHG +FE+TFS +A FE V++VLALAAQ+QWSVYQFDVK  FL+ EL +EVYV QP+GFV + S+EKVYKLTKALYG +
Subjt:  VAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

KAA0055915.1 copia protein [Cucumis melo var. makuwa]6.8e-10750.79Show/hide
Query:  SKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYL
        S+S+TFEKFKHFKAKV KQSGMFIKSLR+DRGG+FL N FNHF +E GIHREL TPYT EQN VA+RKNRTVV M RSMLQ+K LS+ FW EAVSTS+YL
Subjt:  SKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYL

Query:  LNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDGAQTVVKNSNGSSMETPTSTPPLSVPSTPQ
        LNISPTK +MNKTPFEA               C  Y                       VSLV GE  NDG QTVV+    SSMETPTSTP  S  STPQ
Subjt:  LNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDGAQTVVKNSNGSSMETPTSTPPLSVPSTPQ

Query:  SYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVESEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVL
        SYHS  ++DET +ELP  RQQ  K                                   DG LEK+KARLV KGY QQHG DFE+TFS +A F+ ++IVL
Subjt:  SYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVESEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVL

Query:  ALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKVYKLTKALYGFE--------------------------------------------
        ALAAQ+QW VYQFDVK AFL+GELQ EVYV QPEGFV + S+EKVYKLTK LYG                                              
Subjt:  ALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKVYKLTKALYGFE--------------------------------------------

Query:  ---------------------------------------------TYEKLCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYA
                                                     +  KLC F DSDWASSLDDR+SVSANVFTL  GV+TWSSKKQ  VALSSSE EYA
Subjt:  ---------------------------------------------TYEKLCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYA

Query:  AATSAA
        AATSAA
Subjt:  AATSAA

RVX14143.1 putative alpha-mannosidase [Vitis vinifera]1.1e-10450.93Show/hide
Query:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT
        MQ  S GGS YFLLF +D+S MSWVYFL+SK++TFE FK FKA V KQSG  IK LR DR GEFL N F  F +E G+HREL TPY+ EQN VA+RKNRT
Subjt:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT

Query:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGG----ES
        VV M RSM++ K+LS+ FW E V+T+VYLLNISPTKA++N+TP+EAW  + P VSHL+VFG ++Y L+ S  R KLD K  KCIF+   S   G      
Subjt:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGG----ES

Query:  ANDGAQTVVKNSNGSSMETPTSTPPLSVPSTPQSYHSP-------SSHDETLDELPPWRQQ-------------------------------PMKEEITT
         +DGA   + +S     ++    P + +P++P   HSP       SS  ++ +E PP + +                                MKEEI  
Subjt:  ANDGAQTVVKNSNGSSMETPTSTPPLSVPSTPQSYHSP-------SSHDETLDELPPWRQQ-------------------------------PMKEEITT

Query:  IEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEV
        IEKN TW++VE  E K+ I +KWVF+TK++ADG ++K+KARLVAKGY QQHG D++ TFS +A FE V+ +LALAA   W VYQFDVK AFL+GEL +EV
Subjt:  IEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEV

Query:  YVGQPEGFVIEGSKEKVYKLTKALYGFE
        Y  QPEGF++   +E VY+L KALYG +
Subjt:  YVGQPEGFVIEGSKEKVYKLTKALYGFE

TYK00906.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]3.2e-11763.61Show/hide
Query:  KQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEA
        KQSGMFIKSLR+DRGGEFL N FNHF K+ GIHREL TPYT EQN VA+RKNRTVV MTRSMLQ+K LS+ FW EAVSTS+YLLNISPTKA+MNKTPFE 
Subjt:  KQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEA

Query:  WCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV---------------------------------------ENVSLVGGESANDGAQTVVKN
        W  K PNV+HLRVFGCISYALVPSQVRQKLD K EKCIFV                                       E VSLV GE  NDG QTVV+ 
Subjt:  WCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV---------------------------------------ENVSLVGGESANDGAQTVVKN

Query:  SNGSSMETPTSTPPLSVPSTPQSYHSPSSHDE-----TLDELPPW--------------------RQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWV
           SSMETPTSTPP S PSTPQSYHS SS+DE      L    PW                    RQQ MKEE+  IEKNGTWKMV+  EGK+AI LKWV
Subjt:  SNGSSMETPTSTPPLSVPSTPQSYHSPSSHDE-----TLDELPPW--------------------RQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWV

Query:  FKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKV
        +K+KF ADG LEK+KA LVAKGY QQHG DF++T S IA FE VKIVLAL A +QW VYQFDVK AFL+GELQ EVYV QPEGFV + S+EKV
Subjt:  FKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKV

TrEMBL top hitse value%identityAlignment
A0A0V0IV83 Putative ovule protein (Fragment)4.4e-11252.64Show/hide
Query:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT
        MQ KSLGGS YFLLF +DYS MSWVYFLESKS+TFEKF+ FKA V  QS   IK LR DRGGEF+ N FN F +  GIHREL TPYT EQN VA+RKNRT
Subjt:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT

Query:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV--------------
        VV M RSMLQ K+L++ FW EAV+ S+YLLN+SPTK +MNKTP+EAW  + PNVSHLRVFGC++YALV SQ RQKLD K EKCIF+              
Subjt:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV--------------

Query:  --------------ENVSL--------------VGGESANDGAQTVVKNSNGSSMETPTSTPPLS----------VP-------STPQ------------
                      EN S               +G     +   T   + +GSSME PTS  P+S          VP        TP             
Subjt:  --------------ENVSL--------------VGGESANDGAQTVVKNSNGSSMETPTSTPPLS----------VP-------STPQ------------

Query:  ---SYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENV
           S   P +++E   E   WR++ + EE+ + EKNGTW+M+E  +GK+AI LKWVFKTKF ADG L+K+KARLVAKGY QQ+G DFE+TFS +A FE V
Subjt:  ---SYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENV

Query:  KIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKVYKLTKALYGFET-----YEKLCGF
        ++VLALAAQ +W VYQFDVK AFL+G+LQ EVYV QP+GF+ EG++ KV+KL K LYG +      Y K+ G+
Subjt:  KIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKVYKLTKALYGFET-----YEKLCGF

A0A5A7SV62 Putative gag-pol polyprotein, identical3.3e-11558.33Show/hide
Query:  ESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVY
        E KS+ FEKFKHF AKV KQSGMF+KSLR+DRGGEFL N FNHF KERGIHRELITPYT EQN +A+RKNRTVV M RSMLQ+K LS+ FW EAVSTS+Y
Subjt:  ESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVY

Query:  LLNISPTKAIMNKTPFEAWCSKNPNV-------SHLRVFGCISYALVPSQVR--QKLDGKF---EKCIFVEN-----------VSLVGGESANDGAQTVV
        LLNISPTK +MNKTPFEAW  K PN        S   +F  + Y       R    L+GK       +F E+           VSLV GE  NDG QTVV
Subjt:  LLNISPTKAIMNKTPFEAWCSKNPNV-------SHLRVFGCISYALVPSQVR--QKLDGKF---EKCIFVEN-----------VSLVGGESANDGAQTVV

Query:  KNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWR--------------------------------QQPMKEEITTIEKNGTWKMVE-SEG
        +    SSMET TSTPP S PSTPQSYHS SS+DET DELP  R                                QQ MKEE+  IEKNGTWK+V+  EG
Subjt:  KNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWR--------------------------------QQPMKEEITTIEKNGTWKMVE-SEG

Query:  KSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKE
        K+AI LKWV+KTKF ADG LEK+KARLVAKGY QQHG DFE+TFS +A FE V+IVLA A Q+QW VYQFDVK  FL+GELQ EVYV QP+GFV + ++E
Subjt:  KSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKE

Query:  KVYKLTKALYGFETYEKLCGFTDSDWASSLDD
        KVYKLTKALYG +   +        W S +D+
Subjt:  KVYKLTKALYGFETYEKLCGFTDSDWASSLDD

A0A5A7TC06 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-12165.28Show/hide
Query:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT
        MQ KSL  SFYFL+F +DYS MSW+YFLESKS+TFEKFKHFKAKV KQSGMFIKSLR+D+GGEFL N FNHFY+E GIHREL T YT EQN VA++KNRT
Subjt:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT

Query:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDG
        VV M RS+LQ+K LS+ FW+EAVSTS+YLLNISPTKA+MNKTPFEAW  K   +  +      S+           D K  +    E VSLV GE  NDG
Subjt:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDG

Query:  AQTVVKNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARL
         QTVV+    SSMETP STP  S P TPQSYHSPS+               MKEE+TTIEKNGTWKMV+  +GK+AIDLKWV+KTKF ADG LEK+KARL
Subjt:  AQTVVKNSNGSSMETPTSTPPLSVPSTPQSYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARL

Query:  VAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        VAKG+ QQHG +FE+TFS +A FE V++VLALAAQ+QWSVYQFDVK  FL+ EL +EVYV QP+GFV + S+EKVYKLTKALYG +
Subjt:  VAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

A0A5A7UQM0 Copia protein3.3e-10750.79Show/hide
Query:  SKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYL
        S+S+TFEKFKHFKAKV KQSGMFIKSLR+DRGG+FL N FNHF +E GIHREL TPYT EQN VA+RKNRTVV M RSMLQ+K LS+ FW EAVSTS+YL
Subjt:  SKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYL

Query:  LNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDGAQTVVKNSNGSSMETPTSTPPLSVPSTPQ
        LNISPTK +MNKTPFEA               C  Y                       VSLV GE  NDG QTVV+    SSMETPTSTP  S  STPQ
Subjt:  LNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDGAQTVVKNSNGSSMETPTSTPPLSVPSTPQ

Query:  SYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVESEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVL
        SYHS  ++DET +ELP  RQQ  K                                   DG LEK+KARLV KGY QQHG DFE+TFS +A F+ ++IVL
Subjt:  SYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVESEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVL

Query:  ALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKVYKLTKALYGFE--------------------------------------------
        ALAAQ+QW VYQFDVK AFL+GELQ EVYV QPEGFV + S+EKVYKLTK LYG                                              
Subjt:  ALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKVYKLTKALYGFE--------------------------------------------

Query:  ---------------------------------------------TYEKLCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYA
                                                     +  KLC F DSDWASSLDDR+SVSANVFTL  GV+TWSSKKQ  VALSSSE EYA
Subjt:  ---------------------------------------------TYEKLCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYA

Query:  AATSAA
        AATSAA
Subjt:  AATSAA

A0A5D3BRM6 Putative gag-pol polyprotein, identical1.6e-11763.61Show/hide
Query:  KQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEA
        KQSGMFIKSLR+DRGGEFL N FNHF K+ GIHREL TPYT EQN VA+RKNRTVV MTRSMLQ+K LS+ FW EAVSTS+YLLNISPTKA+MNKTPFE 
Subjt:  KQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEA

Query:  WCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV---------------------------------------ENVSLVGGESANDGAQTVVKN
        W  K PNV+HLRVFGCISYALVPSQVRQKLD K EKCIFV                                       E VSLV GE  NDG QTVV+ 
Subjt:  WCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV---------------------------------------ENVSLVGGESANDGAQTVVKN

Query:  SNGSSMETPTSTPPLSVPSTPQSYHSPSSHDE-----TLDELPPW--------------------RQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWV
           SSMETPTSTPP S PSTPQSYHS SS+DE      L    PW                    RQQ MKEE+  IEKNGTWKMV+  EGK+AI LKWV
Subjt:  SNGSSMETPTSTPPLSVPSTPQSYHSPSSHDE-----TLDELPPW--------------------RQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWV

Query:  FKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKV
        +K+KF ADG LEK+KA LVAKGY QQHG DF++T S IA FE VKIVLAL A +QW VYQFDVK AFL+GELQ EVYV QPEGFV + S+EKV
Subjt:  FKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQ-EVYVGQPEGFVIEGSKEKV

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-3225.17Show/hide
Query:  YFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQ
        YF++F + ++H    Y ++ KS  F  F+ F AK      + +  L  D G E+L N    F  ++GI   L  P+T + N V++R  RT+    R+M+ 
Subjt:  YFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQ

Query:  VKDLSDVFWVEAVSTSVYLLNISPTKAIM--NKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIF-----------------------
           L   FW EAV T+ YL+N  P++A++  +KTP+E W +K P + HLRVFG   Y  + ++ + K D K  K IF                       
Subjt:  VKDLSDVFWVEAVSTSVYLLNISPTKAIM--NKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIF-----------------------

Query:  --VENVSLVGG--------------ESAN-----------------------------DGAQTVVKN---------------------------------
          V+  ++V                ES N                             D  ++  KN                                 
Subjt:  --VENVSLVGG--------------ESAN-----------------------------DGAQTVVKN---------------------------------

Query:  ------------------SNGS------------------SMETPTSTPPLSV--------PSTPQ-SYHS-------------------PSSHDET--L
                          S GS                   ++ PT    + +         + PQ SY+                    P+S DE    
Subjt:  ------------------SNGS------------------SMETPTSTPPLSV--------PSTPQ-SYHS-------------------PSSHDET--L

Query:  DELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVY
        D+   W ++ +  E+   + N TW + +  E K+ +D +WVF  K+   G   +YKARLVA+G+ Q++  D+E+TF+ +A   + + +L+L  Q    V+
Subjt:  DELPPWRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVY

Query:  QFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYG--------FETYE---KLCGFTDS--DWASSLDDRQSVSANVFTL
        Q DVK AFL+G L +E+Y+  P+G  I  + + V KL KA+YG        FE +E   K C F +S  D    + D+ +++ N++ L
Subjt:  QFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYG--------FETYE---KLCGFTDS--DWASSLDDRQSVSANVFTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.9e-6234.46Show/hide
Query:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT
        M+++S+GG+ YF+ F +D S   WVY L++K + F+ F+ F A V +++G  +K LR+D GGE+    F  +    GI  E   P T + N VA+R NRT
Subjt:  MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRT

Query:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV--------------
        +V   RSML++  L   FW EAV T+ YL+N SP+  +  + P   W +K  + SHL+VFGC ++A VP + R KLD K   CIF+              
Subjt:  VVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV--------------

Query:  -----------------------------------------------------ENVSLVG---------GESANDGAQTVVKNSNGSSMETP---TSTPP
                                                             + VS  G         GE  ++G + V   + G     P   +  P 
Subjt:  -----------------------------------------------------ENVSLVG---------GESANDGAQTVVKNSNGSSMETP---TSTPP

Query:  LS---VPSTPQSYHSPSSHDETLDEL--PPWRQQPMK---EEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF
        +     PST     S     E+L E+   P + Q MK   EE+ +++KNGT+K+VE  +GK  +  KWVFK K   D  L +YKARLV KG+ Q+ G DF
Subjt:  LS---VPSTPQSYHSPSSHDETLDEL--PPWRQQPMK---EEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF

Query:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        ++ FS +    +++ +L+LAA     V Q DVK AFLHG+L +E+Y+ QPEGF + G K  V KL K+LYG +
Subjt:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-0548.15Show/hide
Query:  LCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYAAAT
        L G+TD+D A  +D+R+S +  +FT   G ++W SK Q  VALS++E EY AAT
Subjt:  LCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYAAAT

P92520 Uncharacterized mitochondrial protein AtMg008202.1e-1039.56Show/hide
Query:  PPWRQQPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ
        P W  Q M+EE+  + +N TW +V     ++ +  KWVFKTK  +DG L++ KARLVAKG+ Q+ G  F +T+S +     ++ +L +A Q
Subjt:  PPWRQQPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-2823.63Show/hide
Query:  YFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQ
        Y+++F + ++  +W+Y L+ KS+  E F  FK  +  +    I +  +D GGEF+  +   ++ + GI      P+T E N +++RK+R +V    ++L 
Subjt:  YFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQ

Query:  VKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV------------------------
           +   +W  A + +VYL+N  PT  +  ++PF+     +PN   LRVFGC  Y  +    + KLD K  +C+F+                        
Subjt:  VKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV------------------------

Query:  ----EN----------------------------------VSLVGGESAND-------------------------------------------------
            EN                                    ++   S +D                                                 
Subjt:  ----EN----------------------------------VSLVGGESAND-------------------------------------------------

Query:  -----GAQTVVKNSNGSSMETPT-----------STPPLSVPSTPQSYHSPSSHDET------LDELPP-------------------------------
               QT   +S  +S   PT           STP  S  S+P    S SS   +      L   PP                               
Subjt:  -----GAQTVVKNSNGSSMETPT-----------STPPLSVPSTPQSYHSPSSHDET------LDELPP-------------------------------

Query:  --------------------------WRQQPMKEEITTIEKNGTWKMVESEGK--SAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSS
                                  WR   M  EI     N TW +V       + +  +W+F  K+ +DG L +YKARLVAKGY Q+ G D+ +TFS 
Subjt:  --------------------------WRQQPMKEEITTIEKNGTWKMVESEGK--SAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSS

Query:  IAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        +    +++IVL +A  R W + Q DV  AFL G L  +VY+ QP GF+ +     V KL KALYG +
Subjt:  IAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-0236.84Show/hide
Query:  LCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYAAATSAA
        L  ++D+DWA   DD  S +  +  L    ++WSSKKQ  V  SS+E EY +  + +
Subjt:  LCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYAAATSAA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-3124.54Show/hide
Query:  YFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQ
        Y+++F + ++  +W+Y L+ KS+  + F  FK+ V  +    I +L +D GGEF+      +  + GI      P+T E N +++RK+R +V M  ++L 
Subjt:  YFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQ

Query:  VKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV------------------------
           +   +W  A S +VYL+N  PT  +  ++PF+    + PN   L+VFGC  Y  +    R KL+ K ++C F+                        
Subjt:  VKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFV------------------------

Query:  ------------ENVSL-VGGESANDGA------------------------------------------------------------------------
                     N  +    E  +D A                                                                        
Subjt:  ------------ENVSL-VGGESANDGA------------------------------------------------------------------------

Query:  -----QTVVKNSNGSSM--------------------ETPTSTPPLSVPSTPQSY-HSPSSHDETLDELPP-----------------------------
             QT   NSN   +                    ++P S+P +  PST  S  +SPSS   +   LPP                             
Subjt:  -----QTVVKNSNGSSM--------------------ETPTSTPPLSVPSTPQSY-HSPSSHDETLDELPP-----------------------------

Query:  ------------------------------WRQQPMKEEITTIEKNGTWKMVESEGKSA--IDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEK
                                      WR Q M  EI     N TW +V     S   +  +W+F  KF +DG L +YKARLVAKGY Q+ G D+ +
Subjt:  ------------------------------WRQQPMKEEITTIEKNGTWKMVESEGKSA--IDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEK

Query:  TFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE-----------TYEKLCGFTDSDWASSL
        TFS +    +++IVL +A  R W + Q DV  AFL G L  EVY+ QP GFV +   + V +L KA+YG +           TY    GF +S   +SL
Subjt:  TFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGEL-QEVYVGQPEGFVIEGSKEKVYKLTKALYGFE-----------TYEKLCGFTDSDWASSL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.1e-2239.57Show/hide
Query:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH
        M +EI  +E   TW++      K  I  KWV+K K+ +DG +E+YKARLVAKGY QQ G DF +TFS +    +VK++LA++A   ++++Q D+  AFL+
Subjt:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH

Query:  GEL-QEVYVGQPEGFVIEGS----KEKVYKLTKALYGFE
        G+L +E+Y+  P G+            V  L K++YG +
Subjt:  GEL-QEVYVGQPEGFVIEGS----KEKVYKLTKALYGFE

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0739.71Show/hide
Query:  NRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISY
        NRT++   RSML    L   F  +A +T+V+++N  P+ AI    P E W    P  S+LR FGC++Y
Subjt:  NRTVVGMTRSMLQVKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISY

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1139.56Show/hide
Query:  PPWRQQPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ
        P W  Q M+EE+  + +N TW +V     ++ +  KWVFKTK  +DG L++ KARLVAKG+ Q+ G  F +T+S +     ++ +L +A Q
Subjt:  PPWRQQPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATGAAGTCTCTTGGTGGAAGCTTTTATTTCTTGCTTTTCCCGAACGATTATAGCCATATGAGTTGGGTGTATTTTCTAGAAAGCAAATCGAAGACATTTGAGAA
GTTCAAGCACTTTAAGGCGAAGGTGGGAAAGCAAAGTGGCATGTTCATCAAATCTCTTCGCAACGATAGAGGTGGAGAATTCTTGTTCAATTACTTCAACCATTTTTACA
AAGAACGTGGCATCCATAGGGAGTTGATAACGCCTTACACTCTAGAGCAAAACGAGGTCGCTAAGAGGAAGAATCGAACTGTGGTGGGGATGACGAGAAGCATGTTGCAA
GTAAAAGACCTTTCAGATGTTTTTTGGGTTGAAGCGGTCTCAACTTCTGTCTACTTATTAAACATCTCACCAACGAAGGCTATAATGAATAAGACTCCATTTGAAGCTTG
GTGCAGCAAAAACCCGAATGTAAGTCATTTAAGAGTTTTTGGTTGTATTTCTTATGCTTTGGTACCTTCTCAAGTTCGTCAAAAACTTGATGGAAAATTCGAAAAATGCA
TTTTTGTTGAAAATGTTTCCTTGGTGGGTGGTGAATCGGCAAATGATGGAGCACAAACGGTGGTCAAAAACTCGAATGGATCCTCAATGGAGACGCCTACCTCAACACCT
CCATTAAGTGTTCCATCAACACCACAAAGCTACCACTCTCCGTCAAGCCATGATGAAACATTGGATGAGTTGCCACCTTGGAGGCAGCAACCAATGAAGGAAGAAATAAC
AACGATTGAGAAGAATGGGACGTGGAAAATGGTAGAATCGGAGGGAAAAAGTGCAATCGACTTGAAGTGGGTCTTTAAGACGAAATTTGTTGCGGATGGAATTTTAGAGA
AGTACAAAGCTCGACTCGTGGCGAAAGGATACGTGCAGCAACACGGTAGTGATTTTGAGAAAACTTTCTCTTCAATAGCTCATTTTGAAAACGTGAAGATTGTTCTAGCA
TTGGCAGCACAACGACAATGGTCGGTTTATCAATTTGATGTCAAGTTAGCCTTTCTCCATGGAGAATTGCAAGAAGTCTATGTTGGACAACCAGAAGGTTTTGTCATAGA
AGGCAGCAAAGAAAAGGTGTATAAGTTGACAAAGGCTTTGTACGGGTTTGAAACTTATGAGAAGCTATGCGGGTTCACGGACAGCGATTGGGCGAGCTCATTGGATGATA
GGCAGAGTGTTTCAGCAAATGTATTCACACTCGAGTTAGGAGTTGTCACTTGGAGCTCGAAGAAACAAGTAAGAGTTGCTTTGTCGTCTTCTGAAGTGGAATATGCTGCA
GCAACTTCAGCAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATGAAGTCTCTTGGTGGAAGCTTTTATTTCTTGCTTTTCCCGAACGATTATAGCCATATGAGTTGGGTGTATTTTCTAGAAAGCAAATCGAAGACATTTGAGAA
GTTCAAGCACTTTAAGGCGAAGGTGGGAAAGCAAAGTGGCATGTTCATCAAATCTCTTCGCAACGATAGAGGTGGAGAATTCTTGTTCAATTACTTCAACCATTTTTACA
AAGAACGTGGCATCCATAGGGAGTTGATAACGCCTTACACTCTAGAGCAAAACGAGGTCGCTAAGAGGAAGAATCGAACTGTGGTGGGGATGACGAGAAGCATGTTGCAA
GTAAAAGACCTTTCAGATGTTTTTTGGGTTGAAGCGGTCTCAACTTCTGTCTACTTATTAAACATCTCACCAACGAAGGCTATAATGAATAAGACTCCATTTGAAGCTTG
GTGCAGCAAAAACCCGAATGTAAGTCATTTAAGAGTTTTTGGTTGTATTTCTTATGCTTTGGTACCTTCTCAAGTTCGTCAAAAACTTGATGGAAAATTCGAAAAATGCA
TTTTTGTTGAAAATGTTTCCTTGGTGGGTGGTGAATCGGCAAATGATGGAGCACAAACGGTGGTCAAAAACTCGAATGGATCCTCAATGGAGACGCCTACCTCAACACCT
CCATTAAGTGTTCCATCAACACCACAAAGCTACCACTCTCCGTCAAGCCATGATGAAACATTGGATGAGTTGCCACCTTGGAGGCAGCAACCAATGAAGGAAGAAATAAC
AACGATTGAGAAGAATGGGACGTGGAAAATGGTAGAATCGGAGGGAAAAAGTGCAATCGACTTGAAGTGGGTCTTTAAGACGAAATTTGTTGCGGATGGAATTTTAGAGA
AGTACAAAGCTCGACTCGTGGCGAAAGGATACGTGCAGCAACACGGTAGTGATTTTGAGAAAACTTTCTCTTCAATAGCTCATTTTGAAAACGTGAAGATTGTTCTAGCA
TTGGCAGCACAACGACAATGGTCGGTTTATCAATTTGATGTCAAGTTAGCCTTTCTCCATGGAGAATTGCAAGAAGTCTATGTTGGACAACCAGAAGGTTTTGTCATAGA
AGGCAGCAAAGAAAAGGTGTATAAGTTGACAAAGGCTTTGTACGGGTTTGAAACTTATGAGAAGCTATGCGGGTTCACGGACAGCGATTGGGCGAGCTCATTGGATGATA
GGCAGAGTGTTTCAGCAAATGTATTCACACTCGAGTTAGGAGTTGTCACTTGGAGCTCGAAGAAACAAGTAAGAGTTGCTTTGTCGTCTTCTGAAGTGGAATATGCTGCA
GCAACTTCAGCAGCATGA
Protein sequenceShow/hide protein sequence
MQMKSLGGSFYFLLFPNDYSHMSWVYFLESKSKTFEKFKHFKAKVGKQSGMFIKSLRNDRGGEFLFNYFNHFYKERGIHRELITPYTLEQNEVAKRKNRTVVGMTRSMLQ
VKDLSDVFWVEAVSTSVYLLNISPTKAIMNKTPFEAWCSKNPNVSHLRVFGCISYALVPSQVRQKLDGKFEKCIFVENVSLVGGESANDGAQTVVKNSNGSSMETPTSTP
PLSVPSTPQSYHSPSSHDETLDELPPWRQQPMKEEITTIEKNGTWKMVESEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLA
LAAQRQWSVYQFDVKLAFLHGELQEVYVGQPEGFVIEGSKEKVYKLTKALYGFETYEKLCGFTDSDWASSLDDRQSVSANVFTLELGVVTWSSKKQVRVALSSSEVEYAA
ATSAA