; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G14810 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G14810
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase
Genome locationChr2:14467963..14469419
RNA-Seq ExpressionCSPI02G14810
SyntenyCSPI02G14810
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038926.1 integrase [Cucumis melo var. makuwa]6.8e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

KAA0039947.1 integrase [Cucumis melo var. makuwa]6.8e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

KAA0048003.1 integrase [Cucumis melo var. makuwa]6.8e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

KAA0057291.1 integrase [Cucumis melo var. makuwa]6.8e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

KAA0060377.1 integrase [Cucumis melo var. makuwa]6.8e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

TrEMBL top hitse value%identityAlignment
A0A5A7UDP7 Integrase3.3e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

A0A5A7UZJ8 Integrase3.3e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

A0A5D3BQ81 Integrase3.3e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

A0A5D3CXM6 Integrase3.3e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

A0A5D3E3T2 Integrase3.3e-15760.34Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+                                                
Subjt:  MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTH------------------------------------------------

Query:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH
                     G+F KE GI HQ T R TPQQNGVAERKNR IME+ARSMLKAK LP++FWGDAV C VY+LNRA TKSV G+TP EAW G KP+VSH
Subjt:  -------------GDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNP+S+K+IISRDV F E + W WN   +E ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAP-NEDQNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS+ E + R+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEID IRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.4e-5627.93Show/hide
Query:  MSNIKKEDQLCEACVFRKHHRNSF-PTGGSWRASKPLELVHTDLCGPMRTTTHGD---------------------------------------------
        ++N++   ++CE C+  K  R  F          +PL +VH+D+CGP+   T  D                                             
Subjt:  MSNIKKEDQLCEACVFRKHHRNSF-PTGGSWRASKPLELVHTDLCGPMRTTTHGD---------------------------------------------

Query:  ------------------FLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSV--QGITPQEAWSGLK
                          F  + GI +  TV  TPQ NGV+ER  R I E AR+M+   KL   FWG+AV  A YL+NR  ++++     TP E W   K
Subjt:  ------------------FLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSV--QGITPQEAWSGLK

Query:  PTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEA-----------------------------------
        P + HLRVFG   Y HI + K+GK DDKS K IFVGY  N   ++L++ +++K I++RDV  DE                                    
Subjt:  PTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEA-----------------------------------

Query:  --------------------------------KLWQWNAPNEDQNPLHVDM-------------DGKKDARDLELEVTQPLTSPSSSHSTSDKE------
                                        K+ Q   PNE +   ++               + KK  RD  L  ++   +P+ S  +   E      
Subjt:  --------------------------------KLWQWNAPNEDQNPLHVDM-------------DGKKDARDLELEVTQPLTSPSSSHSTSDKE------

Query:  -TTSRKTRNIQEIYNTSRRILDEEHVDF---------------ALFANVDPVYFEEAIQDE--NWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIY
             K   I+ I   S R+  +  + +                +F +V   + E   +D+  +W++A+N E++  + N TW + K PENK  +  +W++
Subjt:  -TTSRKTRNIQEIYNTSRRILDEEHVDF---------------ALFANVDPVYFEEAIQDE--NWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIY

Query:  RTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP
          K  + G   +YKARLV +G+ QK+ +DYEE FAPV R+ + R +L+L  + N KVHQMDVK+AFLNG L++EIY+  P
Subjt:  RTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.0e-7233.4Show/hide
Query:  CEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHG---------------------------------------------------------
        C+ C+F K HR SF T  S R    L+LV++D+CGPM   + G                                                         
Subjt:  CEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHG---------------------------------------------------------

Query:  ------DFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIA
              ++   +GI+H+KTV  TPQ NGVAER NR I+E  RSML+  KLP  FWG+AV  A YL+NR+ +  +    P+  W+  + + SHL+VFGC A
Subjt:  ------DFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIA

Query:  YSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPLHVDMDGKKDARDLELEVTQPLTS--PSSSHSTS
        ++H+  E+R KLDDKS  CIF+GY +    YRL++P+ KKVI SRDV F E+++               DM  K     +   VT P TS  P+S+ ST+
Subjt:  YSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPLHVDMDGKKDARDLELEVTQPLTS--PSSSHSTS

Query:  DK------------ETTSRKTRNIQEIYNTSR--------RILDEEHVDFALFANVDPVYFEEAIQDENWKD------------AMNQEIDTIRRNETWE
        D+            E   +    ++E+ + ++        R  +   V+   + + + V   +  + E+ K+            AM +E++++++N T++
Subjt:  DK------------ETTSRKTRNIQEIYNTSR--------RILDEEHVDFALFANVDPVYFEEAIQDENWKD------------AMNQEIDTIRRNETWE

Query:  LVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPVM
        LV+LP+ K+ L  KW+++ K   + ++ +YKARLVVKG++QK G+D++E+F+PV ++ ++R +L+LAA  + +V Q+DVK+AFL+G LE+EIY+EQP   
Subjt:  LVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPVM

Query:  QRLEKK
        +   KK
Subjt:  QRLEKK

P92520 Uncharacterized mitochondrial protein AtMg008201.1e-1643.48Show/hide
Query:  AIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALA
        A++D  W  AM +E+D + RN+TW LV  P N+  LG KW+++TKL  +G + + KARLV KG+ Q+ G+ + E ++PV R  T+R +L +A
Subjt:  AIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.6e-4626.24Show/hide
Query:  DFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISD
        ++  ++GI H  +   TP+ NG++ERK+R I+E   ++L    +P  +W  A   AVYL+NR  T  +Q  +P +   G  P    LRVFGC  Y  +  
Subjt:  DFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISD

Query:  EKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDE----------------------AKLWQ----------------------------
          + KLDDKS +C+F+GYS    AY   +  + ++ ISR V+FDE                      + +W                             
Subjt:  EKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDE----------------------AKLWQ----------------------------

Query:  ----------------------------------------WNAPNEDQNPLHVDMDGKKD--ARDLELEVTQPLTSPSSSHSTSDKETTSRKTRN-----
                                                   P + Q   H   +  ++    +   ++ Q L++P+ S S+S   TTS  + +     
Subjt:  ----------------------------------------WNAPNEDQNPLHVDMDGKKD--ARDLELEVTQPLTSPSSSHSTSDKETTSRKTRN-----

Query:  ----------IQEIYNTSRRILDEEH------------------VDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKAL-GVK
                  + +I N + +     H                  +  +L A  +P    +A++DE W++AM  EI+    N TW+LV  P +   + G +
Subjt:  ----------IQEIYNTSRRILDEEH------------------VDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKAL-GVK

Query:  WIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPP
        WI+  K   +G + +YKARLV KGY Q+ G+DY E F+PV +  ++R++L +A   +W + Q+DV +AFL G L D++Y+ QPP
Subjt:  WIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-4326.84Show/hide
Query:  DFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISD
        D+L ++GI H  +   TP+ NG++ERK+R I+E+  ++L    +P  +W  A + AVYL+NR  T  +Q  +P +   G  P    L+VFGC  Y  +  
Subjt:  DFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISD

Query:  EKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDE---------------AKLWQWNAPNEDQN------PL----------HVDMDGKK
          R KL+DKS++C F+GYS    AY   +  + ++  SR V+FDE                +    +APN   +      PL          H+D   + 
Subjt:  EKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDE---------------AKLWQWNAPNEDQN------PL----------HVDMDGKK

Query:  DARDLELEVTQ---------PLTSPSSSHSTSDKETTSRKTRNIQEIYNTS--------------------------RRILDEEHVD-------------
         +    L  TQ          ++SPSSS  T+      + T    +  N++                          +  +   H+              
Subjt:  DARDLELEVTQ---------PLTSPSSSHSTSDKETTSRKTRNIQEIYNTS--------------------------RRILDEEHVD-------------

Query:  --------------------------------------------------FALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELV-KLPENKKA
                                                           +L AN +P    +A++D+ W+ AM  EI+    N TW+LV   P +   
Subjt:  --------------------------------------------------FALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELV-KLPENKKA

Query:  LGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPP
        +G +WI+  K   +G + +YKARLV KGY Q+ G+DY E F+PV +  ++R++L +A   +W + Q+DV +AFL G L DE+Y+ QPP
Subjt:  LGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.8e-3038.42Show/hide
Query:  PSSSHSTSDKETTSRKTRNIQEIYNTSRRILDEEHV-DFALFANVDPVY---------------FEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENK
        P  S  TS + T  RK   +Q+ Y  S   L    +  F  +  V P+Y               + EA +   W  AM+ EI  +    TWE+  LP NK
Subjt:  PSSSHSTSDKETTSRKTRNIQEIYNTSRRILDEEHV-DFALFANVDPVY---------------FEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENK

Query:  KALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPP
        K +G KW+Y+ K   +G +++YKARLV KGY Q+ G+D+ E F+PV +L +V+L+LA++A  N+ +HQ+D+ +AFLNG L++EIY++ PP
Subjt:  KALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPP

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.4e-1140Show/hide
Query:  NRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISDEKRGKLDDKSEK
        NR I+E  RSML    LP  F  DA   AV+++N+  + ++    P E W    PT S+LR FGC+AY H  +   GKL  +++K
Subjt:  NRIIMELARSMLKAKKLPDQFWGDAVTCAVYLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISDEKRGKLDDKSEK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.8e-1843.48Show/hide
Query:  AIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALA
        A++D  W  AM +E+D + RN+TW LV  P N+  LG KW+++TKL  +G + + KARLV KG+ Q+ G+ + E ++PV R  T+R +L +A
Subjt:  AIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTTTCAGAAAGCATCATCGAAATTCATTTCCGACTGGAGGTTCTTGGAGAGCATCAAAACCACTCGA
GCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAA
ACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGAGACGCAGTAACTTGTGCTGTT
TATCTCCTAAATAGAGCTTCAACGAAAAGTGTGCAAGGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACCGTTAGTCACCTAAGAGTGTTTGGGTGCATTGC
TTACTCTCACATTTCAGATGAGAAAAGAGGTAAGCTAGATGATAAATCAGAGAAATGCATTTTTGTTGGGTACAGTGAGAACTCTAAGGCCTACAGACTATACAATCCGA
TAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGA
AAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATAAAGAAACTACTTCAAGGAAGACCAGAAATAT
TCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAA
ATTGGAAAGATGCAATGAATCAAGAGATTGATACAATAAGAAGAAACGAGACATGGGAGTTAGTAAAATTACCAGAAAATAAAAAGGCTCTTGGAGTCAAATGGATCTAT
AGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGACTCGTTGTAAAAGGTTACAAACAAAAGTTTGGTGTGGATTATGAAGAAGTTTTTGCACCGGT
AACTCGCTTGGAGACTGTTCGTTTGTTGTTAGCCCTTGCAGCAAAAAATAACTGGAAAGTTCATCAAATGGATGTAAAGTCAGCATTCCTAAATGGGTATTTAGAGGATG
AAATATATGTTGAGCAACCCCCGGTTATGCAAAGATTGGAGAAGAAAATAAGGTGTGTCGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTTTCAGAAAGCATCATCGAAATTCATTTCCGACTGGAGGTTCTTGGAGAGCATCAAAACCACTCGA
GCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGATTTCTTGAAGGAAAATGGAATCAAGCATCAGAAGACTGTTCGAAGAACTCCTCAACAAA
ACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGAACTTGCAAGAAGTATGTTGAAGGCAAAGAAGCTTCCTGATCAATTTTGGGGAGACGCAGTAACTTGTGCTGTT
TATCTCCTAAATAGAGCTTCAACGAAAAGTGTGCAAGGTATTACTCCTCAAGAAGCATGGAGCGGATTGAAACCAACCGTTAGTCACCTAAGAGTGTTTGGGTGCATTGC
TTACTCTCACATTTCAGATGAGAAAAGAGGTAAGCTAGATGATAAATCAGAGAAATGCATTTTTGTTGGGTACAGTGAGAACTCTAAGGCCTACAGACTATACAATCCGA
TAAGTAAGAAAGTTATTATTAGTCGAGATGTCAAGTTCGATGAAGCAAAATTGTGGCAATGGAATGCACCAAATGAAGACCAAAATCCATTACATGTTGATATGGATGGA
AAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATAAAGAAACTACTTCAAGGAAGACCAGAAATAT
TCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACTTTGAAGAAGCAATTCAAGATGAAA
ATTGGAAAGATGCAATGAATCAAGAGATTGATACAATAAGAAGAAACGAGACATGGGAGTTAGTAAAATTACCAGAAAATAAAAAGGCTCTTGGAGTCAAATGGATCTAT
AGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGACTCGTTGTAAAAGGTTACAAACAAAAGTTTGGTGTGGATTATGAAGAAGTTTTTGCACCGGT
AACTCGCTTGGAGACTGTTCGTTTGTTGTTAGCCCTTGCAGCAAAAAATAACTGGAAAGTTCATCAAATGGATGTAAAGTCAGCATTCCTAAATGGGTATTTAGAGGATG
AAATATATGTTGAGCAACCCCCGGTTATGCAAAGATTGGAGAAGAAAATAAGGTGTGTCGATTAA
Protein sequenceShow/hide protein sequence
MSNIKKEDQLCEACVFRKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGDFLKENGIKHQKTVRRTPQQNGVAERKNRIIMELARSMLKAKKLPDQFWGDAVTCAV
YLLNRASTKSVQGITPQEAWSGLKPTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPISKKVIISRDVKFDEAKLWQWNAPNEDQNPLHVDMDG
KKDARDLELEVTQPLTSPSSSHSTSDKETTSRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDTIRRNETWELVKLPENKKALGVKWIY
RTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPVMQRLEKKIRCVD