; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0016251 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0016251
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr01:14351934..14353398
RNA-Seq ExpressionCmc01g0016251
SyntenyCmc01g0016251
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]1.5e-17768.66Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +ELLSHF AFH EIKNQFNVSIKTLRTDNAGEYFSH+LGSYLCE+GIIHQSSC DT SQNG+AE KNRH+LETA AL FQMHV K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMPSS+LNGEI YR                                                                       DT FTSSPSS 
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE
        CQGEDDNL IYE+TSP     TD  PSR L S+VY RRPP QPS SC  S+  SSCDP PSDDL I LRKGKRKCTYPVSSF+ YHQLS  TYAFITSLE
Subjt:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE

Query:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS
        STSI N+VHEALSH GW+NAMIEEMTALDDNGTWDLVS P GKKAI CKWVF+VK+N DGTV +LKA LVAKGYAQ YG +YSDTFSPVAKLTSIRLFLS
Subjt:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS

Query:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF
        MAAT+ WSL QLDIKN FLHGDLQEEVYMEQP GF+AQGESDKVCRLRKSLYGLKQ P AWFGKFS AL+ FGM+KST DH VF+
Subjt:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]1.5e-17768.66Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +ELLSHF AFH EIKNQFNVSIKTLRTDNAGEYFSH+LGSYLCE+GIIHQSSC DT SQNG+AE KNRH+LETA AL FQMHV K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMPSS+LNGEI YR                                                                       DT FTSSPSS 
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE
        CQGEDDNL IYE+TSP     TD  PSR L S+VY RRPP QPS SC  S+  SSCDP PSDDL I LRKGKRKCTYPVSSF+ YHQLS  TYAFITSLE
Subjt:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE

Query:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS
        STSI N+VHEALSH GW+NAMIEEMTALDDNGTWDLVS P GKKAI CKWVF+VK+N DGTV +LKA LVAKGYAQ YG +YSDTFSPVAKLTSIRLFLS
Subjt:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS

Query:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF
        MAAT+ WSL QLDIKN FLHGDLQEEVYMEQP GF+AQGESDKVCRLRKSLYGLKQ P AWFGKFS AL+ FGM+KST DH VF+
Subjt:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]1.5e-17768.66Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +ELLSHF AFH EIKNQFNVSIKTLRTDNAGEYFSH+LGSYLCE+GIIHQSSC DT SQNG+AE KNRH+LETA AL FQMHV K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMPSS+LNGEI YR                                                                       DT FTSSPSS 
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE
        CQGEDDNL IYE+TSP     TD  PSR L S+VY RRPP QPS SC  S+  SSCDP PSDDL I LRKGKRKCTYPVSSF+ YHQLS  TYAFITSLE
Subjt:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE

Query:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS
        STSI N+VHEALSH GW+NAMIEEMTALDDNGTWDLVS P GKKAI CKWVF+VK+N DGTV +LKA LVAKGYAQ YG +YSDTFSPVAKLTSIRLFLS
Subjt:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS

Query:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF
        MAAT+ WSL QLDIKN FLHGDLQEEVYMEQP GF+AQGESDKVCRLRKSLYGLKQ P AWFGKFS AL+ FGM+KST DH VF+
Subjt:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]1.5e-17768.66Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +ELLSHF AFH EIKNQFNVSIKTLRTDNAGEYFSH+LGSYLCE+GIIHQSSC DT SQNG+AE KNRH+LETA AL FQMHV K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMPSS+LNGEI YR                                                                       DT FTSSPSS 
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE
        CQGEDDNL IYE+TSP     TD  PSR L S+VY RRPP QPS SC  S+  SSCDP PSDDL I LRKGKRKCTYPVSSF+ YHQLS  TYAFITSLE
Subjt:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE

Query:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS
        STSI N+VHEALSH GW+NAMIEEMTALDDNGTWDLVS P GKKAI CKWVF+VK+N DGTV +LKA LVAKGYAQ YG +YSDTFSPVAKLTSIRLFLS
Subjt:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS

Query:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF
        MAAT+ WSL QLDIKN FLHGDLQEEVYMEQP GF+AQGESDKVCRLRKSLYGLKQ P AWFGKFS AL+ FGM+KST DH VF+
Subjt:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]1.5e-17768.66Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +ELLSHF AFH EIKNQFNVSIKTLRTDNAGEYFSH+LGSYLCE+GIIHQSSC DT SQNG+AE KNRH+LETA AL FQMHV K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMPSS+LNGEI YR                                                                       DT FTSSPSS 
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE
        CQGEDDNL IYE+TSP     TD  PSR L S+VY RRPP QPS SC  S+  SSCDP PSDDL I LRKGKRKCTYPVSSF+ YHQLS  TYAFITSLE
Subjt:  CQGEDDNLSIYEITSP-----TDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLE

Query:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS
        STSI N+VHEALSH GW+NAMIEEMTALDDNGTWDLVS P GKKAI CKWVF+VK+N DGTV +LKA LVAKGYAQ YG +YSDTFSPVAKLTSIRLFLS
Subjt:  STSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLS

Query:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF
        MAAT+ WSL QLDIKN FLHGDLQEEVYMEQP GF+AQGESDKVCRLRKSLYGLKQ P AWFGKFS AL+ FGM+KST DH VF+
Subjt:  MAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF

TrEMBL top hitse value%identityAlignment
A0A438CP53 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-12048.02Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +E+ SHF AF  EIK Q++VS+K LR+DN  EY S++  +Y+  +GI+HQ+SCVDT SQNG+AE KNRH+LETA AL+FQM V K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMP+ +L G+I Y+                                                                       DT F SSP+SS
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITS--PT--------------------DAPPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKC--TYPV
           ED+   +Y++ +  PT                    + PP+   P   +VY RRP +  + +C    PSSS DP    DL I+LRKGKR C   Y +
Subjt:  CQGEDDNLSIYEITS--PT--------------------DAPPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKC--TYPV

Query:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG
        ++FV Y  LSS +   + S++S S+  TV EAL+H GW+NAM+EE+ AL+DN TW LV LP GKK + CKWVF+VKVN DG+V +LKA LVA+GYAQTYG
Subjt:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG

Query:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY
        ++YSDTFSPVAKL S+RLF+S+AA+  W + QLDIKN FLHGDL+EEVY+EQP GF+AQGE  KVCRL+K+LYGLKQ P AWFGKFS  +  FGM KS  
Subjt:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY

Query:  DHYVFF
        DH VF+
Subjt:  DHYVFF

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-12048.42Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +E+ SHF AF  EIK Q++VS+K LR+DN  EY S++  +Y+  +GI+HQ+SCVDT SQNG+AE KNRH+LETA AL+FQM V K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMP+ +L  +I Y+                                                                       DT F SSP+SS
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITS--PT---------DA-----------PPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKC--TYPV
           ED+   +Y++ +  PT         DA           PP+   P   +VY RRP +  + +C T  PSSS DP    DL I+LRKGKR C   Y +
Subjt:  CQGEDDNLSIYEITS--PT---------DA-----------PPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKC--TYPV

Query:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG
        ++FV Y  LSS +   + S++S S+  TV EAL+H GW+NAM+EE+ AL+DN TW LV LP GKK + CKWVF+VKVN DG+V +LKA LVA+GYAQTYG
Subjt:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG

Query:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY
        ++YSDTFSPVAKL S+RLF+S+AA+  W + QLDIKN FLHGDL+EEVY+EQP GF+AQGE  KVCRL+K+LYGLKQ P AWFGKFS  +  FGM KS  
Subjt:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY

Query:  DHYVFF
        DH VF+
Subjt:  DHYVFF

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-946.3e-12148.62Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +E+ SHF AF  EIK Q++VS+K LR+DN  EY S++  +Y+ ++GI+HQ+SCVDT SQNG+AE KNRH+LETA AL+FQM V K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMP+ +L G+I Y+                                                                       DT F SSP+SS
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITS--PT---------DA-----------PPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKR--KCTYPV
           ED+   +Y++ +  PT         DA           PP+   P   +VY RRP +  + +C    PSSS DP    DL I+LRKGKR  K  Y +
Subjt:  CQGEDDNLSIYEITS--PT---------DA-----------PPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKR--KCTYPV

Query:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG
        ++FV Y  LSS +   + S++S S+  TV EAL+H GW+NAM+EE+ AL+DN TW LV LP GKK + CKWVF+VKVNLDG+V +LKA LVA+GYAQTYG
Subjt:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG

Query:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY
        ++YSDTFSPVAKL S+RLF+S+AA+  W + QLDIKN FLHGDL+EEVY+EQP GF+AQGE  KVCRL+K+LYGLKQ P AWFGKFS  +  FGM KS  
Subjt:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY

Query:  DHYVFF
        DH VF+
Subjt:  DHYVFF

A0A5D3DJ35 Cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-15999.64Show/hide
Query:  DTLFTSSPSSSCQGEDDNLSIYEITSPTDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYA
        DTLFTSSPSSSCQGEDDNLSIYEITSPTDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYA
Subjt:  DTLFTSSPSSSCQGEDDNLSIYEITSPTDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYA

Query:  FITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTS
        FITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTS
Subjt:  FITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTS

Query:  IRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGM
        IRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFG+
Subjt:  IRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGM

B0FBS2 Uncharacterized protein2.4e-12048.42Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        MK+ +E+ SHF AF  EIK Q++VS+K LR+DN  EY S++  +Y+  +GI+HQ+SCVDT SQNG+AE KNRH+LETA AL+FQM V K FW D VST C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS
        FLINRMP+ +L G+I Y+                                                                       DT F SSP+SS
Subjt:  FLINRMPSSLLNGEISYR-----------------------------------------------------------------------DTLFTSSPSSS

Query:  CQGEDDNLSIYEITS--PT---------DA-----------PPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKC--TYPV
           ED+   +Y++ +  PT         DA           PP+   P   +VY RRP +  + +C    PSSS DP    DL I+LRKGKR C   Y +
Subjt:  CQGEDDNLSIYEITS--PT---------DA-----------PPSRLLPS--RVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKC--TYPV

Query:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG
        ++FV Y  LSS +   + S++S S+  TV EAL+H GW+NAM+EE+ AL+DN TW LV LP GKK + CKWVF+VKVN DG+V +LKA LVA+GYAQTYG
Subjt:  SSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYG

Query:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY
        ++YSDTFSPVAKL S+RLF+S+AA+  W + QLDIKN FLHGDL+EEVY+EQP GF+AQGE  KVCRL+K+LYGLKQ P AWFGKFS  +  FGM KS  
Subjt:  INYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTY

Query:  DHYVFF
        DH VF+
Subjt:  DHYVFF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-3234.18Show/hide
Query:  DPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHL-------GWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCK
        +P  +D + I  R+ +R  T P  S   Y++  +     +  L + +I N V  +   +        W  A+  E+ A   N TW +   P  K  +  +
Subjt:  DPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHL-------GWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCK

Query:  WVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRK
        WVFSVK N  G   + KA LVA+G+ Q Y I+Y +TF+PVA+++S R  LS+   +N  + Q+D+K  FL+G L+EE+YM  P G      SD VC+L K
Subjt:  WVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRK

Query:  SLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFFI
        ++YGLKQ    WF  F  AL       S+ D  ++ +
Subjt:  SLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFFI

P04146 Copia protein2.8e-0930.63Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        +K  +++ S F  F  + +  FN+ +  L  DN  EY S+ +  +  + GI +  +   T   NG++E   R + E A  ++    + K+FWG+ V T  
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLL
        +LINR+PS  L
Subjt:  FLINRMPSSLL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-5631.93Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        +K+  ++   F  FH  ++ +    +K LR+DN GEY S     Y   HGI H+ +   T   NG+AE  NR ++E   ++L    + K+FWG+ V T C
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLL----------NGEISY------------------RDTLFTSSPSSSCQGEDDNLSIYEITSP------------------------TDA
        +LINR PS  L          N E+SY                  R  L   S      G  D    Y +  P                        ++ 
Subjt:  FLINRMPSSLL----------NGEISY------------------RDTLFTSSPSSSCQGEDDNLSIYEITSP------------------------TDA

Query:  PPSRLLPSRVYFRRPPSQPSGSCHTS--VPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLESTSIS----------NTVHEAL
          + ++P+ V      + P+ +  T+  V      PG   +    L +G  +  +P      +  L       + S    S             ++ E L
Subjt:  PPSRLLPSRVYFRRPPSQPSGSCHTS--VPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFITSLESTSIS----------NTVHEAL

Query:  SH---LGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSMAATHNWSL
        SH        AM EEM +L  NGT+ LV LP GK+ + CKWVF +K + D  + + KA LV KG+ Q  GI++ + FSPV K+TSIR  LS+AA+ +  +
Subjt:  SH---LGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSMAATHNWSL

Query:  DQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF
        +QLD+K  FLHGDL+EE+YMEQP GF   G+   VC+L KSLYGLKQ P  W+ KF   +      K+  D  V+F
Subjt:  DQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFF

P92520 Uncharacterized mitochondrial protein AtMg008203.7e-1741.59Show/hide
Query:  SPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPV
        +P Y+   +        +V  AL   GW  AM EE+ AL  N TW LV  P  +  + CKWVF  K++ DGT+ +LKA LVAKG+ Q  GI + +T+SPV
Subjt:  SPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPV

Query:  AKLTSIRLFLSMA
         +  +IR  L++A
Subjt:  AKLTSIRLFLSMA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-4627.54Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        +K  +++   F  F   ++N+F   I T  +DN GE+ +  L  Y  +HGI H +S   T   NG++E K+RH++ET   LL    + KT+W    +   
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYRDTLFTSSPS-----------------------------------SSCQGE-------------------DDNLSIYEITSPT
        +LINR+P+ LL  E  ++  LF +SP+                                   S  Q                     D+N   +     T
Subjt:  FLINRMPSSLLNGEISYRDTLFTSSPS-----------------------------------SSCQGE-------------------DDNLSIYEITSPT

Query:  DAP-------------PSRLLPSRV------------YFRRPPSQPSGSCHTS-VPSSSCDPG------PSDDLLITLRKGKRKCTYPVSSFVFYH----
         +P             P   LP+R             +   PPS PS     S V SS+ D         S +     + G +  T P  +    H    
Subjt:  DAP-------------PSRLLPSRV------------YFRRPPSQPSGSCHTS-VPSSSCDPG------PSDDLLITLRKGKRKCTYPVSSFVFYH----

Query:  ------------QLS--------------------------------------------------------------------SPTYAFITSLESTSISN
                    QL+                                                                    +P Y+   SL + S   
Subjt:  ------------QLS--------------------------------------------------------------------SPTYAFITSLESTSISN

Query:  TVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAI-SCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSMAATH
        T  +AL    WRNAM  E+ A   N TWDLV  P     I  C+W+F+ K N DG++ + KA LVAKGY Q  G++Y++TFSPV K TSIR+ L +A   
Subjt:  TVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAI-SCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSMAATH

Query:  NWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFFI
        +W + QLD+ N FL G L ++VYM QP GFI +   + VC+LRK+LYGLKQ P AW+ +  + L+  G   S  D  +F +
Subjt:  NWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFFI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-4628.03Show/hide
Query:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC
        +K  +++   F  F   ++N+F   I TL +DN GE+    L  YL +HGI H +S   T   NG++E K+RH++E    LL    V KT+W    S   
Subjt:  MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTC

Query:  FLINRMPSSLLNGEISYR---------------------------------------------------------DTLFTS-----------------SP
        +LINR+P+ LL  +  ++                                                           L+TS                   
Subjt:  FLINRMPSSLLNGEISYR---------------------------------------------------------DTLFTS-----------------SP

Query:  SSSCQGEDDNLSIY--EITSPT-----DAPPSRLLPSRVYFRRPPSQPSGSCHTSV-----PSSSCDPGPSDDLLITLRKGKRKCTYP------------
        S+S +   D+   +    T PT      APP  L P      RPPS PS  C T V     PSSS     S +       G +    P            
Subjt:  SSSCQGEDDNLSIY--EITSPT-----DAPPSRLLPSRVYFRRPPSQPSGSCHTSV-----PSSSCDPGPSDDLLITLRKGKRKCTYP------------

Query:  ---------------VSSFVFYHQLSSP-------------------------------------------------------------TYAFITSLEST
                        +S +    +SSP                                                              Y++ TSL + 
Subjt:  ---------------VSSFVFYHQLSSP-------------------------------------------------------------TYAFITSLEST

Query:  SISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAI-SCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSM
        S   T  +A+    WR AM  E+ A   N TWDLV  P     I  C+W+F+ K N DG++ + KA LVAKGY Q  G++Y++TFSPV K TSIR+ L +
Subjt:  SISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAI-SCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSM

Query:  AATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFFI
        A   +W + QLD+ N FL G L +EVYM QP GF+ +   D VCRLRK++YGLKQ P AW+ +    L+  G   S  D  +F +
Subjt:  AATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFFI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.6e-5046.23Show/hide
Query:  YPVSSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQ
        + +S F+ Y ++S   ++F+  +      +T +EA   L W  AM +E+ A++   TW++ +LP  KK I CKWV+ +K N DGT+ + KA LVAKGY Q
Subjt:  YPVSSFVFYHQLSSPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQ

Query:  TYGINYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIA-QGES---DKVCRLRKSLYGLKQGPHAWFGKFSHALIRF
          GI++ +TFSPV KLTS++L L+++A +N++L QLDI N FL+GDL EE+YM+ P G+ A QG+S   + VC L+KS+YGLKQ    WF KFS  LI F
Subjt:  TYGINYSDTFSPVAKLTSIRLFLSMAATHNWSLDQLDIKNDFLHGDLQEEVYMEQPLGFIA-QGES---DKVCRLRKSLYGLKQGPHAWFGKFSHALIRF

Query:  GMQKSTYDHYVF
        G  +S  DH  F
Subjt:  GMQKSTYDHYVF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.6e-1841.59Show/hide
Query:  SPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPV
        +P Y+   +        +V  AL   GW  AM EE+ AL  N TW LV  P  +  + CKWVF  K++ DGT+ +LKA LVAKG+ Q  GI + +T+SPV
Subjt:  SPTYAFITSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPV

Query:  AKLTSIRLFLSMA
         +  +IR  L++A
Subjt:  AKLTSIRLFLSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGTCATACTGAGTTATTATCTCACTTTTATGCCTTTCATGTTGAAATAAAAAATCAGTTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATA
TTTTTCTCATACTCTTGGCTCTTACTTGTGTGAACATGGCATTATTCATCAGTCTTCATGTGTTGACACTCAATCTCAAAATGGTATCGCAGAACACAAAAACAGGCATG
TACTTGAAACTGCCCATGCTTTGTTGTTTCAAATGCATGTTTTGAAAACTTTTTGGGGTGATGTCGTGTCTACTACTTGTTTTTTAATTAATAGAATGCCTTCATCGCTT
CTTAATGGTGAGATTTCTTATCGTGATACACTTTTTACTTCATCACCATCGAGTTCGTGTCAGGGGGAGGATGACAATCTTTCTATATATGAGATTACCTCTCCCACTGA
TGCGCCTCCTTCCCGTCTGTTGCCTTCTCGAGTCTACTTCCGACGACCTCCATCACAACCTTCAGGCTCATGTCATACATCAGTGCCTTCTTCATCATGTGATCCGGGAC
CAAGTGATGATCTTCTCATTACTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCTGTTTCTTCTTTTGTTTTCTATCACCAATTATCTTCCCCCACATATGCTTTCATT
ACATCTCTTGAGTCCACTTCTATTTCTAACACTGTTCATGAAGCTTTATCTCATCTTGGCTGGCGAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTAC
TTGGGATTTGGTATCTCTTCCTGGAGGAAAGAAGGCTATTAGTTGTAAATGGGTGTTTTCTGTTAAGGTGAATCTTGATGGAACAGTGACTCAATTGAAAGCTCATCTTG
TTGCCAAAGGTTATGCTCAAACCTATGGTATTAATTATTCAGATACATTTTCTCCAGTTGCCAAATTAACTTCCATCCGCCTATTTCTTTCCATGGCTGCTACCCATAAC
TGGTCTTTGGATCAACTTGACATTAAAAATGATTTTCTGCATGGTGATCTTCAAGAGGAAGTTTATATGGAGCAACCACTTGGGTTTATCGCTCAAGGGGAGAGTGATAA
AGTATGTCGCCTTCGAAAATCTTTATATGGTTTGAAACAGGGTCCACATGCATGGTTTGGTAAGTTTAGTCATGCTCTTATACGTTTTGGTATGCAAAAGAGTACATATG
ATCATTATGTTTTTTTTATCGTCGATCTGATAATGGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGTCATACTGAGTTATTATCTCACTTTTATGCCTTTCATGTTGAAATAAAAAATCAGTTTAATGTCTCTATCAAAACTTTGCGTACTGATAATGCGGGTGAATA
TTTTTCTCATACTCTTGGCTCTTACTTGTGTGAACATGGCATTATTCATCAGTCTTCATGTGTTGACACTCAATCTCAAAATGGTATCGCAGAACACAAAAACAGGCATG
TACTTGAAACTGCCCATGCTTTGTTGTTTCAAATGCATGTTTTGAAAACTTTTTGGGGTGATGTCGTGTCTACTACTTGTTTTTTAATTAATAGAATGCCTTCATCGCTT
CTTAATGGTGAGATTTCTTATCGTGATACACTTTTTACTTCATCACCATCGAGTTCGTGTCAGGGGGAGGATGACAATCTTTCTATATATGAGATTACCTCTCCCACTGA
TGCGCCTCCTTCCCGTCTGTTGCCTTCTCGAGTCTACTTCCGACGACCTCCATCACAACCTTCAGGCTCATGTCATACATCAGTGCCTTCTTCATCATGTGATCCGGGAC
CAAGTGATGATCTTCTCATTACTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCTGTTTCTTCTTTTGTTTTCTATCACCAATTATCTTCCCCCACATATGCTTTCATT
ACATCTCTTGAGTCCACTTCTATTTCTAACACTGTTCATGAAGCTTTATCTCATCTTGGCTGGCGAAATGCAATGATTGAGGAGATGACTGCTTTAGATGATAATGGTAC
TTGGGATTTGGTATCTCTTCCTGGAGGAAAGAAGGCTATTAGTTGTAAATGGGTGTTTTCTGTTAAGGTGAATCTTGATGGAACAGTGACTCAATTGAAAGCTCATCTTG
TTGCCAAAGGTTATGCTCAAACCTATGGTATTAATTATTCAGATACATTTTCTCCAGTTGCCAAATTAACTTCCATCCGCCTATTTCTTTCCATGGCTGCTACCCATAAC
TGGTCTTTGGATCAACTTGACATTAAAAATGATTTTCTGCATGGTGATCTTCAAGAGGAAGTTTATATGGAGCAACCACTTGGGTTTATCGCTCAAGGGGAGAGTGATAA
AGTATGTCGCCTTCGAAAATCTTTATATGGTTTGAAACAGGGTCCACATGCATGGTTTGGTAAGTTTAGTCATGCTCTTATACGTTTTGGTATGCAAAAGAGTACATATG
ATCATTATGTTTTTTTTATCGTCGATCTGATAATGGTATAG
Protein sequenceShow/hide protein sequence
MKSHTELLSHFYAFHVEIKNQFNVSIKTLRTDNAGEYFSHTLGSYLCEHGIIHQSSCVDTQSQNGIAEHKNRHVLETAHALLFQMHVLKTFWGDVVSTTCFLINRMPSSL
LNGEISYRDTLFTSSPSSSCQGEDDNLSIYEITSPTDAPPSRLLPSRVYFRRPPSQPSGSCHTSVPSSSCDPGPSDDLLITLRKGKRKCTYPVSSFVFYHQLSSPTYAFI
TSLESTSISNTVHEALSHLGWRNAMIEEMTALDDNGTWDLVSLPGGKKAISCKWVFSVKVNLDGTVTQLKAHLVAKGYAQTYGINYSDTFSPVAKLTSIRLFLSMAATHN
WSLDQLDIKNDFLHGDLQEEVYMEQPLGFIAQGESDKVCRLRKSLYGLKQGPHAWFGKFSHALIRFGMQKSTYDHYVFFIVDLIMV