; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0227491 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0227491
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionBeta-galactosidase
Genome locationCMiso1.1chr08:20558421..20560653
RNA-Seq ExpressionCmc08g0227491
SyntenyCmc08g0227491
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035474.1 Beta-galactosidase [Cucumis melo var. makuwa]6.2e-28672.96Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET
        G                SVV                   RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SLGI EEIWLQKVL DLHQE ET
Subjt:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET

Query:  PLKLFCDNKA
        PLKLFCDNKA
Subjt:  PLKLFCDNKA

KAA0041227.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.9e-29780.82Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAY                           RGYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNL KEVGSPTSQSPAPVQDFEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q---------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLE
        Q                                       GT+S TKHPICNYV YD+LSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLE
Subjt:  Q---------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLE

Query:  KNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYM
        KNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPI KLNTVRVLLYVVVNKNW LYQLDVKNAFWNGDLVEEVYM
Subjt:  KNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYM

Query:  SSRQDLKPNLVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQ
        S RQDLKPNL+SRFTTFVKSQGYS GHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEI+QLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQ
Subjt:  SSRQDLKPNLVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQ

Query:  RKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV----------------------------------------------GSVVDRK
        RKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQR V                                              GSVVDRK
Subjt:  RKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV----------------------------------------------GSVVDRK

Query:  STSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQEC
        STSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQEC
Subjt:  STSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQEC

TYJ97179.1 Beta-galactosidase [Cucumis melo var. makuwa]3.3e-28773.1Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET
        G                SVV                   RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SLGI EEIWLQKVL DLHQECET
Subjt:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET

Query:  PLKLFCDNKA
        PLKLFCDNKA
Subjt:  PLKLFCDNKA

TYK21237.1 Beta-galactosidase [Cucumis melo var. makuwa]9.6e-28771.75Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  ----------------------------------------------GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQ
                                                      GSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SL I EEIWLQ
Subjt:  ----------------------------------------------GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQ

Query:  KVLLDLHQECETPLKLFCDNKA
        KVL DLHQECETPLKLFCDNKA
Subjt:  KVLLDLHQECETPLKLFCDNKA

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]6.2e-28672.96Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET
        G                SVV                   RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SLGI EEIWLQKVL DLHQE ET
Subjt:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET

Query:  PLKLFCDNKA
        PLKLFCDNKA
Subjt:  PLKLFCDNKA

TrEMBL top hitse value%identityAlignment
A0A5A7SW06 Beta-galactosidase3.0e-28672.96Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET
        G                SVV                   RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SLGI EEIWLQKVL DLHQE ET
Subjt:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET

Query:  PLKLFCDNKA
        PLKLFCDNKA
Subjt:  PLKLFCDNKA

A0A5D3BDV4 Beta-galactosidase1.6e-28773.1Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET
        G                SVV                   RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SLGI EEIWLQKVL DLHQECET
Subjt:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET

Query:  PLKLFCDNKA
        PLKLFCDNKA
Subjt:  PLKLFCDNKA

A0A5D3CXF8 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-29780.82Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAY                           RGYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNL KEVGSPTSQSPAPVQDFEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q---------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLE
        Q                                       GT+S TKHPICNYV YD+LSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLE
Subjt:  Q---------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLE

Query:  KNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYM
        KNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPI KLNTVRVLLYVVVNKNW LYQLDVKNAFWNGDLVEEVYM
Subjt:  KNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYM

Query:  SSRQDLKPNLVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQ
        S RQDLKPNL+SRFTTFVKSQGYS GHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEI+QLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQ
Subjt:  SSRQDLKPNLVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQ

Query:  RKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV----------------------------------------------GSVVDRK
        RKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQR V                                              GSVVDRK
Subjt:  RKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV----------------------------------------------GSVVDRK

Query:  STSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQEC
        STSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQEC
Subjt:  STSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQEC

A0A5D3DC56 Beta-galactosidase4.7e-28771.75Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  ----------------------------------------------GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQ
                                                      GSV+DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SL I EEIWLQ
Subjt:  ----------------------------------------------GSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQ

Query:  KVLLDLHQECETPLKLFCDNKA
        KVL DLHQECETPLKLFCDNKA
Subjt:  KVLLDLHQECETPLKLFCDNKA

A0A5D3E603 Beta-galactosidase3.0e-28672.96Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +EVA SLMLS SLPSYLWG+AILT+AHLIN M SRILHLQTPLDCLKESYP TRLVSEV LRVFGCTAYVHNFGPNQTKF P AQACVFVGYPLHQ GYK
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD
        CFH  SRKYF+TMD+TFCENRPYFPVSHLQGE+VSEESN++FEF+EPT  TVSD+DPHPIILPTNQVP KTYYRRNL KEVGSPTSQ PAPVQ+FEPPRD
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRD

Query:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA
        Q                                                                       GT+S TKHPICNYV YD+LSPQFRAFTA
Subjt:  Q-----------------------------------------------------------------------GTKSYTKHPICNYVFYDSLSPQFRAFTA

Query:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV
        +LDS IIPKNIYTALECPEWKN VM+EMK LEKNRTWEICALPK HKTVGCKWVFS KYKADGTLDRHKARLVAKGFTQTYG+DYSETFSP+ KLNTVRV
Subjt:  SLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRV

Query:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV
        LL V VNK+WPLYQLDVKNAF NGDLVEEVYMS                  S   LK +      RFTTFVKSQGYS GH DHTLFTK SKT KI +LIV
Subjt:  LLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMS------------------SRQDLKPN---LVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIV

Query:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
        YVDDIVL GDDQTEISQLKQ+MGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV
Subjt:  YVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLV

Query:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET
        G                SVV                   RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA+SLGI EEIWLQKVL DLHQE ET
Subjt:  G----------------SVVD------------------RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECET

Query:  PLKLFCDNKA
        PLKLFCDNKA
Subjt:  PLKLFCDNKA

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.9e-5023.87Show/hide
Query:  EVACSLMLSISLPSYLWGNAILTSAHLINGMSSRIL--HLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGY
        E A +++    L    WG A+LT+ +LIN + SR L    +TP +      P  +      LRVFG T YVH     Q KF+  +   +FVGY     G+
Subjt:  EVACSLMLSISLPSYLWGNAILTSAHLINGMSSRIL--HLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGY

Query:  KCFHLSSRKYFITMDITFCENRPY------FPVSHLQGESVSEESN--------------------SSFEFIEPT--------------------PSTVS
        K +   + K+ +  D+   E          F    L+    SE  N                     + +F++ +                    P+   
Subjt:  KCFHLSSRKYFITMDITFCENRPY------FPVSHLQGESVSEESN--------------------SSFEFIEPT--------------------PSTVS

Query:  DVDPHPIILPTNQ-------VPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRDQGTKSYTKHPICNYVFYDS----LSPQ--FRAFTASLDSFIIPKNI
        + D    +  + +         +K     +L +  GS         +  E  ++ G  + TK+     +   S      PQ  +     SL+  ++  N 
Subjt:  DVDPHPIILPTNQ-------VPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRDQGTKSYTKHPICNYVFYDS----LSPQ--FRAFTASLDSFIIPKNI

Query:  YTAL--------------ECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNT
        +T                +   W+  +  E+   + N TW I   P+    V  +WVFS KY   G   R+KARLVA+GFTQ Y +DY ETF+P+ ++++
Subjt:  YTAL--------------ECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNT

Query:  VRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDLKPN--------------------LVSRFTTFVKSQGYSHGHCDHTLF-TKVSKTRKIVV
         R +L +V+  N  ++Q+DVK AF NG L EE+YM   Q +  N                        F   +K   + +   D  ++        + + 
Subjt:  VRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDLKPN--------------------LVSRFTTFVKSQGYSHGHCDHTLF-TKVSKTRKIVV

Query:  LIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPI--EFNCKLGNSDDQVPV----
        +++YVDD+V+   D T ++  K+ + ++F + DL  +K+F+G+ +   ++ I +SQ  Y   +L++  M  C    TP+  + N +L NSD+        
Subjt:  LIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPI--EFNCKLGNSDDQVPV----

Query:  ------------------------------DKEQYQRL-----------------------------------VGSVVDRKSTSGYCTFVWG-NLVTWRS
                                      + E +Q L                                    GS +DRKST+GY   ++  NL+ W +
Subjt:  ------------------------------DKEQYQRL-----------------------------------VGSVVDRKSTSGYCTFVWG-NLVTWRS

Query:  KKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECETPLKLFCDNK
        K+Q+ VA SS EAEY A+   + E +WL+ +L  ++ + E P+K++ DN+
Subjt:  KKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECETPLKLFCDNK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-5827.1Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +E   S++    LP   WG A+ T+ +LIN   S  L  + P     E     + VS   L+VFGC A+ H     +TK +  +  C+F+GY   + GY+
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFI----------EPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPT--SQS
         +    +K   + D+ F E+           E V      +F  I          E T   VS+    P      +V  +       ++EV  PT   + 
Subjt:  CFHLSSRKYFITMDITFCENRPYFPVSHLQGESVSEESNSSFEFI----------EPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPT--SQS

Query:  PAPVQDFEPPRDQGTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMK----EMKTLEKNRTWEICALPKEHKTVGCKWVF
          P++  E PR +      ++P   YV               +     P+++   L  PE KN +MK    EM++L+KN T+++  LPK  + + CKWVF
Subjt:  PAPVQDFEPPRDQGTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMK----EMKTLEKNRTWEICALPKEHKTVGCKWVF

Query:  SPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDL----KPNLV--------
          K   D  L R+KARLV KGF Q  G+D+ E FSP+VK+ ++R +L +  + +  + QLDVK AF +GDL EE+YM   +      K ++V        
Subjt:  SPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDL----KPNLV--------

Query:  ----------SRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEG--ISVS
                   +F +F+KSQ Y   + D  ++ K       ++L++YVDD++++G D+  I++LK  +   F++KDLG  +  LGM++ R +    + +S
Subjt:  ----------SRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEG--ISVS

Query:  QRKYTLDLLTETGMLGCRPADTPIEFNCKLGNS------DDQVPVDKEQYQRLVGSVV------------------------------------------
        Q KY   +L    M   +P  TP+  + KL         +++  + K  Y   VGS++                                          
Subjt:  QRKYTLDLLTETGMLGCRPADTPIEFNCKLGNS------DDQVPVDKEQYQRLVGSVV------------------------------------------

Query:  ---------------------------DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVL--LDLHQECETPLKLFCDNK
                                   +RKS++GY     G  ++W+SK Q  VA S+ EAEY A +    E IWL++ L  L LHQ+      ++CD++
Subjt:  ---------------------------DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVL--LDLHQECETPLKLFCDNK

Query:  A
        +
Subjt:  A

P92520 Uncharacterized mitochondrial protein AtMg008201.9e-1946.39Show/hide
Query:  PKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYV
        PK++  AL+ P W   + +E+  L +N+TW +   P     +GCKWVF  K  +DGTLDR KARLVAKGF Q  G+ + ET+SP+V+  T+R +L V
Subjt:  PKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-7026.4Show/hide
Query:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK
        +E   +L+   S+P   W  A   + +LIN + + +L L++P   L  + P     +   LRVFGC  Y      NQ K +  ++ CVF+GY L Q  Y 
Subjt:  MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYK

Query:  CFHLSSRKYFITMDITFCEN------------------------------------------------------RPYFPVSHLQGESVSEESNSSFEF-I
        C HL + + +I+  + F EN                                                       P  P  + Q  S + +S+ S  F  
Subjt:  CFHLSSRKYFITMDITFCEN------------------------------------------------------RPYFPVSHLQGESVSEESNSSFEF-I

Query:  EPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPV-QDFEPPRDQGTKS---------------------YTKHPICNYVFYDSLSP-
         P P+      P P   PT Q   +T+  +N  +   +PT++SP+ + Q    P    + S                     +   P+   V  ++ +P 
Subjt:  EPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPV-QDFEPPRDQGTKS---------------------YTKHPICNYVFYDSLSP-

Query:  -----------------QFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKT-VGCKWVFSPKYKADGTLDRHKARLVAK
                            +   SL +   P+    AL+   W+N +  E+     N TW++   P  H T VGC+W+F+ KY +DG+L+R+KARLVAK
Subjt:  -----------------QFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKT-VGCKWVFSPKYKADGTLDRHKARLVAK

Query:  GFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDL----KPNLVSR------------------FTTFVKSQ
        G+ Q  G+DY+ETFSP++K  ++R++L V V+++WP+ QLDV NAF  G L ++VYMS         +PN V +                     ++ + 
Subjt:  GFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDL----KPNLVSR------------------FTTFVKSQ

Query:  GYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADT
        G+ +   D +LF  + + + IV ++VYVDDI++ G+D T +      +   F +KD   L YFLG+E  R   G+ +SQR+Y LDLL  T M+  +P  T
Subjt:  GYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADT

Query:  PIEFNCKLGNSDDQVPVDKEQYQRLVGSVV---------------------------------------------------------------------D
        P+  + KL         D  +Y+ +VGS+                                                                      D
Subjt:  PIEFNCKLGNSDDQVPVDKEQYQRLVGSVV---------------------------------------------------------------------D

Query:  RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECETPLKLFCDN
          ST+GY  ++  + ++W SKKQ  V RSS EAEYR+++    E  W+  +L +L      P  ++CDN
Subjt:  RKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECETPLKLFCDN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-5729.11Show/hide
Query:  SEESNSSFEFI-EPTPSTVSDVDPH--------PIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRDQ--GTKSYTKHPICNYVFYDSLSP-
        ++ SNS+   +  P P++ S   P+        PI  P    P  +    N      S TS  P P     PP  Q         H +          P 
Subjt:  SEESNSSFEFI-EPTPSTVSDVDPH--------PIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRDQ--GTKSYTKHPICNYVFYDSLSP-

Query:  QFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKT-VGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPI
        Q  ++  SL +   P+    A++   W+  +  E+     N TW++   P    T VGC+W+F+ K+ +DG+L+R+KARLVAKG+ Q  G+DY+ETFSP+
Subjt:  QFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKT-VGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPI

Query:  VKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDL----KPNLVSR------------------FTTFVKSQGYSHGHCDHTLFTKVSK
        +K  ++R++L V V+++WP+ QLDV NAF  G L +EVYMS         +P+ V R                    T++ + G+ +   D +LF  + +
Subjt:  VKLNTVRVLLYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDL----KPNLVSR------------------FTTFVKSQGYSHGHCDHTLFTKVSK

Query:  TRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLG-NSDDQVP
         R I+ ++VYVDDI++ G+D   +      +   F +K+  +L YFLG+E  R  +G+ +SQR+YTLDLL  T ML  +P  TP+  + KL  +S  ++P
Subjt:  TRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLG-NSDDQVP

Query:  VDKEQYQRLVGSVV---------------------------------------------------------------------DRKSTSGYCTFVWGNLV
         D  +Y+ +VGS+                                                                      D  ST+GY  ++  + +
Subjt:  VDKEQYQRLVGSVV---------------------------------------------------------------------DRKSTSGYCTFVWGNLV

Query:  TWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECETPLKLFCDN
        +W SKKQ  V RSS EAEYR+++    E  W+  +L +L  +   P  ++CDN
Subjt:  TWRSKKQSVVARSSAEAEYRAISLGIYEEIWLQKVLLDLHQECETPLKLFCDN

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-8133.97Show/hide
Query:  SDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRDQGTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNV
        S +D  P     N VP  + +            ++ PA +QD+         S T H I  ++ Y+ +SP + +F   +     P     A E   W   
Subjt:  SDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRDQGTKSYTKHPICNYVFYDSLSPQFRAFTASLDSFIIPKNIYTALECPEWKNV

Query:  VMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWN
        +  E+  +E   TWEIC LP   K +GCKWV+  KY +DGT++R+KARLVAKG+TQ  G+D+ ETFSP+ KL +V+++L +    N+ L+QLD+ NAF N
Subjt:  VMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYVVVNKNWPLYQLDVKNAFWN

Query:  GDLVEEVYM------SSRQ--DLKPNLVS------------------RFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQL
        GDL EE+YM      ++RQ   L PN V                   +F+  +   G+   H DHT F K++ T  + VL VYVDDI++  ++   + +L
Subjt:  GDLVEEVYM------SSRQ--DLKPNLVS------------------RFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQL

Query:  KQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLVGSVV--------------
        K ++   F+++DLG LKYFLG+E+ARS  GI++ QRKY LDLL ETG+LGC+P+  P++ +           VD + Y+RL+G ++              
Subjt:  KQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLVGSVV--------------

Query:  -------------------------------------------------------DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYE
                                                                R+ST+GYC F+  +L++W+SKKQ VV++SSAEAEYRA+S    E
Subjt:  -------------------------------------------------------DRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRAISLGIYE

Query:  EIWLQKVLLDLHQECETPLKLFCDNKA
         +WL +   +L      P  LFCDN A
Subjt:  EIWLQKVLLDLHQECETPLKLFCDNKA

ATMG00810.1 DNA/RNA polymerases superfamily protein8.8e-2026.79Show/hide
Query:  LIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQ
        L++YVDDI+L G   T ++ L  ++   F +KDLG + YFLG+++     G+ +SQ KY   +L   GML C+P  TP+        S  + P D   ++
Subjt:  LIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLGNLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQ

Query:  RLVGSV---------------------------------------------------------------------VDRKSTSGYCTFVWGNLVTWRSKKQ
         +VG++                                                                       R+ST+G+CTF+  N+++W +K+Q
Subjt:  RLVGSV---------------------------------------------------------------------VDRKSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAEYRAISLGIYEEIW
          V+RSS E EYRA++L   E  W
Subjt:  SVVARSSAEAEYRAISLGIYEEIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.4e-2046.39Show/hide
Query:  PKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYV
        PK++  AL+ P W   + +E+  L +N+TW +   P     +GCKWVF  K  +DGTLDR KARLVAKGF Q  G+ + ET+SP+V+  T+R +L V
Subjt:  PKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVLLYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTAGCCTGTTCCCTTATGCTTTCCATTTCCCTTCCATCATACCTGTGGGGAAATGCTATTCTTACATCCGCTCACTTAATCAATGGAATGTCTTCTCGTATCCT
CCACCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCTTACCCCCCTACTCGTCTTGTTTCTGAAGTTTCTCTTCGTGTGTTTGGGTGTACGGCTTATGTCCATAATTTTG
GCCCTAATCAGACCAAATTTAACCCTTGGGCTCAGGCTTGTGTGTTTGTTGGGTATCCCCTTCACCAGCGCGGTTATAAATGTTTTCACTTGTCGTCTAGGAAATACTTT
ATCACTATGGATATTACTTTCTGTGAGAACCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGCGTGAGTGAAGAGTCTAACAGCAGCTTTGAATTTATTGAACC
TACTCCTAGTACCGTATCTGACGTTGATCCTCATCCCATAATCCTACCCACAAACCAAGTTCCCAGGAAAACATATTATAGGAGGAATCTCATAAAGGAAGTTGGGTCCC
CTACTAGTCAATCGCCAGCTCCAGTCCAAGACTTCGAACCTCCTCGAGACCAAGGTACCAAGTCCTACACTAAACATCCCATTTGCAATTATGTTTTCTATGATAGTCTC
TCTCCACAGTTTAGAGCATTTACAGCAAGCCTGGACTCTTTCATAATACCGAAAAATATCTACACTGCTCTAGAGTGTCCTGAATGGAAAAATGTTGTTATGAAAGAGAT
GAAGACTCTTGAAAAAAATAGAACTTGGGAGATTTGTGCTCTACCCAAGGAACATAAAACTGTTGGATGCAAATGGGTGTTCTCTCCCAAATACAAAGCAGATGGTACGC
TTGATAGACACAAGGCAAGGTTAGTTGCAAAGGGATTTACTCAGACCTATGGTGTTGACTATTCAGAAACTTTTTCTCCAATTGTTAAGTTGAATACTGTTAGAGTCCTG
CTATATGTTGTTGTGAACAAAAATTGGCCTCTATACCAGCTGGATGTTAAGAATGCTTTTTGGAATGGAGACCTTGTGGAGGAAGTCTACATGAGCTCCCGCCAGGATTT
GAAGCCCAATTTGGTCAGTAGATTCACTACCTTTGTCAAGTCTCAAGGGTACAGTCATGGGCACTGTGACCATACTTTATTTACAAAGGTTTCCAAGACAAGGAAGATTG
TTGTCCTAATAGTTTATGTGGATGACATTGTTTTAATTGGAGATGATCAAACAGAAATCAGTCAACTAAAGCAAAAAATGGGTGATGAATTTGAAATCAAGGATTTGGGA
AATCTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTGTCTCAAAGAAAATACACCCTTGATTTGCTAACCGAGACAGGTATGTTGGGATG
TCGTCCTGCTGATACTCCTATTGAATTCAACTGTAAATTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAATATCAGCGTCTTGTAGGATCTGTTGTTGACA
GAAAGTCGACCTCCGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGAAGCAGTGCTGAGGCTGAATATAGAGCT
ATAAGTTTGGGAATATATGAGGAAATTTGGCTCCAAAAAGTCTTGTTAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTTTGTGACAATAAAGCCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTAGCCTGTTCCCTTATGCTTTCCATTTCCCTTCCATCATACCTGTGGGGAAATGCTATTCTTACATCCGCTCACTTAATCAATGGAATGTCTTCTCGTATCCT
CCACCTTCAAACTCCCTTAGATTGTCTTAAGGAGTCTTACCCCCCTACTCGTCTTGTTTCTGAAGTTTCTCTTCGTGTGTTTGGGTGTACGGCTTATGTCCATAATTTTG
GCCCTAATCAGACCAAATTTAACCCTTGGGCTCAGGCTTGTGTGTTTGTTGGGTATCCCCTTCACCAGCGCGGTTATAAATGTTTTCACTTGTCGTCTAGGAAATACTTT
ATCACTATGGATATTACTTTCTGTGAGAACCGACCCTACTTTCCCGTTAGCCATCTTCAGGGGGAGAGCGTGAGTGAAGAGTCTAACAGCAGCTTTGAATTTATTGAACC
TACTCCTAGTACCGTATCTGACGTTGATCCTCATCCCATAATCCTACCCACAAACCAAGTTCCCAGGAAAACATATTATAGGAGGAATCTCATAAAGGAAGTTGGGTCCC
CTACTAGTCAATCGCCAGCTCCAGTCCAAGACTTCGAACCTCCTCGAGACCAAGGTACCAAGTCCTACACTAAACATCCCATTTGCAATTATGTTTTCTATGATAGTCTC
TCTCCACAGTTTAGAGCATTTACAGCAAGCCTGGACTCTTTCATAATACCGAAAAATATCTACACTGCTCTAGAGTGTCCTGAATGGAAAAATGTTGTTATGAAAGAGAT
GAAGACTCTTGAAAAAAATAGAACTTGGGAGATTTGTGCTCTACCCAAGGAACATAAAACTGTTGGATGCAAATGGGTGTTCTCTCCCAAATACAAAGCAGATGGTACGC
TTGATAGACACAAGGCAAGGTTAGTTGCAAAGGGATTTACTCAGACCTATGGTGTTGACTATTCAGAAACTTTTTCTCCAATTGTTAAGTTGAATACTGTTAGAGTCCTG
CTATATGTTGTTGTGAACAAAAATTGGCCTCTATACCAGCTGGATGTTAAGAATGCTTTTTGGAATGGAGACCTTGTGGAGGAAGTCTACATGAGCTCCCGCCAGGATTT
GAAGCCCAATTTGGTCAGTAGATTCACTACCTTTGTCAAGTCTCAAGGGTACAGTCATGGGCACTGTGACCATACTTTATTTACAAAGGTTTCCAAGACAAGGAAGATTG
TTGTCCTAATAGTTTATGTGGATGACATTGTTTTAATTGGAGATGATCAAACAGAAATCAGTCAACTAAAGCAAAAAATGGGTGATGAATTTGAAATCAAGGATTTGGGA
AATCTGAAATATTTCCTTGGAATGGAGGTGGCCAGATCTAAAGAAGGTATCTCCGTGTCTCAAAGAAAATACACCCTTGATTTGCTAACCGAGACAGGTATGTTGGGATG
TCGTCCTGCTGATACTCCTATTGAATTCAACTGTAAATTAGGAAACTCTGATGATCAAGTTCCAGTTGATAAAGAACAATATCAGCGTCTTGTAGGATCTGTTGTTGACA
GAAAGTCGACCTCCGGTTATTGTACCTTTGTTTGGGGCAATCTTGTAACTTGGAGGAGTAAGAAGCAAAGTGTTGTGGCCAGAAGCAGTGCTGAGGCTGAATATAGAGCT
ATAAGTTTGGGAATATATGAGGAAATTTGGCTCCAAAAAGTCTTGTTAGATCTTCATCAGGAATGTGAGACACCATTGAAGCTTTTTTGTGACAATAAAGCCACTTGA
Protein sequenceShow/hide protein sequence
MEVACSLMLSISLPSYLWGNAILTSAHLINGMSSRILHLQTPLDCLKESYPPTRLVSEVSLRVFGCTAYVHNFGPNQTKFNPWAQACVFVGYPLHQRGYKCFHLSSRKYF
ITMDITFCENRPYFPVSHLQGESVSEESNSSFEFIEPTPSTVSDVDPHPIILPTNQVPRKTYYRRNLIKEVGSPTSQSPAPVQDFEPPRDQGTKSYTKHPICNYVFYDSL
SPQFRAFTASLDSFIIPKNIYTALECPEWKNVVMKEMKTLEKNRTWEICALPKEHKTVGCKWVFSPKYKADGTLDRHKARLVAKGFTQTYGVDYSETFSPIVKLNTVRVL
LYVVVNKNWPLYQLDVKNAFWNGDLVEEVYMSSRQDLKPNLVSRFTTFVKSQGYSHGHCDHTLFTKVSKTRKIVVLIVYVDDIVLIGDDQTEISQLKQKMGDEFEIKDLG
NLKYFLGMEVARSKEGISVSQRKYTLDLLTETGMLGCRPADTPIEFNCKLGNSDDQVPVDKEQYQRLVGSVVDRKSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAEYRA
ISLGIYEEIWLQKVLLDLHQECETPLKLFCDNKAT