; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038010 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038010
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:11563697..11580529
RNA-Seq ExpressionLag0038010
SyntenyLag0038010
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR013766 - Thioredoxin domain
IPR036249 - Thioredoxin-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR039542 - Endoplasmic reticulum vesicle transporter, N-terminal
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]4.2e-20858.26Show/hide
Query:  STTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSSSTKWVIDSSATNHMT------------------------------------------
        S  EFAKFQ YQ+SL+A   SS  TPI +T   GN+  CLL+SSTKWVIDS AT HMT                                          
Subjt:  STTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSSSTKWVIDSSATNHMT------------------------------------------

Query:  --------------------------------------------------------------------------------GHPSLSMLRKLCPQFHNLSS
                                                                                        GHPSL +L+KL P+F +LSS
Subjt:  --------------------------------------------------------------------------------GHPSLSMLRKLCPQFHNLSS

Query:  LNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAG
        LNCDSCQF KFHRLSSSPR+D RA  PF+LVH DIWGPCP+VS+TGFRYFVTFVDD+SR+TWLY MK+RSELLSHFC FH EI+ QF+VS+K LR+DNAG
Subjt:  LNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAG

Query:  EYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFG
        EYF  +  SYLC++ IIH SSCADTPSQNGVAERKNRH LETARAL FQM V K FWVDA+ST  FLIN MPS VL G+IPY VLFPTK LFPI  KIFG
Subjt:  EYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFG

Query:  CTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPS
        C CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+L RYLVS D   FEDTPF SS  S  Q E+D+LFIY             + SPTPSL   
Subjt:  CTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPS

Query:  PSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNA
         S   P+RP I+QVYS+R  P P +   P    SS D   SD+LPIALRKGKR CT+ +SSF+SY  LS ST +F+  L+S SIP +VHEALSH GW+NA
Subjt:  PSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNA

Query:  MIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH
        MIEEM AL DNGTWDLVSRPAGKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVAKLTSIRLF+SMAA+  W LH
Subjt:  MIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]3.0e-20671.57Show/hide
Query:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH
        GHPSL +L+KL P+F +LSSLNCDSCQF KFHRLSSSPR+D RA  PF+LVH DIWGPCP+VS+TGFRYFVTFVDD+SR+TWLY MK+RSELLSHFC FH
Subjt:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH

Query:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI
         EI+ QF+VS+K LR+DNAGEYF  +  SYLC++ IIH SSCADTPSQNGVAERKNRH LETARAL FQM V K FWVDA+ST  FLIN MPS VL G+I
Subjt:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI

Query:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP
        PY VLFPTK LFPI  KIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+L RYLVS D   FEDTPF SS  S  Q E+D+LFIY    
Subjt:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP

Query:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD
                 + SPTPSL    S   P+RP I+QVYS+R  P P +   P    SS D   SD+LPIALRKGKR CT+ +SSF+SY  LS ST +F+  L+
Subjt:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD

Query:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS
        S SIP +VHEALSH GW+NAMIEEM AL DNGTWDLVSRPAGKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVAKLTSIRLF+S
Subjt:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS

Query:  MAASQCWPLH
        MAA+  W LH
Subjt:  MAASQCWPLH

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]3.0e-20671.57Show/hide
Query:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH
        GHPSL +L+KL P+F +LSSLNCDSCQF KFHRLSSSPR+D RA  PF+LVH DIWGPCP+VS+TGFRYFVTFVDD+SR+TWLY MK+RSELLSHFC FH
Subjt:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH

Query:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI
         EI+ QF+VS+K LR+DNAGEYF  +  SYLC++ IIH SSCADTPSQNGVAERKNRH LETARAL FQM V K FWVDA+ST  FLIN MPS VL G+I
Subjt:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI

Query:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP
        PY VLFPTK LFPI  KIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+L RYLVS D   FEDTPF SS  S  Q E+D+LFIY    
Subjt:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP

Query:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD
                 + SPTPSL    S   P+RP I+QVYS+R  P P +   P    SS D   SD+LPIALRKGKR CT+ +SSF+SY  LS ST +F+  L+
Subjt:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD

Query:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS
        S SIP +VHEALSH GW+NAMIEEM AL DNGTWDLVSRPAGKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVAKLTSIRLF+S
Subjt:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS

Query:  MAASQCWPLH
        MAA+  W LH
Subjt:  MAASQCWPLH

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]3.0e-20671.57Show/hide
Query:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH
        GHPSL +L+KL P+F +LSSLNCDSCQF KFHRLSSSPR+D RA  PF+LVH DIWGPCP+VS+TGFRYFVTFVDD+SR+TWLY MK+RSELLSHFC FH
Subjt:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH

Query:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI
         EI+ QF+VS+K LR+DNAGEYF  +  SYLC++ IIH SSCADTPSQNGVAERKNRH LETARAL FQM V K FWVDA+ST  FLIN MPS VL G+I
Subjt:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI

Query:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP
        PY VLFPTK LFPI  KIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+L RYLVS D   FEDTPF SS  S  Q E+D+LFIY    
Subjt:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP

Query:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD
                 + SPTPSL    S   P+RP I+QVYS+R  P P +   P    SS D   SD+LPIALRKGKR CT+ +SSF+SY  LS ST +F+  L+
Subjt:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD

Query:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS
        S SIP +VHEALSH GW+NAMIEEM AL DNGTWDLVSRPAGKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVAKLTSIRLF+S
Subjt:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS

Query:  MAASQCWPLH
        MAA+  W LH
Subjt:  MAASQCWPLH

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]3.0e-20671.57Show/hide
Query:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH
        GHPSL +L+KL P+F +LSSLNCDSCQF KFHRLSSSPR+D RA  PF+LVH DIWGPCP+VS+TGFRYFVTFVDD+SR+TWLY MK+RSELLSHFC FH
Subjt:  GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFH

Query:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI
         EI+ QF+VS+K LR+DNAGEYF  +  SYLC++ IIH SSCADTPSQNGVAERKNRH LETARAL FQM V K FWVDA+ST  FLIN MPS VL G+I
Subjt:  VEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQI

Query:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP
        PY VLFPTK LFPI  KIFGC CFVRDVRP+ TKLDPKSLKCIFLGYSRVQKGYRCYCP+L RYLVS D   FEDTPF SS  S  Q E+D+LFIY    
Subjt:  PYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVP

Query:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD
                 + SPTPSL    S   P+RP I+QVYS+R  P P +   P    SS D   SD+LPIALRKGKR CT+ +SSF+SY  LS ST +F+  L+
Subjt:  PTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQP-PGECLVPQDFASS-DRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLD

Query:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS
        S SIP +VHEALSH GW+NAMIEEM AL DNGTWDLVSRPAGKKAIGCKWVFA+K+NPDG+VARLKARLVAKGYAQ YG DY DTFSPVAKLTSIRLF+S
Subjt:  SISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFIS

Query:  MAASQCWPLH
        MAA+  W LH
Subjt:  MAASQCWPLH

TrEMBL top hitse value%identityAlignment
A0A438CP53 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-18850.42Show/hide
Query:  SPSSFVGSSCADGSRADEEFYSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSSSTKWVI
        S S       A G  A+    +RR     G   F +     + S+K+  +T+  EF+K+  YQ +LKA       TP+   AESG    CL+SSS KW+I
Subjt:  SPSSFVGSSCADGSRADEEFYSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSSSTKWVI

Query:  DSSATNHMT-------------------------------------------------------------------------------------------
        DS AT+HMT                                                                                           
Subjt:  DSSATNHMT-------------------------------------------------------------------------------------------

Query:  --------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSEL
                GHPSL +L+KLCPQF  L SL+C+SC F K HR S  PR++ RA   F+LVH D+WGPCP+ S+TGFRYFVTFVDD+SRMTW+YFMK+RSE+
Subjt:  --------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSEL

Query:  LSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMP
         SHFC F  EI+TQ+DVS+KILRSDN  EY   +F++Y+  + I+H +SC DTPSQNGVAERKNRH LETARAL FQM VPKQFW DA+ST  FLIN MP
Subjt:  LSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMP

Query:  SFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDD
        + VLKG IPY V+ P KSLFP+  +IFGCTC+VRD RP +TKLDPK+L+C+FLGYSR+QKGYRC+ P LN+YLVS D    EDT F SS  S++ EE+++
Subjt:  SFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDD

Query:  LFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSDELPIALRKGKRSC--TFCISSFVSYDHL
          +Y +V   P+   S +V    SL  S         P +PPI QVYS+R      C  P   +SSD  S  +LPI+LRKGKR C   + I++FVSYDHL
Subjt:  LFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSDELPIALRKGKRSC--TFCISSFVSYDHL

Query:  SSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSP
        SSS+   VA +DSIS+PKTV EAL+H GW+NAM+EE+ AL DN TW LV  P GKK +GCKWVFA+KVNPDGSVARLKARLVA+GYAQTYG+DY DTFSP
Subjt:  SSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSP

Query:  VAKLTSIRLFISMAASQCWPLH
        VAKL S+RLFIS+AASQ W +H
Subjt:  VAKLTSIRLFISMAASQCWPLH

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-18648.27Show/hide
Query:  RGVESPSSFVGSSCADGSRADEEF--YSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSS
        RG +S ++ V   C +     +       R+   +  +   S T +  DSS      +  EF+K+  YQ +LKA       TP++  AESG    CL+SS
Subjt:  RGVESPSSFVGSSCADGSRADEEF--YSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSS

Query:  STKWVIDSSATNHMT-------------------------------------------------------------------------------------
        S KW+IDS AT+HMT                                                                                     
Subjt:  STKWVIDSSATNHMT-------------------------------------------------------------------------------------

Query:  ------------------------------------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSK
                                            GHPSL +L+KLCPQF  L SL+C+SC F K HR S  PR++ RA   F+LVH D+WGPCP+ S+
Subjt:  ------------------------------------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSK

Query:  TGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETAR
        TGFRYFVTFVDD+SRMTW+YFMK+RSE+ SHFC F  EI+TQ+DVS+KILRSDN  EY   +F++Y+  + I+H +SC DTPSQNGVAERKNRH LETAR
Subjt:  TGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETAR

Query:  ALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRY
        AL FQM VPKQFW DA+ST  FLIN MP+ VLK  IPY V+ P KSLFP+  +IFGCTC+VRD RP + KLDPK+L+C+FLGYSR+QKGYRC+ P LN+Y
Subjt:  ALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRY

Query:  LVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSD
        LVS D    EDT F SS  S++ EE+++  +Y +V   P+   S +V    SL PS         P +PPI QVYS+R      C  P   +SSD  S  
Subjt:  LVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSD

Query:  ELPIALRKGKRSC--TFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDG
        +LPI+LRKGKR C   + I++FVSYDHLSSS+   VA +DSIS+PKTV EAL+H GW+NAM+EE+ AL DN TW LV  P GKK +GCKWVFA+KVNPDG
Subjt:  ELPIALRKGKRSC--TFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDG

Query:  SVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH
        SVARLKARLVA+GYAQTYG+DY DTFSPVAKL S+RLFIS+AASQ W +H
Subjt:  SVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH

A0A438GWA1 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-18951.13Show/hide
Query:  EEFYSRRHT---LARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSSSTKWVIDSSATNHMT-----
        E   S +HT   +A+ R C         DSS      +  EF+K+  YQ +LKA       TP++  AESG    CL+SSS KW+IDS AT+HMT     
Subjt:  EEFYSRRHT---LARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSSSTKWVIDSSATNHMT-----

Query:  ----------------------------------------------------------------------------------------------GHPSLS
                                                                                                      GHPSL 
Subjt:  ----------------------------------------------------------------------------------------------GHPSLS

Query:  MLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQ
        +L+KLCPQF  L SL+C+SC F K HR S  PR++ RA   F+LVH D+WGPCP+ S+TGFRYFVTFVDD+SRMTW+YFMK+RSE+ SHFC F  EI+TQ
Subjt:  MLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQ

Query:  FDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLF
        +DVS+KILRSDN  EY   +F++Y+  + I+H +SC DTPSQNGVAERKNRH LETARAL FQM VPKQFW DA+ST  FLIN MP+ VLKG IPY V+ 
Subjt:  FDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLF

Query:  PTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTD
        P KSLFP+  +IFGCTC+VRD RP +TKLDPK+L+C+FLGYSR+QKGYRC+ P LN+YLVS D    EDT F SS  S++ EE+++  +Y +V   P+  
Subjt:  PTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTD

Query:  PSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSDELPIALRKGKRSC--TFCISSFVSYDHLSSSTCSFVAPLDSI
         S +V    SL PS         P +PPI QVYS+R      C  P   +SSD  S  +LPI+LRKGKR C   + I++FVSYDHLSSS+   VA +DSI
Subjt:  PSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSDELPIALRKGKRSC--TFCISSFVSYDHLSSSTCSFVAPLDSI

Query:  SIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMA
        S+PKTV EAL+H GW+NAM+EE+ AL DN TW LV  P GKK +GCKWVFA+KVNPDGSVARLKARLVA+GYAQTYG+DY DTFSPVAKL S+RLFIS+ 
Subjt:  SIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMA

Query:  ASQCWPLH
        ASQ W +H
Subjt:  ASQCWPLH

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-18548.27Show/hide
Query:  RGVESPSSFVGSSCADGSRADEEF--YSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSS
        RG +S ++ V   C +     +       R+   +  +   S T +  DSS      +  EF+K+  YQ +LKA       TP++  AESG    CL+SS
Subjt:  RGVESPSSFVGSSCADGSRADEEF--YSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSS

Query:  STKWVIDSSATNHMT-------------------------------------------------------------------------------------
        S KW+IDS AT+HMT                                                                                     
Subjt:  STKWVIDSSATNHMT-------------------------------------------------------------------------------------

Query:  ------------------------------------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSK
                                            GHPSL +L+KLCPQF  L SL+C+SC F K HR S  PR++ RA   F+LVH D+WGPCP+ S+
Subjt:  ------------------------------------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSK

Query:  TGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETAR
        TGFRYFVTFVDD+SRMTW+YFMK+RSE+ SHFC F  EI+TQ+DVS+KILRSDN  EY   +F++Y+ ++ I+H +SC DTPSQNGVAERKNRH LETAR
Subjt:  TGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETAR

Query:  ALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRY
        AL FQM VPKQFW DA+ST  FLIN MP+ VLKG IPY V+ P KSLFP+  +IFGCTC+VRD RP +TKLDPK+L+C+FLGYSR+QKGYRC+ P LN+Y
Subjt:  ALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRY

Query:  LVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSD
        LVS D    EDT F SS  S++ EE+++  +Y +V   P+   S +V    SL PS         P +PPI QVYS+R      C  P   +SSD  S  
Subjt:  LVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSD

Query:  ELPIALRKGKR--SCTFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDG
        +LPI+LRKGKR     + I++FVSYDHLSSS+   VA +DSIS+PKTV EAL+H GW+NAM+EE+ AL DN TW LV  P GKK +GCKWVFA+KVN DG
Subjt:  ELPIALRKGKR--SCTFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDG

Query:  SVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH
        SVARLKARLVA+GYAQTYG+DY DTFSPVAKL S+RLFIS+AASQ W +H
Subjt:  SVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH

B0FBS2 Uncharacterized protein1.5e-18748.53Show/hide
Query:  RGVESPSSFVGSSCADGSRADEEF--YSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSS
        RG +S ++ V   C +     +       R+   +  +   S T +  DSS      +  EF+K+  YQ +LKA       TP++  AESG    CL+SS
Subjt:  RGVESPSSFVGSSCADGSRADEEF--YSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSS

Query:  STKWVIDSSATNHMT-------------------------------------------------------------------------------------
        S KW+IDS AT+HMT                                                                                     
Subjt:  STKWVIDSSATNHMT-------------------------------------------------------------------------------------

Query:  ------------------------------------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSK
                                            GHPSL +L+KLCPQF  L SL+C+SC F K HR S  PR++ RA   F+LVH D+WGPCP+ S+
Subjt:  ------------------------------------GHPSLSMLRKLCPQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSK

Query:  TGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETAR
        TGFRYFVTFVDD+SRMTW+YFMK+RSE+ SHFC F  EI+TQ+DVS+KILRSDN  EY   +F++Y+  + I+H +SC DTPSQNGVAERKNRH LETAR
Subjt:  TGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETAR

Query:  ALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRY
        AL FQM VPKQFW DA+ST  FLIN MP+ VLKG IPY V+ P KSLFP+  +IFGCTC+VRD RP +TKLDPK+L+C+FLGYSR+QKGYRC+ P LN+Y
Subjt:  ALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRY

Query:  LVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSD
        LVS D    EDT F SS  S++ EE+++  +Y +V   P+   S +V    SL PS         P +PPI QVYS+R      C  P   +SSD  S  
Subjt:  LVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTF----VPTRPPITQVYSKRQQPPGECLVPQDFASSDRGSSD

Query:  ELPIALRKGKRSC--TFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDG
        +LPI+LRKGKR C   + I++FVSYDHLSSS+   VA +DSIS+PKTV EAL+H GW+NAM+EE+ AL DN TW LV  P GKK +GCKWVFA+KVNPDG
Subjt:  ELPIALRKGKRSC--TFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDG

Query:  SVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH
        SVARLKARLVA+GYAQTYG+DY DTFSPVAKL S+RLFIS+AASQ W +H
Subjt:  SVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-6734.01Show/hide
Query:  CDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEY
        CD C F K HR+S       R      LV+ D+ GP  I S  G +YFVTF+DD SR  W+Y +K++ ++   F  FH  +  +    LK LRSDN GEY
Subjt:  CDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEY

Query:  FFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCT
            F  Y   H I H  +   TP  NGVAER NR  +E  R++     +PK FW +A+ T  +LIN  PS  L  +IP  V +  K +     K+FGC 
Subjt:  FFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCT

Query:  CFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPS
         F    +   TKLD KS+ CIF+GY   + GYR + P   + + SRD  +F ++  +++   + + +   +  +  +P T S +P+   S T  +     
Subjt:  CFVRDVRPNLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPS

Query:  TFVPTRPPITQVYSKRQQPPGECLVP--------QDFASSDRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSH--
                     S++ + PGE +          ++     +G     P  LR+ +R            +     +  +V   D    P+++ E LSH  
Subjt:  TFVPTRPPITQVYSKRQQPPGECLVP--------QDFASSDRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSH--

Query:  -LGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAAS
              AM EEM +L  NGT+ LV  P GK+ + CKWVF +K + D  + R KARLV KG+ Q  GID+ + FSPV K+TSIR  +S+AAS
Subjt:  -LGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAAS

Q69SA9 Protein disulfide isomerase-like 5-43.2e-9478.95Show/hide
Query:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI
        MISS+KLKSVDFYRKIPRDLTEA+LSGAGLSIVAAL+MVFLFGMELSNYL+V+TSTSVIVD SSDG+FLR+DFN+SFPALSCEFA+VDV+DVLGTNRLNI
Subjt:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI

Query:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL
        TKT+RK+SID NL  TGSEFH GP+  + KHGD+V+E  ++GS  L++RNFD +++Q+P+LVVNFYAPWCYWSNRLKPSWEK AK +RERYDPE+DGRI+
Subjt:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL

Query:  LAKVDCTDE
        LAKVDCT+E
Subjt:  LAKVDCTDE

Q9LJU2 Protein disulfide-isomerase 5-31.6e-8573.58Show/hide
Query:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI
        M+SSTKLKSVDFYRKIPRDLTEA+LSGAGLSIVAAL M+FLFGMELS+YL V+T+T+VIVD SSDGDFLR+DFN+SFPALSCEFA+VDV+DVLGTNRLNI
Subjt:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI

Query:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL
        TKT+RKF ID +LRSTG+EFHSG   + I HG+E  EE  +G+  LT+ +F+  ++  PILVVNF APWCYWSNRLKPSWEKAA  I++RYDPE DGR+L
Subjt:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL

Query:  LAKVDCTDEEFL
        L  VDCT+E  L
Subjt:  LAKVDCTDEEFL

Q9T042 Protein disulfide-isomerase 5-44.7e-9378.47Show/hide
Query:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI
        M+S++K+KSVDFYRKIPRDLTEA+LSGAGLSI+AALSM+FLFGMEL+NYL+VSTSTSVIVD S+DGDFLR+DFN+SFP+LSCEFA+VDV+DVLGTNRLN+
Subjt:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI

Query:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL
        TKTIRKFSIDSN+R TGSEFH+G + +LI HGDE  EE  E S  LT RNFD F +Q PILVVNFYAPWCYW N LKPSWEKAAK I+ERYDPE+DGR++
Subjt:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL

Query:  LAKVDCTDE
        LAKVDCT E
Subjt:  LAKVDCTDE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.5e-6732.4Show/hide
Query:  SATNHMTGHPSLSMLRKLCPQFHNLSSLN-------CDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFM
        S+ +   GHPSL++L  +    H+L  LN       C  C   K H++  S      +SKP + ++ D+W   PI+S   +RY+V FVD ++R TWLY +
Subjt:  SATNHMTGHPSLSMLRKLCPQFHNLSSLN-------CDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFM

Query:  KSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFF
        K +S++   F  F   +  +F   +  L SDN GE  F   R YL +H I H +S   TP  NG++ERK+RH +E    L    SVPK +W  A S   +
Subjt:  KSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEYFFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFF

Query:  LINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRP-NLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDT-PFKSSL--
        LIN +P+ +L+ Q P+  LF     +  K K+FGC C+   +RP N  KL+ KS +C F+GYS  Q  Y C      R   SR     E   PF ++   
Subjt:  LINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRP-NLTKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDT-PFKSSL--

Query:  PSTSQEEEDDLF----IYTLVPPTP-------------STDPSPLVSPTP-------------SLEPSPSTFVPTRP------PITQVYSKRQQPPGECL
         STSQE+  D       +T +P TP              T P P  SP+P             S   SPS+  PT P      P  Q +  +       +
Subjt:  PSTSQEEEDDLF----IYTLVPPTP-------------STDPSPLVSPTP-------------SLEPSPSTFVPTRP------PITQVYSKRQQPPGECL

Query:  V--PQDFASSDRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLDSI--------------------------------------------
        +  P   + S    +   P+             S+ +S  +  SS+ +   PL  +                                            
Subjt:  V--PQDFASSDRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLDSI--------------------------------------------

Query:  SIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAI-GCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISM
        S P+T  +A+    WR AM  E+NA   N TWDLV  P     I GC+W+F  K N DGS+ R KARLVAKGY Q  G+DY +TFSPV K TSIR+ + +
Subjt:  SIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAI-GCKWVFAIKVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISM

Query:  AASQCWPL
        A  + WP+
Subjt:  AASQCWPL

Arabidopsis top hitse value%identityAlignment
AT1G50950.1 Thioredoxin protein with domain of unknown function (DUF1692)4.8e-7767.14Show/hide
Query:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI
        M+S++K+KSVDFYRKIPRDLTEA+LSGAGLSIVAAL+M+FLFGMELS+YL+++TSTSVIVD SSDGDFL +DFN+SFPALSCEFA+VDV+DV GT+RLNI
Subjt:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI

Query:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDE-VDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRI
        +KTIRK  ID +LR+T  EFHS    +LI HGDE   + +      LT   F++F +   ILVVNFYAPWCYWSNRLKPSW KA++  RERY+P  D R+
Subjt:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDE-VDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRI

Query:  LLAKVDCTDEEFL
        LL  VDCT+E  L
Subjt:  LLAKVDCTDEEFL

AT3G20560.1 PDI-like 5-31.1e-8673.58Show/hide
Query:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI
        M+SSTKLKSVDFYRKIPRDLTEA+LSGAGLSIVAAL M+FLFGMELS+YL V+T+T+VIVD SSDGDFLR+DFN+SFPALSCEFA+VDV+DVLGTNRLNI
Subjt:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI

Query:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL
        TKT+RKF ID +LRSTG+EFHSG   + I HG+E  EE  +G+  LT+ +F+  ++  PILVVNF APWCYWSNRLKPSWEKAA  I++RYDPE DGR+L
Subjt:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL

Query:  LAKVDCTDEEFL
        L  VDCT+E  L
Subjt:  LAKVDCTDEEFL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-2846.21Show/hide
Query:  ISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTY
        IS F+SY+ +S    SF+  +     P T +EA   L W  AM +E+ A+    TW++ + P  KK IGCKWV+ IK N DG++ R KARLVAKGY Q  
Subjt:  ISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAIKVNPDGSVARLKARLVAKGYAQTY

Query:  GIDYFDTFSPVAKLTSIRLFISMAASQCWPLH
        GID+ +TFSPV KLTS++L ++++A   + LH
Subjt:  GIDYFDTFSPVAKLTSIRLFISMAASQCWPLH

AT4G27080.1 PDI-like 5-43.3e-9478.47Show/hide
Query:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI
        M+S++K+KSVDFYRKIPRDLTEA+LSGAGLSI+AALSM+FLFGMEL+NYL+VSTSTSVIVD S+DGDFLR+DFN+SFP+LSCEFA+VDV+DVLGTNRLN+
Subjt:  MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNI

Query:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL
        TKTIRKFSIDSN+R TGSEFH+G + +LI HGDE  EE  E S  LT RNFD F +Q PILVVNFYAPWCYW N LKPSWEKAAK I+ERYDPE+DGR++
Subjt:  TKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRIL

Query:  LAKVDCTDE
        LAKVDCT E
Subjt:  LAKVDCTDE

AT4G27080.2 PDI-like 5-41.3e-8778.57Show/hide
Query:  RKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSIDSNL
        +KIPRDLTEA+LSGAGLSI+AALSM+FLFGMEL+NYL+VSTSTSVIVD S+DGDFLR+DFN+SFP+LSCEFA+VDV+DVLGTNRLN+TKTIRKFSIDSN+
Subjt:  RKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSIDSNL

Query:  RSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRILLAKVDCTDE
        R TGSEFH+G + +LI HGDE  EE  E S  LT RNFD F +Q PILVVNFYAPWCYW N LKPSWEKAAK I+ERYDPE+DGR++LAKVDCT E
Subjt:  RSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTARNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRILLAKVDCTDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAGTCAAAACCCCTCATTTTTATCTCCCTTAGGAGCAGAAGCTGCGGCGGTACTCGAAGGTCTTCGACTTGCTAAAAATATGAGTTTTGATCAAGTAATGTTGCT
TTCAGACTGCCTTGGGTTAATCTCCATGATTAATGGAGAAACGGAGACTGATATGGAAATCCAATCTTCCATTTGGGATATTAAAGAACTTTGCTCTCGGTTTTGTAGGG
TCATCCTACACGCACCTCGACTAATCTCACGGGAACTCACCTGGCCTTCACGTGACATCGACAGTGAATTTGTAGAAGCTGGGGAAGACGTGACCTGCAAGACCAGCAAG
AATCGCTGGACACCGGGTTTGGAGGCGTTCTGGGACGAAGCAGGCTGGACCGAGGCAGCTGGAAGCGGTAGGGGCCGAGCGGAGGCAAACGGACTCGGCCTTGGGCCGAG
GCCGACCGTACGGGTCGGGCCACGTTGGCCCAGCCCGTCGGTATGGTCTCCCTCTGGGTCCTTTTCCTGGTCCTATCTCCTCCCGGTTATCTCGTCAGCTCCTCCCATGA
TTTCCTCCACCAAGCTCAAATCTGTAGATTTCTACAGGAAAATACCAAGAGATTTAACGGAGGCAACATTATCAGGTGCAGGATTATCCATAGTAGCAGCTCTGTCCATG
GTGTTTTTATTTGGAATGGAATTGAGCAATTATTTGAGTGTTAGCACGTCTACATCTGTAATTGTTGACAACAGTAGTGATGGAGACTTCTTACGGATGGACTTTAATAT
GAGTTTCCCTGCACTCTCATGTGAATTTGCTGCCGTAGATGTGAACGATGTGTTAGGAACTAATAGGTTGAATATTACGAAAACAATCCGTAAATTTTCTATAGATTCAA
ATTTGCGGTCCACTGGATCTGAGTTTCACTCAGGACCACTATCGAATTTGATTAAGCATGGGGATGAAGTAGATGAAGAAACCAATGAAGGTTCTGCTACGTTAACTGCT
CGGAACTTTGATAGATTTGCTAATCAGCATCCTATATTGGTGGTTAATTTTTATGCTCCTTGGTGCTACTGGAGTAATAGGCTGAAACCTTCGTGGGAGAAGGCTGCCAA
AACTATAAGAGAAAGATATGATCCAGAATTAGATGGACGCATTCTCTTGGCAAAGGTTGATTGCACGGATGAAGAGTTTCTTCTCATCCAAGTGGGGTTAGGGTTTGAAA
TTTGGGTCACCACATTAGAGGGGTTGTTTGTCGACACCACCCACCTAAAGAGAAGTGCTGCCGTTAGGGGAGTCGAATCGCCGTCGTCGTTCGTCGGAAGTAGCTGCGCT
GATGGCTCACGTGCCGACGAAGAGTTCTACAGCCGCCGCCACACGCTGGCGCGTGGGAGGTCTTGTTTCCCTTCCGTCACTTGGTCCATCCATGATTCATCTAAGCTACC
ATTTGTTACTTCTACTACCGAGTTTGCTAAATTTCAACTGTACCAGAAGTCATTGAAAGCATATTCTTTATCATCCGTGCTTACCCCTATTACAACCACTGCTGAGTCAG
GTAACATGAATCATTGTCTTCTTTCCTCCTCTACCAAATGGGTCATAGACTCTAGTGCAACCAATCATATGACAGGTCATCCTTCTCTTTCAATGCTAAGAAAACTTTGT
CCTCAATTTCATAACTTGTCTTCATTAAATTGTGACTCATGTCAGTTTGTCAAATTTCATCGTCTTAGCTCTAGTCCTAGAATAGATAATAGAGCAAGTAAGCCCTTCAA
ATTAGTTCATTATGATATTTGGGGTCCTTGTCCTATTGTTTCCAAAACTGGATTTCGATACTTCGTTACCTTTGTTGATGATTACTCTCGTATGACTTGGTTATACTTTA
TGAAGAGTCGTTCTGAGTTACTTTCTCACTTTTGTAATTTTCATGTTGAAATTCGAACTCAGTTTGATGTTTCTCTTAAAATTTTGAGAAGTGATAATGCTGGTGAGTAC
TTTTTTGAAGCATTTAGGTCGTACCTATGTAAGCATGAAATTATTCATCATTCATCTTGTGCTGATACTCCTTCCCAAAATGGAGTTGCTGAACGAAAAAATAGGCATTT
CCTTGAAACTGCAAGAGCTTTATTCTTTCAAATGAGTGTTCCAAAGCAATTTTGGGTTGATGCAATTTCTACAACTTTCTTCTTGATTAATCACATGCCTTCATTTGTTC
TTAAAGGTCAGATTCCTTATCTTGTTCTATTCCCCACAAAATCGTTGTTTCCTATTAAGTCGAAAATATTTGGTTGTACTTGTTTTGTTAGAGATGTTCGTCCTAATCTA
ACAAAATTAGACCCCAAATCTTTAAAGTGCATTTTCTTAGGCTATTCTCGTGTTCAAAAAGGTTATAGATGCTATTGTCCTAGTTTGAATAGGTATCTTGTATCTCGTGA
TTTTACTTTATTTGAGGACACGCCTTTCAAATCATCTTTGCCTAGTACAAGTCAGGAGGAGGAGGATGATCTTTTTATTTATACACTTGTACCTCCTACACCCTCTACCG
ATCCATCTCCACTTGTGTCTCCTACACCCTCTCTCGAACCATCTCCTTCGACATTTGTTCCTACTCGTCCACCTATTACTCAAGTATACTCTAAACGACAACAACCTCCA
GGTGAATGTCTTGTACCACAAGATTTTGCGTCATCGGATCGGGGATCAAGTGATGAGCTTCCTATTGCTCTCCGTAAAGGTAAACGTTCTTGCACTTTTTGTATTTCTTC
ATTTGTCTCTTATGACCACTTGTCATCTTCTACATGTTCTTTTGTTGCACCTCTGGACTCTATCTCGATTCCTAAAACGGTTCATGAAGCCTTGTCTCATCTTGGTTGGC
GCAATGCGATGATAGAAGAGATGAATGCTTTATATGATAATGGTACCTGGGATTTAGTATCTCGTCCAGCAGGAAAGAAAGCTATCGGGTGTAAATGGGTGTTTGCAATT
AAGGTAAATCCTGATGGTTCAGTTGCCCGATTGAAAGCACGTCTTGTTGCCAAAGGCTATGCTCAAACATATGGGATTGATTATTTTGATACGTTTTCTCCTGTTGCTAA
ATTAACCTCTATAAGGTTATTCATTTCCATGGCTGCTTCTCAATGTTGGCCTTTGCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATAGTCAAAACCCCTCATTTTTATCTCCCTTAGGAGCAGAAGCTGCGGCGGTACTCGAAGGTCTTCGACTTGCTAAAAATATGAGTTTTGATCAAGTAATGTTGCT
TTCAGACTGCCTTGGGTTAATCTCCATGATTAATGGAGAAACGGAGACTGATATGGAAATCCAATCTTCCATTTGGGATATTAAAGAACTTTGCTCTCGGTTTTGTAGGG
TCATCCTACACGCACCTCGACTAATCTCACGGGAACTCACCTGGCCTTCACGTGACATCGACAGTGAATTTGTAGAAGCTGGGGAAGACGTGACCTGCAAGACCAGCAAG
AATCGCTGGACACCGGGTTTGGAGGCGTTCTGGGACGAAGCAGGCTGGACCGAGGCAGCTGGAAGCGGTAGGGGCCGAGCGGAGGCAAACGGACTCGGCCTTGGGCCGAG
GCCGACCGTACGGGTCGGGCCACGTTGGCCCAGCCCGTCGGTATGGTCTCCCTCTGGGTCCTTTTCCTGGTCCTATCTCCTCCCGGTTATCTCGTCAGCTCCTCCCATGA
TTTCCTCCACCAAGCTCAAATCTGTAGATTTCTACAGGAAAATACCAAGAGATTTAACGGAGGCAACATTATCAGGTGCAGGATTATCCATAGTAGCAGCTCTGTCCATG
GTGTTTTTATTTGGAATGGAATTGAGCAATTATTTGAGTGTTAGCACGTCTACATCTGTAATTGTTGACAACAGTAGTGATGGAGACTTCTTACGGATGGACTTTAATAT
GAGTTTCCCTGCACTCTCATGTGAATTTGCTGCCGTAGATGTGAACGATGTGTTAGGAACTAATAGGTTGAATATTACGAAAACAATCCGTAAATTTTCTATAGATTCAA
ATTTGCGGTCCACTGGATCTGAGTTTCACTCAGGACCACTATCGAATTTGATTAAGCATGGGGATGAAGTAGATGAAGAAACCAATGAAGGTTCTGCTACGTTAACTGCT
CGGAACTTTGATAGATTTGCTAATCAGCATCCTATATTGGTGGTTAATTTTTATGCTCCTTGGTGCTACTGGAGTAATAGGCTGAAACCTTCGTGGGAGAAGGCTGCCAA
AACTATAAGAGAAAGATATGATCCAGAATTAGATGGACGCATTCTCTTGGCAAAGGTTGATTGCACGGATGAAGAGTTTCTTCTCATCCAAGTGGGGTTAGGGTTTGAAA
TTTGGGTCACCACATTAGAGGGGTTGTTTGTCGACACCACCCACCTAAAGAGAAGTGCTGCCGTTAGGGGAGTCGAATCGCCGTCGTCGTTCGTCGGAAGTAGCTGCGCT
GATGGCTCACGTGCCGACGAAGAGTTCTACAGCCGCCGCCACACGCTGGCGCGTGGGAGGTCTTGTTTCCCTTCCGTCACTTGGTCCATCCATGATTCATCTAAGCTACC
ATTTGTTACTTCTACTACCGAGTTTGCTAAATTTCAACTGTACCAGAAGTCATTGAAAGCATATTCTTTATCATCCGTGCTTACCCCTATTACAACCACTGCTGAGTCAG
GTAACATGAATCATTGTCTTCTTTCCTCCTCTACCAAATGGGTCATAGACTCTAGTGCAACCAATCATATGACAGGTCATCCTTCTCTTTCAATGCTAAGAAAACTTTGT
CCTCAATTTCATAACTTGTCTTCATTAAATTGTGACTCATGTCAGTTTGTCAAATTTCATCGTCTTAGCTCTAGTCCTAGAATAGATAATAGAGCAAGTAAGCCCTTCAA
ATTAGTTCATTATGATATTTGGGGTCCTTGTCCTATTGTTTCCAAAACTGGATTTCGATACTTCGTTACCTTTGTTGATGATTACTCTCGTATGACTTGGTTATACTTTA
TGAAGAGTCGTTCTGAGTTACTTTCTCACTTTTGTAATTTTCATGTTGAAATTCGAACTCAGTTTGATGTTTCTCTTAAAATTTTGAGAAGTGATAATGCTGGTGAGTAC
TTTTTTGAAGCATTTAGGTCGTACCTATGTAAGCATGAAATTATTCATCATTCATCTTGTGCTGATACTCCTTCCCAAAATGGAGTTGCTGAACGAAAAAATAGGCATTT
CCTTGAAACTGCAAGAGCTTTATTCTTTCAAATGAGTGTTCCAAAGCAATTTTGGGTTGATGCAATTTCTACAACTTTCTTCTTGATTAATCACATGCCTTCATTTGTTC
TTAAAGGTCAGATTCCTTATCTTGTTCTATTCCCCACAAAATCGTTGTTTCCTATTAAGTCGAAAATATTTGGTTGTACTTGTTTTGTTAGAGATGTTCGTCCTAATCTA
ACAAAATTAGACCCCAAATCTTTAAAGTGCATTTTCTTAGGCTATTCTCGTGTTCAAAAAGGTTATAGATGCTATTGTCCTAGTTTGAATAGGTATCTTGTATCTCGTGA
TTTTACTTTATTTGAGGACACGCCTTTCAAATCATCTTTGCCTAGTACAAGTCAGGAGGAGGAGGATGATCTTTTTATTTATACACTTGTACCTCCTACACCCTCTACCG
ATCCATCTCCACTTGTGTCTCCTACACCCTCTCTCGAACCATCTCCTTCGACATTTGTTCCTACTCGTCCACCTATTACTCAAGTATACTCTAAACGACAACAACCTCCA
GGTGAATGTCTTGTACCACAAGATTTTGCGTCATCGGATCGGGGATCAAGTGATGAGCTTCCTATTGCTCTCCGTAAAGGTAAACGTTCTTGCACTTTTTGTATTTCTTC
ATTTGTCTCTTATGACCACTTGTCATCTTCTACATGTTCTTTTGTTGCACCTCTGGACTCTATCTCGATTCCTAAAACGGTTCATGAAGCCTTGTCTCATCTTGGTTGGC
GCAATGCGATGATAGAAGAGATGAATGCTTTATATGATAATGGTACCTGGGATTTAGTATCTCGTCCAGCAGGAAAGAAAGCTATCGGGTGTAAATGGGTGTTTGCAATT
AAGGTAAATCCTGATGGTTCAGTTGCCCGATTGAAAGCACGTCTTGTTGCCAAAGGCTATGCTCAAACATATGGGATTGATTATTTTGATACGTTTTCTCCTGTTGCTAA
ATTAACCTCTATAAGGTTATTCATTTCCATGGCTGCTTCTCAATGTTGGCCTTTGCATTAG
Protein sequenceShow/hide protein sequence
MYSQNPSFLSPLGAEAAAVLEGLRLAKNMSFDQVMLLSDCLGLISMINGETETDMEIQSSIWDIKELCSRFCRVILHAPRLISRELTWPSRDIDSEFVEAGEDVTCKTSK
NRWTPGLEAFWDEAGWTEAAGSGRGRAEANGLGLGPRPTVRVGPRWPSPSVWSPSGSFSWSYLLPVISSAPPMISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSM
VFLFGMELSNYLSVSTSTSVIVDNSSDGDFLRMDFNMSFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSIDSNLRSTGSEFHSGPLSNLIKHGDEVDEETNEGSATLTA
RNFDRFANQHPILVVNFYAPWCYWSNRLKPSWEKAAKTIRERYDPELDGRILLAKVDCTDEEFLLIQVGLGFEIWVTTLEGLFVDTTHLKRSAAVRGVESPSSFVGSSCA
DGSRADEEFYSRRHTLARGRSCFPSVTWSIHDSSKLPFVTSTTEFAKFQLYQKSLKAYSLSSVLTPITTTAESGNMNHCLLSSSTKWVIDSSATNHMTGHPSLSMLRKLC
PQFHNLSSLNCDSCQFVKFHRLSSSPRIDNRASKPFKLVHYDIWGPCPIVSKTGFRYFVTFVDDYSRMTWLYFMKSRSELLSHFCNFHVEIRTQFDVSLKILRSDNAGEY
FFEAFRSYLCKHEIIHHSSCADTPSQNGVAERKNRHFLETARALFFQMSVPKQFWVDAISTTFFLINHMPSFVLKGQIPYLVLFPTKSLFPIKSKIFGCTCFVRDVRPNL
TKLDPKSLKCIFLGYSRVQKGYRCYCPSLNRYLVSRDFTLFEDTPFKSSLPSTSQEEEDDLFIYTLVPPTPSTDPSPLVSPTPSLEPSPSTFVPTRPPITQVYSKRQQPP
GECLVPQDFASSDRGSSDELPIALRKGKRSCTFCISSFVSYDHLSSSTCSFVAPLDSISIPKTVHEALSHLGWRNAMIEEMNALYDNGTWDLVSRPAGKKAIGCKWVFAI
KVNPDGSVARLKARLVAKGYAQTYGIDYFDTFSPVAKLTSIRLFISMAASQCWPLH