; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035455 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035455
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionexportin-4 isoform X1
Genome locationchr3:21818272..21831378
RNA-Seq ExpressionLag0035455
SyntenyLag0035455
Gene Ontology termsGO:0015031 - protein transport (biological process)
GO:0051168 - nuclear export (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005049 - nuclear export signal receptor activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily
IPR044189 - Exportin 4/7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588304.1 Exportin-4, partial [Cucurbita argyrosperma subsp. sororia]1.7e-21693.1Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLE NYLKTFYCWAIDAAVSVT++I+DSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGD+TKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

KAG7020867.1 Exportin-4 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-21693.1Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLE NYLKTFYCWAIDAAVSVT++I+DSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGD+TKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

XP_023530569.1 exportin-4 isoform X1 [Cucurbita pepo subsp. pepo]2.3e-21693.35Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLE NYLKTFYCWAIDAAVSVT++IIDSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWI+PPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

XP_023530570.1 exportin-4 isoform X2 [Cucurbita pepo subsp. pepo]2.3e-21693.35Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLE NYLKTFYCWAIDAAVSVT++IIDSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWI+PPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

XP_023530571.1 exportin-4 isoform X3 [Cucurbita pepo subsp. pepo]2.3e-21693.35Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLE NYLKTFYCWAIDAAVSVT++IIDSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWI+PPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

TrEMBL top hitse value%identityAlignment
A0A6J1EZQ7 exportin-4 isoform X23.2e-21692.86Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLE NYLKTFYCWAIDAAVSVT++I+DSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGD+TKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAA+LFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

A0A6J1F4U8 exportin-4 isoform X33.2e-21692.86Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLE NYLKTFYCWAIDAAVSVT++I+DSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGD+TKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAA+LFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

A0A6J1HMV7 exportin-4 isoform X13.2e-21692.86Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCR+SLE NYLKTFYCWAIDAAVSVT++I+DSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLS LMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

A0A6J1HS30 exportin-4 isoform X33.2e-21692.86Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCR+SLE NYLKTFYCWAIDAAVSVT++I+DSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLS LMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

A0A6J1HTQ6 exportin-4 isoform X23.2e-21692.86Show/hide
Query:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL
        ++ S+ + +NQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCR+SLE NYLKTFYCWAIDAAVSVT++I+DSQTEVPEVKVCTAALRL
Subjt:  DRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRL

Query:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI
        MFQ LNWDFRN  GAK +ISFYFAG KDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSL GTI
Subjt:  MFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTI

Query:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS
        FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPF FD+LLKSIRPFGTLQLLS LMGEVVKVLMTHNSDEETWS
Subjt:  FHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWS

Query:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
        WQARDILLDSW ALL+P+ERSGQ+SLLPHEGISAAANLFALIVESELKAASASA DD+VESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER
Subjt:  WQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTER

Query:  LSKLNQ
        LSKLNQ
Subjt:  LSKLNQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.3e-3227.57Show/hide
Query:  LMEKLKALKYKIEEWSKENHSKATSKKRDLLSKIEEIDWLEDSN-------------NIPQNHIEERKSLKGQLMDLITDEQRSLHQKYKEIEEEILEWY
        L  +LK L+ + +  SK +  +  +K R  L +IE    L+  N             + P   + ++K  K Q+ D I +++  +     EI+  I E+Y
Subjt:  LMEKLKALKYKIEEWSKENHSKATSKKRDLLSKIEEIDWLEDSN-------------NIPQNHIEERKSLKGQLMDLITDEQRSLHQKYKEIEEEILEWY

Query:  RKLY----ESDNNQRFVLDGVDWSPTDSTWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEFWKIFWNILKPDIVEVFQEFFQKGIINKRTNETYIC
        + LY    E+       LD       +      L    +  EI  +I+ L   KSPGPDG T EF++ +   L P ++++FQ   ++GI+     E  I 
Subjt:  RKLY----ESDNNQRFVLDGVDWSPTDSTWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEFWKIFWNILKPDIVEVFQEFFQKGIINKRTNETYIC

Query:  LIPKK-KKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDYRIGK-RQGALLKLDLEKAYDMVNWDFLDE
        LIPK  +   K  ++RPISL+    K++ K+LA R+++ +   I   Q  F+ G Q    I  +   ++     K +   ++ +D EKA+D +   F+ +
Subjt:  LIPKK-KKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDYRIGK-RQGALLKLDLEKAYDMVNWDFLDE

Query:  ILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDTLIFCPN
         L   G    +   IR        +I++NG+         G RQG PLSP LF +V + + R+++   ++  +KG ++GK  E + L  +ADD +++  N
Subjt:  ILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDTLIFCPN

Query:  SELQLINWRDLIMLIMAGSGLRINMLKS
          +   N   LI      SG +IN+ KS
Subjt:  SELQLINWRDLIMLIMAGSGLRINMLKS

P08548 LINE-1 reverse transcriptase homolog5.9e-3424.58Show/hide
Query:  EKLKALKYKIEEWSKENHSKATSKKRDLLSKIE-EIDWLEDSNNIPQ------------NHIEE------RKSLKGQLMDLITDEQRSLHQKYKEIEEEI
        E++  L   +++  KE HS     +R  ++KI  E++ +E+   I Q            N I++      RK     L+  I +    +     EI++ +
Subjt:  EKLKALKYKIEEWSKENHSKATSKKRDLLSKIE-EIDWLEDSNNIPQ------------NHIEE------RKSLKGQLMDLITDEQRSLHQKYKEIEEEI

Query:  LEWYRKL----YESDNNQRFVLDGVDWSPTDSTWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEFWKIFWNILKPDIVEVFQEFFQKGIINKRTNE
         E+Y+KL    YE+       L+              L    S  EI   I +L   KSPGPDG T EF++ F   L P ++ +FQ   ++GI+     E
Subjt:  LEWYRKL----YESDNNQRFVLDGVDWSPTDSTWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEFWKIFWNILKPDIVEVFQEFFQKGIINKRTNE

Query:  TYICLIPKK-KKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDY-RIGKRQGALLKLDLEKAYDMVNWD
          I LIPK  K   +  +YRPISL+    K++ K+L  R+++ +   I   Q  F+ G Q    I  +   ++   ++  +   +L +D EKA+D +   
Subjt:  TYICLIPKK-KKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDY-RIGKRQGALLKLDLEKAYDMVNWD

Query:  FLDEILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDTLI
        F+   L   G    +   I         +I++NG          G RQG PLSP LF +V + +  +++   E+  +KG  IG   E I L  +ADD ++
Subjt:  FLDEILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDTLI

Query:  FCPNSELQLINWRDLIMLIMAGSGLRINMLKSSLIGINVDGRQQGT---GLRFLAVKGRL--------------------TIGR--IPNIQRWESYSSQS
        +  N+        ++I      SG +IN  KS       + + + T    + F  V  ++                    T+ +    ++ +W++     
Subjt:  FCPNSELQLINWRDLIMLIMAGSGLRINMLKSSLIGINVDGRQQGT---GLRFLAVKGRL--------------------TIGR--IPNIQRWESYSSQS

Query:  V-------LNSLP--IYNFSL--LRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTALPIQQGGLGVGSLR--QRNLAFMSKWLWRFSQEKNSLWR
        +       ++ LP  IYNF+   ++AP +  + LEKII +F+WN    +    L+  +  A     GG+ +  LR   +++   + W W  ++E + +W 
Subjt:  V-------LNSLP--IYNFSL--LRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTALPIQQGGLGVGSLR--QRNLAFMSKWLWRFSQEKNSLWR

Query:  KV
        ++
Subjt:  KV

P0C2F6 Putative ribonuclease H protein At1g657501.0e-3026.26Show/hide
Query:  RINMLKSSLIGINVDGRQQGTGLRFLAVKGRLTIGRIPNIQRWESYSSQSVLNSLPIYNFSLLRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTA
        RIN      I   V  R  G   + L+  GRLT+             +++VL+S+P+++ S +  P++I+  L+++ R F+W   A K   +LVKW    
Subjt:  RINMLKSSLIGINVDGRQQGTGLRFLAVKGRLTIGRIPNIQRWESYSSQSVLNSLPIYNFSLLRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTA

Query:  LPIQQGGLGVGSLRQRNLAFMSKWLWRFSQEKNSLWRKVIVSIYGSSHWGWKSDN---LHGKKGNRIWPTISANYHQ-FDQFTDFIVKSGRSIKFWEDCW
         P ++GGLGV + +  N A +SK  WR  QEKNSLW  V+   Y   H G   D+   +     +  W +I+            +I   G+ I+FW D W
Subjt:  LPIQQGGLGVGSLRQRNLAFMSKWLWRFSQEKNSLWRKVIVSIYGSSHWGWKSDN---LHGKKGNRIWPTISANYHQ-FDQFTDFIVKSGRSIKFWEDCW

Query:  CDELPLKSLFSDLFLISNKEASIADCWSYDSQTWDLAFRRGLFDREICSWVALVDKIKEVNLVTD-HDLIRWKLEASGKYSTKSMFYKLVNDSPKLKQPV
            PL  L +             D W    + WD A      D    +   L  +   ++LVT   D + WK    G++S +S +  L  D        
Subjt:  CDELPLKSLFSDLFLISNKEASIADCWSYDSQTWDLAFRRGLFDREICSWVALVDKIKEVNLVTD-HDLIRWKLEASGKYSTKSMFYKLVNDSPKLKQPV

Query:  S--TLIWNHKCPKKVKVFLWSLVYRSLNTDEKLQKKFSKWSLSPSACRLCLKAEENLDHLFLQCDFARSVWCFVGRLLGISFCLPRKIDDWLLEGL
        S    +W  + P++VK FLW +  +++ T+E+  ++    S   + C++C    E++ H+   C     +W  V           + + +WL + L
Subjt:  S--TLIWNHKCPKKVKVFLWSLVYRSLNTDEKLQKKFSKWSLSPSACRLCLKAEENLDHLFLQCDFARSVWCFVGRLLGISFCLPRKIDDWLLEGL

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-3224.96Show/hide
Query:  LMEKLKALKYKIEEWSKENHSKATSKKRDLLSKIEEIDWLEDSNNIPQNHIEERKSL--------KGQ----LMDLITDEQRSLHQKYKEIEEEILEWYR
        L   LKAL+ K     K +  +   K R  ++++E    ++  N       E+   +        KG     L++ I +E+  +    +EI+  I  +Y+
Subjt:  LMEKLKALKYKIEEWSKENHSKATSKKRDLLSKIEEIDWLEDSNNIPQNHIEERKSL--------KGQ----LMDLITDEQRSLHQKYKEIEEEILEWYR

Query:  KLYES-----DNNQRFVLDGVDWSPTDSTWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEFWKIFWNILKPDIVEVFQEFFQKGIINKRTNETYIC
        +LY +     D   +F LD       +    + L    S +EI  VI+ L   KSPGPDG + EF++ F   L P + ++F +   +G +     E  I 
Subjt:  KLYES-----DNNQRFVLDGVDWSPTDSTWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEFWKIFWNILKPDIVEVFQEFFQKGIINKRTNETYIC

Query:  LIPK-KKKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDY-RIGKRQGALLKLDLEKAYDMVNWDFLDE
        LIPK +K   K+ ++RPISL+    K++ K+LA R+++ +   I   Q  F+ G Q    I  +   +    ++  +   ++ LD EKA+D +   F+ +
Subjt:  LIPK-KKKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDY-RIGKRQGALLKLDLEKAYDMVNWDFLDE

Query:  ILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDTLIFC--
        +L   G    +   I+        +I +NG     I    G RQG PLSP+LF +V + + R+++   ++  +KG +IGK    ISLL  ADD +++   
Subjt:  ILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDTLIFC--

Query:  -PNSELQLINWRDLIMLIMAGSGLRINMLKSS--LIGINVDGRQQ----------GTGLRFLAVKGRLTIGRI-------------PNIQRWESYSSQSV
          NS  +L+N   LI       G +IN  KS   L   N    ++             +++L V     +  +              +++RW+      +
Subjt:  -PNSELQLINWRDLIMLIMAGSGLRINMLKSS--LIGINVDGRQQ----------GTGLRFLAVKGRLTIGRI-------------PNIQRWESYSSQSV

Query:  -------LNSLP--IYNFSL--LRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTALPIQQGGLGVGSLR--QRNLAFMSKWLWRFSQEKNSLWRK
               +  LP  IY F+   ++ P      LE  I  FVWN    +   +L+K + T+     GG+ +  L+   R +   + W W +   +   W +
Subjt:  -------LNSLP--IYNFSL--LRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTALPIQQGGLGVGSLR--QRNLAFMSKWLWRFSQEKNSLWRK

Query:  V
        +
Subjt:  V

P14381 Transposon TX1 uncharacterized 149 kDa protein2.8e-3123.85Show/hide
Query:  IEERKSLKGQLMDLITDEQRSLHQKYKEIEEEILEWYRKLYESDNNQRFVLDGV-DWSPTDS-TWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEF
        +E++K  + Q+  L  ++   L    + I +    +Y+ L+  D       + + D  P  S     +LE   + +E+ + +  + + KSPG DG+T EF
Subjt:  IEERKSLKGQLMDLITDEQRSLHQKYKEIEEEILEWYRKLYESDNNQRFVLDGV-DWSPTDS-TWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEF

Query:  WKIFWNILKPDIVEVFQEFFQKGIINKRTNETYICLIPKKKKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATE
        ++ FW+ L PD   V  E F+KG +        + L+PKK     + ++RP+SL+++ YK++AK ++ RLK VL   I   Q+  V GR I D + +  +
Subjt:  WKIFWNILKPDIVEVFQEFFQKGIINKRTNETYICLIPKKKKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATE

Query:  AVEDYRIGKRQGALLKLDLEKAYDMVNWDFLDEILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFC
         +   R      A L LD EKA+D V+  +L   L    FG ++  +++    +    + IN      +   RG+RQG PLS  L++L  +        C
Subjt:  AVEDYRIGKRQGALLKLDLEKAYDMVNWDFLDEILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFC

Query:  LEKSILKGWEIGKNFEMISLLQYADDTLIFCPNSELQLINWRDLIMLIMAGSGLRINMLKSS---------------------------LIGINVDGRQQ
        L +  L G  + +    + L  YADD ++   +  + L   ++   +  A S  RIN  KSS                            +G+ +   + 
Subjt:  LEKSILKGWEIGKNFEMISLLQYADDTLIFCPNSELQLINWRDLIMLIMAGSGLRINMLKSS---------------------------LIGINVDGRQQ

Query:  GTGLRFLAVKGRLTIGRIPNIQRWESYSS-------QSVLNSLPI----YNFSLLRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTALPIQQGGL
             F+ ++  +    +  + +W+ ++          V+N L      Y    L   +  I  +++ + +F+W       G + V    ++LP+++GG 
Subjt:  GTGLRFLAVKGRLTIGRIPNIQRWESYSS-------QSVLNSLPI----YNFSLLRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWTALPIQQGGL

Query:  GVGSLRQRNLAFMSKWLWRF
        GV  +R +   F  + + R+
Subjt:  GVGSLRQRNLAFMSKWLWRF

Arabidopsis top hitse value%identityAlignment
AT3G04490.1 unknown protein6.1e-14363.66Show/hide
Query:  INQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRLMFQTLNWD
        INQ+I G HG+DVQF GVNFLESLVSEFSPSTSSAMGLPREFHE CR+SLE N+LK+FY WA DAA+SVTS II+S + VPEVKVC A LRLM Q LNW+
Subjt:  INQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTSIIIDSQTEVPEVKVCTAALRLMFQTLNWD

Query:  F-RNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTIFHSDNGQ
        F  +  G + SI+ +  G +     ++++E  +VQPG +W DVL+SS H+ WL+N Y+++RQKF  + +WLDCP+AVSARKLIVQ CSL G IF S+N Q
Subjt:  F-RNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKLIVQFCSLTGTIFHSDNGQ

Query:  MHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWSWQARDIL
        M + HLL LL G++ WIDPPD +S+ IE G+  SEM+DGCRALLSI TVTTP VFDQLL+S+RPFGTL LLS LMGEVVKVLM +++DEETWS++ARDIL
Subjt:  MHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETWSWQARDIL

Query:  LDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTERLSKLNQ
        LD+WT LL  ++ SG ++ LP EGI AAA+LF+LIVESELK ASASA+ +  +     ASVSAMDERL +YALIARAA++ T+PFL  LF++ +++L+Q
Subjt:  LDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTERLSKLNQ

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.5e-0525.13Show/hide
Query:  VKSGRSIKFWEDCWCDELPLKSLFSDL---FLISNKEASIADCWSYDSQTWDLAFRRGLFDREICSWVALVDKIKEVNLVTDHDLIRWKLE---ASGKYS
        V SG + KFW D W    PL  +   L    +    +A + D  +    +W +A  R   +  I     L+ + + +      D   WK +    S ++S
Subjt:  VKSGRSIKFWEDCWCDELPLKSLFSDL---FLISNKEASIADCWSYDSQTWDLAFRRGLFDREICSWVALVDKIKEVNLVTDHDLIRWKLE---ASGKYS

Query:  TKSMFYKLVNDSPKLKQPVSTLIWNHKCPKKVKVFLWSLVYRSLNTDEKLQKKFSKWSLS-PSACRLCLKAEENLDHLFLQCDFARSVWCF
            +  L   S  +  P    +W      K     W + +  L+T ++LQ     W LS P+ C LC   +++  HLF +C F+  VW F
Subjt:  TKSMFYKLVNDSPKLKQPVSTLIWNHKCPKKVKVFLWSLVYRSLNTDEKLQKKFSKWSLS-PSACRLCLKAEENLDHLFLQCDFARSVWCF

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.4e-0943.21Show/hide
Query:  LAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDYR--IGKRQGALLKLDLEKAYDMVNWDFLDEILALKGFGMKW
        + ERLK ++   I   QA+F+ GR   D I+   EAV   R   G +   LLKLDLEKAYD + WD+L++ L   GF   W
Subjt:  LAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDYR--IGKRQGALLKLDLEKAYDMVNWDFLDEILALKGFGMKW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.3e-0947.89Show/hide
Query:  MINGRPRGKILASRGLRQGDPLSPFLFTL---VGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDT
        +ING P+G +  SRGLRQGDPLSP+LF L   V   +CR  Q   E+  L G  +  N   I+ L +ADDT
Subjt:  MINGRPRGKILASRGLRQGDPLSPFLFTL---VGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATTGGAATGGGGTCGCGTGTTTGGAATCGGACTTATCCATGTCAAGCCCAGCTAGTAGAACCACCGTAGATATGGAGGAAGAGCGTTTAAACTCCATCGAGAA
AAATCAGGAAGAATTCCCAGAAGGATACCATGAATGCTTTGCCCTTGATAATGTGTATGAAAGCAGAAATCCCACGCAAGAGGGGAACATTAGAGATAGCAAAATACAGG
GACTGGTGATGAATAATATTGAAATCCCAAACAAGAACTCTCATGAAAAATTTGACTATGAGGAGACGATTGAGGAAACTCCAGAGGCCCTTGCTGTTATGCCGCCTGAG
AAGGAGAAGGGTAAAAGTAAAGTTAGTTGTCACATTGAGGGGTTTGCGATTAGTAAAGAGATGGTGCTAACCCTTAGGAAAAATAACTTATGCATTAGACCGATTATGGG
AACTAACAACAGAAAAGGTAGTACTTCGCAGAAGAGAAGGAACAGAGAGGTGACAAGCCTTTTAAGAACGTGGGAGAAGGAACCAGAACCGCCGATAGACCTGATAGAGG
AAGTAGAGGAAGATATTTTGTCTGAAAATAGGGAGTCTAGGTTGAATCATGTAGACAGAAGGCTCGTGAAATCCATTTGGAGCTCTAAGCACATAGCTTGGCTTGCTTTG
GATGCCATTAACTCGGCAGGGGGAATTCTCGTGATGTGGAAGGAAAATGGGATTGATGTGTTTGACTCGGTCATTGGAGCTTTCTCCATCTTGATTCACTGCCATTTTCA
AGGTCACAACGAGGGCTGGATTACGGGGGTGTATGGGCATTGTTCCTACTCGGAAAGAGGTGACTTTAATATGATAAGATGGTCCAAGGAGAGACTCAACGCTAGCAATT
CCTCTAGAAGTATGGCTAAATTCAATAGATTCATTGATTTGTCTGATCTCATTGATCCCCCCATGTTCAATGGTAGAGTGGAGAGGCTTCTTCGCCCAACCTCGAACCAC
TTCCCTATTATGATGGCTATTGGTACGATGAAATGGGGACCAACCCCATTTAGATTCGAGAATACTTGGCTGGATAACCCAAATTTCAAAAGCAAAGTAGACTCTTGGTG
GAAAGAGCTGAATCCTATTGGCTGGGCCGGTTTTAAACTTATGGAAAAGTTGAAAGCCCTGAAGTATAAAATAGAGGAATGGAGCAAGGAAAATCATTCCAAGGCTACCA
GTAAAAAGAGGGACCTTTTGAGTAAGATAGAGGAGATAGACTGGTTAGAAGATAGTAACAATATCCCGCAGAATCATATTGAGGAGAGGAAAAGTCTCAAGGGCCAGCTC
ATGGATCTTATTACGGATGAGCAGAGGAGTTTACACCAAAAATATAAGGAAATTGAAGAGGAAATTCTAGAATGGTACAGAAAATTATATGAGTCCGATAACAATCAAAG
ATTTGTGCTCGATGGGGTGGACTGGTCTCCAACTGACAGTACATGGAGTAATAAACTGGAAGATAGCTTCAGTGAAGAGGAAATCCGCAAAGTTATCAGTGATTTGGGTA
ATTTGAAGTCTCCAGGTCCAGATGGGATGACAGGTGAATTTTGGAAAATTTTTTGGAACATTTTGAAGCCCGATATAGTAGAGGTGTTCCAAGAATTTTTTCAAAAGGGT
ATTATCAACAAGAGAACTAATGAGACCTATATTTGCTTAATTCCTAAAAAGAAGAAAGCTGCCAAGGTGAGTGATTACAGACCGATAAGCCTAATTACTTCGCCATACAA
GCTGATTGCTAAAGTGCTAGCCGAGAGATTGAAGAAAGTTCTTCCCCTTACCATAAGTGATTGCCAAGCGGCTTTTGTCCAAGGCAGGCAGATTCTTGATGCTATTTTAG
TGGCCACTGAAGCGGTGGAAGACTACAGAATCGGGAAAAGACAAGGTGCTTTGCTTAAGCTCGACCTTGAAAAGGCTTATGATATGGTTAATTGGGATTTTTTAGATGAG
ATTCTAGCCTTGAAGGGCTTTGGGATGAAATGGAGGACATGGATCAGGGGCTGTCTTAAAAATACTAACTTCTCGATTATGATTAATGGGCGACCGAGAGGGAAGATCCT
TGCTTCTAGAGGCCTTAGACAAGGGGATCCGTTATCTCCGTTCCTCTTTACGTTAGTTGGGGATGCTATTTGTAGATCTGTGCAATTTTGTCTTGAGAAAAGTATCCTCA
AGGGATGGGAGATAGGGAAGAACTTTGAAATGATTTCCTTGCTTCAATATGCAGATGATACTCTTATTTTCTGTCCGAACAGTGAGCTTCAGTTGATCAATTGGAGGGAC
TTAATTATGTTGATTATGGCTGGATCGGGCTTGAGGATAAACATGTTGAAATCTTCTTTGATTGGCATTAATGTTGATGGGAGACAGCAAGGCACTGGGCTGAGATTTTT
GGCTGTCAAAGGAAGATTGACCATTGGAAGGATTCCCAATATCCAAAGGTGGGAGAGTTACTCTTCCCAATCGGTGCTTAATAGTCTTCCTATCTATAATTTCTCCCTTC
TTAGAGCTCCCAAAGCCATTATTAGATCTTTGGAAAAGATTATCAGAAACTTTGTGTGGAATGGGGGAGCTTATAAACCGGGGGCTAATCTCGTTAAATGGGAGTGGACT
GCTTTGCCTATTCAACAGGGTGGCTTGGGTGTAGGCTCTCTAAGGCAAAGAAATTTGGCCTTTATGTCTAAGTGGCTTTGGAGGTTTTCCCAGGAGAAGAATTCTCTATG
GAGAAAGGTGATAGTCAGCATATATGGATCCTCTCATTGGGGATGGAAGTCCGACAATCTGCATGGAAAAAAAGGGAATAGAATATGGCCTACTATTTCTGCGAATTACC
ACCAATTTGATCAGTTTACTGACTTCATTGTCAAAAGTGGCAGAAGTATCAAGTTTTGGGAAGATTGTTGGTGTGATGAGCTCCCCCTAAAATCCCTTTTCTCAGATTTA
TTCCTGATTTCAAACAAGGAGGCGTCCATAGCAGATTGTTGGAGCTATGATTCTCAAACGTGGGATTTGGCCTTTAGAAGAGGCCTTTTTGATAGGGAAATTTGCAGCTG
GGTAGCGCTGGTGGACAAAATTAAAGAGGTAAATTTGGTGACTGACCATGATTTGATTAGATGGAAGCTGGAAGCCTCTGGGAAGTACTCGACCAAATCCATGTTCTATA
AGCTGGTCAATGATTCCCCTAAATTGAAGCAGCCCGTGAGTACTCTTATATGGAACCATAAATGCCCTAAAAAAGTTAAAGTTTTCTTATGGTCCCTAGTCTATAGAAGC
TTAAACACGGACGAGAAATTGCAAAAGAAGTTCAGTAAGTGGTCGCTTTCCCCCTCTGCTTGTAGATTGTGCCTTAAAGCTGAAGAAAATCTAGACCACCTCTTCCTGCA
ATGTGATTTTGCGAGGTCAGTCTGGTGCTTTGTTGGAAGGCTGCTGGGAATATCCTTTTGTCTGCCTAGGAAAATTGACGATTGGCTCTTGGAAGGTCTGAATGCGTGGA
ACCTTAAGAGCAAGGCGAGGGTTTTGGCTAGTTGTGCTTTAGAACTACTCTTTGGACCCTGTGGAAAGAAAGAAATGCTAGAACCTTTGAAGACAAGAATCCCTTCTTCT
ATTGCCTGTGGGTTGGAAAGGATCCCTTCTTCTATTGCATGTAAAGGAGTCGAGGGCAAGGTTCTCACTCGGTTAGATGGGAGGTTGTTACAAGATCGGCCGAGTTGGAT
GGTTTGGATAAATCAGTCTATTCATGGTGTTCATGGCATTGATGTGCAATTTTGTGGAGTTAACTTCCTGGAATCATTGGTATCGGAATTTTCACCCTCTACTTCAAGTG
CAATGGGTCTTCCAAGGGAGTTTCATGAGCAGTGTAGGAGGTCATTGGAGTTGAACTACCTGAAGACGTTCTACTGTTGGGCAATAGATGCTGCCGTAAGTGTCACGAGC
ATAATAATTGATTCTCAGACAGAGGTTCCAGAAGTCAAAGTTTGTACGGCTGCTTTACGTTTGATGTTTCAAACCTTGAACTGGGATTTTCGTAATACTGCTGGTGCTAA
GGGCAGTATAAGCTTTTACTTTGCTGGAGCCAAGGATCATGGTGATGCAACCAAGAGATCCGAGTATAACTTGGTGCAGCCTGGTCCAGCTTGGCGTGATGTTTTGATTT
CAAGTGGCCATATTTCATGGCTTTTGAATTTGTATGCGGCACTTCGACAGAAGTTTTCATGTCAAGCCTTTTGGCTTGACTGCCCTATAGCTGTATCTGCTCGAAAGCTA
ATTGTACAATTTTGCTCCTTGACAGGGACGATATTTCATTCTGATAATGGGCAGATGCATGAAAATCATCTATTACAGCTTCTTTTGGGGATTATACAGTGGATTGATCC
TCCTGATGCTGTTTCGAGAGCTATCGAAAGTGGAAAATGTGAAAGTGAGATGCTTGATGGTTGTCGTGCATTGCTATCCATCGCGACTGTAACAACTCCCTTTGTGTTTG
ACCAACTACTTAAATCAATCAGGCCATTTGGCACGCTTCAACTGTTATCTAGTTTGATGGGCGAAGTTGTAAAGGTTCTTATGACCCATAACAGTGATGAGGAGACGTGG
AGTTGGCAAGCCCGTGATATATTACTCGATTCCTGGACTGCCCTTCTCATACCACTAGAGAGGTCTGGTCAGAGTTCATTGCTTCCACACGAAGGAATCAGCGCTGCAGC
TAACCTGTTTGCTCTGATTGTGGAGTCAGAGCTAAAAGCTGCATCTGCCTCAGCATCGGATGACAATGTTGAATCTGAATATTTTCAAGCTTCAGTTTCTGCCATGGATG
AAAGATTAAGTGCTTATGCTCTTATTGCAAGGGCAGCAATAAATGTCACAGTTCCCTTCCTCATAGGACTGTTTACAGAGCGTCTCTCTAAGCTTAATCAGGTTCCCCTT
TTCAAGCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGATTGGAATGGGGTCGCGTGTTTGGAATCGGACTTATCCATGTCAAGCCCAGCTAGTAGAACCACCGTAGATATGGAGGAAGAGCGTTTAAACTCCATCGAGAA
AAATCAGGAAGAATTCCCAGAAGGATACCATGAATGCTTTGCCCTTGATAATGTGTATGAAAGCAGAAATCCCACGCAAGAGGGGAACATTAGAGATAGCAAAATACAGG
GACTGGTGATGAATAATATTGAAATCCCAAACAAGAACTCTCATGAAAAATTTGACTATGAGGAGACGATTGAGGAAACTCCAGAGGCCCTTGCTGTTATGCCGCCTGAG
AAGGAGAAGGGTAAAAGTAAAGTTAGTTGTCACATTGAGGGGTTTGCGATTAGTAAAGAGATGGTGCTAACCCTTAGGAAAAATAACTTATGCATTAGACCGATTATGGG
AACTAACAACAGAAAAGGTAGTACTTCGCAGAAGAGAAGGAACAGAGAGGTGACAAGCCTTTTAAGAACGTGGGAGAAGGAACCAGAACCGCCGATAGACCTGATAGAGG
AAGTAGAGGAAGATATTTTGTCTGAAAATAGGGAGTCTAGGTTGAATCATGTAGACAGAAGGCTCGTGAAATCCATTTGGAGCTCTAAGCACATAGCTTGGCTTGCTTTG
GATGCCATTAACTCGGCAGGGGGAATTCTCGTGATGTGGAAGGAAAATGGGATTGATGTGTTTGACTCGGTCATTGGAGCTTTCTCCATCTTGATTCACTGCCATTTTCA
AGGTCACAACGAGGGCTGGATTACGGGGGTGTATGGGCATTGTTCCTACTCGGAAAGAGGTGACTTTAATATGATAAGATGGTCCAAGGAGAGACTCAACGCTAGCAATT
CCTCTAGAAGTATGGCTAAATTCAATAGATTCATTGATTTGTCTGATCTCATTGATCCCCCCATGTTCAATGGTAGAGTGGAGAGGCTTCTTCGCCCAACCTCGAACCAC
TTCCCTATTATGATGGCTATTGGTACGATGAAATGGGGACCAACCCCATTTAGATTCGAGAATACTTGGCTGGATAACCCAAATTTCAAAAGCAAAGTAGACTCTTGGTG
GAAAGAGCTGAATCCTATTGGCTGGGCCGGTTTTAAACTTATGGAAAAGTTGAAAGCCCTGAAGTATAAAATAGAGGAATGGAGCAAGGAAAATCATTCCAAGGCTACCA
GTAAAAAGAGGGACCTTTTGAGTAAGATAGAGGAGATAGACTGGTTAGAAGATAGTAACAATATCCCGCAGAATCATATTGAGGAGAGGAAAAGTCTCAAGGGCCAGCTC
ATGGATCTTATTACGGATGAGCAGAGGAGTTTACACCAAAAATATAAGGAAATTGAAGAGGAAATTCTAGAATGGTACAGAAAATTATATGAGTCCGATAACAATCAAAG
ATTTGTGCTCGATGGGGTGGACTGGTCTCCAACTGACAGTACATGGAGTAATAAACTGGAAGATAGCTTCAGTGAAGAGGAAATCCGCAAAGTTATCAGTGATTTGGGTA
ATTTGAAGTCTCCAGGTCCAGATGGGATGACAGGTGAATTTTGGAAAATTTTTTGGAACATTTTGAAGCCCGATATAGTAGAGGTGTTCCAAGAATTTTTTCAAAAGGGT
ATTATCAACAAGAGAACTAATGAGACCTATATTTGCTTAATTCCTAAAAAGAAGAAAGCTGCCAAGGTGAGTGATTACAGACCGATAAGCCTAATTACTTCGCCATACAA
GCTGATTGCTAAAGTGCTAGCCGAGAGATTGAAGAAAGTTCTTCCCCTTACCATAAGTGATTGCCAAGCGGCTTTTGTCCAAGGCAGGCAGATTCTTGATGCTATTTTAG
TGGCCACTGAAGCGGTGGAAGACTACAGAATCGGGAAAAGACAAGGTGCTTTGCTTAAGCTCGACCTTGAAAAGGCTTATGATATGGTTAATTGGGATTTTTTAGATGAG
ATTCTAGCCTTGAAGGGCTTTGGGATGAAATGGAGGACATGGATCAGGGGCTGTCTTAAAAATACTAACTTCTCGATTATGATTAATGGGCGACCGAGAGGGAAGATCCT
TGCTTCTAGAGGCCTTAGACAAGGGGATCCGTTATCTCCGTTCCTCTTTACGTTAGTTGGGGATGCTATTTGTAGATCTGTGCAATTTTGTCTTGAGAAAAGTATCCTCA
AGGGATGGGAGATAGGGAAGAACTTTGAAATGATTTCCTTGCTTCAATATGCAGATGATACTCTTATTTTCTGTCCGAACAGTGAGCTTCAGTTGATCAATTGGAGGGAC
TTAATTATGTTGATTATGGCTGGATCGGGCTTGAGGATAAACATGTTGAAATCTTCTTTGATTGGCATTAATGTTGATGGGAGACAGCAAGGCACTGGGCTGAGATTTTT
GGCTGTCAAAGGAAGATTGACCATTGGAAGGATTCCCAATATCCAAAGGTGGGAGAGTTACTCTTCCCAATCGGTGCTTAATAGTCTTCCTATCTATAATTTCTCCCTTC
TTAGAGCTCCCAAAGCCATTATTAGATCTTTGGAAAAGATTATCAGAAACTTTGTGTGGAATGGGGGAGCTTATAAACCGGGGGCTAATCTCGTTAAATGGGAGTGGACT
GCTTTGCCTATTCAACAGGGTGGCTTGGGTGTAGGCTCTCTAAGGCAAAGAAATTTGGCCTTTATGTCTAAGTGGCTTTGGAGGTTTTCCCAGGAGAAGAATTCTCTATG
GAGAAAGGTGATAGTCAGCATATATGGATCCTCTCATTGGGGATGGAAGTCCGACAATCTGCATGGAAAAAAAGGGAATAGAATATGGCCTACTATTTCTGCGAATTACC
ACCAATTTGATCAGTTTACTGACTTCATTGTCAAAAGTGGCAGAAGTATCAAGTTTTGGGAAGATTGTTGGTGTGATGAGCTCCCCCTAAAATCCCTTTTCTCAGATTTA
TTCCTGATTTCAAACAAGGAGGCGTCCATAGCAGATTGTTGGAGCTATGATTCTCAAACGTGGGATTTGGCCTTTAGAAGAGGCCTTTTTGATAGGGAAATTTGCAGCTG
GGTAGCGCTGGTGGACAAAATTAAAGAGGTAAATTTGGTGACTGACCATGATTTGATTAGATGGAAGCTGGAAGCCTCTGGGAAGTACTCGACCAAATCCATGTTCTATA
AGCTGGTCAATGATTCCCCTAAATTGAAGCAGCCCGTGAGTACTCTTATATGGAACCATAAATGCCCTAAAAAAGTTAAAGTTTTCTTATGGTCCCTAGTCTATAGAAGC
TTAAACACGGACGAGAAATTGCAAAAGAAGTTCAGTAAGTGGTCGCTTTCCCCCTCTGCTTGTAGATTGTGCCTTAAAGCTGAAGAAAATCTAGACCACCTCTTCCTGCA
ATGTGATTTTGCGAGGTCAGTCTGGTGCTTTGTTGGAAGGCTGCTGGGAATATCCTTTTGTCTGCCTAGGAAAATTGACGATTGGCTCTTGGAAGGTCTGAATGCGTGGA
ACCTTAAGAGCAAGGCGAGGGTTTTGGCTAGTTGTGCTTTAGAACTACTCTTTGGACCCTGTGGAAAGAAAGAAATGCTAGAACCTTTGAAGACAAGAATCCCTTCTTCT
ATTGCCTGTGGGTTGGAAAGGATCCCTTCTTCTATTGCATGTAAAGGAGTCGAGGGCAAGGTTCTCACTCGGTTAGATGGGAGGTTGTTACAAGATCGGCCGAGTTGGAT
GGTTTGGATAAATCAGTCTATTCATGGTGTTCATGGCATTGATGTGCAATTTTGTGGAGTTAACTTCCTGGAATCATTGGTATCGGAATTTTCACCCTCTACTTCAAGTG
CAATGGGTCTTCCAAGGGAGTTTCATGAGCAGTGTAGGAGGTCATTGGAGTTGAACTACCTGAAGACGTTCTACTGTTGGGCAATAGATGCTGCCGTAAGTGTCACGAGC
ATAATAATTGATTCTCAGACAGAGGTTCCAGAAGTCAAAGTTTGTACGGCTGCTTTACGTTTGATGTTTCAAACCTTGAACTGGGATTTTCGTAATACTGCTGGTGCTAA
GGGCAGTATAAGCTTTTACTTTGCTGGAGCCAAGGATCATGGTGATGCAACCAAGAGATCCGAGTATAACTTGGTGCAGCCTGGTCCAGCTTGGCGTGATGTTTTGATTT
CAAGTGGCCATATTTCATGGCTTTTGAATTTGTATGCGGCACTTCGACAGAAGTTTTCATGTCAAGCCTTTTGGCTTGACTGCCCTATAGCTGTATCTGCTCGAAAGCTA
ATTGTACAATTTTGCTCCTTGACAGGGACGATATTTCATTCTGATAATGGGCAGATGCATGAAAATCATCTATTACAGCTTCTTTTGGGGATTATACAGTGGATTGATCC
TCCTGATGCTGTTTCGAGAGCTATCGAAAGTGGAAAATGTGAAAGTGAGATGCTTGATGGTTGTCGTGCATTGCTATCCATCGCGACTGTAACAACTCCCTTTGTGTTTG
ACCAACTACTTAAATCAATCAGGCCATTTGGCACGCTTCAACTGTTATCTAGTTTGATGGGCGAAGTTGTAAAGGTTCTTATGACCCATAACAGTGATGAGGAGACGTGG
AGTTGGCAAGCCCGTGATATATTACTCGATTCCTGGACTGCCCTTCTCATACCACTAGAGAGGTCTGGTCAGAGTTCATTGCTTCCACACGAAGGAATCAGCGCTGCAGC
TAACCTGTTTGCTCTGATTGTGGAGTCAGAGCTAAAAGCTGCATCTGCCTCAGCATCGGATGACAATGTTGAATCTGAATATTTTCAAGCTTCAGTTTCTGCCATGGATG
AAAGATTAAGTGCTTATGCTCTTATTGCAAGGGCAGCAATAAATGTCACAGTTCCCTTCCTCATAGGACTGTTTACAGAGCGTCTCTCTAAGCTTAATCAGGTTCCCCTT
TTCAAGCATTAA
Protein sequenceShow/hide protein sequence
MDDWNGVACLESDLSMSSPASRTTVDMEEERLNSIEKNQEEFPEGYHECFALDNVYESRNPTQEGNIRDSKIQGLVMNNIEIPNKNSHEKFDYEETIEETPEALAVMPPE
KEKGKSKVSCHIEGFAISKEMVLTLRKNNLCIRPIMGTNNRKGSTSQKRRNREVTSLLRTWEKEPEPPIDLIEEVEEDILSENRESRLNHVDRRLVKSIWSSKHIAWLAL
DAINSAGGILVMWKENGIDVFDSVIGAFSILIHCHFQGHNEGWITGVYGHCSYSERGDFNMIRWSKERLNASNSSRSMAKFNRFIDLSDLIDPPMFNGRVERLLRPTSNH
FPIMMAIGTMKWGPTPFRFENTWLDNPNFKSKVDSWWKELNPIGWAGFKLMEKLKALKYKIEEWSKENHSKATSKKRDLLSKIEEIDWLEDSNNIPQNHIEERKSLKGQL
MDLITDEQRSLHQKYKEIEEEILEWYRKLYESDNNQRFVLDGVDWSPTDSTWSNKLEDSFSEEEIRKVISDLGNLKSPGPDGMTGEFWKIFWNILKPDIVEVFQEFFQKG
IINKRTNETYICLIPKKKKAAKVSDYRPISLITSPYKLIAKVLAERLKKVLPLTISDCQAAFVQGRQILDAILVATEAVEDYRIGKRQGALLKLDLEKAYDMVNWDFLDE
ILALKGFGMKWRTWIRGCLKNTNFSIMINGRPRGKILASRGLRQGDPLSPFLFTLVGDAICRSVQFCLEKSILKGWEIGKNFEMISLLQYADDTLIFCPNSELQLINWRD
LIMLIMAGSGLRINMLKSSLIGINVDGRQQGTGLRFLAVKGRLTIGRIPNIQRWESYSSQSVLNSLPIYNFSLLRAPKAIIRSLEKIIRNFVWNGGAYKPGANLVKWEWT
ALPIQQGGLGVGSLRQRNLAFMSKWLWRFSQEKNSLWRKVIVSIYGSSHWGWKSDNLHGKKGNRIWPTISANYHQFDQFTDFIVKSGRSIKFWEDCWCDELPLKSLFSDL
FLISNKEASIADCWSYDSQTWDLAFRRGLFDREICSWVALVDKIKEVNLVTDHDLIRWKLEASGKYSTKSMFYKLVNDSPKLKQPVSTLIWNHKCPKKVKVFLWSLVYRS
LNTDEKLQKKFSKWSLSPSACRLCLKAEENLDHLFLQCDFARSVWCFVGRLLGISFCLPRKIDDWLLEGLNAWNLKSKARVLASCALELLFGPCGKKEMLEPLKTRIPSS
IACGLERIPSSIACKGVEGKVLTRLDGRLLQDRPSWMVWINQSIHGVHGIDVQFCGVNFLESLVSEFSPSTSSAMGLPREFHEQCRRSLELNYLKTFYCWAIDAAVSVTS
IIIDSQTEVPEVKVCTAALRLMFQTLNWDFRNTAGAKGSISFYFAGAKDHGDATKRSEYNLVQPGPAWRDVLISSGHISWLLNLYAALRQKFSCQAFWLDCPIAVSARKL
IVQFCSLTGTIFHSDNGQMHENHLLQLLLGIIQWIDPPDAVSRAIESGKCESEMLDGCRALLSIATVTTPFVFDQLLKSIRPFGTLQLLSSLMGEVVKVLMTHNSDEETW
SWQARDILLDSWTALLIPLERSGQSSLLPHEGISAAANLFALIVESELKAASASASDDNVESEYFQASVSAMDERLSAYALIARAAINVTVPFLIGLFTERLSKLNQVPL
FKH