; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G21540 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G21540
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionNuclear factor of activated T-cells 5 isoform X1
Genome locationChr1:17103963..17113890
RNA-Seq ExpressionCSPI01G21540
SyntenyCSPI01G21540
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057094.1 nuclear factor of activated T-cells 5 isoform X1 [Cucumis melo var. makuwa]0.0e+0090.71Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLK-AVDLCGTIHDVMKVWNRHIKLFPQSIRAMPY
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLK AVDLCGTIHDVMKVWNRHIKLFPQSIR MPY
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLK-AVDLCGTIHDVMKVWNRHIKLFPQSIRAMPY

Query:  KDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQ
        KDPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ
Subjt:  KDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQ

Query:  VNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGS
         NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGS
Subjt:  VNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGS

Query:  SSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQK
        S SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQK
Subjt:  SSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQK

Query:  PQVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQM
        PQVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQM
Subjt:  PQVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQM

Query:  QQQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH
        QQQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH
Subjt:  QQQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH

XP_008443864.1 PREDICTED: uncharacterized protein LOC103487357 isoform X1 [Cucumis melo]0.0e+0090.84Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIR MPYK
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK

Query:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
        DPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ 
Subjt:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV

Query:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
        NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGSS
Subjt:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS

Query:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
         SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQKP
Subjt:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP

Query:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ
        QVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQMQ
Subjt:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ

Query:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH
        QQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH
Subjt:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH

XP_008443873.1 PREDICTED: uncharacterized protein LOC103487357 isoform X2 [Cucumis melo]0.0e+0090.74Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIR MPYK
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK

Query:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
        DPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ 
Subjt:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV

Query:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
        NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGSS
Subjt:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS

Query:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
         SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQKP
Subjt:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP

Query:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ
        QVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQMQ
Subjt:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ

Query:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDHLDS
        QQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH DS
Subjt:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDHLDS

XP_008443890.1 PREDICTED: uncharacterized protein LOC103487357 isoform X3 [Cucumis melo]0.0e+0090.84Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIR MPYK
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK

Query:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
        DPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ 
Subjt:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV

Query:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
        NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGSS
Subjt:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS

Query:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
         SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQKP
Subjt:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP

Query:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ
        QVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQMQ
Subjt:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ

Query:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH
        QQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH
Subjt:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH

XP_031740258.1 pre-mRNA-processing factor 39 isoform X2 [Cucumis sativus]0.0e+0099.46Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
        EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK

Query:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
        DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
Subjt:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV

Query:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
        NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
Subjt:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS

Query:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
        SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
Subjt:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP

Query:  QVERISQDHNHIQSA-QQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNT
        QVERISQDHNHIQSA QQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNT
Subjt:  QVERISQDHNHIQSA-QQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNT

TrEMBL top hitse value%identityAlignment
A0A0A0LUZ9 Uncharacterized protein4.5e-27399.38Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLK-AVDLCGTIHDVMKVWNRHIKLFPQSIRAMPY
        EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLK AVDLCGTIHDVMKVWNRHIKLFPQSIRAMPY
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLK-AVDLCGTIHDVMKVWNRHIKLFPQSIRAMPY

Query:  KDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQ
        KDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQ
Subjt:  KDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQ

Query:  VNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGS
        VNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGS
Subjt:  VNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGS

Query:  SSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTS
        SSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTS
Subjt:  SSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTS

A0A1S3B905 uncharacterized protein LOC103487357 isoform X10.0e+0090.84Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIR MPYK
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK

Query:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
        DPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ 
Subjt:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV

Query:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
        NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGSS
Subjt:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS

Query:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
         SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQKP
Subjt:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP

Query:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ
        QVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQMQ
Subjt:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ

Query:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH
        QQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH
Subjt:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH

A0A1S3B928 uncharacterized protein LOC103487357 isoform X30.0e+0090.84Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIR MPYK
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK

Query:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
        DPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ 
Subjt:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV

Query:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
        NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGSS
Subjt:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS

Query:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
         SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQKP
Subjt:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP

Query:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ
        QVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQMQ
Subjt:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ

Query:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH
        QQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH
Subjt:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH

A0A1S3B9U5 uncharacterized protein LOC103487357 isoform X20.0e+0090.74Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIR MPYK
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYK

Query:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV
        DPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ 
Subjt:  DPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQV

Query:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS
        NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGSS
Subjt:  NSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSS

Query:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP
         SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQKP
Subjt:  SSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKP

Query:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ
        QVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQMQ
Subjt:  QVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQ

Query:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDHLDS
        QQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH DS
Subjt:  QQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDHLDS

A0A5D3DAZ3 Nuclear factor of activated T-cells 5 isoform X10.0e+0090.71Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +KVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYR+AL+MALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLK-AVDLCGTIHDVMKVWNRHIKLFPQSIRAMPY
        EVLIDGIRNVPLCKLLLEELINFVMV+GVPKLINLVDPIVANAISLKADVS+GWSEQDREDISTLYLK AVDLCGTIHDVMKVWNRHIKLFPQSIR MPY
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLK-AVDLCGTIHDVMKVWNRHIKLFPQSIRAMPY

Query:  KDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQ
        KDPI   EAIK T+GGKQT D+TVT QPIRD +VNPSNQPPLEENK+SLLDNQNFKNDQSSNGNEPTSCLLVK NIA KESTID+INLGDSEI AEEREQ
Subjt:  KDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQ

Query:  VNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGS
         NSPKVLE YGSGGNQIESAQM  PMDNSKKDE GDALGVTLKNLSI SLSLNAKNNDKINL S+ACHEGEPPLENSLSSESV+NTDEEVVMHNPL+VGS
Subjt:  VNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGS

Query:  SSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQK
        S SIQISNE  SPSS PSP KPTH QVH+QFHMH+TGDRKWHHKR+AGNL HD QH FQGHSRRRPHRTWKDSPQDYRGM+SGQT GDQDYTSE+IASQK
Subjt:  SSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQK

Query:  PQVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQM
        PQVERISQD+NHIQS QQQNF TTSQSQLPSQGFTQEKSQ TT N EQYGHMQS Q PNTYEQMWQYY YQQQQQYLLQQQQLQQ+QNFQQQYYQ+QVQM
Subjt:  PQVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQM

Query:  QQQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH
        QQQYFQS  QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQ QEASQTDQLSFQQHEHQP ELEE EQKQHTKQVSSLSIQIQ GERDH
Subjt:  QQQYFQS--QYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQP-ELEEDEQKQHTKQVSSLSIQIQTGERDH

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.5e-1937.82Show/hide
Query:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV
        MDF+  LP+ +G   + VVVDRFSK    +P      A+  + +F + V+   G PK I++D D +F S  WK+        +  S  Y PQ+DGQTE  
Subjt:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV

Query:  NRGVEIYLRCFCGEKLKEW
        N+ VE  LRC C      W
Subjt:  NRGVEIYLRCFCGEKLKEW

P0CT35 Transposon Tf2-2 polyprotein8.5e-1937.82Show/hide
Query:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV
        MDF+  LP+ +G   + VVVDRFSK    +P      A+  + +F + V+   G PK I++D D +F S  WK+        +  S  Y PQ+DGQTE  
Subjt:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV

Query:  NRGVEIYLRCFCGEKLKEW
        N+ VE  LRC C      W
Subjt:  NRGVEIYLRCFCGEKLKEW

P0CT36 Transposon Tf2-3 polyprotein8.5e-1937.82Show/hide
Query:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV
        MDF+  LP+ +G   + VVVDRFSK    +P      A+  + +F + V+   G PK I++D D +F S  WK+        +  S  Y PQ+DGQTE  
Subjt:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV

Query:  NRGVEIYLRCFCGEKLKEW
        N+ VE  LRC C      W
Subjt:  NRGVEIYLRCFCGEKLKEW

P0CT41 Transposon Tf2-12 polyprotein8.5e-1937.82Show/hide
Query:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV
        MDF+  LP+ +G   + VVVDRFSK    +P      A+  + +F + V+   G PK I++D D +F S  WK+        +  S  Y PQ+DGQTE  
Subjt:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV

Query:  NRGVEIYLRCFCGEKLKEW
        N+ VE  LRC C      W
Subjt:  NRGVEIYLRCFCGEKLKEW

Q9UR07 Transposon Tf2-11 polyprotein8.5e-1937.82Show/hide
Query:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV
        MDF+  LP+ +G   + VVVDRFSK    +P      A+  + +F + V+   G PK I++D D +F S  WK+        +  S  Y PQ+DGQTE  
Subjt:  MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVV

Query:  NRGVEIYLRCFCGEKLKEW
        N+ VE  LRC C      W
Subjt:  NRGVEIYLRCFCGEKLKEW

Arabidopsis top hitse value%identityAlignment
AT1G04080.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-2831.58Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +K P IHLF +R KEQ  D++GARAA+  +  ++    +E +I  ANME R+G   +AF++Y   + +   K+   +LP LY  +SR  ++++   + A 
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLF
         ++++ + +V   K L+E LI+F  +   P+ I+ ++P+V   I   AD     S  +RE++S +Y++ + + G +  + K  ++H+KLF
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLF

AT1G04080.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-2831.58Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +K P IHLF +R KEQ  D++GARAA+  +  ++    +E +I  ANME R+G   +AF++Y   + +   K+   +LP LY  +SR  ++++   + A 
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLF
         ++++ + +V   K L+E LI+F  +   P+ I+ ++P+V   I   AD     S  +RE++S +Y++ + + G +  + K  ++H+KLF
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLF

AT1G04080.3 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-2831.58Show/hide
Query:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM
        +K P IHLF +R KEQ  D++GARAA+  +  ++    +E +I  ANME R+G   +AF++Y   + +   K+   +LP LY  +SR  ++++   + A 
Subjt:  EKVPVIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAM

Query:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLF
         ++++ + +V   K L+E LI+F  +   P+ I+ ++P+V   I   AD     S  +RE++S +Y++ + + G +  + K  ++H+KLF
Subjt:  EVLIDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLF

AT5G46400.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-4630.92Show/hide
Query:  VIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALM-KKKLDVLPALYVHFSRLKHMITGSVDAAMEVL
        VIHLFN+RFKE + D S A  A  +   +L   FVEN+  KANMEKR+G    A   YR+AL   L+ K+ L+    LYV FSRLK++IT S D A ++L
Subjt:  VIHLFNSRFKEQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALM-KKKLDVLPALYVHFSRLKHMITGSVDAAMEVL

Query:  IDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYKDPI
        ++G  NVP CKLLLEEL+  +M+HG  + ++L+DPI+   +S +AD S+G S +D+E+IS LY++ +DL GTIHDV K   RHIKLFP S RA   +   
Subjt:  IDGIRNVPLCKLLLEELINFVMVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYKDPI

Query:  PGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGN----EPT-SCLLVKHNIAMKESTIDKINLGDSEICAEERE
        P     ++ +  ++     +    + +  ++     P +E KES LD+   ++  +   +    EP   CL   H +   ++ I++  L +S+       
Subjt:  PGIEAIKKTMGGKQTADSTVTNQPIRDDNVNPSNQPPLEENKESLLDNQNFKNDQSSNGN----EPT-SCLLVKHNIAMKESTIDKINLGDSEICAEERE

Query:  QVNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVG
        + N          GG +  S ++ +P+  S +       G   K     S S++   +D I +       G    ++  S ES+  T         LN  
Subjt:  QVNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNLSIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVG

Query:  SSSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSP-QDYRGMQSGQTSGDQDYTSETIAS
             Q+  +    S    P  P  P           G  +    +H    H D +   Q  + + P   +++S  Q +  +Q+           + +  
Subjt:  SSSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQHDFQGHSRRRPHRTWKDSP-QDYRGMQSGQTSGDQDYTSETIAS

Query:  QKPQVERISQDHNHIQSAQQQNF--PTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQ-----YYCYQQQQQYLLQQQQLQQSQNFQQ
          P+ +       +  S  Q +F  P T   Q P Q            NY+Q G MQS +    Y QMWQ     YY YQQQQQ  L  +Q Q +QN Q 
Subjt:  QKPQVERISQDHNHIQSAQQQNF--PTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMWQ-----YYCYQQQQQYLLQQQQLQQSQNFQQ

Query:  QYYQRQVQMQQQYFQSQYPYQPVELQQQYHI-----------QQQLQQTQQQQQLLGLQLQEASQTDQ--LSFQQHEHQPELEEDEQKQHTKQVSSLSIQ
        Q  Q  VQ+  + +QSQ   Q ++ QQ   +           QQQ+Q  QQQQQ    Q Q+  Q  Q  L  QQ + Q E + DEQ+    Q S+ +  
Subjt:  QYYQRQVQMQQQYFQSQYPYQPVELQQQYHI-----------QQQLQQTQQQQQLLGLQLQEASQTDQ--LSFQQHEHQPELEEDEQKQHTKQVSSLSIQ

Query:  IQTGE
        IQ  +
Subjt:  IQTGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTTGTAGAAGGCTTACCAAAAGTGAATGGTATCGAGGTAATTCTGGTGGTGGTAGATCGCTTCAGTAAATATGGTCACTTTTTACCATTGAAGCATCCATATAA
TGCGAAGTCTGTATCAGAATTGTTTGTCAAAGAAGTGGTGCGACTACATGGCTTTCCAAAGTCCATTGTGTCAGATAGGGATAAAGTATTTCTGAGCTCATTCTGGAAAG
AGTTGTTCAGGCTAGCGGGCACAAGATTAAACCATAGTACAGCATACCATCCTCAATCGGATGGACAAACAGAAGTAGTGAATCGTGGAGTGGAGATTTATTTACGGTGT
TTTTGTGGAGAAAAACTGAAGGAGTGGGGTTTACCAAGTCATGAGGCAACTTGGGAGTTATATGAAGATTTGCAGCAACGATTTCCAGATTTTCACCTTGAGGACAAGGT
GAATTTGGAAAGGGAAAGTAATGATAGACCTCCAATACTATATCAATATAGTAGGAGAGGGAAGAAGGGTAGTGGCACGGCACGTGTTGGTGCTCGAGAAGGTTCCCCAG
TTCCTCGGCAACACCTTGAGGTTCATCTCAATTTTTGCAACAGTTGCTCAGAGGATGAAGGATTAAGTGAGAAAGTTCCTGTTATACATCTTTTCAATTCAAGGTTTAAG
GAACAAATAAGAGATTTATCGGGTGCACGTGCTGCTTTTCTTCAGCTTGATGGAGATTTAGATTCTAAGTTTGTGGAAAATATCATATTGAAGGCTAATATGGAGAAACG
AATGGGAAAATCTACAGAAGCTTTTAATATTTACCGAGATGCCCTGCAAATGGCTTTGATGAAGAAGAAATTGGATGTTTTACCAGCTCTGTATGTACATTTTTCTCGAC
TTAAACACATGATTACAGGAAGTGTTGATGCTGCTATGGAGGTCTTAATAGATGGGATCCGAAATGTACCTCTCTGCAAATTGCTTCTTGAGGAACTTATAAACTTCGTC
ATGGTGCATGGAGTGCCAAAGCTTATAAATTTAGTTGATCCCATCGTAGCTAATGCAATATCTCTCAAGGCAGACGTATCTGAAGGTTGGAGTGAGCAAGACAGAGAGGA
TATTTCAACTCTGTATTTAAAGGCTGTTGACTTGTGTGGAACCATCCATGATGTAATGAAGGTGTGGAATCGGCATATTAAATTGTTTCCACAGTCTATTAGAGCAATGC
CATATAAAGACCCCATCCCAGGGATAGAAGCCATAAAAAAGACCATGGGAGGAAAACAAACAGCAGATTCCACTGTAACCAACCAACCAATCAGAGATGACAATGTCAAT
CCATCAAATCAGCCTCCTTTAGAAGAAAATAAAGAGTCTCTGTTAGATAACCAAAACTTCAAGAATGACCAATCTTCCAATGGGAATGAACCAACATCCTGTTTACTCGT
TAAGCATAATATTGCTATGAAAGAGTCTACCATCGACAAGATTAATTTAGGAGATTCTGAAATTTGTGCAGAGGAAAGGGAGCAGGTAAATTCTCCAAAAGTTCTTGAGC
GTTATGGAAGTGGTGGAAATCAGATTGAATCGGCACAAATGCCAATGCCCATGGACAACTCCAAAAAAGACGAGTACGGTGATGCTTTGGGCGTGACCTTGAAAAATCTT
TCAATTAAGAGTCTTTCCTTGAACGCAAAGAACAATGACAAAATAAATTTACCTTCCAAAGCATGTCATGAAGGGGAACCTCCCTTGGAGAACAGTTTGTCTAGTGAAAG
TGTCAGCAATACAGATGAAGAGGTTGTAATGCACAACCCTCTAAATGTCGGATCTTCCAGTTCCATCCAGATTTCCAATGAAGGGGCCAGCCCATCATCCTTTCCAAGTC
CTGGCAAGCCTACACACCCCCAAGTACATACACAGTTTCACATGCATGAAACTGGGGACAGAAAGTGGCACCATAAACGTCATGCTGGTAACTTGCATCATGACCTCCAG
CATGATTTTCAAGGACACTCACGAAGAAGGCCTCATCGAACATGGAAAGATTCTCCTCAGGACTACCGAGGAATGCAATCTGGTCAAACATCAGGTGATCAAGATTATAC
CTCTGAAACTATTGCTTCACAAAAACCACAAGTTGAACGAATCAGCCAAGACCACAATCATATTCAATCTGCGCAGCAGCAGAACTTCCCCACTACTTCTCAGTCTCAAC
TTCCTTCTCAAGGTTTTACTCAAGAGAAATCTCAAAATACTACACCAAACTACGAGCAATATGGTCACATGCAGAGTAGTCAGGTGCCAAATACCTATGAACAAATGTGG
CAATATTATTGCTATCAGCAACAGCAGCAGTATCTTTTGCAGCAGCAACAACTTCAACAGTCACAGAATTTTCAGCAACAGTATTACCAGCGGCAAGTGCAAATGCAACA
ACAGTATTTTCAATCGCAATATCCTTACCAGCCTGTGGAATTACAACAGCAGTATCACATTCAGCAGCAATTGCAACAAACGCAGCAGCAGCAACAGTTACTTGGACTTC
AGCTGCAAGAAGCCTCCCAGACAGATCAGCTATCATTCCAACAACATGAGCATCAGCCAGAACTGGAGGAAGATGAACAAAAGCAACACACAAAACAAGTTTCTTCGTTG
TCTATTCAGATCCAGACTGGTGAACGTGATCATCTGGATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTTGTAGAAGGCTTACCAAAAGTGAATGGTATCGAGGTAATTCTGGTGGTGGTAGATCGCTTCAGTAAATATGGTCACTTTTTACCATTGAAGCATCCATATAA
TGCGAAGTCTGTATCAGAATTGTTTGTCAAAGAAGTGGTGCGACTACATGGCTTTCCAAAGTCCATTGTGTCAGATAGGGATAAAGTATTTCTGAGCTCATTCTGGAAAG
AGTTGTTCAGGCTAGCGGGCACAAGATTAAACCATAGTACAGCATACCATCCTCAATCGGATGGACAAACAGAAGTAGTGAATCGTGGAGTGGAGATTTATTTACGGTGT
TTTTGTGGAGAAAAACTGAAGGAGTGGGGTTTACCAAGTCATGAGGCAACTTGGGAGTTATATGAAGATTTGCAGCAACGATTTCCAGATTTTCACCTTGAGGACAAGGT
GAATTTGGAAAGGGAAAGTAATGATAGACCTCCAATACTATATCAATATAGTAGGAGAGGGAAGAAGGGTAGTGGCACGGCACGTGTTGGTGCTCGAGAAGGTTCCCCAG
TTCCTCGGCAACACCTTGAGGTTCATCTCAATTTTTGCAACAGTTGCTCAGAGGATGAAGGATTAAGTGAGAAAGTTCCTGTTATACATCTTTTCAATTCAAGGTTTAAG
GAACAAATAAGAGATTTATCGGGTGCACGTGCTGCTTTTCTTCAGCTTGATGGAGATTTAGATTCTAAGTTTGTGGAAAATATCATATTGAAGGCTAATATGGAGAAACG
AATGGGAAAATCTACAGAAGCTTTTAATATTTACCGAGATGCCCTGCAAATGGCTTTGATGAAGAAGAAATTGGATGTTTTACCAGCTCTGTATGTACATTTTTCTCGAC
TTAAACACATGATTACAGGAAGTGTTGATGCTGCTATGGAGGTCTTAATAGATGGGATCCGAAATGTACCTCTCTGCAAATTGCTTCTTGAGGAACTTATAAACTTCGTC
ATGGTGCATGGAGTGCCAAAGCTTATAAATTTAGTTGATCCCATCGTAGCTAATGCAATATCTCTCAAGGCAGACGTATCTGAAGGTTGGAGTGAGCAAGACAGAGAGGA
TATTTCAACTCTGTATTTAAAGGCTGTTGACTTGTGTGGAACCATCCATGATGTAATGAAGGTGTGGAATCGGCATATTAAATTGTTTCCACAGTCTATTAGAGCAATGC
CATATAAAGACCCCATCCCAGGGATAGAAGCCATAAAAAAGACCATGGGAGGAAAACAAACAGCAGATTCCACTGTAACCAACCAACCAATCAGAGATGACAATGTCAAT
CCATCAAATCAGCCTCCTTTAGAAGAAAATAAAGAGTCTCTGTTAGATAACCAAAACTTCAAGAATGACCAATCTTCCAATGGGAATGAACCAACATCCTGTTTACTCGT
TAAGCATAATATTGCTATGAAAGAGTCTACCATCGACAAGATTAATTTAGGAGATTCTGAAATTTGTGCAGAGGAAAGGGAGCAGGTAAATTCTCCAAAAGTTCTTGAGC
GTTATGGAAGTGGTGGAAATCAGATTGAATCGGCACAAATGCCAATGCCCATGGACAACTCCAAAAAAGACGAGTACGGTGATGCTTTGGGCGTGACCTTGAAAAATCTT
TCAATTAAGAGTCTTTCCTTGAACGCAAAGAACAATGACAAAATAAATTTACCTTCCAAAGCATGTCATGAAGGGGAACCTCCCTTGGAGAACAGTTTGTCTAGTGAAAG
TGTCAGCAATACAGATGAAGAGGTTGTAATGCACAACCCTCTAAATGTCGGATCTTCCAGTTCCATCCAGATTTCCAATGAAGGGGCCAGCCCATCATCCTTTCCAAGTC
CTGGCAAGCCTACACACCCCCAAGTACATACACAGTTTCACATGCATGAAACTGGGGACAGAAAGTGGCACCATAAACGTCATGCTGGTAACTTGCATCATGACCTCCAG
CATGATTTTCAAGGACACTCACGAAGAAGGCCTCATCGAACATGGAAAGATTCTCCTCAGGACTACCGAGGAATGCAATCTGGTCAAACATCAGGTGATCAAGATTATAC
CTCTGAAACTATTGCTTCACAAAAACCACAAGTTGAACGAATCAGCCAAGACCACAATCATATTCAATCTGCGCAGCAGCAGAACTTCCCCACTACTTCTCAGTCTCAAC
TTCCTTCTCAAGGTTTTACTCAAGAGAAATCTCAAAATACTACACCAAACTACGAGCAATATGGTCACATGCAGAGTAGTCAGGTGCCAAATACCTATGAACAAATGTGG
CAATATTATTGCTATCAGCAACAGCAGCAGTATCTTTTGCAGCAGCAACAACTTCAACAGTCACAGAATTTTCAGCAACAGTATTACCAGCGGCAAGTGCAAATGCAACA
ACAGTATTTTCAATCGCAATATCCTTACCAGCCTGTGGAATTACAACAGCAGTATCACATTCAGCAGCAATTGCAACAAACGCAGCAGCAGCAACAGTTACTTGGACTTC
AGCTGCAAGAAGCCTCCCAGACAGATCAGCTATCATTCCAACAACATGAGCATCAGCCAGAACTGGAGGAAGATGAACAAAAGCAACACACAAAACAAGTTTCTTCGTTG
TCTATTCAGATCCAGACTGGTGAACGTGATCATCTGGATTCTTGATGAAGTAGCGATAACAGCTGATCTGAGGCATAACAACCAAATGAGTCAAATGTGGCAG
Protein sequenceShow/hide protein sequence
MDFVEGLPKVNGIEVILVVVDRFSKYGHFLPLKHPYNAKSVSELFVKEVVRLHGFPKSIVSDRDKVFLSSFWKELFRLAGTRLNHSTAYHPQSDGQTEVVNRGVEIYLRC
FCGEKLKEWGLPSHEATWELYEDLQQRFPDFHLEDKVNLERESNDRPPILYQYSRRGKKGSGTARVGAREGSPVPRQHLEVHLNFCNSCSEDEGLSEKVPVIHLFNSRFK
EQIRDLSGARAAFLQLDGDLDSKFVENIILKANMEKRMGKSTEAFNIYRDALQMALMKKKLDVLPALYVHFSRLKHMITGSVDAAMEVLIDGIRNVPLCKLLLEELINFV
MVHGVPKLINLVDPIVANAISLKADVSEGWSEQDREDISTLYLKAVDLCGTIHDVMKVWNRHIKLFPQSIRAMPYKDPIPGIEAIKKTMGGKQTADSTVTNQPIRDDNVN
PSNQPPLEENKESLLDNQNFKNDQSSNGNEPTSCLLVKHNIAMKESTIDKINLGDSEICAEEREQVNSPKVLERYGSGGNQIESAQMPMPMDNSKKDEYGDALGVTLKNL
SIKSLSLNAKNNDKINLPSKACHEGEPPLENSLSSESVSNTDEEVVMHNPLNVGSSSSIQISNEGASPSSFPSPGKPTHPQVHTQFHMHETGDRKWHHKRHAGNLHHDLQ
HDFQGHSRRRPHRTWKDSPQDYRGMQSGQTSGDQDYTSETIASQKPQVERISQDHNHIQSAQQQNFPTTSQSQLPSQGFTQEKSQNTTPNYEQYGHMQSSQVPNTYEQMW
QYYCYQQQQQYLLQQQQLQQSQNFQQQYYQRQVQMQQQYFQSQYPYQPVELQQQYHIQQQLQQTQQQQQLLGLQLQEASQTDQLSFQQHEHQPELEEDEQKQHTKQVSSL
SIQIQTGERDHLDS