; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc11g0301341 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc11g0301341
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr11:21566971..21568287
RNA-Seq ExpressionCmc11g0301341
SyntenyCmc11g0301341
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.9e-16074.94Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRK+KIDY+KM                              ELLQFRRNNVWTLVSKPEGVNVIGTKW+FKNKIDE GCVTKNKARLVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI
        +DFDETFA                        MDVKSAFLN YLNEEVY+AQPKGFVDSEHPKHVYKLNKA+YGLKQ  RAWYD LTV  R +GY REEI
Subjt:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI

Query:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL
        DKT FIH+KSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELS FLGLQIKQKND IFI+QEKYA+NM+KKF LEQARNKRT AATHVKL
Subjt:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL

Query:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG
        TKD +GAEVDHKLYRSIVG+LLYLT SRP+IAY VGI A YQAD RITHLEA+KRILKYVHGT+D  MMYSYDTTPTLVGY DAD AGS +DRKSTS G
Subjt:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-17175.12Show/hide
Query:  ISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA------------
        ++ +E  T+NSAL+DEYWLN MQEELLQFRRNNVWTL+SKPEGVNVIGTKW+FKNK DE GCVTKNKARLVA GYTQVEG+DFDETFA            
Subjt:  ISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA------------

Query:  -----------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYV
                    +DVKS FLNGYLNEEVY+AQPKGFVDSEHPKHVYKLNKA+YGLKQ LRAWYD LTV  R +GYSR EIDK LFIH+KSDQLLVAQIYV
Subjt:  -----------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYV

Query:  DDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGS
        DDIIFGGFP DL+NNFINIMQSEFEMSMVGELS FLGLQIKQKNDGIFI+QEKYARNM+KKFGL+QARNKRT AATHVKLTKD +GAEVDHKLYRSIVGS
Subjt:  DDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGS

Query:  LLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSL
        LLYLTASRP+IAY VGI A YQAD RIT LE +KRILKYVHGT+D GMMYSYDTT TLVGY DAD AGS DDRK+                         
Subjt:  LLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSL

Query:  STAEAEYIAAGSGCTQLI
           EAEYIAAGSGCTQLI
Subjt:  STAEAEYIAAGSGCTQLI

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-16572.15Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRKEKIDYMKMVADLCYIST EPSTV+SALRDEYWLNAMQEELLQFR+NNVWTLVSKPEGVNVIGTKWVFKNK DEAGCVTKNKA+LVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFAS-----------------------MDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI
        IDFDETFAS                       MDVKSAFL+GYLNEEVY+AQPKGFVDSEHPKH+YKLNKA+YGLKQ  RAWYD LTV  R KGYSR EI
Subjt:  IDFDETFAS-----------------------MDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI

Query:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL
        DKTLFI +KSDQLLVAQIYVDDIIF GFP DLVNNFI     EFEMSMVGELS FLGLQIKQKND IFI+QEKYARNM+KKFGLEQARNKRT AATHVKL
Subjt:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL

Query:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGC
        TKD + +EVDHKLYRSI                         AD RITHLEA+KRILKYVHGT+D GMMYSYDTTPTLVGY DA+ AGSTDD K+     
Subjt:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGC

Query:  FFFRNNLISWLSKKQNCVSLSTAEAEYIAAGSGCTQLI
                               EA+Y+AAGSGCTQLI
Subjt:  FFFRNNLISWLSKKQNCVSLSTAEAEYIAAGSGCTQLI

TYK08028.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.4e-16774.7Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRK+KIDY+KMVA+LCYIST E  TV+SAL+DEYWLNAMQEELLQ+R+NNVWTLVSKPEGVNVIGTK++FKNK DE GCVTKNKARLVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFASMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI
        +DFDETF+ MDVKSAFLNGYLNEEVY+AQPKGFVDSEHPKHVYKLNKA+Y L                                KKS+QLL+AQIYVDDI
Subjt:  IDFDETFASMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI

Query:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY
        IFGGFPQDLVNNFIN MQSEF+MSM+GELS FLGLQIKQKNDGIFI+QEKYARNM+KKFGLEQARNKRT AATHVKLTKD  GAEVDHKLYRSIVGSLLY
Subjt:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY

Query:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA
        LTASRP+IAYA+GI A YQAD  ITHLE +KRILKYVHGT+D GMMYSYDTTPTLVGY DAD AGSTDDRK                             
Subjt:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA

Query:  EAEYIAAGSGCTQLI
        EAEYIAAGSGCTQLI
Subjt:  EAEYIAAGSGCTQLI

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.4e-16074.69Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRK+KIDY+KM                              ELLQFRRNNVWTLVSKPEGVNVIGTKW+FKNKIDE GCVTKNKARLVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI
        +DFDETFA                        MDVKSAFLN YLNEEVY+AQPKGFVDSEHPKHVYKLNKA+YGLKQ  RAWYD LT   R +GY REEI
Subjt:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI

Query:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL
        DKT FIH+KSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELS FLGLQIKQKND IFI+QEKYA+NM+KKF LEQARNKRT AATHVKL
Subjt:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL

Query:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG
        TKD +GAEVDHKLYRSIVG+LLYLT SRP+IAY VGI A YQAD RITHLEA+KRILKYVHGT+D  MMYSYDTTPTLVGY DAD AGS +DRKSTS G
Subjt:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein1.4e-16074.94Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRK+KIDY+KM                              ELLQFRRNNVWTLVSKPEGVNVIGTKW+FKNKIDE GCVTKNKARLVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI
        +DFDETFA                        MDVKSAFLN YLNEEVY+AQPKGFVDSEHPKHVYKLNKA+YGLKQ  RAWYD LTV  R +GY REEI
Subjt:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI

Query:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL
        DKT FIH+KSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELS FLGLQIKQKND IFI+QEKYA+NM+KKF LEQARNKRT AATHVKL
Subjt:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL

Query:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG
        TKD +GAEVDHKLYRSIVG+LLYLT SRP+IAY VGI A YQAD RITHLEA+KRILKYVHGT+D  MMYSYDTTPTLVGY DAD AGS +DRKSTS G
Subjt:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG

A0A5D3BPB3 Gag-pol polyprotein6.0e-17275.12Show/hide
Query:  ISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA------------
        ++ +E  T+NSAL+DEYWLN MQEELLQFRRNNVWTL+SKPEGVNVIGTKW+FKNK DE GCVTKNKARLVA GYTQVEG+DFDETFA            
Subjt:  ISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA------------

Query:  -----------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYV
                    +DVKS FLNGYLNEEVY+AQPKGFVDSEHPKHVYKLNKA+YGLKQ LRAWYD LTV  R +GYSR EIDK LFIH+KSDQLLVAQIYV
Subjt:  -----------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYV

Query:  DDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGS
        DDIIFGGFP DL+NNFINIMQSEFEMSMVGELS FLGLQIKQKNDGIFI+QEKYARNM+KKFGL+QARNKRT AATHVKLTKD +GAEVDHKLYRSIVGS
Subjt:  DDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGS

Query:  LLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSL
        LLYLTASRP+IAY VGI A YQAD RIT LE +KRILKYVHGT+D GMMYSYDTT TLVGY DAD AGS DDRK+                         
Subjt:  LLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSL

Query:  STAEAEYIAAGSGCTQLI
           EAEYIAAGSGCTQLI
Subjt:  STAEAEYIAAGSGCTQLI

A0A5D3CA21 Gag-pol polyprotein1.2e-16774.7Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRK+KIDY+KMVA+LCYIST E  TV+SAL+DEYWLNAMQEELLQ+R+NNVWTLVSKPEGVNVIGTK++FKNK DE GCVTKNKARLVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFASMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI
        +DFDETF+ MDVKSAFLNGYLNEEVY+AQPKGFVDSEHPKHVYKLNKA+Y L                                KKS+QLL+AQIYVDDI
Subjt:  IDFDETFASMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI

Query:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY
        IFGGFPQDLVNNFIN MQSEF+MSM+GELS FLGLQIKQKNDGIFI+QEKYARNM+KKFGLEQARNKRT AATHVKLTKD  GAEVDHKLYRSIVGSLLY
Subjt:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY

Query:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA
        LTASRP+IAYA+GI A YQAD  ITHLE +KRILKYVHGT+D GMMYSYDTTPTLVGY DAD AGSTDDRK                             
Subjt:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA

Query:  EAEYIAAGSGCTQLI
        EAEYIAAGSGCTQLI
Subjt:  EAEYIAAGSGCTQLI

A0A5D3CJ17 Gag-pol polyprotein4.1e-16074.69Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRK+KIDY+KM                              ELLQFRRNNVWTLVSKPEGVNVIGTKW+FKNKIDE GCVTKNKARLVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI
        +DFDETFA                        MDVKSAFLN YLNEEVY+AQPKGFVDSEHPKHVYKLNKA+YGLKQ  RAWYD LT   R +GY REEI
Subjt:  IDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI

Query:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL
        DKT FIH+KSDQLLVAQIYVDDIIFGGFP DLVNNFINIMQSEFEMS VGELS FLGLQIKQKND IFI+QEKYA+NM+KKF LEQARNKRT AATHVKL
Subjt:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL

Query:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG
        TKD +GAEVDHKLYRSIVG+LLYLT SRP+IAY VGI A YQAD RITHLEA+KRILKYVHGT+D  MMYSYDTTPTLVGY DAD AGS +DRKSTS G
Subjt:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRG

A0A5D3CXU0 Gag-pol polyprotein6.5e-16672.15Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG
        MQTRRKEKIDYMKMVADLCYIST EPSTV+SALRDEYWLNAMQEELLQFR+NNVWTLVSKPEGVNVIGTKWVFKNK DEAGCVTKNKA+LVA GYTQVEG
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEG

Query:  IDFDETFAS-----------------------MDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI
        IDFDETFAS                       MDVKSAFL+GYLNEEVY+AQPKGFVDSEHPKH+YKLNKA+YGLKQ  RAWYD LTV  R KGYSR EI
Subjt:  IDFDETFAS-----------------------MDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEI

Query:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL
        DKTLFI +KSDQLLVAQIYVDDIIF GFP DLVNNFI     EFEMSMVGELS FLGLQIKQKND IFI+QEKYARNM+KKFGLEQARNKRT AATHVKL
Subjt:  DKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKL

Query:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGC
        TKD + +EVDHKLYRSI                         AD RITHLEA+KRILKYVHGT+D GMMYSYDTTPTLVGY DA+ AGSTDD K+     
Subjt:  TKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGC

Query:  FFFRNNLISWLSKKQNCVSLSTAEAEYIAAGSGCTQLI
                               EA+Y+AAGSGCTQLI
Subjt:  FFFRNNLISWLSKKQNCVSLSTAEAEYIAAGSGCTQLI

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-6033.17Show/hide
Query:  WLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA-----------------------SMDVKS
        W  A+  EL   + NN WT+  +PE  N++ ++WVF  K +E G   + KARLVA G+TQ   ID++ETFA                        MDVK+
Subjt:  WLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA-----------------------SMDVKS

Query:  AFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKS--DQLLVAQIYVDDIIFGGFPQDLVNN
        AFLNG L EE+Y+  P+G   S +  +V KLNKA+YGLKQ  R W++      ++  +    +D+ ++I  K   ++ +   +YVDD++        +NN
Subjt:  AFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKS--DQLLVAQIYVDDIIFGGFPQDLVNN

Query:  FINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY-LTASRPNIAYA
        F   +  +F M+ + E+  F+G++I+ + D I+++Q  Y + ++ KF +E      T   + +     N   + +    RS++G L+Y +  +RP++  A
Subjt:  FINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY-LTASRPNIAYA

Query:  VGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTT--PTLVGYYDADLAGSTDDRKSTSRGCF-FFRNNLISWLSKKQNCVSLSTAEAEYIA
        V I + Y + +     + +KR+L+Y+ GT D+ +++  +      ++GY D+D AGS  DRKST+   F  F  NLI W +K+QN V+ S+ EAEY+A
Subjt:  VGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTT--PTLVGYYDADLAGSTDDRKSTSRGCF-FFRNNLISWLSKKQNCVSLSTAEAEYIA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-6134.09Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSAL---RDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQ
        +++RR    +Y+ +  D       EP ++   L        + AMQEE+   ++N  + LV  P+G   +  KWVFK K D    + + KARLV  G+ Q
Subjt:  MQTRRKEKIDYMKMVADLCYISTFEPSTVNSAL---RDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQ

Query:  VEGIDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSR
         +GIDFDE F+                        +DVK+AFL+G L EE+Y+ QP+GF  +     V KLNK++YGLKQ  R WY       + + Y +
Subjt:  VEGIDFDETFA-----------------------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSR

Query:  EEIDKTLFIHKKSD-QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDG--IFIAQEKYARNMIKKFGLEQARNKRTAA
           D  ++  + S+   ++  +YVDD++  G  + L+      +   F+M  +G     LG++I ++     ++++QEKY   ++++F ++ A+   T  
Subjt:  EEIDKTLFIHKKSD-QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDG--IFIAQEKYARNMIKKFGLEQARNKRTAA

Query:  ATHVKLTKDNDGAEVDHK------LYRSIVGSLLY-LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLA
        A H+KL+K      V+ K       Y S VGSL+Y +  +RP+IA+AVG+ + +  +    H EA+K IL+Y+ GT    + +   + P L GY DAD+A
Subjt:  ATHVKLTKDNDGAEVDHK------LYRSIVGSLLY-LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLA

Query:  GSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTAEAEYIAA
        G  D+RKS++   F F    ISW SK Q CV+LST EAEYIAA
Subjt:  GSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTAEAEYIAA

P25600 Putative transposon Ty5-1 protein YCL074W1.7e-3028.24Show/hide
Query:  MDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDL
        MDV +AFLN  ++E +Y+ QP GFV+  +P +V++L   +YGLKQ    W + +    +  G+ R E +  L+    SD  +   +YVDD++       +
Subjt:  MDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDG-IFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY-LTASRPN
         +     +   + M  +G++  FLGL I Q ++G I ++ + Y      +  +   +  +T       L +       D   Y+SIVG LL+     RP+
Subjt:  VNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDG-IFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY-LTASRPN

Query:  IAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKK-QNCVSLSTAEAEYIA
        I+Y V + + +  + R  HLE+ +R+L+Y++ T  + + Y   +   L  Y DA      D   ST           ++W SKK +  + + + EAEYI 
Subjt:  IAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKK-QNCVSLSTAEAEYIA

Query:  A
        A
Subjt:  A

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-6535.27Show/hide
Query:  EPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEG-VNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA---------------
        EP T   AL+DE W NAM  E+     N+ W LV  P   V ++G +W+F  K +  G + + KARLVA GY Q  G+D+ ETF+               
Subjt:  EPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEG-VNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA---------------

Query:  --------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI
                 +DV +AFL G L ++VY++QP GF+D + P +V KL KA+YGLKQ  RAWY  L       G+     D +LF+ ++   ++   +YVDDI
Subjt:  --------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI

Query:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY
        +  G    L++N ++ +   F +    EL +FLG++ K+   G+ ++Q +Y  +++ +  +  A+   T  A   KL+  +     D   YR IVGSL Y
Subjt:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY

Query:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA
        L  +RP+I+YAV   + +       HL+A+KRIL+Y+ GT + G+      T +L  Y DAD AG  DD  ST+    +  ++ ISW SKKQ  V  S+ 
Subjt:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA

Query:  EAEYIAAGSGCTQL
        EAEY +  +  +++
Subjt:  EAEYIAAGSGCTQL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-6635.02Show/hide
Query:  EPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLV-SKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA---------------
        EP T   A++D+ W  AM  E+     N+ W LV   P  V ++G +W+F  K +  G + + KARLVA GY Q  G+D+ ETF+               
Subjt:  EPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLV-SKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA---------------

Query:  --------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI
                 +DV +AFL G L +EVY++QP GFVD + P +V +L KA+YGLKQ  RAWY  L       G+     D +LF+ ++   ++   +YVDDI
Subjt:  --------SMDVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDI

Query:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY
        +  G    L+ + ++ +   F +    +L +FLG++ K+   G+ ++Q +Y  +++ +  +  A+   T  AT  KLT  +     D   YR IVGSL Y
Subjt:  IFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLY

Query:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA
        L  +RP+++YAV   + Y       H  A+KR+L+Y+ GT D G+      T +L  Y DAD AG TDD  ST+    +  ++ ISW SKKQ  V  S+ 
Subjt:  LTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTA

Query:  EAEYIAAGSGCTQL
        EAEY +  +  ++L
Subjt:  EAEYIAAGSGCTQL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.3e-6535.34Show/hide
Query:  LCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA---------
        +C     EPST N A     W  AM +E+      + W + + P     IG KWV+K K +  G + + KARLVA GYTQ EGIDF ETF+         
Subjt:  LCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFA---------

Query:  --------------SMDVKSAFLNGYLNEEVYIAQPKGFV----DSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQL
                       +D+ +AFLNG L+EE+Y+  P G+     DS  P  V  L K++YGLKQ  R W+   +V     G+ +   D T F+   +   
Subjt:  --------------SMDVKSAFLNGYLNEEVYIAQPKGFV----DSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQL

Query:  LVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKL
        L   +YVDDII        V+   + ++S F++  +G L +FLGL+I +   GI I Q KYA +++ + GL   +         V  +  + G  VD K 
Subjt:  LVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKL

Query:  YRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSK
        YR ++G L+YL  +R +I++AV   + +    R+ H +A+ +IL Y+ GT   G+ YS      L  + DA      D R+ST+  C F   +LISW SK
Subjt:  YRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSK

Query:  KQNCVSLSTAEAEYIA
        KQ  VS S+AEAEY A
Subjt:  KQNCVSLSTAEAEYIA

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.6e-0732.91Show/hide
Query:  LYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGC
        +YLT +RP++ +AV   + + + SR   ++A+ ++L YV GT   G+ YS  +   L  + D+D A   D R+S +  C
Subjt:  LYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGC

ATMG00810.1 DNA/RNA polymerases superfamily protein1.0e-3035.75Show/hide
Query:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEV-DHKLYRS
        +YVDDI+  G    L+N  I  + S F M  +G + +FLG+QIK    G+F++Q KYA  ++   G+   +   T     +KL      A+  D   +RS
Subjt:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEV-DHKLYRS

Query:  IVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQN
        IVG+L YLT +RP+I+YAV I      +  +   + +KR+L+YV GT   G+    ++   +  + D+D AG T  R+ST+  C F   N+ISW +K+Q 
Subjt:  IVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAIKRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQN

Query:  CVSLSTAEAEYIAAGSGCTQL
         VS S+ E EY A      +L
Subjt:  CVSLSTAEAEYIAAGSGCTQL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.5e-1842.74Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTF--EPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQV
        M TR K  I+ +     L   +T   EP +V  AL+D  W  AMQEEL    RN  W LV  P   N++G KWVFK K+   G + + KARLVA G+ Q 
Subjt:  MQTRRKEKIDYMKMVADLCYISTF--EPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQV

Query:  EGIDFDETFASMDVKSAFLNGYLN
        EGI F ET++ + V++A +   LN
Subjt:  EGIDFDETFASMDVKSAFLNGYLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGTTATATTTCCACCTTTGAACCTTCTACTGTTAACTCTGCTCTCAGGGATGAATA
TTGGCTCAATGCTATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATGTCTGGACATTAGTGTCAAAACCAGAAGGTGTAAACGTTATCGGTACTAAATGGGTGTTCA
AAAATAAAATTGATGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTGGCTCTAGGGTATACTCAAGTTGAAGGTATTGACTTTGATGAAACGTTTGCTTCGATG
GATGTAAAGAGTGCCTTCTTAAATGGATATTTAAATGAGGAGGTTTATATTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCGAAGCATGTGTATAAGCTCAACAA
AGCCGTATATGGACTAAAGCAAACTCTGAGAGCTTGGTATGACTGGCTAACTGTGTGTTGGAGAGATAAAGGATATTCCAGAGAAGAAATTGACAAGACCTTGTTCATAC
ACAAGAAATCTGACCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGGTTTCCTCAAGATCTAGTAAATAATTTCATTAACATCATGCAGTCAGAG
TTCGAAATGAGCATGGTTGGAGAACTTTCATGGTTTCTGGGACTTCAAATTAAGCAAAAGAATGATGGCATTTTCATAGCTCAGGAAAAGTATGCCAGGAATATGATCAA
AAAGTTTGGCTTGGAACAGGCTCGAAATAAGCGAACTGCAGCTGCGACACATGTTAAACTTACAAAAGACAATGATGGTGCTGAAGTTGATCACAAACTTTACAGGAGTA
TAGTAGGCAGCCTATTATATTTAACAGCAAGCCGACCTAACATAGCTTATGCTGTGGGAATATATGCTTCTTATCAGGCAGATTCCCGCATCACTCACCTAGAAGCTATT
AAACGAATTCTTAAGTATGTTCATGGGACCAATGACATTGGAATGATGTATTCCTATGATACCACTCCCACTCTAGTTGGATATTATGATGCTGACTTGGCAGGTTCAAC
TGATGATCGTAAAAGTACATCTAGAGGTTGCTTCTTTTTTAGAAACAATTTAATCTCTTGGTTAAGTAAGAAGCAAAACTGTGTTTCTTTATCTACAGCTGAAGCTGAAT
ATATAGCTGCAGGTAGTGGTTGTACACAATTGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGTTATATTTCCACCTTTGAACCTTCTACTGTTAACTCTGCTCTCAGGGATGAATA
TTGGCTCAATGCTATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATGTCTGGACATTAGTGTCAAAACCAGAAGGTGTAAACGTTATCGGTACTAAATGGGTGTTCA
AAAATAAAATTGATGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTGGCTCTAGGGTATACTCAAGTTGAAGGTATTGACTTTGATGAAACGTTTGCTTCGATG
GATGTAAAGAGTGCCTTCTTAAATGGATATTTAAATGAGGAGGTTTATATTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCGAAGCATGTGTATAAGCTCAACAA
AGCCGTATATGGACTAAAGCAAACTCTGAGAGCTTGGTATGACTGGCTAACTGTGTGTTGGAGAGATAAAGGATATTCCAGAGAAGAAATTGACAAGACCTTGTTCATAC
ACAAGAAATCTGACCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGGTTTCCTCAAGATCTAGTAAATAATTTCATTAACATCATGCAGTCAGAG
TTCGAAATGAGCATGGTTGGAGAACTTTCATGGTTTCTGGGACTTCAAATTAAGCAAAAGAATGATGGCATTTTCATAGCTCAGGAAAAGTATGCCAGGAATATGATCAA
AAAGTTTGGCTTGGAACAGGCTCGAAATAAGCGAACTGCAGCTGCGACACATGTTAAACTTACAAAAGACAATGATGGTGCTGAAGTTGATCACAAACTTTACAGGAGTA
TAGTAGGCAGCCTATTATATTTAACAGCAAGCCGACCTAACATAGCTTATGCTGTGGGAATATATGCTTCTTATCAGGCAGATTCCCGCATCACTCACCTAGAAGCTATT
AAACGAATTCTTAAGTATGTTCATGGGACCAATGACATTGGAATGATGTATTCCTATGATACCACTCCCACTCTAGTTGGATATTATGATGCTGACTTGGCAGGTTCAAC
TGATGATCGTAAAAGTACATCTAGAGGTTGCTTCTTTTTTAGAAACAATTTAATCTCTTGGTTAAGTAAGAAGCAAAACTGTGTTTCTTTATCTACAGCTGAAGCTGAAT
ATATAGCTGCAGGTAGTGGTTGTACACAATTGATTTGA
Protein sequenceShow/hide protein sequence
MQTRRKEKIDYMKMVADLCYISTFEPSTVNSALRDEYWLNAMQEELLQFRRNNVWTLVSKPEGVNVIGTKWVFKNKIDEAGCVTKNKARLVALGYTQVEGIDFDETFASM
DVKSAFLNGYLNEEVYIAQPKGFVDSEHPKHVYKLNKAVYGLKQTLRAWYDWLTVCWRDKGYSREEIDKTLFIHKKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSE
FEMSMVGELSWFLGLQIKQKNDGIFIAQEKYARNMIKKFGLEQARNKRTAAATHVKLTKDNDGAEVDHKLYRSIVGSLLYLTASRPNIAYAVGIYASYQADSRITHLEAI
KRILKYVHGTNDIGMMYSYDTTPTLVGYYDADLAGSTDDRKSTSRGCFFFRNNLISWLSKKQNCVSLSTAEAEYIAAGSGCTQLI