; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0025201 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0025201
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:25870803..25872020
RNA-Seq ExpressionCmc01g0025201
SyntenyCmc01g0025201
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.6e-17987.9Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRKDKIDYLKM                              ELLQFRRNNVWTLVSK EGVNVIGTKWIFKNK DETGCVTKNKARLVAQGYTQVEGVD
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFAPVARL+AIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGY R EIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        T FIHRKSDQLLVAQIYVDDIIFGGFP+DLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRTLAATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        DT+GAEVD KLYRSIVGNLLYLTTSRPDIAY VGICA YQADPRITHLE VKRILKYVHGTSDF MMYSYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

KAA0033021.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.8e-17083.33Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRK+KIDY+KMVADLCYIST EPSTVDSA +DEYWLNAMQ+ELL+FRRNNVWTLVSK EGVNVI                                G+D
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLN YLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GYS+GEIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        TLFIHRKSDQLLVAQIYVDDIIFG FP DL+NNFINIMQSE EMS VGELSCFLGLQIKQKNDDI ISQEKYARNMVKKFGLEQARNKRT AATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        +T+GAEVD KLY+SIVGNLLYLT SRPDIAY VGI ARYQADPRITHLE VK+I+KYVHGTSDFGMM SYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.4e-18391.81Show/hide
Query:  ISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLL
        ++  E  T++SALKDEYWLN MQEELLQFRRNNVWTL+SK EGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLL
Subjt:  ISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLL

Query:  GISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYV
        GISCIQKFKLYQ+DVKS FLN YLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQA RAWYDRLTVYLRGRGYSRGEIDK LFIHRKSDQLLVAQIYV
Subjt:  GISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYV

Query:  DDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGN
        DDIIFGGFP DL+NNFINIMQSEFEMS VGELSCFLGLQIKQKND IFISQEKYARNMVKKFGL+QARNKRT AATHVKLTKDT+GAEVD KLYRSIVG+
Subjt:  DDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGN

Query:  LLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        LLYLT SRPDIAY VGICARYQADPRIT LE VKRILKYVHGTSDFGMMYSYDT
Subjt:  LLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.0e-17484.95Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRK+KIDY+KMVADLCYIST+EPSTVDSAL+DEYWLNAMQEELLQFR+NNVWTLVSK EGVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+D
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFA VARLEAIRLLLGISCIQKFKLYQMDVKSAFL+ YLNEEVYVAQPKGFVDSEHPKH+YKLNKALYGLKQA RAWYD+LTVYLRG+GYSRGEIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        TLFI RKSDQLLVAQIYVDDIIF GFPHDLVNNFI     EFEMS VGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRT AATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        DT+ +EVD KLYRSI                         ADPRITHLE VKRILKYVHGTSDFGMMYSYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.6e-17987.63Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRKDKIDYLKM                              ELLQFRRNNVWTLVSK EGVNVIGTKWIFKNK DETGCVTKNKARLVAQGYTQVEGVD
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFAPVARL+AIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLT YLRGRGY R EIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        T FIHRKSDQLLVAQIYVDDIIFGGFP+DLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRTLAATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        DT+GAEVD KLYRSIVGNLLYLTTSRPDIAY VGICA YQADPRITHLE VKRILKYVHGTSDF MMYSYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein7.7e-18087.9Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRKDKIDYLKM                              ELLQFRRNNVWTLVSK EGVNVIGTKWIFKNK DETGCVTKNKARLVAQGYTQVEGVD
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFAPVARL+AIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGY R EIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        T FIHRKSDQLLVAQIYVDDIIFGGFP+DLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRTLAATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        DT+GAEVD KLYRSIVGNLLYLTTSRPDIAY VGICA YQADPRITHLE VKRILKYVHGTSDF MMYSYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

A0A5D3BPB3 Gag-pol polyprotein1.1e-18391.81Show/hide
Query:  ISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLL
        ++  E  T++SALKDEYWLN MQEELLQFRRNNVWTL+SK EGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLL
Subjt:  ISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLL

Query:  GISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYV
        GISCIQKFKLYQ+DVKS FLN YLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQA RAWYDRLTVYLRGRGYSRGEIDK LFIHRKSDQLLVAQIYV
Subjt:  GISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYV

Query:  DDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGN
        DDIIFGGFP DL+NNFINIMQSEFEMS VGELSCFLGLQIKQKND IFISQEKYARNMVKKFGL+QARNKRT AATHVKLTKDT+GAEVD KLYRSIVG+
Subjt:  DDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGN

Query:  LLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        LLYLT SRPDIAY VGICARYQADPRIT LE VKRILKYVHGTSDFGMMYSYDT
Subjt:  LLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

A0A5D3BWU5 Gag-pol polyprotein8.5e-17183.33Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRK+KIDY+KMVADLCYIST EPSTVDSA +DEYWLNAMQ+ELL+FRRNNVWTLVSK EGVNVI                                G+D
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLN YLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GYS+GEIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        TLFIHRKSDQLLVAQIYVDDIIFG FP DL+NNFINIMQSE EMS VGELSCFLGLQIKQKNDDI ISQEKYARNMVKKFGLEQARNKRT AATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        +T+GAEVD KLY+SIVGNLLYLT SRPDIAY VGI ARYQADPRITHLE VK+I+KYVHGTSDFGMM SYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

A0A5D3CJ17 Gag-pol polyprotein2.2e-17987.63Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRKDKIDYLKM                              ELLQFRRNNVWTLVSK EGVNVIGTKWIFKNK DETGCVTKNKARLVAQGYTQVEGVD
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFAPVARL+AIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLT YLRGRGY R EIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        T FIHRKSDQLLVAQIYVDDIIFGGFP+DLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRTLAATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        DT+GAEVD KLYRSIVGNLLYLTTSRPDIAY VGICA YQADPRITHLE VKRILKYVHGTSDF MMYSYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

A0A5D3CXU0 Gag-pol polyprotein9.7e-17584.95Show/hide
Query:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD
        TRRK+KIDY+KMVADLCYIST+EPSTVDSAL+DEYWLNAMQEELLQFR+NNVWTLVSK EGVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+D
Subjt:  TRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVD

Query:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK
        FDETFA VARLEAIRLLLGISCIQKFKLYQMDVKSAFL+ YLNEEVYVAQPKGFVDSEHPKH+YKLNKALYGLKQA RAWYD+LTVYLRG+GYSRGEIDK
Subjt:  FDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDK

Query:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK
        TLFI RKSDQLLVAQIYVDDIIF GFPHDLVNNFI     EFEMS VGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRT AATHVKLTK
Subjt:  TLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTK

Query:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT
        DT+ +EVD KLYRSI                         ADPRITHLE VKRILKYVHGTSDFGMMYSYDT
Subjt:  DTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDT

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.4e-5533.83Show/hide
Query:  WLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKS
        W  A+  EL   + NN WT+  + E  N++ ++W+F  K +E G   + KARLVA+G+TQ   +D++ETFAPVAR+ + R +L +      K++QMDVK+
Subjt:  WLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKS

Query:  AFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKS--DQLLVAQIYVDDIIFGGFPHDLVNN
        AFLN  L EE+Y+  P+G   S +  +V KLNKA+YGLKQA R W++     L+   +    +D+ ++I  K   ++ +   +YVDD++        +NN
Subjt:  AFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKS--DQLLVAQIYVDDIIFGGFPHDLVNN

Query:  FINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKL-YRSIVGNLLY-LTTSRPDIAY
        F   +  +F M+ + E+  F+G++I+ + D I++SQ  Y + ++ KF +E      T   +  K+  +   ++ D     RS++G L+Y +  +RPD+  
Subjt:  FINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKL-YRSIVGNLLY-LTTSRPDIAY

Query:  AVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMY
        AV I +RY +       +++KR+L+Y+ GT D  +++
Subjt:  AVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-5535.71Show/hide
Query:  LNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSA
        + AMQEE+   ++N  + LV   +G   +  KW+FK K D    + + KARLV +G+ Q +G+DFDE F+PV ++ +IR +L ++     ++ Q+DVK+A
Subjt:  LNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSA

Query:  FLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD-QLLVAQIYVDDIIFGGFPHDLVNNFI
        FL+  L EE+Y+ QP+GF  +     V KLNK+LYGLKQAPR WY +   +++ + Y +   D  ++  R S+   ++  +YVDD++  G    L+    
Subjt:  FLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD-QLLVAQIYVDDIIFGGFPHDLVNNFI

Query:  NIMQSEFEMSKVGELSCFLGLQI--KQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLK------LYRSIVGNLLY-LTTSR
          +   F+M  +G     LG++I  ++ +  +++SQEKY   ++++F ++ A+   T  A H+KL+K      V+ K       Y S VG+L+Y +  +R
Subjt:  NIMQSEFEMSKVGELSCFLGLQI--KQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLK------LYRSIVGNLLY-LTTSR

Query:  PDIAYAVGICARYQADPRITHLEDVKRILKYVHGTS
        PDIA+AVG+ +R+  +P   H E VK IL+Y+ GT+
Subjt:  PDIAYAVGICARYQADPRITHLEDVKRILKYVHGTS

P25600 Putative transposon Ty5-1 protein YCL074W1.4e-3232.92Show/hide
Query:  MDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPHDL
        MDV +AFLN  ++E +YV QP GFV+  +P +V++L   +YGLKQAP  W + +   L+  G+ R E +  L+    SD  +   +YVDD++       +
Subjt:  MDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPHDL

Query:  VNNFINIMQSEFEMSKVGELSCFLGLQIKQ-KNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLY-LTTSRPD
         +     +   + M  +G++  FLGL I Q  N DI +S + Y      +  +   +  +T       L + T     D+  Y+SIVG LL+   T RPD
Subjt:  VNNFINIMQSEFEMSKVGELSCFLGLQIKQ-KNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLY-LTTSRPD

Query:  IAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMY
        I+Y V + +R+  +PR  HLE  +R+L+Y++ T    + Y
Subjt:  IAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.9e-6538.55Show/hide
Query:  EPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEG-VNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGIS
        EP T   ALKDE W NAM  E+     N+ W LV      V ++G +WIF  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++LG++
Subjt:  EPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEG-VNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGIS

Query:  CIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI
          + + + Q+DV +AFL   L ++VY++QP GF+D + P +V KL KALYGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI
Subjt:  CIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDI

Query:  IFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLY
        +  G    L++N ++ +   F +    EL  FLG++ K+    + +SQ +Y  +++ +  +  A+   T  A   KL+  +     D   YR IVG+L Y
Subjt:  IFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLY

Query:  LTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGM
        L  +RPDI+YAV   +++   P   HL+ +KRIL+Y+ GT + G+
Subjt:  LTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.0e-6436.41Show/hide
Query:  IIQ-QALSLVNHQLGCTRRKD-------KIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLV-SKLEGVNVIGTKWIFKNKTD
        IIQ  A + VN     TR KD       K  Y   +A     +  EP T   A+KD+ W  AM  E+     N+ W LV      V ++G +WIF  K +
Subjt:  IIQ-QALSLVNHQLGCTRRKD-------KIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLV-SKLEGVNVIGTKWIFKNKTD

Query:  ETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQA
          G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  + + + Q+DV +AFL   L +EVY++QP GFVD + P +V +L KA+YGLKQA
Subjt:  ETGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQA

Query:  PRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNM
        PRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+  G    L+ + ++ +   F + +  +L  FLG++ K+    + +SQ +Y  ++
Subjt:  PRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNM

Query:  VKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGM
        + +  +  A+   T  AT  KLT  +     D   YR IVG+L YL  +RPD++YAV   ++Y   P   H   +KR+L+Y+ GT D G+
Subjt:  VKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-6335.6Show/hide
Query:  SLVNHQLGCTRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQ
        SL  H +      +K+  L     +C     EPST + A +   W  AM +E+      + W + +       IG KW++K K +  G + + KARLVA+
Subjt:  SLVNHQLGCTRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQ

Query:  GYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFV----DSEHPKHVYKLNKALYGLKQAPRAWYDRLTVY
        GYTQ EG+DF ETF+PV +L +++L+L IS I  F L+Q+D+ +AFLN  L+EE+Y+  P G+     DS  P  V  L K++YGLKQA R W+ + +V 
Subjt:  GYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFV----DSEHPKHVYKLNKALYGLKQAPRAWYDRLTVY

Query:  LRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARN
        L G G+ +   D T F+   +   L   +YVDDII        V+   + ++S F++  +G L  FLGL+I +    I I Q KYA +++ + GL   + 
Subjt:  LRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARN

Query:  KRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYS
                V  +  + G  VD K YR ++G L+YL  +R DI++AV   +++   PR+ H + V +IL Y+ GT   G+ YS
Subjt:  KRTLAATHVKLTKDTKGAEVDLKLYRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYS

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.7e-0436Show/hide
Query:  LYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYS
        +YLT +RPD+ +AV   +++ +  R   ++ V ++L YV GT   G+ YS
Subjt:  LYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYS

ATMG00810.1 DNA/RNA polymerases superfamily protein4.7e-2037.5Show/hide
Query:  IYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEV-DLKLYRS
        +YVDDI+  G  + L+N  I  + S F M  +G +  FLG+QIK     +F+SQ KYA  ++   G+   +   T     +KL      A+  D   +RS
Subjt:  IYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEV-DLKLYRS

Query:  IVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGM
        IVG L YLT +RPDI+YAV I  +   +P +   + +KR+L+YV GT   G+
Subjt:  IVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGM

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.6e-2043.9Show/hide
Query:  TRRKDKIDYLKMVADLCYISTI--EPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEG
        TR K  I+ L     L   +TI  EP +V  ALKD  W  AMQEEL    RN  W LV      N++G KW+FK K    G + + KARLVA+G+ Q EG
Subjt:  TRRKDKIDYLKMVADLCYISTI--EPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQGYTQVEG

Query:  VDFDETFAPVARLEAIRLLLGIS
        + F ET++PV R   IR +L ++
Subjt:  VDFDETFAPVARLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGAAAAATCATCCAACAAGCTTTATCATTGGTGAACCATCAGCTGGGATGCACCAGAAGGAAAGATAAGATTGACTATTTGAAGATGGTTGCTGACTTGTGCTA
TATTTCCACCATTGAACCTTCGACTGTTGACTCTGCTCTCAAGGATGAGTATTGGTTAAATGCTATGCAAGAGGAGCTACTGCAATTTAGACGAAACAATGTCTGGACAT
TAGTCTCAAAGCTAGAAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAAACTGATGAAACTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAA
GGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCTGTTGCTCGGCTTGAAGCCATTCGACTTTTACTGGGCATATCATGCATACAGAAATTTAAATT
GTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGAGTACTTGAATGAGGAGGTTTATGTTGCCCAACCAAAAGGTTTTGTAGATTCCGAGCACCCGAAGCATGTGTATA
AGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGGCTAACTGTGTACTTGAGAGGTAGAGGATATTCCAGAGGAGAAATTGACAAGACC
TTGTTCATACATAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTCATGATCTAGTAAATAATTTCATTAATATTAT
GCAGTCAGAATTCGAAATGAGCAAGGTTGGAGAGCTTTCATGTTTTTTGGGACTTCAAATTAAGCAAAAGAATGATGACATTTTTATATCACAAGAAAAGTACGCTAGGA
ATATGGTTAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGGACTCTAGCTGCGACACATGTTAAACTTACAAAAGACACTAAAGGTGCTGAAGTTGATCTCAAACTT
TACAGGAGTATAGTAGGCAACCTATTATACTTAACAACAAGTCGACCTGACATAGCTTATGCTGTGGGAATATGTGCTCGTTATCAGGCAGATCCTCGCATCACTCACCT
AGAAGATGTTAAACGAATTCTTAAATATGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCTGTAACGCCCCGAATTTCGAGGTAAAATTTCAGCATT
TTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGAAAAATCATCCAACAAGCTTTATCATTGGTGAACCATCAGCTGGGATGCACCAGAAGGAAAGATAAGATTGACTATTTGAAGATGGTTGCTGACTTGTGCTA
TATTTCCACCATTGAACCTTCGACTGTTGACTCTGCTCTCAAGGATGAGTATTGGTTAAATGCTATGCAAGAGGAGCTACTGCAATTTAGACGAAACAATGTCTGGACAT
TAGTCTCAAAGCTAGAAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAAACTGATGAAACTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAA
GGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCTGTTGCTCGGCTTGAAGCCATTCGACTTTTACTGGGCATATCATGCATACAGAAATTTAAATT
GTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGAGTACTTGAATGAGGAGGTTTATGTTGCCCAACCAAAAGGTTTTGTAGATTCCGAGCACCCGAAGCATGTGTATA
AGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGGCTAACTGTGTACTTGAGAGGTAGAGGATATTCCAGAGGAGAAATTGACAAGACC
TTGTTCATACATAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTCATGATCTAGTAAATAATTTCATTAATATTAT
GCAGTCAGAATTCGAAATGAGCAAGGTTGGAGAGCTTTCATGTTTTTTGGGACTTCAAATTAAGCAAAAGAATGATGACATTTTTATATCACAAGAAAAGTACGCTAGGA
ATATGGTTAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGGACTCTAGCTGCGACACATGTTAAACTTACAAAAGACACTAAAGGTGCTGAAGTTGATCTCAAACTT
TACAGGAGTATAGTAGGCAACCTATTATACTTAACAACAAGTCGACCTGACATAGCTTATGCTGTGGGAATATGTGCTCGTTATCAGGCAGATCCTCGCATCACTCACCT
AGAAGATGTTAAACGAATTCTTAAATATGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCTGTAACGCCCCGAATTTCGAGGTAAAATTTCAGCATT
TTATGTGA
Protein sequenceShow/hide protein sequence
MKRKIIQQALSLVNHQLGCTRRKDKIDYLKMVADLCYISTIEPSTVDSALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNVIGTKWIFKNKTDETGCVTKNKARLVAQ
GYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSAFLNEYLNEEVYVAQPKGFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKT
LFIHRKSDQLLVAQIYVDDIIFGGFPHDLVNNFINIMQSEFEMSKVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTLAATHVKLTKDTKGAEVDLKL
YRSIVGNLLYLTTSRPDIAYAVGICARYQADPRITHLEDVKRILKYVHGTSDFGMMYSYDTCNAPNFEVKFQHFM