; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0225731 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0225731
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:17039055..17041278
RNA-Seq ExpressionCmc08g0225731
SyntenyCmc08g0225731
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.3e-28674.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

KAA0037582.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.3e-28674.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

KAA0041108.1 reverse transcriptase [Cucumis melo var. makuwa]1.3e-28674.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.3e-28674.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

KAA0067829.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]0.0e+0086.15Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
        RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRFNAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGKVIAYASRR
        VSVDPQKVEAVVNWERPNSVTE              NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGKVIAYASRR
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRFNAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGKVIAYASRR

Query:  LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANV---------------------
        LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPG +N                      
Subjt:  LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANV---------------------

Query:  ---------------------------VVDALSRKSRLLKSALCGIRVVLLNELRGSK-VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHD
                                     +A+ ++ RL  S +  ++  +L E   S  VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHD
Subjt:  ---------------------------VVDALSRKSRLLKSALCGIRVVLLNELRGSK-VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHD

Query:  GIWVIVNKITKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLR
        GIWVIVNKITKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLR
Subjt:  GIWVIVNKITKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLR

Query:  LKEVWIPTCHLWSLLIIIAISLVSVWHHMKLYTGDHAELLCVGMKWESEC
        LKEVWIPTCHLWSLLIIIAISLVSVWHHMKLYTGDHAELLCVGMKWESEC
Subjt:  LKEVWIPTCHLWSLLIIIAISLVSVWHHMKLYTGDHAELLCVGMKWESEC

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase6.3e-28774.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

A0A5A7U2V7 Reverse transcriptase6.3e-28774.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

A0A5A7UUL6 Reverse transcriptase6.3e-28774.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

A0A5A7VRP7 Retrotransposon protein, putative, Ty3-gypsy subclass0.0e+0086.15Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
        RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRFNAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGKVIAYASRR
        VSVDPQKVEAVVNWERPNSVTE              NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGKVIAYASRR
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRFNAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGKVIAYASRR

Query:  LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANV---------------------
        LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPG +N                      
Subjt:  LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANV---------------------

Query:  ---------------------------VVDALSRKSRLLKSALCGIRVVLLNELRGSK-VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHD
                                     +A+ ++ RL  S +  ++  +L E   S  VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHD
Subjt:  ---------------------------VVDALSRKSRLLKSALCGIRVVLLNELRGSK-VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHD

Query:  GIWVIVNKITKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLR
        GIWVIVNKITKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLR
Subjt:  GIWVIVNKITKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLR

Query:  LKEVWIPTCHLWSLLIIIAISLVSVWHHMKLYTGDHAELLCVGMKWESEC
        LKEVWIPTCHLWSLLIIIAISLVSVWHHMKLYTGDHAELLCVGMKWESEC
Subjt:  LKEVWIPTCHLWSLLIIIAISLVSVWHHMKLYTGDHAELLCVGMKWESEC

A0A5D3BHI1 Reverse transcriptase6.3e-28774.43Show/hide
Query:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK
        MA SELKELKMQLQELVDKG IRPSVSPWGAPVLFVKKKDGTLRLCI+YRQLNKVTIRNKYPLPRI+DLFDQ RGAALFSKIDLRSGYHQLKVRESDIAK
Subjt:  MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAK

Query:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG
         AFRTRYGHYEFRVMPFGLTNAP  FMDLMNRIFH+YLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLR+KQLYAKFSKCEFWLEQVVFLGHVVSAKG
Subjt:  RAFRTRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKG

Query:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERP S TEVRSFLGLAGYYRRF                 N KFEWSDKCEQ FQELKKRLVT PILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA
        GCVLMQDG VIAYASR+LKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIK YDCTIEYHPGKANVV DA
Subjt:  GCVLMQDGKVIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDA

Query:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------
        LSRKSRL KSALCGIRV LLNELRGSK                                                                         
Subjt:  LSRKSRLLKSALCGIRVVLLNELRGSK-------------------------------------------------------------------------

Query:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI
                                                          VKP+RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTS+GHDGIWVIV+++
Subjt:  --------------------------------------------------VKPIRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKI

Query:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        TKT  FI IK TS LDQLARLYVDKIVSQYGV VSIV DRD RFTSKFW SLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVL+LK  W
Subjt:  TKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.1e-8128.57Show/hide
Query:  ELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFR
        +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ ++Y+ LNK    N YPLP I  L  + +G+ +F+K+DL+S YH ++VR+ D  K AFR
Subjt:  ELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFR

Query:  TRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVD
           G +E+ VMP+G++ AP  F   +N I  +  +  V+ ++DDIL++S     H +H++ VLQ L++  L    +KCEF   QV F+G+ +S KG +  
Subjt:  TRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVD

Query:  PQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL
         + ++ V+ W++P +  E+R FLG   Y R+F                 + +++W+    Q  + +K+ LV+ P+L      K  ++  DAS + +G VL
Subjt:  PQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL

Query:  MQ---DGKV--IAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKAN
         Q   D K   + Y S ++ + + NY   D E+ A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++ ++  I Y PG AN
Subjt:  MQ---DGKV--IAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKAN

Query:  VVVDALSR-------------------------------------------------------KSRLLKSALC---------------------------
         + DALSR                                                       ++  LK  L                            
Subjt:  VVVDALSR-------------------------------------------------------KSRLLKSALC---------------------------

Query:  -----GIRVVLLNELRGSKVKPIRQ--------------------RPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISIKA
             GI ++    LR    K IR+                    +P G L P+P  E  WE ++MDF+  LP  S+G++ ++V+V++ +K A  +    
Subjt:  -----GIRVVLLNELRGSKVKPIRQ--------------------RPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISIKA

Query:  TSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVWI
        +   +Q AR++  ++++ +G    I+ D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR         W+
Subjt:  TSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVWI

P0CT35 Transposon Tf2-2 polyprotein1.1e-8128.57Show/hide
Query:  ELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFR
        +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ ++Y+ LNK    N YPLP I  L  + +G+ +F+K+DL+S YH ++VR+ D  K AFR
Subjt:  ELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFR

Query:  TRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVD
           G +E+ VMP+G++ AP  F   +N I  +  +  V+ ++DDIL++S     H +H++ VLQ L++  L    +KCEF   QV F+G+ +S KG +  
Subjt:  TRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVD

Query:  PQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL
         + ++ V+ W++P +  E+R FLG   Y R+F                 + +++W+    Q  + +K+ LV+ P+L      K  ++  DAS + +G VL
Subjt:  PQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL

Query:  MQ---DGKV--IAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKAN
         Q   D K   + Y S ++ + + NY   D E+ A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++ ++  I Y PG AN
Subjt:  MQ---DGKV--IAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKAN

Query:  VVVDALSR-------------------------------------------------------KSRLLKSALC---------------------------
         + DALSR                                                       ++  LK  L                            
Subjt:  VVVDALSR-------------------------------------------------------KSRLLKSALC---------------------------

Query:  -----GIRVVLLNELRGSKVKPIRQ--------------------RPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISIKA
             GI ++    LR    K IR+                    +P G L P+P  E  WE ++MDF+  LP  S+G++ ++V+V++ +K A  +    
Subjt:  -----GIRVVLLNELRGSKVKPIRQ--------------------RPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISIKA

Query:  TSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVWI
        +   +Q AR++  ++++ +G    I+ D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR         W+
Subjt:  TSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVWI

P0CT41 Transposon Tf2-12 polyprotein1.1e-8128.57Show/hide
Query:  ELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFR
        +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ ++Y+ LNK    N YPLP I  L  + +G+ +F+K+DL+S YH ++VR+ D  K AFR
Subjt:  ELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFR

Query:  TRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVD
           G +E+ VMP+G++ AP  F   +N I  +  +  V+ ++DDIL++S     H +H++ VLQ L++  L    +KCEF   QV F+G+ +S KG +  
Subjt:  TRYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVD

Query:  PQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL
         + ++ V+ W++P +  E+R FLG   Y R+F                 + +++W+    Q  + +K+ LV+ P+L      K  ++  DAS + +G VL
Subjt:  PQKVEAVVNWERPNSVTEVRSFLGLAGYYRRF-----------------NAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL

Query:  MQ---DGKV--IAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKAN
         Q   D K   + Y S ++ + + NY   D E+ A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++ ++  I Y PG AN
Subjt:  MQ---DGKV--IAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKAN

Query:  VVVDALSR-------------------------------------------------------KSRLLKSALC---------------------------
         + DALSR                                                       ++  LK  L                            
Subjt:  VVVDALSR-------------------------------------------------------KSRLLKSALC---------------------------

Query:  -----GIRVVLLNELRGSKVKPIRQ--------------------RPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISIKA
             GI ++    LR    K IR+                    +P G L P+P  E  WE ++MDF+  LP  S+G++ ++V+V++ +K A  +    
Subjt:  -----GIRVVLLNELRGSKVKPIRQ--------------------RPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISIKA

Query:  TSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVWI
        +   +Q AR++  ++++ +G    I+ D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR         W+
Subjt:  TSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVWI

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.5e-9333.19Show/hide
Query:  KELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFRTR
        +E+   +Q+L+D   I PS SP  +PV+ V KKDGT RLC++YR LNK TI + +PLPRI++L  +   A +F+ +DL SGYHQ+ +   D  K AF T 
Subjt:  KELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFRTR

Query:  YGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQ
         G YE+ VMPFGL NAP+ F   M   F     +FV V++DDIL++S   E H +HL  VL+ L+++ L  K  KC+F  E+  FLG+ +  + ++    
Subjt:  YGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQ

Query:  KVEAVVNWERPNSVTEVRSFLGLAGYYRRFNA---------------KFEWSDKCEQCFQELKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQ-
        K  A+ ++  P +V + + FLG+  YYRRF                 K +W++K ++  ++LK  L  +P+L +P   K +Y +  DAS+ G+G VL + 
Subjt:  KVEAVVNWERPNSVTEVRSFLGLAGYYRRFNA---------------KFEWSDKCEQCFQELKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQ-

Query:  DGK-----VIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDAL
        D K     V+ Y S+ L+  + NYP  +LEL  ++ AL  +R+ L G+   + TDH SL  + ++ E   R +RWL+ +  YD T+EY  G  NVV DA+
Subjt:  DGK-----VIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDAL

Query:  SR------------------KSRLLKSALCGIRVVLLNELRG----------------------------------------------------------
        SR                  KS      LC   ++ + EL                                                            
Subjt:  SR------------------KSRLLKSALCGIRVVLLNELRG----------------------------------------------------------

Query:  -------------SKVKPI--------------------------RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISI
                     +K+ PI                          R R  G L PLP+ E +W  I+MDF+ GLP TSN  + I V+V++ +K A FI+ 
Subjt:  -------------SKVKPI--------------------------RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISI

Query:  KATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW
        + T    QL  L    I S +G   +I  DRD+R T+  +  L K +G     S++ HPQTDGQSERTIQTL  +LRA V    + W
Subjt:  KATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLRLKEVW

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.0e-9233.33Show/hide
Query:  KELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFRTR
        +E+   +Q+L+D   I PS SP  +PV+ V KKDGT RLC++YR LNK TI + +PLPRI++L  +   A +F+ +DL SGYHQ+ +   D  K AF T 
Subjt:  KELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFRTR

Query:  YGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQ
         G YE+ VMPFGL NAP+ F   M   F     +FV V++DDIL++S   E H +HL  VL+ L+++ L  K  KC+F  E+  FLG+ +  + ++    
Subjt:  YGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQ

Query:  KVEAVVNWERPNSVTEVRSFLGLAGYYRRFNA---------------KFEWSDKCEQCFQELKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQ-
        K  A+ ++  P +V + + FLG+  YYRRF                 K +W++K ++   +LK  L  +P+L +P   K +Y +  DAS+ G+G VL + 
Subjt:  KVEAVVNWERPNSVTEVRSFLGLAGYYRRFNA---------------KFEWSDKCEQCFQELKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQ-

Query:  DGK-----VIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDAL
        D K     V+ Y S+ L+  + NYP  +LEL  ++ AL  +R+ L G+   + TDH SL  + ++ E   R +RWL+ +  YD T+EY  G  NVV DA+
Subjt:  DGK-----VIAYASRRLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDAL

Query:  SR------------------KSRLLKSALCGIRVVLLNELRG----------------------------------------------------------
        SR                  KS      LC   ++ + EL                                                            
Subjt:  SR------------------KSRLLKSALCGIRVVLLNELRG----------------------------------------------------------

Query:  -------------SKVKPI--------------------------RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISI
                     +K+ PI                          R R  G L PLP+ E +W  I+MDF+ GLP TSN  + I V+V++ +K A FI+ 
Subjt:  -------------SKVKPI--------------------------RQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISI

Query:  KATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRA
        + T    QL  L    I S +G   +I  DRD+R T+  +  L K +G     S++ HPQTDGQSERTIQTL  +LRA
Subjt:  KATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRA

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.4e-2040.65Show/hide
Query:  HLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLG--HVVSAKGVSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRFNAKF----------------EWS
        HL +VLQ     Q YA   KC F   Q+ +LG  H++S +GVS DP K+EA+V W  P + TE+R FLGL GYYRRF   +                +W+
Subjt:  HLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLG--HVVSAKGVSVDPQKVEAVVNWERPNSVTEVRSFLGLAGYYRRFNAKF----------------EWS

Query:  DKCEQCFQELKKRLVTTPILALP
        +     F+ LK  + T P+LALP
Subjt:  DKCEQCFQELKKRLVTTPILALP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAAGCGAGCTAAAAGAATTGAAGATGCAGTTACAAGAACTAGTTGACAAGGGATGCATCAGGCCTAGTGTTTCGCCGTGGGGAGCACCAGTGCTATTT
GTGAAAAAGAAAGATGGTACCCTCAGATTATGTATTGAATATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCAATGACTTATTT
GATCAACCAAGGGGAGCAGCGTTGTTCTCTAAGATTGACTTAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGAGAGCATTTAGAACA
AGGTATGGACATTATGAGTTTCGAGTTATGCCATTCGGTTTAACGAATGCGCCAACGGCTTTCATGGATCTCATGAACAGGATCTTCCATCAGTATTTAGATCAG
TTTGTGATTGTGTTCATTGATGATATATTAGTTTACTCGGTTGACAGAGAATCCCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGATAAACAGTTG
TACGCTAAGTTCAGCAAATGTGAGTTCTGGTTGGAACAAGTAGTATTTTTGGGGCATGTAGTTTCAGCAAAAGGAGTTAGTGTGGATCCACAAAAAGTAGAAGCG
GTTGTCAATTGGGAAAGACCAAATAGTGTGACAGAAGTACGTAGTTTTCTGGGTTTGGCAGGATACTATAGGCGTTTTAATGCTAAGTTTGAGTGGTCAGATAAA
TGCGAGCAATGTTTTCAGGAATTGAAAAAAAGACTAGTTACAACACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATTTATTGTGATGCTTCAAGG
CTAGGATTAGGTTGTGTGCTTATGCAAGATGGGAAGGTAATAGCTTATGCTTCAAGGCGGTTGAAGGAGCATGAGTGTAATTACCCTACCCATGATCTTGAGCTA
GCAGCAGTTGTTTTAGCACTAAAAATCTGGAGACACTATTTGTTTGGGGAAAAGTGCCATATTTTCACAGATCATAAGAGTCTGAAGTATATTTTTGATCAGAAA
GAGCTAAATTTGAGACAAAGGCGATGGCTAGAACTGATTAAATATTATGATTGTACTATAGAGTATCATCCAGGTAAGGCTAATGTAGTAGTAGATGCATTAAGT
AGGAAGTCAAGACTTCTGAAGAGTGCCTTGTGTGGTATTCGAGTAGTTTTGTTGAATGAGTTAAGAGGTTCCAAGGTTAAACCAATAAGACAGAGGCCAGGAGGA
TTTCTTAATCCTTTGCCAGTGCCAGAGTGGAAATGGGAGCATATTACTATGGATTTTCTATTTGGATTACCTCGTACATCCAATGGACATGATGGTATATGGGTA
ATAGTAAACAAAATCACCAAGACGGCAGGATTTATATCGATTAAAGCGACATCTATGTTAGACCAGCTAGCTAGATTATATGTTGATAAGATTGTGAGTCAGTAT
GGAGTACTAGTGTCCATAGTTTTAGATAGGGATCTGAGGTTTACTTCTAAATTCTGGCATAGTTTACAGAAAGCAATGGGAACAGGGCTAAAGTTTAGTACATCA
TTTCATCCCCAAACAGATGGTCAGTCCGAGAGGACCATCCAAACTTTAGAGGACATGTTGAGAGCATGTGTCCTTCGACTTAAGGAAGTTTGGATACCCACTTGC
CACTTATGGAGTTTGCTTATAATAATAGCTATCAGTCTAGTATCGGTATGGCACCATATGAAGCTTTATACGGGAGACCATGCAGAACTCCTGTGTGTTGGAATG
AAGTGGGAGAGCGAATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAAGCGAGCTAAAAGAATTGAAGATGCAGTTACAAGAACTAGTTGACAAGGGATGCATCAGGCCTAGTGTTTCGCCGTGGGGAGCACCAGTGCTATTT
GTGAAAAAGAAAGATGGTACCCTCAGATTATGTATTGAATATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCAATGACTTATTT
GATCAACCAAGGGGAGCAGCGTTGTTCTCTAAGATTGACTTAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGAGAGCATTTAGAACA
AGGTATGGACATTATGAGTTTCGAGTTATGCCATTCGGTTTAACGAATGCGCCAACGGCTTTCATGGATCTCATGAACAGGATCTTCCATCAGTATTTAGATCAG
TTTGTGATTGTGTTCATTGATGATATATTAGTTTACTCGGTTGACAGAGAATCCCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGATAAACAGTTG
TACGCTAAGTTCAGCAAATGTGAGTTCTGGTTGGAACAAGTAGTATTTTTGGGGCATGTAGTTTCAGCAAAAGGAGTTAGTGTGGATCCACAAAAAGTAGAAGCG
GTTGTCAATTGGGAAAGACCAAATAGTGTGACAGAAGTACGTAGTTTTCTGGGTTTGGCAGGATACTATAGGCGTTTTAATGCTAAGTTTGAGTGGTCAGATAAA
TGCGAGCAATGTTTTCAGGAATTGAAAAAAAGACTAGTTACAACACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATTTATTGTGATGCTTCAAGG
CTAGGATTAGGTTGTGTGCTTATGCAAGATGGGAAGGTAATAGCTTATGCTTCAAGGCGGTTGAAGGAGCATGAGTGTAATTACCCTACCCATGATCTTGAGCTA
GCAGCAGTTGTTTTAGCACTAAAAATCTGGAGACACTATTTGTTTGGGGAAAAGTGCCATATTTTCACAGATCATAAGAGTCTGAAGTATATTTTTGATCAGAAA
GAGCTAAATTTGAGACAAAGGCGATGGCTAGAACTGATTAAATATTATGATTGTACTATAGAGTATCATCCAGGTAAGGCTAATGTAGTAGTAGATGCATTAAGT
AGGAAGTCAAGACTTCTGAAGAGTGCCTTGTGTGGTATTCGAGTAGTTTTGTTGAATGAGTTAAGAGGTTCCAAGGTTAAACCAATAAGACAGAGGCCAGGAGGA
TTTCTTAATCCTTTGCCAGTGCCAGAGTGGAAATGGGAGCATATTACTATGGATTTTCTATTTGGATTACCTCGTACATCCAATGGACATGATGGTATATGGGTA
ATAGTAAACAAAATCACCAAGACGGCAGGATTTATATCGATTAAAGCGACATCTATGTTAGACCAGCTAGCTAGATTATATGTTGATAAGATTGTGAGTCAGTAT
GGAGTACTAGTGTCCATAGTTTTAGATAGGGATCTGAGGTTTACTTCTAAATTCTGGCATAGTTTACAGAAAGCAATGGGAACAGGGCTAAAGTTTAGTACATCA
TTTCATCCCCAAACAGATGGTCAGTCCGAGAGGACCATCCAAACTTTAGAGGACATGTTGAGAGCATGTGTCCTTCGACTTAAGGAAGTTTGGATACCCACTTGC
CACTTATGGAGTTTGCTTATAATAATAGCTATCAGTCTAGTATCGGTATGGCACCATATGAAGCTTTATACGGGAGACCATGCAGAACTCCTGTGTGTTGGAATG
AAGTGGGAGAGCGAATGTTAG
Protein sequenceShow/hide protein sequence
MASSELKELKMQLQELVDKGCIRPSVSPWGAPVLFVKKKDGTLRLCIEYRQLNKVTIRNKYPLPRINDLFDQPRGAALFSKIDLRSGYHQLKVRESDIAKRAFRT
RYGHYEFRVMPFGLTNAPTAFMDLMNRIFHQYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLRDKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
VVNWERPNSVTEVRSFLGLAGYYRRFNAKFEWSDKCEQCFQELKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGKVIAYASRRLKEHECNYPTHDLEL
AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKYYDCTIEYHPGKANVVVDALSRKSRLLKSALCGIRVVLLNELRGSKVKPIRQRPGG
FLNPLPVPEWKWEHITMDFLFGLPRTSNGHDGIWVIVNKITKTAGFISIKATSMLDQLARLYVDKIVSQYGVLVSIVLDRDLRFTSKFWHSLQKAMGTGLKFSTS
FHPQTDGQSERTIQTLEDMLRACVLRLKEVWIPTCHLWSLLIIIAISLVSVWHHMKLYTGDHAELLCVGMKWESEC