; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026522 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026522
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:38386103..38406533
RNA-Seq ExpressionLag0026522
SyntenyLag0026522
Gene Ontology termsGO:0006265 - DNA topological change (biological process)
GO:0016020 - membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003916 - DNA topoisomerase activity (molecular function)
InterPro domainsIPR003602 - DNA topoisomerase, type IA, DNA-binding domain
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR013497 - DNA topoisomerase, type IA, central
IPR013824 - DNA topoisomerase, type IA, central region, subdomain 1
IPR023405 - DNA topoisomerase, type IA, core domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW35064.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-20460.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

RVW70260.1 Copia protein [Vitis vinifera]1.1e-20460.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

RVW86492.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-20460.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

RVX02146.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.7e-20560.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

RVX19036.1 Copia protein [Vitis vinifera]1.9e-20460.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

TrEMBL top hitse value%identityAlignment
A0A438DHV4 Retrovirus-related Pol polyprotein from transposon TNT 1-949.4e-20560.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

A0A438GDK0 Copia protein5.5e-20560.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

A0A438IZK5 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-20560.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

A0A438KCU4 Copia protein9.4e-20560.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

A0A438KPB7 Copia protein9.4e-20560.5Show/hide
Query:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA
        VY RR++     H  Q         NS+  +ENI        + P IDD TLPIA RKGVR CT+HPI  YV Y  L P Y+AF + LD   +P TI +A
Subjt:  VYTRRKEVTLQNHECQPPEPDSSSLNST--DENITS------MAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDA

Query:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ
        LK  +WKK V +EI ALE NGTWT+T LP GK PVGCKWIFT+KYK+DGSVERF+ARLVA+GFTQSYGIDYQETFAPVAKLNT+R+LLSLAVNQDW L Q
Subjt:  LKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQ

Query:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD
        LD+KNAFLNGDLEEEVYM+IPPG E+    N VCKL+KSLYGLKQSPRAWFDRF K ++K GY Q QADHTLFVK S+  K AILIVYVDDII++GND+ 
Subjt:  LDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSN-NKTAILIVYVDDIIITGNDVD

Query:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI
        E+ NLK+ L+ EFE+KDLG+L+YFLGMEVARS  GI++SQR YILDLL                                                RPDI
Subjt:  EILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDI

Query:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA
         ++VS  SQ+M++P E H+ AV RIL+YLK TPG GL F+K+ +R  E+Y+DA WAG + DR+ T+GYCS+VWGNLVTWRSKKQ+VVARSSAEAEYRALA
Subjt:  AYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALA

Query:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS
         GICEGIWI+++L+EL   + + + M CD+Q+AISIAKNP+HHDRTKH+EIDRHFI EKV +  V+LNYVP+++Q ADILTKALPR +FE+   KLGL  
Subjt:  LGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLIS

Query:  LLEKA
        +   A
Subjt:  LLEKA

SwissProt top hitse value%identityAlignment
C7J0A2 DNA topoisomerase 3-alpha4.4e-6679.19Show/hide
Query:  VDIKRNYLDVYRFESWGGSTIPTYVSGQQFVPTTLTLDSGVTRPPPLLSEADLLSCMDKEGIGTDATMHDHIKKLLDRFYATKDANTRFTPTNLGEALVM
        V + +NYLDVYRF+SWGG+ +PTY+ GQQFVPTTLTLDSG+TRPPPLL+EADLL CMDK GIGTDATMHDHIKKLLDR YATKDANTRF+PTNLGEALVM
Subjt:  VDIKRNYLDVYRFESWGGSTIPTYVSGQQFVPTTLTLDSGVTRPPPLLSEADLLSCMDKEGIGTDATMHDHIKKLLDRFYATKDANTRFTPTNLGEALVM

Query:  GYDDMGYKLWKPHLRAVMEHEMKSVSEGTKRKDEFLATSLQQMKTCFLD
        GYD+MGY+LWKP+LR++ME +MKSVS GTK K E L   LQQMK CFLD
Subjt:  GYDDMGYKLWKPHLRAVMEHEMKSVSEGTKRKDEFLATSLQQMKTCFLD

P04146 Copia protein1.2e-10038.55Show/hide
Query:  WKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKN
        W++ ++ E+ A + N TWT+T+ P  KN V  +W+F+VKY   G+  R++ARLVA+GFTQ Y IDY+ETFAPVA++++ R +LSL +  +  +HQ+DVK 
Subjt:  WKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKN

Query:  AFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFV--KSSNNKTAILIVYVDDIIITGNDVDEILN
        AFLNG L+EE+YM++P G+   S+N  VCKL K++YGLKQ+ R WF+ F + L +  +     D  +++  K + N+   +++YVDD++I   D+  + N
Subjt:  AFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFV--KSSNNKTAILIVYVDDIIITGNDVDEILN

Query:  LKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDIAYSV
         KR L  +F + DL  +++F+G+ +    + I +SQ  Y+  +L                                                RPD+  +V
Subjt:  LKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDIAYSV

Query:  SIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSS--RMVELYTDASWAGELTDRKCTTGYCSYVWG-NLVTWRSKKQAVVARSSAEAEYRALAL
        +I S+Y +  N      + R+L+YLKGT    L+FKK+ +    +  Y D+ WAG   DRK TTGY   ++  NL+ W +K+Q  VA SS EAEY AL  
Subjt:  SIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSS--RMVELYTDASWAGELTDRKCTTGYCSYVWG-NLVTWRSKKQAVVARSSAEAEYRALAL

Query:  GICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLI
         + E +W++ LLT + I  +N +K++ D+Q  ISIA NP  H R KHI+I  HF  E+V N ++ L Y+P+ NQ+ADI TK LP A F E   KLGL+
Subjt:  GICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.4e-9437.57Show/hide
Query:  PQTIHDALKDP---KWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSL
        P+++ + L  P   +  K + EE+++L+ NGT+ L +LP GK P+ CKW+F +K   D  + R++ARLV KGF Q  GID+ E F+PV K+ ++R +LSL
Subjt:  PQTIHDALKDP---KWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSL

Query:  AVNQDWPLHQLDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKS-SNNKTAILIVYVD
        A + D  + QLDVK AFL+GDLEEE+YM+ P G E     ++VCKL KSLYGLKQ+PR W+ +F   +    Y +  +D  ++ K  S N   IL++YVD
Subjt:  AVNQDWPLHQLDVKNAFLNGDLEEEVYMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKS-SNNKTAILIVYVD

Query:  DIIITGNDVDEILNLKRMLATEFEIKDLGSLRYFLGMEVA--RSNNGIIISQRNYILDLL----------------------------------------
        D++I G D   I  LK  L+  F++KDLG  +  LGM++   R++  + +SQ  YI  +L                                        
Subjt:  DIIITGNDVDEILNLKRMLATEFEIKDLGSLRYFLGMEVA--RSNNGIIISQRNYILDLL----------------------------------------

Query:  ---------------RPDIAYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRS
                       RPDIA++V + S+++ NP + H  AV  IL+YL+GT G  L F   S  +++ YTDA  AG++ +RK +TGY     G  ++W+S
Subjt:  ---------------RPDIAYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRS

Query:  KKQAVVARSSAEAEYRALALGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILT
        K Q  VA S+ EAEY A      E IW+++ L EL +  +  V ++CDSQSAI ++KN ++H RTKHI++  H+I E V++  +++  + +    AD+LT
Subjt:  KKQAVVARSSAEAEYRALALGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILT

Query:  KALPRASFEEFNSKLGLIS
        K +PR  FE     +G+ S
Subjt:  KALPRASFEEFNSKLGLIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.1e-11339.59Show/hide
Query:  QNHECQPPEPDSSSLNSTDENITSMAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFIS-QLDSIHIPQTIHDALKDPKWKKTVDEEIKAL
        Q+    P    S+S +ST     S+          I           H +      G + P+ K  ++  L +   P+T   ALKD +W+  +  EI A 
Subjt:  QNHECQPPEPDSSSLNSTDENITSMAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFIS-QLDSIHIPQTIHDALKDPKWKKTVDEEIKAL

Query:  ESNGTWTLTQLPHGK-NPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKNAFLNGDLEEEV
          N TW L   P      VGC+WIFT KY SDGS+ R++ARLVAKG+ Q  G+DY ETF+PV K  ++R++L +AV++ WP+ QLDV NAFL G L ++V
Subjt:  ESNGTWTLTQLPHGK-NPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKNAFLNGDLEEEV

Query:  YMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSNNKTAILIVYVDDIIITGNDVDEILNLKRMLATEFEIKD
        YM  PPG  DK   N VCKLRK+LYGLKQ+PRAW+      L+  G+    +D +LFV         ++VYVDDI+ITGND   + N    L+  F +KD
Subjt:  YMKIPPGMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSNNKTAILIVYVDDIIITGNDVDEILNLKRMLATEFEIKD

Query:  LGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDIAYSVSIGSQYMNNPNER
           L YFLG+E  R   G+ +SQR YILDLL                                                RPDI+Y+V+  SQ+M+ P E 
Subjt:  LGSLRYFLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDIAYSVSIGSQYMNNPNER

Query:  HLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALALGICEGIWIQKLLTELK
        HL A+ RIL+YL GTP HG+  KK ++  +  Y+DA WAG+  D   T GY  Y+  + ++W SKKQ  V RSS EAEYR++A    E  WI  LLTEL 
Subjt:  HLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALALGICEGIWIQKLLTELK

Query:  IDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGL
        I       ++CD+  A  +  NP+ H R KHI ID HFI  +V +  + + +V + +Q+AD LTK L R +F+ F SK+G+
Subjt:  IDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.8e-11138.24Show/hide
Query:  PEPDSSSLNSTDENITSMAPVIDDLTLPIAKRKGVRSCTN-HPIEKYVAYGKLLPDYK-AFISQLDSIHIPQTIHDALKDPKWKKTVDEEIKALESNGTW
        P    S  NS   + TS  P+   L  P   +   ++  N H +      G   P+ K ++ + L +   P+T   A+KD +W++ +  EI A   N TW
Subjt:  PEPDSSSLNSTDENITSMAPVIDDLTLPIAKRKGVRSCTN-HPIEKYVAYGKLLPDYK-AFISQLDSIHIPQTIHDALKDPKWKKTVDEEIKALESNGTW

Query:  TLT-QLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKNAFLNGDLEEEVYMKIPP
         L    P     VGC+WIFT K+ SDGS+ R++ARLVAKG+ Q  G+DY ETF+PV K  ++R++L +AV++ WP+ QLDV NAFL G L +EVYM  PP
Subjt:  TLT-QLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKNAFLNGDLEEEVYMKIPP

Query:  GMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSNNKTAILIVYVDDIIITGNDVDEILNLKRMLATEFEIKDLGSLRY
        G  DK   + VC+LRK++YGLKQ+PRAW+      L+  G+    +D +LFV         ++VYVDDI+ITGND   + +    L+  F +K+   L Y
Subjt:  GMEDKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSNNKTAILIVYVDDIIITGNDVDEILNLKRMLATEFEIKDLGSLRY

Query:  FLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDIAYSVSIGSQYMNNPNERHLGAVN
        FLG+E  R   G+ +SQR Y LDLL                                                RPD++Y+V+  SQYM+ P + H  A+ 
Subjt:  FLGMEVARSNNGIIISQRNYILDLL------------------------------------------------RPDIAYSVSIGSQYMNNPNERHLGAVN

Query:  RILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALALGICEGIWIQKLLTELKIDTQNT
        R+L+YL GTP HG+  KK ++  +  Y+DA WAG+  D   T GY  Y+  + ++W SKKQ  V RSS EAEYR++A    E  WI  LLTEL I   + 
Subjt:  RILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALALGICEGIWIQKLLTELKIDTQNT

Query:  VKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLISL
          ++CD+  A  +  NP+ H R KHI +D HFI  +V +  + + +V + +Q+AD LTK L R +F+ F+ K+G+I +
Subjt:  VKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLISL

Arabidopsis top hitse value%identityAlignment
AT2G32000.1 DNA topoisomerase, type IA, core5.2e-1430.71Show/hide
Query:  IPTYVSGQQFVPTTLTLDSGVTRPPPLLSEADLLSCMDKEGIGTDATMHDHIKKLLDRFYATKDANTRFTPTNLGEALVMGYDDMGYKLWKPHLRAVMEH
        +P+++ G++     + L  G T PP  L+E++L+S M+K GIGTDA++  HI  + +R Y    +  +  PT LG  L+ GY  +   L  P +R+ +E 
Subjt:  IPTYVSGQQFVPTTLTLDSGVTRPPPLLSEADLLSCMDKEGIGTDATMHDHIKKLLDRFYATKDANTRFTPTNLGEALVMGYDDMGYKLWKPHLRAVMEH

Query:  EMKSVSEGTKRKDEFLATSLQQMKTCF
        ++  V++G       +   +QQ +  F
Subjt:  EMKSVSEGTKRKDEFLATSLQQMKTCF

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-11140.76Show/hide
Query:  VRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDALKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLV
        V S T H I ++++Y K+ P Y +F+  +     P T ++A +   W   +D+EI A+E+  TW +  LP  K P+GCKW++ +KY SDG++ER++ARLV
Subjt:  VRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDALKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLV

Query:  AKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKNAFLNGDLEEEVYMKIPPGMEDKSNN----NLVCKLRKSLYGLKQSPRAWFDRFA
        AKG+TQ  GID+ ETF+PV KL +V+++L+++   ++ LHQLD+ NAFLNGDL+EE+YMK+PPG   +  +    N VC L+KS+YGLKQ+ R WF +F+
Subjt:  AKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKNAFLNGDLEEEVYMKIPPGMEDKSNN----NLVCKLRKSLYGLKQSPRAWFDRFA

Query:  KTLIKSGYYQCQADHTLFVKSSNNKTAILIVYVDDIIITGNDVDEILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL--------
         TLI  G+ Q  +DHT F+K +      ++VYVDDIII  N+   +  LK  L + F+++DLG L+YFLG+E+ARS  GI I QR Y LDLL        
Subjt:  KTLIKSGYYQCQADHTLFVKSSNNKTAILIVYVDDIIITGNDVDEILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNYILDLL--------

Query:  ----------------------------------------RPDIAYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWA
                                                R DI+++V+  SQ+   P   H  AV +IL Y+KGT G GL +   +   +++++DAS+ 
Subjt:  ----------------------------------------RPDIAYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWA

Query:  GELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALALGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFI
             R+ T GYC ++  +L++W+SKKQ VV++SSAEAEYRAL+    E +W+ +   EL++       +FCD+ +AI IA N + H+RTKHIE D H +
Subjt:  GELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEAEYRALALGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFI

Query:  FEK
         E+
Subjt:  FEK

AT5G63920.1 topoisomerase 3alpha1.2e-6680.27Show/hide
Query:  IKRNYLDVYRFESWGGSTIPTYVSGQQFVPTTLTLDSGVTRPPPLLSEADLLSCMDKEGIGTDATMHDHIKKLLDRFYATKDANTRFTPTNLGEALVMGY
        + +NYL+VYRFESWGGS IP Y  GQQF+PTTLTLD+ VTRPPPLL EADLLSCMDK GIGTDATMHDHIKKLLDR YATKDANTRF+PTNLGEALVMGY
Subjt:  IKRNYLDVYRFESWGGSTIPTYVSGQQFVPTTLTLDSGVTRPPPLLSEADLLSCMDKEGIGTDATMHDHIKKLLDRFYATKDANTRFTPTNLGEALVMGY

Query:  DDMGYKLWKPHLRAVMEHEMKSVSEGTKRKDEFLATSLQQMKTCFLD
        DDMGY+LWKP+LRA+MEH+M  VS G K K E L T LQQMK CFLD
Subjt:  DDMGYKLWKPHLRAVMEHEMKSVSEGTKRKDEFLATSLQQMKTCFLD

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-3033.18Show/hide
Query:  LIVYVDDIIITGNDVDEILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNY-----------------------------------------
        L++YVDDI++TG+    +  L   L++ F +KDLG + YFLG+++    +G+ +SQ  Y                                         
Subjt:  LIVYVDDIIITGNDVDEILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGIIISQRNY-----------------------------------------

Query:  ------ILDLLRPDIAYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQA
               L L RPDI+Y+V+I  Q M+ P       + R+L+Y+KGT  HGL   K+S   V+ + D+ WAG  + R+ TTG+C+++  N+++W +K+Q 
Subjt:  ------ILDLLRPDIAYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQA

Query:  VVARSSAEAEYRALALGICEGIW
         V+RSS E EYRALAL   E  W
Subjt:  VVARSSAEAEYRALALGICEGIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.0e-2243.97Show/hide
Query:  KLLPDYKAFISQLDSIHIPQTIHDALKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETF
        KL P Y   I+       P+++  ALKDP W + + EE+ AL  N TW L   P  +N +GCKW+F  K  SDG+++R +ARLVAKGF Q  GI + ET+
Subjt:  KLLPDYKAFISQLDSIHIPQTIHDALKDPKWKKTVDEEIKALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETF

Query:  APVAKLNTVRVLLSLA
        +PV +  T+R +L++A
Subjt:  APVAKLNTVRVLLSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTATTGTTTATACAAGAAGAAAAGAGGTTACCTTGCAAAATCACGAATGTCAACCACCTGAACCGGATTCATCATCGCTGAATTCCACTGATGAGAACATCACATC
TATGGCACCTGTGATCGATGATCTAACCCTACCAATTGCTAAGAGAAAAGGTGTTAGAAGTTGCACTAACCATCCCATTGAAAAATATGTTGCATATGGGAAATTGCTTC
CTGATTATAAAGCATTTATTAGTCAACTGGATAGTATTCATATTCCACAAACCATTCATGATGCACTCAAAGATCCGAAATGGAAAAAGACAGTGGATGAAGAAATCAAA
GCTCTTGAAAGCAATGGAACGTGGACTCTTACTCAACTTCCTCATGGAAAGAATCCAGTTGGTTGTAAATGGATTTTCACAGTTAAATATAAATCTGATGGTAGTGTGGA
ACGATTTAGAGCCAGACTTGTTGCGAAGGGCTTCACTCAGTCATACGGTATAGATTACCAAGAAACTTTTGCCCCAGTTGCAAAATTAAATACTGTGCGTGTTTTGTTGT
CTCTAGCTGTGAACCAGGATTGGCCTCTTCACCAACTGGATGTTAAAAATGCATTCTTAAATGGTGATCTCGAAGAAGAAGTTTATATGAAAATTCCCCCTGGAATGGAA
GACAAGTCCAATAATAACTTGGTGTGTAAGTTGAGAAAGTCTCTATATGGATTGAAACAATCTCCACGTGCTTGGTTTGATAGATTTGCTAAAACACTGATTAAAAGCGG
CTATTATCAATGTCAAGCTGATCATACTTTGTTTGTGAAATCCTCGAATAACAAAACTGCAATTTTGATTGTATATGTGGATGACATCATCATTACAGGGAATGATGTAG
ATGAGATCCTCAACCTCAAAAGGATGCTTGCAACTGAGTTTGAAATCAAAGATCTGGGAAGCTTGAGATATTTTCTAGGCATGGAAGTGGCACGATCCAATAATGGGATT
ATAATTTCTCAGAGAAATTATATCCTAGATTTATTAAGACCAGACATTGCATACTCTGTAAGTATTGGTAGCCAGTATATGAATAATCCAAATGAACGCCATCTTGGAGC
TGTGAACAGAATTTTGAAATACTTGAAAGGCACCCCAGGTCATGGTCTATTATTCAAAAAATCTTCAAGCAGAATGGTAGAACTTTATACTGATGCTAGTTGGGCTGGAG
AATTAACTGATCGAAAATGTACGACTGGCTATTGCTCCTATGTTTGGGGTAACTTGGTCACTTGGAGAAGTAAAAAACAAGCTGTTGTTGCAAGAAGCAGTGCAGAAGCT
GAATACAGAGCTCTTGCTTTAGGAATATGTGAAGGAATATGGATTCAAAAACTTTTGACAGAACTGAAAATTGACACTCAGAACACAGTTAAAATGTTTTGTGATAGTCA
ATCAGCCATCAGTATTGCAAAGAATCCTATTCATCATGACAGAACAAAGCACATAGAGATAGATCGACATTTCATATTTGAGAAGGTCAACAATAGAATAGTGGAACTAA
ACTATGTTCCATCTCGTAATCAAGTTGCTGACATTCTCACCAAAGCATTACCAAGGGCAAGCTTTGAAGAATTCAACTCCAAGCTTGGCTTAATTTCCCTTCTAGAGAAA
GCATACTGGGTGATGCATGGTGATGATGTTGACATAAAAAGAAATTACCTGGACGTTTACCGCTTTGAATCATGGGGAGGTTCAACTATTCCAACTTACGTTTCTGGGCA
ACAGTTTGTTCCAACAACATTGACTCTTGATTCAGGAGTCACGAGACCACCCCCCCTTCTGAGTGAAGCTGATTTGCTTTCGTGTATGGACAAGGAGGGAATTGGTACAG
ATGCAACAATGCATGACCACATCAAGAAGCTGTTAGATAGATTTTATGCAACAAAGGATGCAAACACTCGCTTCACACCAACTAACCTAGGTGAGGCTCTGGTAATGGGA
TATGACGACATGGGGTATAAACTGTGGAAACCACATCTTAGAGCTGTCATGGAGCATGAGATGAAATCAGTTAGCGAGGGCACTAAGAGAAAAGATGAATTTTTGGCTAC
ATCCTTGCAGCAGATGAAAACTTGCTTCTTAGATGATGCATACGAGATAGATAAAGCAGTCTTCCAACAAGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTATTGTTTATACAAGAAGAAAAGAGGTTACCTTGCAAAATCACGAATGTCAACCACCTGAACCGGATTCATCATCGCTGAATTCCACTGATGAGAACATCACATC
TATGGCACCTGTGATCGATGATCTAACCCTACCAATTGCTAAGAGAAAAGGTGTTAGAAGTTGCACTAACCATCCCATTGAAAAATATGTTGCATATGGGAAATTGCTTC
CTGATTATAAAGCATTTATTAGTCAACTGGATAGTATTCATATTCCACAAACCATTCATGATGCACTCAAAGATCCGAAATGGAAAAAGACAGTGGATGAAGAAATCAAA
GCTCTTGAAAGCAATGGAACGTGGACTCTTACTCAACTTCCTCATGGAAAGAATCCAGTTGGTTGTAAATGGATTTTCACAGTTAAATATAAATCTGATGGTAGTGTGGA
ACGATTTAGAGCCAGACTTGTTGCGAAGGGCTTCACTCAGTCATACGGTATAGATTACCAAGAAACTTTTGCCCCAGTTGCAAAATTAAATACTGTGCGTGTTTTGTTGT
CTCTAGCTGTGAACCAGGATTGGCCTCTTCACCAACTGGATGTTAAAAATGCATTCTTAAATGGTGATCTCGAAGAAGAAGTTTATATGAAAATTCCCCCTGGAATGGAA
GACAAGTCCAATAATAACTTGGTGTGTAAGTTGAGAAAGTCTCTATATGGATTGAAACAATCTCCACGTGCTTGGTTTGATAGATTTGCTAAAACACTGATTAAAAGCGG
CTATTATCAATGTCAAGCTGATCATACTTTGTTTGTGAAATCCTCGAATAACAAAACTGCAATTTTGATTGTATATGTGGATGACATCATCATTACAGGGAATGATGTAG
ATGAGATCCTCAACCTCAAAAGGATGCTTGCAACTGAGTTTGAAATCAAAGATCTGGGAAGCTTGAGATATTTTCTAGGCATGGAAGTGGCACGATCCAATAATGGGATT
ATAATTTCTCAGAGAAATTATATCCTAGATTTATTAAGACCAGACATTGCATACTCTGTAAGTATTGGTAGCCAGTATATGAATAATCCAAATGAACGCCATCTTGGAGC
TGTGAACAGAATTTTGAAATACTTGAAAGGCACCCCAGGTCATGGTCTATTATTCAAAAAATCTTCAAGCAGAATGGTAGAACTTTATACTGATGCTAGTTGGGCTGGAG
AATTAACTGATCGAAAATGTACGACTGGCTATTGCTCCTATGTTTGGGGTAACTTGGTCACTTGGAGAAGTAAAAAACAAGCTGTTGTTGCAAGAAGCAGTGCAGAAGCT
GAATACAGAGCTCTTGCTTTAGGAATATGTGAAGGAATATGGATTCAAAAACTTTTGACAGAACTGAAAATTGACACTCAGAACACAGTTAAAATGTTTTGTGATAGTCA
ATCAGCCATCAGTATTGCAAAGAATCCTATTCATCATGACAGAACAAAGCACATAGAGATAGATCGACATTTCATATTTGAGAAGGTCAACAATAGAATAGTGGAACTAA
ACTATGTTCCATCTCGTAATCAAGTTGCTGACATTCTCACCAAAGCATTACCAAGGGCAAGCTTTGAAGAATTCAACTCCAAGCTTGGCTTAATTTCCCTTCTAGAGAAA
GCATACTGGGTGATGCATGGTGATGATGTTGACATAAAAAGAAATTACCTGGACGTTTACCGCTTTGAATCATGGGGAGGTTCAACTATTCCAACTTACGTTTCTGGGCA
ACAGTTTGTTCCAACAACATTGACTCTTGATTCAGGAGTCACGAGACCACCCCCCCTTCTGAGTGAAGCTGATTTGCTTTCGTGTATGGACAAGGAGGGAATTGGTACAG
ATGCAACAATGCATGACCACATCAAGAAGCTGTTAGATAGATTTTATGCAACAAAGGATGCAAACACTCGCTTCACACCAACTAACCTAGGTGAGGCTCTGGTAATGGGA
TATGACGACATGGGGTATAAACTGTGGAAACCACATCTTAGAGCTGTCATGGAGCATGAGATGAAATCAGTTAGCGAGGGCACTAAGAGAAAAGATGAATTTTTGGCTAC
ATCCTTGCAGCAGATGAAAACTTGCTTCTTAGATGATGCATACGAGATAGATAAAGCAGTCTTCCAACAAGTCTGA
Protein sequenceShow/hide protein sequence
MPIVYTRRKEVTLQNHECQPPEPDSSSLNSTDENITSMAPVIDDLTLPIAKRKGVRSCTNHPIEKYVAYGKLLPDYKAFISQLDSIHIPQTIHDALKDPKWKKTVDEEIK
ALESNGTWTLTQLPHGKNPVGCKWIFTVKYKSDGSVERFRARLVAKGFTQSYGIDYQETFAPVAKLNTVRVLLSLAVNQDWPLHQLDVKNAFLNGDLEEEVYMKIPPGME
DKSNNNLVCKLRKSLYGLKQSPRAWFDRFAKTLIKSGYYQCQADHTLFVKSSNNKTAILIVYVDDIIITGNDVDEILNLKRMLATEFEIKDLGSLRYFLGMEVARSNNGI
IISQRNYILDLLRPDIAYSVSIGSQYMNNPNERHLGAVNRILKYLKGTPGHGLLFKKSSSRMVELYTDASWAGELTDRKCTTGYCSYVWGNLVTWRSKKQAVVARSSAEA
EYRALALGICEGIWIQKLLTELKIDTQNTVKMFCDSQSAISIAKNPIHHDRTKHIEIDRHFIFEKVNNRIVELNYVPSRNQVADILTKALPRASFEEFNSKLGLISLLEK
AYWVMHGDDVDIKRNYLDVYRFESWGGSTIPTYVSGQQFVPTTLTLDSGVTRPPPLLSEADLLSCMDKEGIGTDATMHDHIKKLLDRFYATKDANTRFTPTNLGEALVMG
YDDMGYKLWKPHLRAVMEHEMKSVSEGTKRKDEFLATSLQQMKTCFLDDAYEIDKAVFQQV