; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001616 (gene) of Snake gourd v1 genome

Gene IDTan0001616
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:12793843..12797625
RNA-Seq ExpressionTan0001616
SyntenyTan0001616
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-11135.49Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL E+CPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL+MMVHFN+A  N A IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------
         + KK   KK G+G + + AAA+  KK K  A KG CF CN++ HWKR C KY+A+KKK                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------DKPDEVKPIGCKWIYKRKRGVDE--------------------------------KSIRILLVIAAYYDYEVWQ
                                  D+PD VKPIGCKWIYKRKRG D                                 KSIRILL IAAY+DYE+WQ
Subjt:  --------------------------DKPDEVKPIGCKWIYKRKRGVDE--------------------------------KSIRILLVIAAYYDYEVWQ

Query:  MDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGY
        MDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSIYG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I+N SVAFL LYVDDILLIGND+G 
Subjt:  MDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGY

Query:  LTDIKEWLATQFQMKDLGDA
        LTDIK+WLATQFQMKDLG+A
Subjt:  LTDIKEWLATQFQMKDLGDA

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-11036.79Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL+MMVHFN+AE NGA IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------
         + KK   KK G+G + + AAA+  KK K  A KG CFHCN++ HWKR C KY+AEKKK                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------EDKPDEVKP----------------------------------
                                                                 E+ P +++P                                  
Subjt:  ---------------------------------------------------------EDKPDEVKP----------------------------------

Query:  --------------------------------------------IGC-----------------------------------------------------
                                                    +G                                                      
Subjt:  --------------------------------------------IGC-----------------------------------------------------

Query:  ----KWI---------------YKRKRGVDE----------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSI
            +WI               Y +  GVD           KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSI
Subjt:  ----KWI---------------YKRKRGVDE----------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSI

Query:  YGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
        YG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I+N SVAFL LYVDDILLIGND+G LTDIK+WLATQFQMKDLG+A
Subjt:  YGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-12143.32Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA + VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL++MVHFN+AE NGA IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKK-----NMKKVGKGKQTDKAAAQKG----------------------------------------------------------------------
         + KKK     N   +   K T KA A KG                                                                      
Subjt:  LRGKKK-----NMKKVGKGKQTDKAAAQKG----------------------------------------------------------------------

Query:  ------------KKVKDFADKGKCF----HCNEDEHWKRKCLKYIAEKKKE-------------------------------------------------
                    K  K F      F    H  E +   +  L  ++++  +                                                 
Subjt:  ------------KKVKDFADKGKCF----HCNEDEHWKRKCLKYIAEKKKE-------------------------------------------------

Query:  --------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE---------------------------
                                                          D+PD VKPIGCKWIYKRKRG D                            
Subjt:  --------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE---------------------------

Query:  -----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKI
             KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSIYG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I
Subjt:  -----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKI

Query:  LNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
        +N  VAFL LYVDDILLIGND+G LTDIK+WLATQFQMKDLG+A
Subjt:  LNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

KAA0066490.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-12038.65Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE ++TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL+MMVHF++AE NGA IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------
         + KK   KK G+G + + AAA+  KK K  A KG CFHCN++ HWKR C KY+AEKKK                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE-------------
                                                                        D+PD VKPIGCKWIYKRKRG D              
Subjt:  ----------------------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE-------------

Query:  -------------------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYG
                           KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSIYG+KQASRSWNIRFD AI+SYG
Subjt:  -------------------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYG

Query:  VDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
         D   DEPCVYK+I+N SVAFL LYVDDILLI ND+G LTDIK+WLATQFQMKDLG+A
Subjt:  VDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

KAA0067084.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-12744.32Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM +     +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL MMVHFN+ + NGA IDE+S V+FILE+LP+S LQFRSNAVMN + + LT+LL+ELQTF+SLMKI+G KGE NVA  +R +HKGSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------
         + KK   KK G+G + +  AA+  KK K  A KG CFHCN++ HWKR C KY+AEK K                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EDKPDEVKPIGCKWIYKRKRGVDE----------------------------
                                                         ++PDE K IGCKWIYKRKRG D                             
Subjt:  ------------------------------------------------EDKPDEVKPIGCKWIYKRKRGVDE----------------------------

Query:  ----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKIL
            KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+ GQEQKVC+L RSIYG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I+
Subjt:  ----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKIL

Query:  NSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
        N+S+AFL LYVDDILLIGND G LTDIK+WL TQFQMK LG+A
Subjt:  NSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

TrEMBL top hitse value%identityAlignment
A0A5A7SNP8 Gag/pol protein1.7e-11135.49Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL E+CPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL+MMVHFN+A  N A IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------
         + KK   KK G+G + + AAA+  KK K  A KG CF CN++ HWKR C KY+A+KKK                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------DKPDEVKPIGCKWIYKRKRGVDE--------------------------------KSIRILLVIAAYYDYEVWQ
                                  D+PD VKPIGCKWIYKRKRG D                                 KSIRILL IAAY+DYE+WQ
Subjt:  --------------------------DKPDEVKPIGCKWIYKRKRGVDE--------------------------------KSIRILLVIAAYYDYEVWQ

Query:  MDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGY
        MDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSIYG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I+N SVAFL LYVDDILLIGND+G 
Subjt:  MDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGY

Query:  LTDIKEWLATQFQMKDLGDA
        LTDIK+WLATQFQMKDLG+A
Subjt:  LTDIKEWLATQFQMKDLGDA

A0A5A7U869 Gag/pol protein4.9e-11136.79Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL+MMVHFN+AE NGA IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------
         + KK   KK G+G + + AAA+  KK K  A KG CFHCN++ HWKR C KY+AEKKK                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------EDKPDEVKP----------------------------------
                                                                 E+ P +++P                                  
Subjt:  ---------------------------------------------------------EDKPDEVKP----------------------------------

Query:  --------------------------------------------IGC-----------------------------------------------------
                                                    +G                                                      
Subjt:  --------------------------------------------IGC-----------------------------------------------------

Query:  ----KWI---------------YKRKRGVDE----------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSI
            +WI               Y +  GVD           KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSI
Subjt:  ----KWI---------------YKRKRGVDE----------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSI

Query:  YGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
        YG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I+N SVAFL LYVDDILLIGND+G LTDIK+WLATQFQMKDLG+A
Subjt:  YGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

A0A5A7V6N0 Gag/pol protein4.0e-12143.32Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA + VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL++MVHFN+AE NGA IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKK-----NMKKVGKGKQTDKAAAQKG----------------------------------------------------------------------
         + KKK     N   +   K T KA A KG                                                                      
Subjt:  LRGKKK-----NMKKVGKGKQTDKAAAQKG----------------------------------------------------------------------

Query:  ------------KKVKDFADKGKCF----HCNEDEHWKRKCLKYIAEKKKE-------------------------------------------------
                    K  K F      F    H  E +   +  L  ++++  +                                                 
Subjt:  ------------KKVKDFADKGKCF----HCNEDEHWKRKCLKYIAEKKKE-------------------------------------------------

Query:  --------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE---------------------------
                                                          D+PD VKPIGCKWIYKRKRG D                            
Subjt:  --------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE---------------------------

Query:  -----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKI
             KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSIYG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I
Subjt:  -----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKI

Query:  LNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
        +N  VAFL LYVDDILLIGND+G LTDIK+WLATQFQMKDLG+A
Subjt:  LNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

A0A5A7VH46 Gag/pol protein2.6e-12038.65Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE ++TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM       +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL+MMVHF++AE NGA IDE+S V+FILE+LP+SFLQFRSNAVMN + + LT+LLNELQTF+SLMKI+G KGEANVA  +R +H+GSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------
         + KK   KK G+G + + AAA+  KK K  A KG CFHCN++ HWKR C KY+AEKKK                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKE----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE-------------
                                                                        D+PD VKPIGCKWIYKRKRG D              
Subjt:  ----------------------------------------------------------------DKPDEVKPIGCKWIYKRKRGVDE-------------

Query:  -------------------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYG
                           KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+PGQEQK+C+L RSIYG+KQASRSWNIRFD AI+SYG
Subjt:  -------------------KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYG

Query:  VDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
         D   DEPCVYK+I+N SVAFL LYVDDILLI ND+G LTDIK+WLATQFQMKDLG+A
Subjt:  VDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

A0A5A7VI97 Gag/pol protein9.9e-12844.32Show/hide
Query:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH
        +DDL+FVL EECPQV  +NA R VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM +     +H
Subjt:  VDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQARHNALKYIFNSRMMR-VHHWDH

Query:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN
        VL MMVHFN+ + NGA IDE+S V+FILE+LP+S LQFRSNAVMN + + LT+LL+ELQTF+SLMKI+G KGE NVA  +R +HKGSTSGTK++ SSS N
Subjt:  VLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVA--SRSYHKGSTSGTKTVASSSSN

Query:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------
         + KK   KK G+G + +  AA+  KK K  A KG CFHCN++ HWKR C KY+AEK K                                         
Subjt:  LRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKK-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EDKPDEVKPIGCKWIYKRKRGVDE----------------------------
                                                         ++PDE K IGCKWIYKRKRG D                             
Subjt:  ------------------------------------------------EDKPDEVKPIGCKWIYKRKRGVDE----------------------------

Query:  ----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKIL
            KSIRILL IAAY+DYE+WQMDVKT+FLN NLEETIYM QP+GFI+ GQEQKVC+L RSIYG+KQASRSWNIRFD AI+SYG D   DEPCVYK+I+
Subjt:  ----KSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKIL

Query:  NSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
        N+S+AFL LYVDDILLIGND G LTDIK+WL TQFQMK LG+A
Subjt:  NSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-1935.71Show/hide
Query:  SIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVY---KKILNS
        S R +L +   Y+ +V QMDVKT+FLN  L+E IYM  P+G  +      VC+L ++IYG+KQA+R W   F++A++     ++  + C+Y   K  +N 
Subjt:  SIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVY---KKILNS

Query:  SVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGD
        ++ ++ LYVDD+++   D+  + + K +L  +F+M DL +
Subjt:  SVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-3033.89Show/hide
Query:  TDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKEDKPDEVKPIGCKWIYK----------------------RKRGVD----------EK
        +D    +  K+V    +K +     ++E    + L+     K  + P   +P+ CKW++K                      +K+G+D            
Subjt:  TDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKEDKPDEVKPIGCKWIYK----------------------RKRGVD----------EK

Query:  SIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVY-KKILNSSV
        SIR +L +AA  D EV Q+DVKT+FL+ +LEE IYM+QP+GF + G++  VC+L +S+YG+KQA R W ++FD  ++S        +PCVY K+   ++ 
Subjt:  SIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVY-KKILNSSV

Query:  AFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA
          L LYVDD+L++G D G +  +K  L+  F MKDLG A
Subjt:  AFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDA

P25600 Putative transposon Ty5-1 protein YCL074W6.0e-1332.2Show/hide
Query:  MDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGY
        MDV T+FLN  ++E IY+ QP GF+       V  L   +YG+KQA   WN   +  ++  G   ++ E  +Y +  +    ++ +YVDD+L+       
Subjt:  MDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGY

Query:  LTDIKEWLATQFQMKDLG
           +K+ L   + MKDLG
Subjt:  LTDIKEWLATQFQMKDLG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-1928.19Show/hide
Query:  PDEVKPIGCKWIYKRKRGVD--------------------------------EKSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPG
        P  V  +GC+WI+ +K   D                                  SIRI+L +A    + + Q+DV  +FL   L + +YM QP GFI   
Subjt:  PDEVKPIGCKWIYKRKRGVD--------------------------------EKSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPG

Query:  QEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKD
        +   VC+L++++YG+KQA R+W +     + + G  ++  +  ++      S+ ++ +YVDDIL+ GND   L +  + L+ +F +KD
Subjt:  QEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1928.27Show/hide
Query:  PDEVKPIGCKWIYKRK----------------RGVDEK----------------SIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPG
        P  V  +GC+WI+ +K                +G +++                SIRI+L +A    + + Q+DV  +FL   L + +YM QP GF+   
Subjt:  PDEVKPIGCKWIYKRK----------------RGVDEK----------------SIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFILPG

Query:  QEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGD
        +   VCRL+++IYG+KQA R+W +     + + G  ++  +  ++      S+ ++ +YVDDIL+ GND   L    + L+ +F +K+  D
Subjt:  QEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.0e-2331.12Show/hide
Query:  PDEVKPIGCKWIYK----------------------RKRGVD----------EKSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFI---
        P   KPIGCKW+YK                      ++ G+D            S++++L I+A Y++ + Q+D+  +FLN +L+E IYM  P G+    
Subjt:  PDEVKPIGCKWIYK----------------------RKRGVD----------EKSIRILLVIAAYYDYEVWQMDVKTSFLNDNLEETIYMDQPKGFI---

Query:  ---LPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLG
           LP     VC LK+SIYG+KQASR W ++F   +  +G   +  +   + KI  +    + +YVDDI++  N+   + ++K  L + F+++DLG
Subjt:  ---LPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWLATQFQMKDLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATCTCCAGGAAGTTCACACCGTGAGACTTACTTGGACTCGTGGCGCAAACGGGAGCGTCTCCCTTCGGATGGTGTTTGTATGAGTCAATATCAAGTGGATGATCT
GAAGTTTGTGCTAACTGAGGAGTGTCCTCAGGTGCTAGGGTCGAATGCGTTACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCT
ACATGTTGGCAAGTATGTCTGACATATTAGCCAAGAAGCATGAGGGCATGATTACCGCCAAGGAAATTATGGATTTTGTGCAGGGTATGTTTGGACAACAGTCCACACAA
GCCCGACATAATGCCCTAAAGTACATATTCAACTCGAGGATGATGCGGGTACATCATTGGGATCATGTCCTGGATATGATGGTGCACTTTAACATCGCAGAGTCGAATGG
TGCTTCCATCGATGAGTCGAGTCATGTCAACTTCATTCTGGAAACCCTTCCAGATAGTTTCCTGCAGTTTAGAAGTAATGCTGTTATGAATAATCTTATTTTTAATCTTA
CCTCCCTTCTGAATGAACTCCAGACCTTTCAATCTTTGATGAAAATTCAAGGATTGAAAGGTGAGGCAAATGTTGCCAGTAGGAGTTATCACAAAGGTTCGACCTCTGGG
ACAAAAACAGTGGCTTCATCTTCTTCTAACCTGAGAGGAAAGAAGAAGAACATGAAAAAAGTTGGTAAAGGGAAACAGACTGACAAAGCTGCCGCCCAGAAAGGCAAGAA
AGTCAAAGACTTTGCTGACAAAGGAAAGTGTTTCCACTGCAACGAAGACGAGCATTGGAAACGGAAATGTCTGAAGTACATTGCAGAAAAGAAGAAGGAAGATAAGCCTG
ATGAGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAACGTGGTGTAGATGAGAAGTCTATCCGTATCCTTCTTGTCATTGCCGCATATTATGACTATGAGGTA
TGGCAAATGGATGTCAAGACATCCTTTTTGAATGACAATCTTGAGGAAACCATCTACATGGACCAACCCAAAGGGTTCATTCTACCAGGACAAGAGCAAAAGGTTTGCAG
GCTTAAAAGGTCAATTTATGGAGTGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTGATGAGGCAATTAGATCTTATGGTGTTGACCACAATGATGATGAACCTTGTG
TCTACAAAAAAATCCTCAACAGTTCTGTTGCATTCCTATTTCTCTATGTGGATGATATCTTACTCATTGGGAATGATGTGGGTTATCTTACTGACATCAAGGAGTGGCTA
GCTACGCAGTTCCAAATGAAAGATTTGGGTGATGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTATCTCCAGGAAGTTCACACCGTGAGACTTACTTGGACTCGTGGCGCAAACGGGAGCGTCTCCCTTCGGATGGTGTTTGTATGAGTCAATATCAAGTGGATGATCT
GAAGTTTGTGCTAACTGAGGAGTGTCCTCAGGTGCTAGGGTCGAATGCGTTACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCT
ACATGTTGGCAAGTATGTCTGACATATTAGCCAAGAAGCATGAGGGCATGATTACCGCCAAGGAAATTATGGATTTTGTGCAGGGTATGTTTGGACAACAGTCCACACAA
GCCCGACATAATGCCCTAAAGTACATATTCAACTCGAGGATGATGCGGGTACATCATTGGGATCATGTCCTGGATATGATGGTGCACTTTAACATCGCAGAGTCGAATGG
TGCTTCCATCGATGAGTCGAGTCATGTCAACTTCATTCTGGAAACCCTTCCAGATAGTTTCCTGCAGTTTAGAAGTAATGCTGTTATGAATAATCTTATTTTTAATCTTA
CCTCCCTTCTGAATGAACTCCAGACCTTTCAATCTTTGATGAAAATTCAAGGATTGAAAGGTGAGGCAAATGTTGCCAGTAGGAGTTATCACAAAGGTTCGACCTCTGGG
ACAAAAACAGTGGCTTCATCTTCTTCTAACCTGAGAGGAAAGAAGAAGAACATGAAAAAAGTTGGTAAAGGGAAACAGACTGACAAAGCTGCCGCCCAGAAAGGCAAGAA
AGTCAAAGACTTTGCTGACAAAGGAAAGTGTTTCCACTGCAACGAAGACGAGCATTGGAAACGGAAATGTCTGAAGTACATTGCAGAAAAGAAGAAGGAAGATAAGCCTG
ATGAGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAACGTGGTGTAGATGAGAAGTCTATCCGTATCCTTCTTGTCATTGCCGCATATTATGACTATGAGGTA
TGGCAAATGGATGTCAAGACATCCTTTTTGAATGACAATCTTGAGGAAACCATCTACATGGACCAACCCAAAGGGTTCATTCTACCAGGACAAGAGCAAAAGGTTTGCAG
GCTTAAAAGGTCAATTTATGGAGTGAAACAAGCCTCTAGGTCCTGGAATATAAGATTTGATGAGGCAATTAGATCTTATGGTGTTGACCACAATGATGATGAACCTTGTG
TCTACAAAAAAATCCTCAACAGTTCTGTTGCATTCCTATTTCTCTATGTGGATGATATCTTACTCATTGGGAATGATGTGGGTTATCTTACTGACATCAAGGAGTGGCTA
GCTACGCAGTTCCAAATGAAAGATTTGGGTGATGCGTAA
Protein sequenceShow/hide protein sequence
MLSPGSSHRETYLDSWRKRERLPSDGVCMSQYQVDDLKFVLTEECPQVLGSNALRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMITAKEIMDFVQGMFGQQSTQ
ARHNALKYIFNSRMMRVHHWDHVLDMMVHFNIAESNGASIDESSHVNFILETLPDSFLQFRSNAVMNNLIFNLTSLLNELQTFQSLMKIQGLKGEANVASRSYHKGSTSG
TKTVASSSSNLRGKKKNMKKVGKGKQTDKAAAQKGKKVKDFADKGKCFHCNEDEHWKRKCLKYIAEKKKEDKPDEVKPIGCKWIYKRKRGVDEKSIRILLVIAAYYDYEV
WQMDVKTSFLNDNLEETIYMDQPKGFILPGQEQKVCRLKRSIYGVKQASRSWNIRFDEAIRSYGVDHNDDEPCVYKKILNSSVAFLFLYVDDILLIGNDVGYLTDIKEWL
ATQFQMKDLGDA