; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003172 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003172
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrotransposon gag protein
Genome locationchr06:12836497..12843060
RNA-Seq ExpressionPay0003172
SyntenyPay0003172
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025369.1 uncharacterized protein E6C27_scaffold1204G00530 [Cucumis melo var. makuwa]1.0e-15562.69Show/hide
Query:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI
        MTRSSNL+FSYFEDLNR VRRIRRERREENSI NLSNQEPLRGLE +LDSPLD NLGRGN+GEV+EKTLRELAEPDEDQRPL IVIP TTQPFELKS LI
Subjt:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI

Query:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES
        HLL IFKGS GED +KHLKDFHMVC SM+ H ISEEQLNLR FPF LTD                      +FL+KFFPASRANNIRKEIY IRQ FGES
Subjt:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES

Query:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL
        LSEY ER KEL ASFPHHHISDPSLIQY Y  LLSSDRNTVD  AGGALADK   E RELIS+M ENSQSF NRASELDNSLTKEVS             
Subjt:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL

Query:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPHAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYA
              TPLKVTKCGVCGLVGHPNDKCPEVIEDVNIV++YDP                              MKQQ+TQLTTAISKM+GKGKL AQ ++A
Subjt:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPHAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYA

Query:  NVSAISLRS---------------------------------------------------------------------------EILEMFRKVQINLPLL
        NVSAISLRS                                                                           E+LEM RKVQINLPLL
Subjt:  NVSAISLRS---------------------------------------------------------------------------EILEMFRKVQINLPLL

Query:  DAIQQVPRYAKFLKELCTNKRKTKERPM
        DAIQQVPR AKFLKELCTNKRKTKERPM
Subjt:  DAIQQVPRYAKFLKELCTNKRKTKERPM

KAA0025369.1 uncharacterized protein E6C27_scaffold1204G00530 [Cucumis melo var. makuwa]6.3e-1293.18Show/hide
Query:  PKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE
        P KL IELKTLPPH+KYIFLGKKNTFPVIISRELNQKQEERLIE
Subjt:  PKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE

KAA0025369.1 uncharacterized protein E6C27_scaffold1204G00530 [Cucumis melo var. makuwa]1.5e-14655.23Show/hide
Query:  NTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVR
        NTVDA AGGALADK PT ARELIS+MAENSQSF N+ASELDNSL KEVSELKSQMLNMTTLLTSFV GTPLKVTKCGVCGLVGH NDKCPEVIEDVNIVR
Subjt:  NTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVR

Query:  KYDP-------------------------HAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS----
        +YDP                         HAPSTSS QGTNLEDIIKALATNTLSFQQEMKQQMTQLTT  SKMDGKGKLSAQ  +ANVS ISLRS    
Subjt:  KYDP-------------------------HAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS----

Query:  -----------------------------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFL
                                                                               E+LEMFRKVQINLPLLDAIQQVP YAKFL
Subjt:  -----------------------------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFL

Query:  KELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYI
        KELCTNKRKTKERPMV+QNVSALLKSNIPEKCN                                                                   
Subjt:  KELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYI

Query:  LEMNEACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALE
                                      D GSL VEFDGDIVTFNIFESMRYSDECLSLCS ELHDEVDELSIH EF EQEIVENRELLEPNV YALE
Subjt:  LEMNEACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALE

Query:  KNNCDNLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNP
        KNN DNLFFSP+KL IELKTLPPHL YIFLGKKN FPVIISRELNQKQEERLI+                K I    C++ +++  + +  I+P  RLNP
Subjt:  KNNCDNLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNP

Query:  RL-ELAKPELTK
         L E+   E+ K
Subjt:  RL-ELAKPELTK

KAA0031967.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.1e-18667.45Show/hide
Query:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI
        MTRSSNLEF YFEDLN+ VRRIRRERREEN+IPNLSNQEPLRGLE +LDSPLD NLGRGNMGEV+EKT+R+L E DEDQRPL IVI  TTQPFELK RLI
Subjt:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI

Query:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES
        HLLPIFKG+SGEDP+KHLKDFHMVCDSM+ HGISEEQLNLR FPFSLTD                      +FL+KFFPASR NNIRKEIY IRQ FGES
Subjt:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES

Query:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL
        L +Y E+FKELCA+FPHHHI  PSLIQY YF LLSSDRNTVDA AGGALA+K PTEARELIS+MA+NSQ F NRASEL+NSLTKEVSELKSQMLNMTTLL
Subjt:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL

Query:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQ
        TSFVQGTPLKVTKC VCGLVGHPNDKCPEVIE++NIV+KYDP+                         AP+TSSNQ TNL+DIIKALATNTLSFQQEMKQ
Subjt:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQ

Query:  QMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS------------------------------------------------------------------
        QMTQLTT ISKMDGKGKL AQ N+ANVS ISLRS                                                                  
Subjt:  QMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS------------------------------------------------------------------

Query:  ---------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCTNKRKTKERPM
                 E+LEMFRKVQINLPLLDAIQQVPRYAKFLKELCTNKRKTKER M
Subjt:  ---------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCTNKRKTKERPM

KAA0060435.1 reverse transcriptase [Cucumis melo var. makuwa]1.6e-20456.5Show/hide
Query:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI
        MTRSSNLEFSYFEDLNR V RIRRE REENSIP+LSNQEPLRGLE +LDSPL  NLGRGNMGEV+EKTLREL +PDEDQRPL I+IPPTTQPFELKS LI
Subjt:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI

Query:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTDQFLK-KFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHIS
        HLLPIFKGSSGEDP+KHLKDFHMVCDSM+ HGIS+E           T +F    FFP       RKE+                               
Subjt:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTDQFLK-KFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHIS

Query:  DPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVG
                   LL   RNTVDA  GGALADK PTEARELIS+MAENSQSF NRASE DNSLTKE                            CGVCGLVG
Subjt:  DPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVG

Query:  HPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQ
        HPNDKCPEVI DVNIVR+YDPH                         AP TSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTT ISKMDGKGKL AQ
Subjt:  HPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQ

Query:  SNYANVSAISLRS-----------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCT
         ++ANVSAI LRS                                                     E+ EMFRKVQINLPLLDAIQQVPRYAKFLKELCT
Subjt:  SNYANVSAISLRS-----------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCT

Query:  NKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYILEMNE
        NKRKTKER M+SQNVSALLKSNIPEKCN                                                                        
Subjt:  NKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYILEMNE

Query:  ACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALEKNNCD
                   GRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRY DECLSLCSLELHD VDE SIH EF EQE+VENR+LLE NVDYALEKNNCD
Subjt:  ACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALEKNNCD

Query:  NLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNPRLELA
        NLFFSP+KL IELKTLPP+LKYIFLGKKNTFPVIIS+ELNQKQEERLIE                K I    C++ +++    +  I+P  RLNP L+ A
Subjt:  NLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNPRLELA

KAA0061315.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-16749.03Show/hide
Query:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI
        MT SSNLEFSYFEDLNR VRRIRRER+EENSIPNLSNQEPLRGLE +LDSPLD NL R NMGEV+EKTL+ELAEP+EDQRPLY                 
Subjt:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI

Query:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTDQFLKKFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHISD
                                                                                                            
Subjt:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTDQFLKKFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHISD

Query:  PSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGH
                            A AGGALADK PT ARELIS+MAENSQSF N+ASELDNSL KEVSELKSQMLNMTTLLTSFV GTPLKVTKCGVCGLVGH
Subjt:  PSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGH

Query:  PNDKCPEVIEDVNIVRKYDP-------------------------HAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQS
         NDKCPEVIEDVNIVR+YDP                         HAPSTSS QGTNLEDIIKALATNTLSFQQEMKQQMTQLTT  SKMDGKGKLSAQ 
Subjt:  PNDKCPEVIEDVNIVRKYDP-------------------------HAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQS

Query:  NYANVSAISLRS---------------------------------------------------------------------------EILEMFRKVQINL
         +ANVS ISLRS                                                                           E+LEMFRKVQINL
Subjt:  NYANVSAISLRS---------------------------------------------------------------------------EILEMFRKVQINL

Query:  PLLDAIQQVPRYAKFLKELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVED
        PLLDAIQQVP YAKFLKELCTNKRKTKERPMV+QNVSALLKSNIPEKCN                                                   
Subjt:  PLLDAIQQVPRYAKFLKELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVED

Query:  VLVKIDKLIFPVDFYILEMNEACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEI
                                                      D GSL VEFDGDIVTFNIFESMRYSDECLSLCS ELHDEVDELSIH EF EQEI
Subjt:  VLVKIDKLIFPVDFYILEMNEACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEI

Query:  VENRELLEPNVDYALEKNNCDNLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVV
        VENRELLEPNV YALEKNN DNLFFSP+KL IELKTLPPHL YIFLGKKN FPVIISRELNQKQEERLI+                K I    C++ +++
Subjt:  VENRELLEPNVDYALEKNNCDNLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVV

Query:  TSKERLIIEP-NRLNPRL-ELAKPELTK
          + +  I+P  RLNP L E+   E+ K
Subjt:  TSKERLIIEP-NRLNPRL-ELAKPELTK

TrEMBL top hitse value%identityAlignment
A0A5A7SRF5 Retrotransposon gag protein5.5e-18767.45Show/hide
Query:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI
        MTRSSNLEF YFEDLN+ VRRIRRERREEN+IPNLSNQEPLRGLE +LDSPLD NLGRGNMGEV+EKT+R+L E DEDQRPL IVI  TTQPFELK RLI
Subjt:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI

Query:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES
        HLLPIFKG+SGEDP+KHLKDFHMVCDSM+ HGISEEQLNLR FPFSLTD                      +FL+KFFPASR NNIRKEIY IRQ FGES
Subjt:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES

Query:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL
        L +Y E+FKELCA+FPHHHI  PSLIQY YF LLSSDRNTVDA AGGALA+K PTEARELIS+MA+NSQ F NRASEL+NSLTKEVSELKSQMLNMTTLL
Subjt:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL

Query:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQ
        TSFVQGTPLKVTKC VCGLVGHPNDKCPEVIE++NIV+KYDP+                         AP+TSSNQ TNL+DIIKALATNTLSFQQEMKQ
Subjt:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQ

Query:  QMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS------------------------------------------------------------------
        QMTQLTT ISKMDGKGKL AQ N+ANVS ISLRS                                                                  
Subjt:  QMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS------------------------------------------------------------------

Query:  ---------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCTNKRKTKERPM
                 E+LEMFRKVQINLPLLDAIQQVPRYAKFLKELCTNKRKTKER M
Subjt:  ---------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCTNKRKTKERPM

A0A5A7USL5 Retrotrans_gag domain-containing protein2.8e-14661.21Show/hide
Query:  MGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLIHLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD-----------
        MGEV+EK LRELAEP+ED RPL IVIPPTTQPFELK  LIHLLPIFKGSSGEDP+KHLKDFHMVCDSM+ + ISEEQLNLR FPF LTD           
Subjt:  MGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLIHLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD-----------

Query:  -----------QFLKKFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEAREL
                   +FL+KFFPASRANNIRKEIY IRQ FGESLS+Y ERFKELCAS PH+HI DPSLIQY Y  LLS DRNTVDA  GGALADK PTEAR+L
Subjt:  -----------QFLKKFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEAREL

Query:  ISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPHAPSTSSNQGTNLEDIIK
        IS+M ENSQSF NRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVR+YDPH        GTNLEDIIK
Subjt:  ISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPHAPSTSSNQGTNLEDIIK

Query:  ALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYAN-------------------------------VSAISLR-------SEILEMFRKVQINL
        ALATNTLSFQQEMKQQMTQLTTAISKMDGKGKL AQ ++AN                                S ++ +        E+LEMFRKVQIN 
Subjt:  ALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYAN-------------------------------VSAISLR-------SEILEMFRKVQINL

Query:  PLLDAIQQVPRYAKFLKELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVED
        PLLDAIQQV R     +E    K   K++  +   +   +K   P  C H                  +L+ GA   + P        LN   K  ++++
Subjt:  PLLDAIQQVPRYAKFLKELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVED

Query:  VL-VKIDKLIFPV
        VL +K   +I+PV
Subjt:  VL-VKIDKLIFPV

A0A5A7V3J9 Reverse transcriptase7.6e-20556.5Show/hide
Query:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI
        MTRSSNLEFSYFEDLNR V RIRRE REENSIP+LSNQEPLRGLE +LDSPL  NLGRGNMGEV+EKTLREL +PDEDQRPL I+IPPTTQPFELKS LI
Subjt:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI

Query:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTDQFLK-KFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHIS
        HLLPIFKGSSGEDP+KHLKDFHMVCDSM+ HGIS+E           T +F    FFP       RKE+                               
Subjt:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTDQFLK-KFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHIS

Query:  DPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVG
                   LL   RNTVDA  GGALADK PTEARELIS+MAENSQSF NRASE DNSLTKE                            CGVCGLVG
Subjt:  DPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVG

Query:  HPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQ
        HPNDKCPEVI DVNIVR+YDPH                         AP TSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTT ISKMDGKGKL AQ
Subjt:  HPNDKCPEVIEDVNIVRKYDPH-------------------------APSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQ

Query:  SNYANVSAISLRS-----------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCT
         ++ANVSAI LRS                                                     E+ EMFRKVQINLPLLDAIQQVPRYAKFLKELCT
Subjt:  SNYANVSAISLRS-----------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFLKELCT

Query:  NKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYILEMNE
        NKRKTKER M+SQNVSALLKSNIPEKCN                                                                        
Subjt:  NKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYILEMNE

Query:  ACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALEKNNCD
                   GRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRY DECLSLCSLELHD VDE SIH EF EQE+VENR+LLE NVDYALEKNNCD
Subjt:  ACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALEKNNCD

Query:  NLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNPRLELA
        NLFFSP+KL IELKTLPP+LKYIFLGKKNTFPVIIS+ELNQKQEERLIE                K I    C++ +++    +  I+P  RLNP L+ A
Subjt:  NLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNPRLELA

A0A5D3DIP6 Retrotrans_gag domain-containing protein5.0e-15662.69Show/hide
Query:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI
        MTRSSNL+FSYFEDLNR VRRIRRERREENSI NLSNQEPLRGLE +LDSPLD NLGRGN+GEV+EKTLRELAEPDEDQRPL IVIP TTQPFELKS LI
Subjt:  MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLI

Query:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES
        HLL IFKGS GED +KHLKDFHMVC SM+ H ISEEQLNLR FPF LTD                      +FL+KFFPASRANNIRKEIY IRQ FGES
Subjt:  HLLPIFKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTD----------------------QFLKKFFPASRANNIRKEIYEIRQTFGES

Query:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL
        LSEY ER KEL ASFPHHHISDPSLIQY Y  LLSSDRNTVD  AGGALADK   E RELIS+M ENSQSF NRASELDNSLTKEVS             
Subjt:  LSEYRERFKELCASFPHHHISDPSLIQYSYFDLLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLL

Query:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPHAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYA
              TPLKVTKCGVCGLVGHPNDKCPEVIEDVNIV++YDP                              MKQQ+TQLTTAISKM+GKGKL AQ ++A
Subjt:  TSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVRKYDPHAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYA

Query:  NVSAISLRS---------------------------------------------------------------------------EILEMFRKVQINLPLL
        NVSAISLRS                                                                           E+LEM RKVQINLPLL
Subjt:  NVSAISLRS---------------------------------------------------------------------------EILEMFRKVQINLPLL

Query:  DAIQQVPRYAKFLKELCTNKRKTKERPM
        DAIQQVPR AKFLKELCTNKRKTKERPM
Subjt:  DAIQQVPRYAKFLKELCTNKRKTKERPM

A0A5D3DIP6 Retrotrans_gag domain-containing protein3.1e-1293.18Show/hide
Query:  PKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE
        P KL IELKTLPPH+KYIFLGKKNTFPVIISRELNQKQEERLIE
Subjt:  PKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE

A0A5D3DIP6 Retrotrans_gag domain-containing protein7.3e-14755.23Show/hide
Query:  NTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVR
        NTVDA AGGALADK PT ARELIS+MAENSQSF N+ASELDNSL KEVSELKSQMLNMTTLLTSFV GTPLKVTKCGVCGLVGH NDKCPEVIEDVNIVR
Subjt:  NTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIVR

Query:  KYDP-------------------------HAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS----
        +YDP                         HAPSTSS QGTNLEDIIKALATNTLSFQQEMKQQMTQLTT  SKMDGKGKLSAQ  +ANVS ISLRS    
Subjt:  KYDP-------------------------HAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYANVSAISLRS----

Query:  -----------------------------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFL
                                                                               E+LEMFRKVQINLPLLDAIQQVP YAKFL
Subjt:  -----------------------------------------------------------------------EILEMFRKVQINLPLLDAIQQVPRYAKFL

Query:  KELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYI
        KELCTNKRKTKERPMV+QNVSALLKSNIPEKCN                                                                   
Subjt:  KELCTNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYI

Query:  LEMNEACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALE
                                      D GSL VEFDGDIVTFNIFESMRYSDECLSLCS ELHDEVDELSIH EF EQEIVENRELLEPNV YALE
Subjt:  LEMNEACLKPSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALE

Query:  KNNCDNLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNP
        KNN DNLFFSP+KL IELKTLPPHL YIFLGKKN FPVIISRELNQKQEERLI+                K I    C++ +++  + +  I+P  RLNP
Subjt:  KNNCDNLFFSPKKLRIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIE----------------KRIE---CLNSLVVTSKERLIIEP-NRLNP

Query:  RL-ELAKPELTK
         L E+   E+ K
Subjt:  RL-ELAKPELTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCGTTCTTCAAATTTAGAATTTTCTTACTTTGAAGATTTAAATAGAGGGGTTCGTAGAATTAGAAGAGAAAGGAGAGAAGAGAATAGCATCCCCAATCTT
TCTAACCAAGAACCTTTAAGAGGTTTAGAATCTAACCTAGATTCTCCTTTAGATTCAAACCTTGGTAGAGGGAATATGGGCGAAGTTAAAGAAAAGACCCTTAGA
GAGCTTGCTGAGCCCGATGAAGACCAAAGACCTCTTTATATAGTCATACCCCCAACCACTCAACCTTTTGAATTAAAATCGAGACTTATCCATCTTTTGCCCATC
TTTAAAGGAAGTTCCGGAGAGGACCCATACAAACATCTTAAGGATTTTCATATGGTTTGTGATTCCATGAAGATTCACGGCATCTCGGAAGAACAACTAAATTTA
CGAGTCTTTCCATTTTCCCTTACAGATCAATTCTTAAAGAAATTTTTCCCCGCCTCTAGAGCAAACAATATTAGGAAAGAAATTTATGAGATAAGACAAACCTTT
GGAGAATCCCTCTCGGAATACAGGGAAAGATTCAAAGAGCTTTGTGCTAGCTTTCCTCACCATCATATTTCCGACCCTTCTTTAATTCAATATTCTTACTTCGAT
CTTTTGTCTTCCGACAGGAATACGGTAGATGCAGTGGCGGGAGGAGCTCTAGCCGATAAGGCACCCACCGAAGCACGGGAGCTCATTTCAAAAATGGCAGAAAAT
TCTCAAAGCTTTGAGAATAGAGCATCGGAGCTTGACAATTCTCTAACAAAAGAGGTAAGTGAGTTAAAATCACAAATGTTAAATATGACTACTCTTCTTACTTCT
TTTGTACAAGGTACTCCTCTTAAAGTAACCAAGTGTGGAGTTTGTGGCTTGGTTGGTCATCCAAATGACAAATGTCCCGAGGTGATCGAGGATGTAAACATTGTT
CGAAAATATGACCCCCACGCACCATCCACATCTTCAAACCAAGGTACGAATCTTGAAGATATTATCAAAGCTTTGGCAACTAACACTCTTTCTTTTCAACAAGAG
ATGAAACAACAAATGACTCAACTTACTACCGCCATAAGCAAGATGGATGGGAAAGGCAAACTTTCGGCTCAATCGAACTATGCTAATGTAAGTGCCATTTCACTA
AGGAGTGAAATCTTGGAGATGTTTAGAAAGGTGCAAATCAACTTGCCCCTTCTCGATGCAATCCAACAAGTCCCGAGGTATGCAAAGTTTCTAAAGGAGTTGTGT
ACCAACAAGAGGAAAACAAAAGAAAGACCAATGGTAAGTCAAAATGTTTCGGCTCTTCTTAAGAGTAATATTCCGGAAAAATGCAATCACCTCGGTATGTTTTCT
CTACCTTGTGTAATAGGAAATAGGCTAATTTCTCATGCAATGCTTGACTTAGGAGCATCCATAAATGTCATGCCATACAACGTCTTTAAAGATCTAGAACTCAAT
AATTTGCAAAAGACTAGAATAGTTGAAGATGTCCTTGTGAAAATTGACAAACTAATTTTTCCGGTGGATTTTTACATATTAGAAATGAATGAAGCATGCCTAAAA
CCATCTCACTCTATTTTATTAGGAAGACCTTTTCTTAAAACTGCTAAAGCCATTATAAATGTTGATAAAGGTTCCTTGAGTGTAGAGTTTGATGGAGACATTGTC
ACATTCAATATTTTTGAATCCATGAGATATTCGGATGAATGCTTGTCTTTATGTTCATTAGAATTGCATGATGAAGTTGATGAATTGTCTATACATTATGAATTT
CTTGAACAAGAAATAGTTGAAAATAGAGAATTGTTAGAACCCAATGTTGACTATGCTTTAGAAAAAAATAATTGTGATAATCTTTTCTTTTCTCCAAAGAAACTA
CGGATTGAGCTCAAAACTCTCCCCCCACACTTGAAATACATATTCTTAGGAAAAAAGAATACGTTTCCCGTAATCATCTCAAGGGAACTTAACCAAAAACAAGAA
GAAAGACTTATCGAAAAAAGGATAGAATGCTTAAATTCGCTTGTTGTCACCTCGAAAGAACGGTTAATCATAGAACCTAATAGATTAAATCCAAGATTAGAGTTA
GCCAAGCCGGAGCTAACAAAAAGCTTGCTACCAAAAATAGAATCCCTAAAAAAAAAAGTAGCAAAATGGTGTCCGCTCCTATGCACGACACAAATCCCTGCATGG
ATCTCACGGTGTGAATTCTTAGGGACGCGAGAGCTAAAAGTACACTACTTCTCTCTAGCTGAAGTGTTCTATAATGATTTTAAGGGTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCCGTTCTTCAAATTTAGAATTTTCTTACTTTGAAGATTTAAATAGAGGGGTTCGTAGAATTAGAAGAGAAAGGAGAGAAGAGAATAGCATCCCCAATCTT
TCTAACCAAGAACCTTTAAGAGGTTTAGAATCTAACCTAGATTCTCCTTTAGATTCAAACCTTGGTAGAGGGAATATGGGCGAAGTTAAAGAAAAGACCCTTAGA
GAGCTTGCTGAGCCCGATGAAGACCAAAGACCTCTTTATATAGTCATACCCCCAACCACTCAACCTTTTGAATTAAAATCGAGACTTATCCATCTTTTGCCCATC
TTTAAAGGAAGTTCCGGAGAGGACCCATACAAACATCTTAAGGATTTTCATATGGTTTGTGATTCCATGAAGATTCACGGCATCTCGGAAGAACAACTAAATTTA
CGAGTCTTTCCATTTTCCCTTACAGATCAATTCTTAAAGAAATTTTTCCCCGCCTCTAGAGCAAACAATATTAGGAAAGAAATTTATGAGATAAGACAAACCTTT
GGAGAATCCCTCTCGGAATACAGGGAAAGATTCAAAGAGCTTTGTGCTAGCTTTCCTCACCATCATATTTCCGACCCTTCTTTAATTCAATATTCTTACTTCGAT
CTTTTGTCTTCCGACAGGAATACGGTAGATGCAGTGGCGGGAGGAGCTCTAGCCGATAAGGCACCCACCGAAGCACGGGAGCTCATTTCAAAAATGGCAGAAAAT
TCTCAAAGCTTTGAGAATAGAGCATCGGAGCTTGACAATTCTCTAACAAAAGAGGTAAGTGAGTTAAAATCACAAATGTTAAATATGACTACTCTTCTTACTTCT
TTTGTACAAGGTACTCCTCTTAAAGTAACCAAGTGTGGAGTTTGTGGCTTGGTTGGTCATCCAAATGACAAATGTCCCGAGGTGATCGAGGATGTAAACATTGTT
CGAAAATATGACCCCCACGCACCATCCACATCTTCAAACCAAGGTACGAATCTTGAAGATATTATCAAAGCTTTGGCAACTAACACTCTTTCTTTTCAACAAGAG
ATGAAACAACAAATGACTCAACTTACTACCGCCATAAGCAAGATGGATGGGAAAGGCAAACTTTCGGCTCAATCGAACTATGCTAATGTAAGTGCCATTTCACTA
AGGAGTGAAATCTTGGAGATGTTTAGAAAGGTGCAAATCAACTTGCCCCTTCTCGATGCAATCCAACAAGTCCCGAGGTATGCAAAGTTTCTAAAGGAGTTGTGT
ACCAACAAGAGGAAAACAAAAGAAAGACCAATGGTAAGTCAAAATGTTTCGGCTCTTCTTAAGAGTAATATTCCGGAAAAATGCAATCACCTCGGTATGTTTTCT
CTACCTTGTGTAATAGGAAATAGGCTAATTTCTCATGCAATGCTTGACTTAGGAGCATCCATAAATGTCATGCCATACAACGTCTTTAAAGATCTAGAACTCAAT
AATTTGCAAAAGACTAGAATAGTTGAAGATGTCCTTGTGAAAATTGACAAACTAATTTTTCCGGTGGATTTTTACATATTAGAAATGAATGAAGCATGCCTAAAA
CCATCTCACTCTATTTTATTAGGAAGACCTTTTCTTAAAACTGCTAAAGCCATTATAAATGTTGATAAAGGTTCCTTGAGTGTAGAGTTTGATGGAGACATTGTC
ACATTCAATATTTTTGAATCCATGAGATATTCGGATGAATGCTTGTCTTTATGTTCATTAGAATTGCATGATGAAGTTGATGAATTGTCTATACATTATGAATTT
CTTGAACAAGAAATAGTTGAAAATAGAGAATTGTTAGAACCCAATGTTGACTATGCTTTAGAAAAAAATAATTGTGATAATCTTTTCTTTTCTCCAAAGAAACTA
CGGATTGAGCTCAAAACTCTCCCCCCACACTTGAAATACATATTCTTAGGAAAAAAGAATACGTTTCCCGTAATCATCTCAAGGGAACTTAACCAAAAACAAGAA
GAAAGACTTATCGAAAAAAGGATAGAATGCTTAAATTCGCTTGTTGTCACCTCGAAAGAACGGTTAATCATAGAACCTAATAGATTAAATCCAAGATTAGAGTTA
GCCAAGCCGGAGCTAACAAAAAGCTTGCTACCAAAAATAGAATCCCTAAAAAAAAAAGTAGCAAAATGGTGTCCGCTCCTATGCACGACACAAATCCCTGCATGG
ATCTCACGGTGTGAATTCTTAGGGACGCGAGAGCTAAAAGTACACTACTTCTCTCTAGCTGAAGTGTTCTATAATGATTTTAAGGGTATTTAA
Protein sequenceShow/hide protein sequence
MTRSSNLEFSYFEDLNRGVRRIRRERREENSIPNLSNQEPLRGLESNLDSPLDSNLGRGNMGEVKEKTLRELAEPDEDQRPLYIVIPPTTQPFELKSRLIHLLPI
FKGSSGEDPYKHLKDFHMVCDSMKIHGISEEQLNLRVFPFSLTDQFLKKFFPASRANNIRKEIYEIRQTFGESLSEYRERFKELCASFPHHHISDPSLIQYSYFD
LLSSDRNTVDAVAGGALADKAPTEARELISKMAENSQSFENRASELDNSLTKEVSELKSQMLNMTTLLTSFVQGTPLKVTKCGVCGLVGHPNDKCPEVIEDVNIV
RKYDPHAPSTSSNQGTNLEDIIKALATNTLSFQQEMKQQMTQLTTAISKMDGKGKLSAQSNYANVSAISLRSEILEMFRKVQINLPLLDAIQQVPRYAKFLKELC
TNKRKTKERPMVSQNVSALLKSNIPEKCNHLGMFSLPCVIGNRLISHAMLDLGASINVMPYNVFKDLELNNLQKTRIVEDVLVKIDKLIFPVDFYILEMNEACLK
PSHSILLGRPFLKTAKAIINVDKGSLSVEFDGDIVTFNIFESMRYSDECLSLCSLELHDEVDELSIHYEFLEQEIVENRELLEPNVDYALEKNNCDNLFFSPKKL
RIELKTLPPHLKYIFLGKKNTFPVIISRELNQKQEERLIEKRIECLNSLVVTSKERLIIEPNRLNPRLELAKPELTKSLLPKIESLKKKVAKWCPLLCTTQIPAW
ISRCEFLGTRELKVHYFSLAEVFYNDFKGI