; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006276 (gene) of Snake gourd v1 genome

Gene IDTan0006276
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG04:31664264..31669905
RNA-Seq ExpressionTan0006276
SyntenyTan0006276
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05005.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.2e-7232.7Show/hide
Query:  SRDATESTSLIQFGTLEPVEIWLQREFL-NTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKRKNVPRKKNKKIKSWRKCKIVKEDNEDFV
        +R+      L +FGT EPV +   +E     S  + +  E +DE WT+VTRRKK+K   IQ+E   +RNY+R N   +KNKK K  RK K++ ++++DF 
Subjt:  SRDATESTSLIQFGTLEPVEIWLQREFL-NTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKRKNVPRKKNKKIKSWRKCKIVKEDNEDFV

Query:  RPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNVSAE--------------------------------------------------------------
        R ++ ITL DF P  FL ++  + P +  CH +N + E                                                              
Subjt:  RPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNVSAE--------------------------------------------------------------

Query:  --------------------------------------------------------------------GENQ----------------------------
                                                                            G NQ                            
Subjt:  --------------------------------------------------------------------GENQ----------------------------

Query:  --ILIKKESIKEVDACEV----------------------------KTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVE
            +K +S  EV + EV                            + S+ ST++A   +  +EK+SN  +LRY+PLSRRKKGESPF E  + LK GD+E
Subjt:  --ILIKKESIKEVDACEV----------------------------KTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVE

Query:  ILKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEP
        +LKESF TPLT I KQEI   K++  + + P+RRTK+ FDPKAYKL++ AGYDFTTHT FKS KI  E+P+LSSTQKKLL+EG+ IP SRKGLGYK PEP
Subjt:  ILKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEP

Query:  IRITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSA
        IRITRKGKEKM D+NHIT++EVD   ++    +RTS F R+ P   R    +R+ VT  E      T +  R S  +RL +   K  +T   P ++    
Subjt:  IRITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSA

Query:  LQQLGTFTKERKNIPLTSNVKRPSAFHRLSVATSKSEGPSMSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVN
           L   T    +I    +  R S +HR+                       +E+  GK    E K E+EI    PSRMKRKTFVT+N
Subjt:  LQQLGTFTKERKNIPLTSNVKRPSAFHRLSVATSKSEGPSMSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVN

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]8.3e-1643.41Show/hide
Query:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN
        K +SKS+A++ +Y GPITRSRS+ I   + Q   IA  IL+Q+ ES +    I EN ++      S    + ++P VMSVMM D  + E  M E E+KIN
Subjt:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN

Query:  MLLKAVEERDYEISSLKNQIESRDATEST
        +L+K V+ERD+EI++LK Q+++R+  ES+
Subjt:  MLLKAVEERDYEISSLKNQIESRDATEST

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]8.3e-1643.41Show/hide
Query:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN
        K +SKS+A++ +Y GPITRSRS+ I   + Q   IA  IL+Q+ ES +    I EN ++      S    + ++P VMSVMM D  + E  M E E+KIN
Subjt:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN

Query:  MLLKAVEERDYEISSLKNQIESRDATEST
        +L+K V+ERD+EI++LK Q+++R+  ES+
Subjt:  MLLKAVEERDYEISSLKNQIESRDATEST

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]1.1e-7130.85Show/hide
Query:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR
        EK+I + L+ V + ++ E++ +     SR   E   SL+QFGT EP+ + + Q        G+ +  E +DEGW +VT RKK++    QRE  +++NY+R
Subjt:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR

Query:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------
         N   +KNKK K   K K+V  ++ +F RP++ +TL DFLP+ FL ++  + PE+  CH +N          S EGE                       
Subjt:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------

Query:  -----------------------------------------------------------NQILIK-----------------------------------
                                                                   ++ILI                                    
Subjt:  -----------------------------------------------------------NQILIK-----------------------------------

Query:  -----------------------------------------------------------KESIKEVDA--------------------------------
                                                                   ++ +K+V+A                                
Subjt:  -----------------------------------------------------------KESIKEVDA--------------------------------

Query:  -----------CEVKTSKPS--------------TNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI
                     + T++P               T+S    + K+E ++N PVLRY+PLSRRKKGESPF E  + LK GD+EI+KESF TPLT IAKQE+
Subjt:  -----------CEVKTSKPS--------------TNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI

Query:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT
           KV+  +   P+RRTK+GFDPKAYKL++ AGYDFT HT FKS +I D RPELSSTQKKLL+EG+ IP SRKGLGYKSPEPIRIT+KGKEK+ D NHIT
Subjt:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT

Query:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----
        IEE D+++ +    +R SVF R+ P   R    +R+ +T  E        +  R S+ +RL     KE+ T    T  +PSA ++LG    ++KN+    
Subjt:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----

Query:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT
         P+ +++    +   +  ++ T K E  S + V+  ++H  +      K   ETK+  EI    PSRMKRKTFVT+NT
Subjt:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]8.3e-1643.41Show/hide
Query:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN
        K +SKS+A++ +Y GPITRSRS+ I   + Q   IA  IL+Q+ ES +    I EN ++      S    + ++P VMSVMM D  + E  M E E+KIN
Subjt:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN

Query:  MLLKAVEERDYEISSLKNQIESRDATEST
        +L+K V+ERD+EI++LK Q+++R+  ES+
Subjt:  MLLKAVEERDYEISSLKNQIESRDATEST

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]1.4e-7130.85Show/hide
Query:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR
        EK+I + L+ V + ++ E++ +     SR   E   SL+QFGT EP+ + + Q        G+ +  E +DEGW +VT RKK++    QRE  +++NY+R
Subjt:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR

Query:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------
         N   +KNKK K   K K+V  ++ +F RP++ +TL DFLP+ FL ++  + PE+  CH +N          S EGE                       
Subjt:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------

Query:  -----------------------------------------------------------NQILIK-----------------------------------
                                                                   ++ILI                                    
Subjt:  -----------------------------------------------------------NQILIK-----------------------------------

Query:  -----------------------------------------------------------KESIKEVDA--------------------------------
                                                                   ++ +K+V+A                                
Subjt:  -----------------------------------------------------------KESIKEVDA--------------------------------

Query:  -----------CEVKTSKPS--------------TNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI
                     + T++P               T+S    + K+E ++N PVLRY+PLSRRKKGESPF E  + LK GD+EI+KESF TPLT IAKQE+
Subjt:  -----------CEVKTSKPS--------------TNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI

Query:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT
           KV+  +   P+RRTK+GFDPKAYKL++ AGYDFT HT FKS +I D RPELSSTQKKLL+EG+ IP SRKGLGYKSPEPIRIT+KGKEK+ D NHIT
Subjt:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT

Query:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----
        IEE D+++ +    +R SVF R+ P   R    +R+ +T  E           R S+ +RL     KE+ T    T  +PSA ++LG    ++KN+    
Subjt:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----

Query:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT
         P+ +++    +   +  ++ T K E  S + V+  ++H  +      K   ETK+  EI    PSRMKRKTFVT+NT
Subjt:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]8.1e-7231.23Show/hide
Query:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR
        EK+I + L+ V + ++ E++ +     SR   E   SL+QFGT EP+ + + Q        G+ +  E +DEGW +VT RKK++    QRE  +++NY+R
Subjt:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR

Query:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------
         N   +KNKK K   K K+V  ++ +F RP++ +TL DFLP+ FL ++  + PE+  CH +N          S EGE                       
Subjt:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------

Query:  -----------------------------------------------------------NQILI------------------------------------
                                                                   ++ILI                                    
Subjt:  -----------------------------------------------------------NQILI------------------------------------

Query:  ---------------------------------------------------------------------------------------KKESIKEVDACEV
                                                                                               K  +I EV   E 
Subjt:  ---------------------------------------------------------------------------------------KKESIKEVDACEV

Query:  KTSKPSTNS--------------------------ANEK--VSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI
          +K   NS                          +N K  + K+E ++N PVLRY+PLSRRKKGESPF E  + LK GD+EI+KESF TPLT IAKQE+
Subjt:  KTSKPSTNS--------------------------ANEK--VSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI

Query:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT
           KV+  +   P+RRTK+GFDPKAYKL++ AGYDFT HT FKS +I D RPELSSTQKKLL+EG+ IP SRKGLGYKSPEPIRIT+KGKEK+ D NHIT
Subjt:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT

Query:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----
        IEE D+++ +    +R SVF R+ P   R    +R+ +T  E        +  R S+ +RL     KE+ T    T  +PSA ++LG    ++KN+    
Subjt:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----

Query:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT
         P+ +++    +   +  ++ T K E  S + V+  ++H  +      K   ETK+  EI    PSRMKRKTFVT+NT
Subjt:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]8.3e-1643.41Show/hide
Query:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN
        K +SKS+A++ +Y GPITRSRS+ I   + Q   IA  IL+Q+ ES +    I EN ++      S    + ++P VMSVMM D  + E  M E E+KIN
Subjt:  KISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFEKKIN

Query:  MLLKAVEERDYEISSLKNQIESRDATEST
        +L+K V+ERD+EI++LK Q+++R+  ES+
Subjt:  MLLKAVEERDYEISSLKNQIESRDATEST

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]1.1e-7130.85Show/hide
Query:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR
        EK+I + L+ V + ++ E++ +     SR   E   SL+QFGT EP+ + + Q        G+ +  E +DEGW +VT RKK++    QRE  +++NY+R
Subjt:  EKKINMLLKAVEERDY-EISSLKNQIESRDATES-TSLIQFGTLEPVEI-WLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR

Query:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------
         N   +KNKK K   K K+V  ++ +F RP++ +TL DFLP+ FL ++  + PE+  CH +N          S EGE                       
Subjt:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNV---------SAEGE-----------------------

Query:  -----------------------------------------------------------NQILIK-----------------------------------
                                                                   ++ILI                                    
Subjt:  -----------------------------------------------------------NQILIK-----------------------------------

Query:  -----------------------------------------------------------KESIKEVDA--------------------------------
                                                                   ++ +K+V+A                                
Subjt:  -----------------------------------------------------------KESIKEVDA--------------------------------

Query:  -----------CEVKTSKPS--------------TNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI
                     + T++P               T+S    + K+E ++N PVLRY+PLSRRKKGESPF E  + LK GD+EI+KESF TPLT IAKQE+
Subjt:  -----------CEVKTSKPS--------------TNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEI

Query:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT
           KV+  +   P+RRTK+GFDPKAYKL++ AGYDFT HT FKS +I D RPELSSTQKKLL+EG+ IP SRKGLGYKSPEPIRIT+KGKEK+ D NHIT
Subjt:  RNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHIT

Query:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----
        IEE D+++ +    +R SVF R+ P   R    +R+ +T  E        +  R S+ +RL     KE+ T    T  +PSA ++LG    ++KN+    
Subjt:  IEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNI----

Query:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT
         P+ +++    +   +  ++ T K E  S + V+  ++H  +      K   ETK+  EI    PSRMKRKTFVT+NT
Subjt:  -PLTSNVKRPSAFHRL--SVATSKSEGPS-MSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTVNT

TrEMBL top hitse value%identityAlignment
A0A5A7UD46 Uncharacterized protein1.3e-6748.87Show/hide
Query:  EITTCHVVNVSAEGENQI--LIKKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLT
        E+ +  V+ V+ E   Q+  L  KE  K +       S+ STN+A   +  +EK+SN P+LRY+PLSRRKKGESPF E  + LK G++E+LKESF TPLT
Subjt:  EITTCHVVNVSAEGENQI--LIKKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLT

Query:  TIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKM
         I KQEI   K++  + + P+RRTK+GFDPKAYKL++ AGYDFTTHT FKS KI+ E+P+LSSTQKKLL+EG++IP SRKGLGYKSPEPIRITRKGKEK+
Subjt:  TIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKM

Query:  ADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKER
         D+NHIT++E D   ++    +RTS F R+ P   R    +++ +T  E      T +  R S  QRL +   +EK    T    +PSA ++L    K+ 
Subjt:  ADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKER

Query:  KNIPLTSNVKR
           PL   + R
Subjt:  KNIPLTSNVKR

A0A5D3BSG5 Uncharacterized protein2.2e-7030.82Show/hide
Query:  EKKINMLLKAVEERDYEISSLKNQIESRDA--TESTSLIQFGTLEPVEIWLQREFL-NTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR
        EKKI + L+ V + ++  + + ++  S      +  SL+QFGT EPV +   +E     S  + +  E +DEGWT+VTRRKK+K   IQ+E   +RNY++
Subjt:  EKKINMLLKAVEERDYEISSLKNQIESRDA--TESTSLIQFGTLEPVEIWLQREFL-NTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKR

Query:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVN-----------------------------VSAEGENQI
         N   +KNKK K  RK K++ E+++DF + ++ +TL DF P  FL ++  + P +  CH +N                             +S   E +I
Subjt:  KNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVN-----------------------------VSAEGENQI

Query:  LI--------------------------------------------------------------------------------------------------
        ++                                                                                                  
Subjt:  LI--------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------KKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYI
                                                                   ++ +K+V+A     S+  ++ A+ K   +  SS+  V   +
Subjt:  ----------------------------------------------------------KKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYI

Query:  PL------------------------------------SRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGF
        PL                                     +RKKGESPF E  + LK GD+E+LKESFITPL  I KQEI   K++  K + P+RRTK+GF
Subjt:  PL------------------------------------SRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGF

Query:  DPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFR
        DPKAYKL++ AGYDFTTHT FKS KI  E+P+LSSTQKKLL+EG++IP SRKGLGYKSPEPIRITRKGKEK+ D+NHIT++EVD   +     +RTS F 
Subjt:  DPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFR

Query:  RVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNIPLTSNVKR--PSAFH---RLSVATS
        R+ P   R    +R+ +T  E      T +  R S+ QRL +   +EK    T    +PSA ++L    K+    P    + R      H     S+ T 
Subjt:  RVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNIPLTSNVKR--PSAFH---RLSVATS

Query:  KSEGPS-MSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVNT
        K E  S +SV+  ++H   +E+  GK    E K E+EIR   PSR KRKTFVT+NT
Subjt:  KSEGPS-MSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVNT

A0A5D3BY54 Ty3-gypsy retrotransposon protein4.1e-6938.74Show/hide
Query:  ENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKRKNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKD----------------
        E +DEGWT+VTRRKK+K   IQ+E   + NY+R N   +KNKK K  RK K+V E ++DF R ++ +TL DF P  FL ++  +                
Subjt:  ENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKRKNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKD----------------

Query:  -------------------------------------------------------------------------------------------------VPE
                                                                                                          PE
Subjt:  -------------------------------------------------------------------------------------------------VPE

Query:  ITTCHVVNVSAEGENQI--LIKKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTT
        + +  V  V+ E   Q+  L  +ES K         S+ ST++A   +  +EK+SN P+LRY+PLSRRKKGESPF E  + LK GD+E+LKESF TPLT 
Subjt:  ITTCHVVNVSAEGENQI--LIKKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTT

Query:  IAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMA
        I KQEI   K++  + + P+RRTK+GFDPKAYK ++ AGYDFTTHT FKS KI  E+P+LSSTQKKLL+EG+ IP SRKGLGYK PEPIRITRKGKEK+ 
Subjt:  IAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMA

Query:  DANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEK
        D+NHIT++EVD   ++    +RTS F R+ P   R    +R+ +   E      T    R S  QRL M    EK
Subjt:  DANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEK

A0A5D3BY54 Ty3-gypsy retrotransposon protein3.4e-1544.36Show/hide
Query:  ISSKSAASNTS-----YMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFE
        ++SK AASN+S     Y GPITRSRS+ I  E+ Q   +A  IL+Q+ ES +    I EN ++      S +  + ++P VMSVMM D    E  M E E
Subjt:  ISSKSAASNTS-----YMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLEPSETSNPGVMSVMMPDAGISEERMVEFE

Query:  KKINMLLKAVEERDYEISSLKNQIESRDATEST
        +KIN L+KAVEERD+EI +L+ Q+ +R+  ES+
Subjt:  KKINMLLKAVEERDYEISSLKNQIESRDATEST

A0A5D3BY54 Ty3-gypsy retrotransposon protein4.5e-6845.88Show/hide
Query:  FLNNYSKDVPEITTCHVVNVSAEGENQI--LIKKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEI
        +L N S   PE+    V  V+ E   Q+  L  KE  K +     + S+ ST++A   +  +EK+SN P+L Y+PLSRRKKGESPF E S+ LK GD+E+
Subjt:  FLNNYSKDVPEITTCHVVNVSAEGENQI--LIKKESIKEVDACEVKTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEI

Query:  LKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPI
        LKESF TPLT I KQEI   K++  + + P+RRTK+ FDPKAYKL++ AGYDFTTHT FKS KI  E+P++SSTQKKLL+EG+ IP SRKGLGYK PEPI
Subjt:  LKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPI

Query:  RITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSAL
        RITRKGKEKM D+NHI ++EVD   ++    +RTS F R+ P   R    +R+ V   E      T +  R S  +RL +   K  +T   P       +
Subjt:  RITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSAL

Query:  QQLGTFTKERKNIPLTSNVKRPSAFHRLSVATSKSEGPS-MSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVN
         +LG             +V+  S     S+ T K E  S +SV+  ++H   +E+  GK    E K E+EI    PSRMKRKTFVT N
Subjt:  QQLGTFTKERKNIPLTSNVKRPSAFHRLSVATSKSEGPS-MSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVN

A0A5D3C0W6 Ty3-gypsy retrotransposon protein3.0e-7232.7Show/hide
Query:  SRDATESTSLIQFGTLEPVEIWLQREFL-NTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKRKNVPRKKNKKIKSWRKCKIVKEDNEDFV
        +R+      L +FGT EPV +   +E     S  + +  E +DE WT+VTRRKK+K   IQ+E   +RNY+R N   +KNKK K  RK K++ ++++DF 
Subjt:  SRDATESTSLIQFGTLEPVEIWLQREFL-NTSYGQPKHTENEDEGWTLVTRRKKQKQRCIQREPHAFRNYKRKNVPRKKNKKIKSWRKCKIVKEDNEDFV

Query:  RPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNVSAE--------------------------------------------------------------
        R ++ ITL DF P  FL ++  + P +  CH +N + E                                                              
Subjt:  RPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNVSAE--------------------------------------------------------------

Query:  --------------------------------------------------------------------GENQ----------------------------
                                                                            G NQ                            
Subjt:  --------------------------------------------------------------------GENQ----------------------------

Query:  --ILIKKESIKEVDACEV----------------------------KTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVE
            +K +S  EV + EV                            + S+ ST++A   +  +EK+SN  +LRY+PLSRRKKGESPF E  + LK GD+E
Subjt:  --ILIKKESIKEVDACEV----------------------------KTSKPSTNSANEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVE

Query:  ILKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEP
        +LKESF TPLT I KQEI   K++  + + P+RRTK+ FDPKAYKL++ AGYDFTTHT FKS KI  E+P+LSSTQKKLL+EG+ IP SRKGLGYK PEP
Subjt:  ILKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIFDERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEP

Query:  IRITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSA
        IRITRKGKEKM D+NHIT++EVD   ++    +RTS F R+ P   R    +R+ VT  E      T +  R S  +RL +   K  +T   P ++    
Subjt:  IRITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIRQRLRMPTDKEKRTSITPTLLQPSA

Query:  LQQLGTFTKERKNIPLTSNVKRPSAFHRLSVATSKSEGPSMSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVN
           L   T    +I    +  R S +HR+                       +E+  GK    E K E+EI    PSRMKRKTFVT+N
Subjt:  LQQLGTFTKERKNIPLTSNVKRPSAFHRLSVATSKSEGPSMSVFDLLQHDCLIETSKGKS-SVETKDEQEIRCAFPSRMKRKTFVTVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGATTAAGCCAAATAAGACTAAGTCCAAGAATTGGGTCCAAAGTCCACCAGGCCCAAGCCCATACAAGTCCAGAGAATTCAAAAGATCCAAAGATCCGAACTC
GAAGATTCAACGACTTAAAATTTCTTCAAAGTCTGCCGCTTCAAATACCTCCTACATGGGGCCAATCACGCGCAGTCGCTCTAGAGAAATCCAACCTGAAGAAAGTCAAA
CACCTGTCATCGCAAACAAAATTCTTAGACAAATGACAGAATCTGCTCAAGTTGATGCCACCATCGCAGAAAACCTCGTGTTTGGAAAAATCACTTTTCCTTCTCTTGAA
CCAAGTGAAACATCAAACCCTGGCGTGATGTCCGTTATGATGCCTGATGCAGGCATTAGTGAAGAAAGAATGGTTGAATTTGAAAAAAAGATCAACATGTTATTGAAGGC
AGTTGAAGAAAGAGATTATGAAATCTCTTCTCTCAAGAATCAGATTGAAAGTCGGGATGCTACTGAATCAACAAGCTTGATCCAATTTGGAACTTTAGAGCCTGTAGAAA
TTTGGCTGCAGCGAGAATTTCTAAATACCTCCTATGGCCAGCCTAAGCATACTGAAAATGAAGATGAAGGATGGACTTTAGTAACGCGTCGAAAGAAGCAAAAACAACGA
TGCATCCAGAGAGAACCACACGCGTTTCGAAACTACAAGAGGAAAAATGTGCCACGAAAGAAGAACAAAAAGATCAAAAGTTGGAGGAAATGCAAGATAGTTAAGGAAGA
TAACGAAGACTTTGTTCGACCTCGACAACCAATAACGTTGAACGATTTCTTGCCAAGGGGCTTCCTTAATAACTATAGCAAAGATGTCCCTGAAATTACCACGTGCCATG
TGGTTAACGTATCTGCAGAAGGGGAGAACCAAATACTCATAAAGAAAGAATCGATTAAAGAGGTAGACGCTTGTGAAGTTAAAACAAGTAAGCCATCTACCAACTCTGCG
AACGAAAAGGTTTCAAAAGAAGAAAAATCATCAAATCTTCCAGTACTGCGTTACATCCCTTTATCACGACGTAAAAAGGGAGAATCACCATTTACAGAATGCTCAGAAAA
ATTAAAATCCGGCGATGTGGAAATCCTAAAAGAAAGTTTCATCACACCTCTAACAACAATCGCCAAACAAGAAATTCGAAATTCGAAGGTTAATAATGCAAAAATAGCCT
TCCCCGAAAGAAGAACGAAGAATGGGTTTGATCCTAAAGCATACAAACTTCTATCAAATGCGGGTTACGACTTCACAACTCACACTACGTTCAAAAGTTCAAAAATCTTC
GATGAGAGGCCTGAGCTTTCTTCAACACAAAAGAAACTTTTAAAGGAAGGTTATATGATACCCACGTCAAGAAAAGGACTTGGATACAAGTCCCCTGAGCCAATCCGTAT
AACGAGGAAAGGAAAGGAGAAGATGGCAGACGCAAACCACATAACAATTGAGGAAGTAGATGATTCGAACAAAGAAGGAAATGTCATCCGAAGGACTTCTGTTTTTAGGC
GTGTTGGGCCATTGGCAACACGACTCTCGACCCTCCAGAGGATAGGGGTGACTGCTGAAGAAAGTGTTTTACCAGGTCCTACTTTTGACTCCACCCGATCTTCAATCCGT
CAGAGGTTAAGAATGCCCACTGATAAAGAAAAAAGAACGTCAATCACTCCAACTTTGCTGCAACCATCAGCGTTGCAACAATTAGGCACATTTACGAAAGAGAGGAAAAA
TATACCTTTGACTTCAAATGTGAAGCGCCCTTCAGCTTTTCACAGGTTAAGTGTAGCCACGTCAAAAAGTGAGGGTCCTTCCATGTCAGTTTTTGACCTACTTCAACATG
ATTGCCTGATTGAAACCTCGAAGGGTAAATCCTCTGTTGAAACAAAAGATGAGCAAGAAATTCGCTGTGCCTTTCCTTCCCGTATGAAAAGAAAAACCTTCGTTACAGTT
AATACAGAGGGTGTTGATGAAGAAAGATACAATATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTGGATTAAGCCAAATAAGACTAAGTCCAAGAATTGGGTCCAAAGTCCACCAGGCCCAAGCCCATACAAGTCCAGAGAATTCAAAAGATCCAAAGATCCGAACTC
GAAGATTCAACGACTTAAAATTTCTTCAAAGTCTGCCGCTTCAAATACCTCCTACATGGGGCCAATCACGCGCAGTCGCTCTAGAGAAATCCAACCTGAAGAAAGTCAAA
CACCTGTCATCGCAAACAAAATTCTTAGACAAATGACAGAATCTGCTCAAGTTGATGCCACCATCGCAGAAAACCTCGTGTTTGGAAAAATCACTTTTCCTTCTCTTGAA
CCAAGTGAAACATCAAACCCTGGCGTGATGTCCGTTATGATGCCTGATGCAGGCATTAGTGAAGAAAGAATGGTTGAATTTGAAAAAAAGATCAACATGTTATTGAAGGC
AGTTGAAGAAAGAGATTATGAAATCTCTTCTCTCAAGAATCAGATTGAAAGTCGGGATGCTACTGAATCAACAAGCTTGATCCAATTTGGAACTTTAGAGCCTGTAGAAA
TTTGGCTGCAGCGAGAATTTCTAAATACCTCCTATGGCCAGCCTAAGCATACTGAAAATGAAGATGAAGGATGGACTTTAGTAACGCGTCGAAAGAAGCAAAAACAACGA
TGCATCCAGAGAGAACCACACGCGTTTCGAAACTACAAGAGGAAAAATGTGCCACGAAAGAAGAACAAAAAGATCAAAAGTTGGAGGAAATGCAAGATAGTTAAGGAAGA
TAACGAAGACTTTGTTCGACCTCGACAACCAATAACGTTGAACGATTTCTTGCCAAGGGGCTTCCTTAATAACTATAGCAAAGATGTCCCTGAAATTACCACGTGCCATG
TGGTTAACGTATCTGCAGAAGGGGAGAACCAAATACTCATAAAGAAAGAATCGATTAAAGAGGTAGACGCTTGTGAAGTTAAAACAAGTAAGCCATCTACCAACTCTGCG
AACGAAAAGGTTTCAAAAGAAGAAAAATCATCAAATCTTCCAGTACTGCGTTACATCCCTTTATCACGACGTAAAAAGGGAGAATCACCATTTACAGAATGCTCAGAAAA
ATTAAAATCCGGCGATGTGGAAATCCTAAAAGAAAGTTTCATCACACCTCTAACAACAATCGCCAAACAAGAAATTCGAAATTCGAAGGTTAATAATGCAAAAATAGCCT
TCCCCGAAAGAAGAACGAAGAATGGGTTTGATCCTAAAGCATACAAACTTCTATCAAATGCGGGTTACGACTTCACAACTCACACTACGTTCAAAAGTTCAAAAATCTTC
GATGAGAGGCCTGAGCTTTCTTCAACACAAAAGAAACTTTTAAAGGAAGGTTATATGATACCCACGTCAAGAAAAGGACTTGGATACAAGTCCCCTGAGCCAATCCGTAT
AACGAGGAAAGGAAAGGAGAAGATGGCAGACGCAAACCACATAACAATTGAGGAAGTAGATGATTCGAACAAAGAAGGAAATGTCATCCGAAGGACTTCTGTTTTTAGGC
GTGTTGGGCCATTGGCAACACGACTCTCGACCCTCCAGAGGATAGGGGTGACTGCTGAAGAAAGTGTTTTACCAGGTCCTACTTTTGACTCCACCCGATCTTCAATCCGT
CAGAGGTTAAGAATGCCCACTGATAAAGAAAAAAGAACGTCAATCACTCCAACTTTGCTGCAACCATCAGCGTTGCAACAATTAGGCACATTTACGAAAGAGAGGAAAAA
TATACCTTTGACTTCAAATGTGAAGCGCCCTTCAGCTTTTCACAGGTTAAGTGTAGCCACGTCAAAAAGTGAGGGTCCTTCCATGTCAGTTTTTGACCTACTTCAACATG
ATTGCCTGATTGAAACCTCGAAGGGTAAATCCTCTGTTGAAACAAAAGATGAGCAAGAAATTCGCTGTGCCTTTCCTTCCCGTATGAAAAGAAAAACCTTCGTTACAGTT
AATACAGAGGGTGTTGATGAAGAAAGATACAATATTTGA
Protein sequenceShow/hide protein sequence
MDWIKPNKTKSKNWVQSPPGPSPYKSREFKRSKDPNSKIQRLKISSKSAASNTSYMGPITRSRSREIQPEESQTPVIANKILRQMTESAQVDATIAENLVFGKITFPSLE
PSETSNPGVMSVMMPDAGISEERMVEFEKKINMLLKAVEERDYEISSLKNQIESRDATESTSLIQFGTLEPVEIWLQREFLNTSYGQPKHTENEDEGWTLVTRRKKQKQR
CIQREPHAFRNYKRKNVPRKKNKKIKSWRKCKIVKEDNEDFVRPRQPITLNDFLPRGFLNNYSKDVPEITTCHVVNVSAEGENQILIKKESIKEVDACEVKTSKPSTNSA
NEKVSKEEKSSNLPVLRYIPLSRRKKGESPFTECSEKLKSGDVEILKESFITPLTTIAKQEIRNSKVNNAKIAFPERRTKNGFDPKAYKLLSNAGYDFTTHTTFKSSKIF
DERPELSSTQKKLLKEGYMIPTSRKGLGYKSPEPIRITRKGKEKMADANHITIEEVDDSNKEGNVIRRTSVFRRVGPLATRLSTLQRIGVTAEESVLPGPTFDSTRSSIR
QRLRMPTDKEKRTSITPTLLQPSALQQLGTFTKERKNIPLTSNVKRPSAFHRLSVATSKSEGPSMSVFDLLQHDCLIETSKGKSSVETKDEQEIRCAFPSRMKRKTFVTV
NTEGVDEERYNI