; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038829 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038829
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:28298909..28306082
RNA-Seq ExpressionLag0038829
SyntenyLag0038829
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]2.4e-9134.32Show/hide
Query:  MADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARD
        MA++++   P+ ++DY +PV  G  S I+  PINANNFELK                            +IC T+KINGV++D IRLRLFPFSL+DKAR 
Subjt:  MADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARD

Query:  WLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW---------------------------DSV
        WLQS+ PGSI +W  + + FL KFFP AKT +LR+EIG F+Q   E L+EAWER+K+L+R C       W                              
Subjt:  WLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW---------------------------DSV

Query:  VEDRGKCLH--TSRGY-------------GVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE-SASALASTPQEETIEQ-------------------
         E     L    S  Y             G+ E++ ++AL AQ+ +L++     +  R  QS E  AS     P  E  ++                   
Subjt:  VEDRGKCLH--TSRGY-------------GVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE-SASALASTPQEETIEQ-------------------

Query:  ---------------------------------------LEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAA
                                               LED + +F+ E++ +  K +  +  I +  +   A +KN+E Q+GQL + +N   +G   +
Subjt:  ---------------------------------------LEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAA

Query:  EQEKTQMEYCKAITVHQEKADEE-PESEDYDTPTG--------EVEED----TSSDEAEKP-----EPEPPIPSPTLMVPKEKKKKRKKKNNQVQFDKFM
          E    E CKAIT+   K  E  P  E   TPT         +VEE+     + +E + P        PPI +P L  P+  +K++  K    QF KF+
Subjt:  EQEKTQMEYCKAITVHQEKADEE-PESEDYDTPTG--------EVEED----TSSDEAEKP-----EPEPPIPSPTLMVPKEKKKKRKKKNNQVQFDKFM

Query:  NAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLN
        + F  ++INIPF +ALE MP Y +F+K+ ++KKR+ ++  TV L+  CS  +Q+K+P+K+ DP SF++PC+ G  +F R LCDLGASIN++P  +C+KL 
Subjt:  NAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLN

Query:  IGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRVQERAPLLD-SKKRSLLEARST
        +GE+K   + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR +ID+++ ELT+RV +   + +  +     E  ST
Subjt:  IGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRVQERAPLLD-SKKRSLLEARST

Query:  C-RANDVKQAL
        C R + +KQ +
Subjt:  C-RANDVKQAL

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]5.8e-9334.6Show/hide
Query:  MADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARD
        MA++++   P+ ++DY +PV  G  S I+  PINANNFELK                            +IC T+KINGV++D IRLRLFPFSL+DKAR 
Subjt:  MADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARD

Query:  WLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW---------------------------DSV
        WLQS+ PGSI +W  + + FL KFFP AKT +LR+EIG F+Q   E L+EAWER+K+L+R C       W                              
Subjt:  WLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW---------------------------DSV

Query:  VEDRGKCLH--TSRGY-------------GVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE-SASALASTPQEETIEQ-------------------
         E     L    S  Y             G+ E++ ++AL AQ+ +L++     +  R  QS E  AS     P  E  ++                   
Subjt:  VEDRGKCLH--TSRGY-------------GVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE-SASALASTPQEETIEQ-------------------

Query:  ---------------------------------------LEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAA
                                               LED + +F+ E++ +  K +  +  I +  +   A +KN+E Q+GQL + +N   +G   +
Subjt:  ---------------------------------------LEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAA

Query:  EQEKTQMEYCKAITVHQEKADEE-PESEDYDTPTG--------EVEED----TSSDEAEKP-----EPEPPIPSPTLMVPKEKKKKRKKKNNQVQFDKFM
          E    E CKAIT+   K  E  P  E   TPT         +VEE+     + +E + P        PPI +P L  P+  +K++  K    QF KF+
Subjt:  EQEKTQMEYCKAITVHQEKADEE-PESEDYDTPTG--------EVEED----TSSDEAEKP-----EPEPPIPSPTLMVPKEKKKKRKKKNNQVQFDKFM

Query:  NAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLN
        + F  ++INIPF +ALE MP Y +F+K+ ++KKR+ ++  TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G  +F R LCDLGASIN++P S+C+KL 
Subjt:  NAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLN

Query:  IGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRVQERAPLLD-SKKRSLLEARST
        +GE+K   + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR +ID+++ ELT+RV +   + +  +     E  ST
Subjt:  IGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRVQERAPLLD-SKKRSLLEARST

Query:  C-RANDVKQAL
        C R + +KQ +
Subjt:  C-RANDVKQAL

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]5.4e-9935.23Show/hide
Query:  SRELISLDPEIERTILRIQRENREIIHMADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICG
        SR++I +DPEIERT+  ++R   +I+ MA++++   P+ ++DY +PV  G  S I+  PINANNFELK                            +IC 
Subjt:  SRELISLDPEIERTILRIQRENREIIHMADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICG

Query:  TIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW---
        T+KINGV++D IRLRLFPFSL+DKAR WLQS+ PGSI +W  + + FL KFFP AKT +LR+EIG F+Q   E L+EAWER+K+L+R C       W   
Subjt:  TIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW---

Query:  ------------------------DSVVEDRGKCLH--TSRGY-------------GVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE-SASALAST
                                    E     L    S  Y             G+ +++ ++AL AQ+ +L++     +  R  QS E  AS     
Subjt:  ------------------------DSVVEDRGKCLH--TSRGY-------------GVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE-SASALAST

Query:  PQEETIEQ----------------------------------------------------------LEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHS
        P  E  ++                                                          LED + +F+ E++ +  K +  +  I +  +   
Subjt:  PQEETIEQ----------------------------------------------------------LEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHS

Query:  AAIKNIETQLGQLVSVVNTMNKGKAAAEQEKTQMEYCKAITVHQEK-ADEEPESEDYDTPT--------GEVEED-TSSDEAEKPE--------PEPPIP
        AAIKNIE Q+GQL + +N   +G   +  E    E CKAIT+   K  +  P  E   TPT         +VEED   +D  E+ +          PPI 
Subjt:  AAIKNIETQLGQLVSVVNTMNKGKAAAEQEKTQMEYCKAITVHQEK-ADEEPESEDYDTPT--------GEVEED-TSSDEAEKPE--------PEPPIP

Query:  SPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFG
        +P L  P+  +K++  K    QF KF++ F  ++INIPF +ALE MP Y +F+K+ ++KKR+ ++  TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G
Subjt:  SPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFG

Query:  TYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERK
          +F + LCDLGASIN++PLS+C+KL + E+K   + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR +ID+++ 
Subjt:  TYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERK

Query:  ELTVRVQERAPLLD-SKKRSLLEARSTC-RANDVKQAL
        ELT+RV +   L    +   + E  STC R + +KQ +
Subjt:  ELTVRVQERAPLLD-SKKRSLLEARSTC-RANDVKQAL

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]2.2e-9236.08Show/hide
Query:  MRSSR--ELISLDPEIERT--ILR-IQRENREIIHMAD---QNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------
        MR +R  +L+ +DPE ERT  ILR IQR  RE +   D    N+  + + IRDY +PV     SGI    I A NFELK                     
Subjt:  MRSSR--ELISLDPEIERT--ILR-IQRENREIIHMAD---QNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------

Query:  ------TDICGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC
               +IC T+K+NGV++DAIRLRLF FSL+DKA+ W QS+P GSITTWD L Q FL K+FP +K+ +LR EI  F+Q   E  +EAWERFK+LLR C
Subjt:  ------TDICGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC

Query:  ------KWDSV------VEDRGKCLHTSRGYGVL------------------------------------EVDKLSALQAQMTSLANAFMKFSGTRSAQS
              KW  +      +  + + +  +   G+L                                    EVD ++AL AQ+ SL N  +  +   + Q+
Subjt:  ------KWDSV------VEDRGKCLHTSRGYGVL------------------------------------EVDKLSALQAQMTSLANAFMKFSGTRSAQS

Query:  IESASALASTPQEETI--EQ-------------------------------------------------------LEDLVGAFIAESSNKTTKLEEAVIA
        ++S  + +S+ QE  +  EQ                                                       LED++G FI+E+ ++  K E  +  
Subjt:  IESASALASTPQEETI--EQ-------------------------------------------------------LEDLVGAFIAESSNKTTKLEEAVIA

Query:  INSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQEKTQMEYCKAITVHQEKADEEPESEDYDTPTGEV-EEDTSSDEAEKPEPE------------
        I + V+   A +KN+E Q+GQL +++ +  KGK  ++ E    E+C AIT+   K  EE + +    PT +V   D    E +K E E            
Subjt:  INSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQEKTQMEYCKAITVHQEKADEEPESEDYDTPTGEV-EEDTSSDEAEKPEPE------------

Query:  ---PPIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSF
           PPI  P L  P+   KK+       QF KF+  F  ++INIPF E L +MP Y +F+KE ++ K+K ++  T+ L   CS  + QK+P K+ DPGSF
Subjt:  ---PPIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSF

Query:  SVPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRV
        ++PC+ G   F RALCD GASIN++PLS+ KKL +GE+K   + LQLAD+S+  P G+IE+VL++V +F LP+D  V+DM EN  IP+ILGRPFL TGR 
Subjt:  SVPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRV

Query:  IIDI
        +ID+
Subjt:  IIDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.2e-9034.14Show/hide
Query:  MADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARD
        MA+  Q  +P+ ++DY +P+     SGI    INANNFELK                            +IC TIK+NGV++D IRLRLFPFSL+DKAR 
Subjt:  MADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK---------------------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARD

Query:  WLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW------------------------------
        WLQS+ PGSIT+W  + + FL KFFP AKT +LR+EIG F+Q   E L+EAWER+K+L+R C       W                              
Subjt:  WLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNC------KW------------------------------

Query:  ----DSVVEDRGKCLH--------TSRGYGVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE--SASALASTPQEETIEQLEDLVG------------
             S++E+     +          +  G+ E++  +AL AQ+ SL++     +  R  Q  E  +AS++     E + EQ++ +              
Subjt:  ----DSVVEDRGKCLH--------TSRGYGVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIE--SASALASTPQEETIEQLEDLVG------------

Query:  -------------------------AFIAESSNKTTKLEEAVIAI----------------NSTVNCHS--AAIKNIETQLGQLVSVVNTMNKGKAAAEQ
                                  F ++ S K   LE+A+++                 N   +C +  A +KN+E Q+GQL + +N   +G   +  
Subjt:  -------------------------AFIAESSNKTTKLEEAVIAI----------------NSTVNCHS--AAIKNIETQLGQLVSVVNTMNKGKAAAEQ

Query:  EKTQMEYCKAITVHQ-EKADEEPESEDYDTPTG-------------EVEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNL
        E    E CKAIT+    + +  P  E   TPT              E+ EDT  +    P    P   P L  P    ++ +K+    QF KF++ F  +
Subjt:  EKTQMEYCKAITVHQ-EKADEEPESEDYDTPTG-------------EVEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNL

Query:  NINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKS
        +INIPF +ALE MP Y +F+K+ ++KKR+ ++  TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G  +F + LCDLGASIN++PLS+ +KL +GE+K 
Subjt:  NINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKS

Query:  ARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRV-QERAPLLDSKKRSLLEARSTCRANDV
          + LQLAD+S+  P GIIE+VL++V +F  P D  V+DM E+  +P+ILGRPFL TGR ++D+++ ELT+RV +E       +     E  STC   DV
Subjt:  ARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRV-QERAPLLDSKKRSLLEARSTCRANDV

TrEMBL top hitse value%identityAlignment
A0A5N6N163 Retrotrans_gag domain-containing protein7.4e-7834.48Show/hide
Query:  SSRELISLDPEIERTILRIQRENREIIHMADQNQPKEP--KPIRDYFQPVFQGQQSGIVYAPINANNFELKTDICGTIK--INGVSDDAIRLRLFPFSLQ
        ++ +L++   E ER   +  R  +E   MA  N    P  + I DY +P   G +S IV   + ANNF +   I   I+  INGVS D I+LRLFPFSL 
Subjt:  SSRELISLDPEIERTILRIQRENREIIHMADQNQPKEP--KPIRDYFQPVFQGQQSGIVYAPINANNFELKTDICGTIK--INGVSDDAIRLRLFPFSLQ

Query:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNCK------WDSV-----------VEDRGKCLHTS
        D+A  WL S+P GSI TW  +   FL ++FP +K  ++R+ I  + Q+  E   E WERFKELLR C       W  +           V    +C  T 
Subjt:  DKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNCK------WDSV-----------VEDRGKCLHTS

Query:  RG----YGVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIESASALASTPQEETIEQLEDLVGAFIAESS----NKTTKLEEAVIAINSTVNCHSAAIK
         G       ++ D L +L+       + F     + SA             Q E  E +E+++  ++A +        TK +E  I + +      AAI+
Subjt:  RG----YGVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIESASALASTPQEETIEQLEDLVGAFIAESS----NKTTKLEEAVIAINSTVNCHSAAIK

Query:  NIETQLGQLVSVVNTMNKGKAAAEQEKTQMEYCKAITVHQEKADEEPESEDYDTPTGEVEEDTSSD-----------------EAEKPEPEPPIPSPTLM
        NIE  LGQ+ + ++   +G   A  EK   E+ KA+     K  +  E+     P  E EE+T  +                 E  KP  +PP+P P   
Subjt:  NIETQLGQLVSVVNTMNKGKAAAEQEKTQMEYCKAITVHQEKADEEPESEDYDTPTGEVEEDTSSD-----------------EAEKPEPEPPIPSPTLM

Query:  VPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTY-YF
               + +K+  + ++ KF+  F  L+IN+PFVEAL +MP+Y +F+K+ L  K+K ++ +TV L+  CS  +Q K+P K++DPGSF++PC   +    
Subjt:  VPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTY-YF

Query:  RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVR
         AL DLGA+IN++P S+  KL++GE    R+ +QLAD+SV  P GI+EN+L++VG+F  P D  ++DM ++  +P+ILGRPFL T R +ID+   +LT+R
Subjt:  RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVR

Query:  VQERAPLLDSKK
        V E     D KK
Subjt:  VQERAPLLDSKK

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129455.3e-8433.43Show/hide
Query:  RSSRELISLDPEIERTILRIQRENREII----HMADQNQ----------PKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK----------------
        R++  L+  DP+IERT  R +REN ++      MA+ N           P+  + +RDY  P+ QG    I    INANNFE+K                
Subjt:  RSSRELISLDPEIERTILRIQRENREII----HMADQNQ----------PKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELK----------------

Query:  ------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFK
                     +IC T K NGV+DDAIRLRLFPFSL+DKA+ WL S+P GSITTW+ L Q FL KFFP AKT K+R +I +F Q   E L+EAWERFK
Subjt:  ------------TDICGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFK

Query:  ELLRNCK-------------WDSVVEDRGKCLHTSRGYGVLEVDKLSALQAQMTSLANAFM---KFSGTRSAQSIESASALAS-TPQEETIEQLEDLVGA
        ELLR C              ++ +V      +  + G  ++  + + A        +N +    + SG+R A       AL + T Q   + +  D +G 
Subjt:  ELLRNCK-------------WDSVVEDRGKCLHTSRGYGVLEVDKLSALQAQMTSLANAFM---KFSGTRSAQSIESASALAS-TPQEETIEQLEDLVGA

Query:  FIAESS-------------------------------------------------------------------------------NKTTKLEEAVI----
           ++S                                                                                K ++LEE ++    
Subjt:  FIAESS-------------------------------------------------------------------------------NKTTKLEEAVI----

Query:  AINSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQE--KTQMEYCKAITVHQEKADE-------EPESEDYDTPTGEVEEDTSSDEAEKPEPEPPI
          ++ +    A+++N+ETQ+GQL + +N   +G   ++ +      E C+AIT+   K  E       E E E  D   G  E +    + +  + E   
Subjt:  AINSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQE--KTQMEYCKAITVHQEKADE-------EPESEDYDTPTGEVEEDTSSDEAEKPEPEPPI

Query:  PSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF
         S  +  P    ++ +K+  + QF KF+N F  L+INIPF EALE MP Y +F+K+ L+KKRK  +  TV+L   CS  +Q K+P K+ DPGSF++PC+ 
Subjt:  PSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEALE-MPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF

Query:  GTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIER
        G  +F +AL DLGASIN++P S+ +KL +GE K   V LQLAD+S V P GIIE+VL++V +F  P+D  ++DM E+  IP+ILGRPFL T   IID+  
Subjt:  GTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIER

Query:  KELTVRVQE
         +++ +V E
Subjt:  KELTVRVQE

A0A6P6X9H2 Reverse transcriptase7.6e-7533.91Show/hide
Query:  DICGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNCKWDSVVE
        +IC TIK+NGVSD+AIRLRLFPFSL+DKA+ WL S  P + TTWD L +AFL K+FP  KT KLR +I  F Q   E L+E WERF++LLR C    + E
Subjt:  DICGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNCKWDSVVE

Query:  -------------DRGKCLHTSRG---YGVLEVDKLSALQAQMTSLANAFMKFSG-------------------------------------TRSAQSIE
                          +  + G    G++E+D L+ L AQM ++     +  G                                      R+AQ+  
Subjt:  -------------DRGKCLHTSRG---YGVLEVDKLSALQAQMTSLANAFMKFSG-------------------------------------TRSAQSIE

Query:  SASAL----------------------------ASTPQEETIEQLEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHSAAIKNIETQLGQLVSVVNTMNK
         ++                              +  PQ ET    E  V      +S++  ++E  +  +       +   +N+E Q+GQ+ S +N  N+
Subjt:  SASAL----------------------------ASTPQEETIEQLEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHSAAIKNIETQLGQLVSVVNTMNK

Query:  GKAAAEQEKTQMEYCKAITVHQEKADEEP--------ESEDYDTPTGEVEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKRKKKNNQV--QFDKFMNAF
        G+  ++ E    E+ KAIT+   K  E+P        ESE+        E     +  + P    P  S  + +P      ++ K N+    F+KF+  F
Subjt:  GKAAAEQEKTQMEYCKAITVHQEKADEEP--------ESEDYDTPTGEVEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKRKKKNNQV--QFDKFMNAF

Query:  MNLNINIPFVEA-LEMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGE
          L+INIPF +A L++P Y +F+KE + +KRK +   T+ L   CS  +Q K+P K+ DPGSFS+PC+ G+  F +ALCDLGAS+++IPL++ ++L + E
Subjt:  MNLNINIPFVEA-LEMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGE

Query:  IKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRVQE
        +K   + LQLAD+S+  P+G++ENVLI+V +F +P+D  V+DM E+ S+P+ILGRPFL T   IID++  +L  ++ E
Subjt:  IKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKELTVRVQE

A0A6P8DD93 uncharacterized protein LOC1162064536.9e-7632.02Show/hide
Query:  MRSSR--ELISLDPEIERTILRIQRENR-----EIIHMADQNQPKE----PKPIRDYFQPVFQGQQSGIVYAPINANNFELKTDI---------------
        MR SR  EL+ LDPEIERT+ R++RENR     +++ MAD +  ++     + +RDY  P   G  S I    I ANNFELK  +               
Subjt:  MRSSR--ELISLDPEIERTILRIQRENR-----EIIHMADQNQPKE----PKPIRDYFQPVFQGQQSGIVYAPINANNFELKTDI---------------

Query:  ------------CGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELL
                    C T+K+N V+DD IRL+LFPFSL+DKAR W  S+P  SITTW  L   FL++FFP A+T +LR EI  F +   E L+EAWERFKE +
Subjt:  ------------CGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RNC----------------------------------------KWDSVVEDRGKCLHT-------SRGYGVLEVDKLSALQAQMTSLANAFMKFSGTRSA
        R C                                        +  +++E+     H        SR   V ++D ++ L  Q+++L     K +   S 
Subjt:  RNC----------------------------------------KWDSVVEDRGKCLHT-------SRGYGVLEVDKLSALQAQMTSLANAFMKFSGTRSA

Query:  QSIESASALASTPQEETIEQLEDLVGA--------FI------------------------------------------------AESSNKTTKLEEAVI
         + + A     +    T+E +     A        F+                                                A      +++EE ++
Subjt:  QSIESASALASTPQEETIEQLEDLVGA--------FI------------------------------------------------AESSNKTTKLEEAVI

Query:  A----INSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQEKT-------QMEYCKAITVHQEKADEEPESEDYDTPTGEVEEDTSSDEAEKPEPEP
        +     ++ +    A I+N+E Q+ Q+   ++    G   +  E+         +   K + +   KA  + ES + D    +VEE        KP   P
Subjt:  A----INSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQEKT-------QMEYCKAITVHQEKADEEPESEDYDTPTGEVEEDTSSDEAEKPEPEP

Query:  PIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQ---KVPEKVADPGSFS
        P+P P         ++ K++    QF KF++ F  L INIPF EAL +MP Y RFMK+ L KKRK      V L   CS  +Q+    +P K  D GSF+
Subjt:  PIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQ---KVPEKVADPGSFS

Query:  VPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVI
        VPC+ G ++F   L D GASIN++PLS+ +KL +GE K   V LQLAD+S+  P GI+ENVL++V +F  P+D  V++M E+  +P+ILGRPFL TG+ +
Subjt:  VPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVI

Query:  IDIERKELTVRV
        ID+E+ +LT+RV
Subjt:  IDIERKELTVRV

A0A6P8DKJ2 uncharacterized protein LOC1162042313.4e-7531.88Show/hide
Query:  MRSSR--ELISLDPEIERTILRIQRENR-----EIIHMADQNQPKE----PKPIRDYFQPVFQGQQSGIVYAPINANNFELKTDI---------------
        MR SR  EL+ LDPEIERT+ R++RENR     +++ MAD +  ++     + +RDY  P   G  S I    I ANNFELK  +               
Subjt:  MRSSR--ELISLDPEIERTILRIQRENR-----EIIHMADQNQPKE----PKPIRDYFQPVFQGQQSGIVYAPINANNFELKTDI---------------

Query:  ------------CGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELL
                    C T+K+N V+DD IRL+LFPFSL+DKAR W  S+P  SITTW  L   FL++FFP A+T +LR EI  F +   E L+EAWERFKE +
Subjt:  ------------CGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPPGSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RNC----------------------------------------KWDSVVEDRGKCLHT-------SRGYGVLEVDKLSALQAQMTSLANAFMKFSGTRSA
        R C                                        +  +++E+     H        SR   V ++D ++ L  Q+++L     K +   S 
Subjt:  RNC----------------------------------------KWDSVVEDRGKCLHT-------SRGYGVLEVDKLSALQAQMTSLANAFMKFSGTRSA

Query:  QSIESASALASTPQEETIEQLEDLVGA--------FI------------------------------------------------AESSNKTTKLEEAVI
         + + A     +    T+E +     A        F+                                                A      +++EE ++
Subjt:  QSIESASALASTPQEETIEQLEDLVGA--------FI------------------------------------------------AESSNKTTKLEEAVI

Query:  A----INSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQEKT-------QMEYCKAITVHQEKADEEPESEDYDTPTGEVEEDTSSDEAEKPEPEP
        +     ++ +    A I+N+E Q+ Q+   ++    G   +  E+         +   K + +   KA  + ES + D    +VEE        KP   P
Subjt:  A----INSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQEKT-------QMEYCKAITVHQEKADEEPESEDYDTPTGEVEEDTSSDEAEKPEPEP

Query:  PIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQ---KVPEKVADPGSFS
        P+P P          + K++    QF KF++ F  L INIPF EAL +MP Y RFMK+ L KKRK      V L   CS  +Q+    +P K  D GSF+
Subjt:  PIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEAL-EMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQ---KVPEKVADPGSFS

Query:  VPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVI
        VPC+ G ++F   L D GASIN++PLS+ +KL +GE K   + LQLAD+S+  P GI+ENVL++V +F  P+D  V++M E+  +P+ILGRPFL TG+ +
Subjt:  VPCSFGTYYF-RALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVI

Query:  IDIERKELTVRV
        ID+E+ +LT+RV
Subjt:  IDIERKELTVRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGTAGTAGAGAATTGATATCGTTGGATCCCGAGATCGAGAGAACAATTCTTAGGATTCAAAGAGAAAATAGAGAAATTATTCACATGGCTGACCAAAATCAACC
TAAGGAGCCTAAGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGACA
TTTGTGGGACGATAAAAATTAATGGAGTCTCAGATGATGCTATTCGCTTACGCTTATTTCCTTTTTCTTTGCAAGATAAAGCACGAGATTGGTTGCAGTCTATTCCCCCT
GGGAGCATCACCACCTGGGATGCTTTAGTCCAGGCCTTTTTGAAGAAATTTTTCCCTCTTGCAAAGACGGACAAGTTGAGGACCGAGATTGGGACATTCCAACAACAATA
TGATGAGCAGCTGTTTGAGGCTTGGGAGCGATTCAAAGAGTTGTTGAGGAACTGCAAGTGGGACTCTGTTGTCGAAGACCGTGGAAAATGCTTGCATACTTCTAGAGGAT
ATGGAGTGTTGGAGGTTGATAAGTTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACACGGAGTGCTCAATCAATTGAATCA
GCTTCTGCTTTGGCATCTACACCTCAGGAGGAGACCATTGAACAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAAGACAACCAAATTAGAGGAGGC
AGTCATTGCCATCAACTCAACAGTGAATTGCCACAGTGCTGCCATCAAGAACATTGAAACTCAGTTGGGACAGTTGGTAAGTGTCGTAAACACCATGAATAAAGGTAAGG
CCGCAGCTGAGCAGGAGAAAACCCAGATGGAATATTGTAAGGCAATCACTGTGCACCAGGAGAAAGCTGACGAGGAGCCTGAGTCTGAGGACTATGACACGCCTACAGGG
GAAGTTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAGGAAGAA
AAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGTAGAGGCATTAGAGATGCCCCAGTATAATAGGTTCATGA
AGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTAACACTGTATATCTTGCTTCCACATGTAGCACCAGAGTACAACAGAAGGTACCTGAAAAAGTAGCAGATCCA
GGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTATTATTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATTATTCCTTTGTCCTTGTGTAAAAAGTTAAACAT
AGGTGAGATTAAATCTGCCCGTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTTAAACCAGTTGGCATTATAGAAAATGTTTTAATTAGAGTAGGTAGATTTTTCCTCC
CTATTGATTTGTATGTTATGGACATGATAGAAAATCCTTCAATTCCTGTCATATTAGGAAGACCATTCCTCGATACTGGGCGAGTGATTATTGATATTGAGCGCAAGGAG
CTCACTGTTAGAGTGCAAGAAAGAGCACCTCTGTTGGATTCAAAGAAAAGAAGCCTCCTTGAAGCACGGTCAACATGTCGAGCTAATGACGTTAAACAAGCGCTTATGGG
AGGCAACCCAACCTCCAAGTGTTCTGTTCTGCTCTGTTTACGGCTCCAAAAGGTCCCAAATCGAACTCCCTCAAAACCTAGTCTCGACGCTGCCTTAAAAACGTGCGTTT
CAGTAAGCGAAATAACACAGCGTCGAGACGCTGTGACCTTTACGCGCCTAATCAGATATGGAATGACAAAGAGAGATACTGAGGAAGAGGAAGTGACCATTACGCCTGAG
GCACCGAAGACAATGGCAAAGAGAAGGAAAACGCCGGAAGAGAGAGAGGCTAAGAGACGAAGAAGACAACAGAGAGCTGAGGTTACGAAAGTAGTGAGAAATGTGATTGA
GGACATTGCTGATGAAGCGGTTGAGGAAGAACGACCAAAGGAACCTAAGGAAAAGAAAGATCCTGAAAAGAATTCCACGTCGTCGCCGTCGAAAGCAAAAGGCAAGCCGA
ATCAAGGTAATCAGGACAGACACTCCATTGCGGTTGACGACAGAATCTCAGAAAGAAGAACGAGAGAAAAGGGGGCAGAGGACCAGGAAAGAGAAGAAAGGGAAGAGAAA
GTAGAAGAGGAAGCTTTGGTGAAGCATCAAGAAGACAAGGGTGGAGTCAATTCTGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGTAGTAGAGAATTGATATCGTTGGATCCCGAGATCGAGAGAACAATTCTTAGGATTCAAAGAGAAAATAGAGAAATTATTCACATGGCTGACCAAAATCAACC
TAAGGAGCCTAAGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGACA
TTTGTGGGACGATAAAAATTAATGGAGTCTCAGATGATGCTATTCGCTTACGCTTATTTCCTTTTTCTTTGCAAGATAAAGCACGAGATTGGTTGCAGTCTATTCCCCCT
GGGAGCATCACCACCTGGGATGCTTTAGTCCAGGCCTTTTTGAAGAAATTTTTCCCTCTTGCAAAGACGGACAAGTTGAGGACCGAGATTGGGACATTCCAACAACAATA
TGATGAGCAGCTGTTTGAGGCTTGGGAGCGATTCAAAGAGTTGTTGAGGAACTGCAAGTGGGACTCTGTTGTCGAAGACCGTGGAAAATGCTTGCATACTTCTAGAGGAT
ATGGAGTGTTGGAGGTTGATAAGTTAAGTGCACTCCAGGCCCAGATGACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACACGGAGTGCTCAATCAATTGAATCA
GCTTCTGCTTTGGCATCTACACCTCAGGAGGAGACCATTGAACAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAAGACAACCAAATTAGAGGAGGC
AGTCATTGCCATCAACTCAACAGTGAATTGCCACAGTGCTGCCATCAAGAACATTGAAACTCAGTTGGGACAGTTGGTAAGTGTCGTAAACACCATGAATAAAGGTAAGG
CCGCAGCTGAGCAGGAGAAAACCCAGATGGAATATTGTAAGGCAATCACTGTGCACCAGGAGAAAGCTGACGAGGAGCCTGAGTCTGAGGACTATGACACGCCTACAGGG
GAAGTTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAGGAAGAA
AAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGTAGAGGCATTAGAGATGCCCCAGTATAATAGGTTCATGA
AGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTAACACTGTATATCTTGCTTCCACATGTAGCACCAGAGTACAACAGAAGGTACCTGAAAAAGTAGCAGATCCA
GGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTATTATTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATTATTCCTTTGTCCTTGTGTAAAAAGTTAAACAT
AGGTGAGATTAAATCTGCCCGTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTTAAACCAGTTGGCATTATAGAAAATGTTTTAATTAGAGTAGGTAGATTTTTCCTCC
CTATTGATTTGTATGTTATGGACATGATAGAAAATCCTTCAATTCCTGTCATATTAGGAAGACCATTCCTCGATACTGGGCGAGTGATTATTGATATTGAGCGCAAGGAG
CTCACTGTTAGAGTGCAAGAAAGAGCACCTCTGTTGGATTCAAAGAAAAGAAGCCTCCTTGAAGCACGGTCAACATGTCGAGCTAATGACGTTAAACAAGCGCTTATGGG
AGGCAACCCAACCTCCAAGTGTTCTGTTCTGCTCTGTTTACGGCTCCAAAAGGTCCCAAATCGAACTCCCTCAAAACCTAGTCTCGACGCTGCCTTAAAAACGTGCGTTT
CAGTAAGCGAAATAACACAGCGTCGAGACGCTGTGACCTTTACGCGCCTAATCAGATATGGAATGACAAAGAGAGATACTGAGGAAGAGGAAGTGACCATTACGCCTGAG
GCACCGAAGACAATGGCAAAGAGAAGGAAAACGCCGGAAGAGAGAGAGGCTAAGAGACGAAGAAGACAACAGAGAGCTGAGGTTACGAAAGTAGTGAGAAATGTGATTGA
GGACATTGCTGATGAAGCGGTTGAGGAAGAACGACCAAAGGAACCTAAGGAAAAGAAAGATCCTGAAAAGAATTCCACGTCGTCGCCGTCGAAAGCAAAAGGCAAGCCGA
ATCAAGGTAATCAGGACAGACACTCCATTGCGGTTGACGACAGAATCTCAGAAAGAAGAACGAGAGAAAAGGGGGCAGAGGACCAGGAAAGAGAAGAAAGGGAAGAGAAA
GTAGAAGAGGAAGCTTTGGTGAAGCATCAAGAAGACAAGGGTGGAGTCAATTCTGTGTGA
Protein sequenceShow/hide protein sequence
MRSSRELISLDPEIERTILRIQRENREIIHMADQNQPKEPKPIRDYFQPVFQGQQSGIVYAPINANNFELKTDICGTIKINGVSDDAIRLRLFPFSLQDKARDWLQSIPP
GSITTWDALVQAFLKKFFPLAKTDKLRTEIGTFQQQYDEQLFEAWERFKELLRNCKWDSVVEDRGKCLHTSRGYGVLEVDKLSALQAQMTSLANAFMKFSGTRSAQSIES
ASALASTPQEETIEQLEDLVGAFIAESSNKTTKLEEAVIAINSTVNCHSAAIKNIETQLGQLVSVVNTMNKGKAAAEQEKTQMEYCKAITVHQEKADEEPESEDYDTPTG
EVEEDTSSDEAEKPEPEPPIPSPTLMVPKEKKKKRKKKNNQVQFDKFMNAFMNLNINIPFVEALEMPQYNRFMKEWLAKKRKEKKVNTVYLASTCSTRVQQKVPEKVADP
GSFSVPCSFGTYYFRALCDLGASINIIPLSLCKKLNIGEIKSARVKLQLADQSVVKPVGIIENVLIRVGRFFLPIDLYVMDMIENPSIPVILGRPFLDTGRVIIDIERKE
LTVRVQERAPLLDSKKRSLLEARSTCRANDVKQALMGGNPTSKCSVLLCLRLQKVPNRTPSKPSLDAALKTCVSVSEITQRRDAVTFTRLIRYGMTKRDTEEEEVTITPE
APKTMAKRRKTPEEREAKRRRRQQRAEVTKVVRNVIEDIADEAVEEERPKEPKEKKDPEKNSTSSPSKAKGKPNQGNQDRHSIAVDDRISERRTREKGAEDQEREEREEK
VEEEALVKHQEDKGGVNSV