; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000809 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000809
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:17120812..17131435
RNA-Seq ExpressionLag0000809
SyntenyLag0000809
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.1e-9734.73Show/hide
Query:  MADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ------------------
        MA++++   PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+I   V   G ++                  
Subjt:  MADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ------------------

Query:  --------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
                 I +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  --------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF---------------------------------------------
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L++                                                  
Subjt:  VENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF---------------------------------------------

Query:  -------------------------------SESSNRTTKLEEAVIA------------------INTTVNGHSAAIKNIETHLGQLVNVVSTMNKG---
                                       S+ S +   LE+A+++                  I T  +   A +KN+E  +GQL   ++   +G   
Subjt:  -------------------------------SESSNRTTKLEEAVIA------------------INTTVNGHSAAIKNIETHLGQLVNVVSTMNKG---

Query:  -----------------------------------KARTEQEKTQMEYWEAEEDTSSDEAEKPN----PEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKF
                                            A   Q K Q+E  E   DT  +    P       PPI +P L  P+  +K+   K    QF KF
Subjt:  -----------------------------------KARTEQEKTQMEYWEAEEDTSSDEAEKPN----PEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKF

Query:  MNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF++PC+ G   F R LCDLGASIN++P  +C+KL
Subjt:  MNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+    GI+E+VL++V                        GRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.6e-9935.01Show/hide
Query:  MADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ------------------
        MA++++   PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+I   V   G ++                  
Subjt:  MADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ------------------

Query:  --------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
                 I +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  --------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF---------------------------------------------
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ+ +L++                                                  
Subjt:  VENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF---------------------------------------------

Query:  -------------------------------SESSNRTTKLEEAVIA------------------INTTVNGHSAAIKNIETHLGQLVNVVSTMNKG---
                                       S+ S +   LE+A+++                  I T  +   A +KN+E  +GQL   ++   +G   
Subjt:  -------------------------------SESSNRTTKLEEAVIA------------------INTTVNGHSAAIKNIETHLGQLVNVVSTMNKG---

Query:  -----------------------------------KARTEQEKTQMEYWEAEEDTSSDEAEKPN----PEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKF
                                            A   Q K Q+E  E   DT  +    P       PPI +P L  P+  +K+   K    QF KF
Subjt:  -----------------------------------KARTEQEKTQMEYWEAEEDTSSDEAEKPN----PEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKF

Query:  MNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F R LCDLGASIN++P S+C+KL
Subjt:  MNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+    GI+E+VL++V                        GRPFLATGR +ID+++ ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]9.9e-10734.68Show/hide
Query:  SRELIPLDPEIERTIHRIRRENREIIHMADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI--
        SR++IP+DPEIERT+  +RR   +I+ MA++++   PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+I  
Subjt:  SRELIPLDPEIERTIHRIRRENREIIHMADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI--

Query:  -VYSPGASQ--------------------------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQV
         V   G ++                           I +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQV
Subjt:  -VYSPGASQ--------------------------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQV

Query:  QLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF------------------
        Q+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ+ +L++                       
Subjt:  QLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF------------------

Query:  ----------------------------------------------------------SESSNRTTKLEEAVIA------------------INTTVNGH
                                                                  S+ S R   LE+A+++                  I T  +  
Subjt:  ----------------------------------------------------------SESSNRTTKLEEAVIA------------------INTTVNGH

Query:  SAAIKNIETHLGQLVNVVSTMNKG--KARTE------------------------------------QEKTQMEYWEAEEDTSSDEAEKPN----PEPPI
         AAIKNIE  +GQL   ++   +G   + TE                                    Q K ++E  E   DT  +    P       PPI
Subjt:  SAAIKNIETHLGQLVNVVSTMNKG--KARTE------------------------------------QEKTQMEYWEAEEDTSSDEAEKPN----PEPPI

Query:  PSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSF
         +P L  P+  +K+   K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ 
Subjt:  PSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSF

Query:  GTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIER
        G   F + LCDLGASIN++PLS+C+KL + E+K T + LQLAD+S+    GI+E+VL++V                        GRPFLATGR +ID+++
Subjt:  GTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIER

Query:  RELTIRVRNEKEIFKAVEDS----------KDEVLFMGYKKGARKSTSVGFTKRSLLEARSTRRANDVKQ
         ELT+RV  E+ +FK  +            + +V+  G +K  ++       +R+L   + T  +N+V++
Subjt:  RELTIRVRNEKEIFKAVEDS----------KDEVLFMGYKKGARKSTSVGFTKRSLLEARSTRRANDVKQ

XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]6.2e-9334.15Show/hide
Query:  RSSRELIPLDPEIERTIHRIRRENREII----HMADQNQ----------PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS
        R++  L+P DP+IERT  R RREN ++      MA+ N           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    + G 
Subjt:  RSSRELIPLDPEIERTIHRIRRENREII----HMADQNQ----------PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS

Query:  PTEDPNSHLKSFLDIVYS-------------------------------PGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWER
        P++DPNSHL +FL+I  +                               P  S  ITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWER
Subjt:  PTEDPNSHLKSFLDIVYS-------------------------------PGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWER

Query:  FKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF--
        FKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K A G +E+D +  L  Q+ +L+     
Subjt:  FKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF--

Query:  ---------------------------------------------------------MKFSESSN------------------------RTTKLEEAVI-
                                                                   FS S+N                        + ++LEE ++ 
Subjt:  ---------------------------------------------------------MKFSESSN------------------------RTTKLEEAVI-

Query:  ---AINTTVNGHSAAIKNIETHLGQLVNVVSTMNKGKARTEQE-----------------------------------------KTQMEYWEAEEDTSSD
             +  +    A+++N+ET +GQL N ++   +G   ++ +                                         + ++E  + ++D + +
Subjt:  ---AINTTVNGHSAAIKNIETHLGQLVNVVSTMNKGKARTEQE-----------------------------------------KTQMEYWEAEEDTSSD

Query:  EAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVA
        +       PP P P         ++ +K+  + QF KF+N F  ++INIPFAEALE MP Y +F+K+ L+KKRK  + +TV+L   CS  +Q K+P K+ 
Subjt:  EAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVA

Query:  DPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFL
        DPGSF++PC+ G   F +AL DLGASIN++P S+ +KL +GE K T V LQLAD+S V   GI+E+VL++V                        GRPFL
Subjt:  DPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFL

Query:  ATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
        AT   IID+   +++ +V  E   F     SK
Subjt:  ATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]3.2e-9734.88Show/hide
Query:  MADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ------------------
        MA+  Q  +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+I   +   G ++                  
Subjt:  MADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ------------------

Query:  --------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
                 IT+W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  --------CITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF---------------------------------------------
         E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +AL AQ+ SL++                                                  
Subjt:  VENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKF---------------------------------------------

Query:  -----------------------------SESSNRTTKLEEAVIA------------------INTTVNGHSAAIKNIETHLGQLVNVVSTMNKG--KAR
                                     S+ S +   LE+A+++                  I T  +   A +KN+E  +GQL   ++   +G   + 
Subjt:  -----------------------------SESSNRTTKLEEAVIA------------------INTTVNGHSAAIKNIETHLGQLVNVVSTMNKG--KAR

Query:  TE------------------------------------QEKTQMEYWEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMN
        TE                                    Q K ++E  E  EDT  +    P+   P   P L  P    ++ +K+    QF KF++ F  
Subjt:  TE------------------------------------QEKTQMEYWEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMN

Query:  MNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIK
        ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+ +KL +GE+K
Subjt:  MNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIK

Query:  STPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
         T + LQLAD+S+    GI+E+VL++V                        GRPFLATGR ++D+++ ELT+RV  E+  F   E  K
Subjt:  STPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.0e-9334.15Show/hide
Query:  RSSRELIPLDPEIERTIHRIRRENREII----HMADQNQ----------PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS
        R++  L+P DP+IERT  R RREN ++      MA+ N           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    + G 
Subjt:  RSSRELIPLDPEIERTIHRIRRENREII----HMADQNQ----------PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS

Query:  PTEDPNSHLKSFLDIVYS-------------------------------PGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWER
        P++DPNSHL +FL+I  +                               P  S  ITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWER
Subjt:  PTEDPNSHLKSFLDIVYS-------------------------------PGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWER

Query:  FKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF--
        FKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K A G +E+D +  L  Q+ +L+     
Subjt:  FKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF--

Query:  ---------------------------------------------------------MKFSESSN------------------------RTTKLEEAVI-
                                                                   FS S+N                        + ++LEE ++ 
Subjt:  ---------------------------------------------------------MKFSESSN------------------------RTTKLEEAVI-

Query:  ---AINTTVNGHSAAIKNIETHLGQLVNVVSTMNKGKARTEQE-----------------------------------------KTQMEYWEAEEDTSSD
             +  +    A+++N+ET +GQL N ++   +G   ++ +                                         + ++E  + ++D + +
Subjt:  ---AINTTVNGHSAAIKNIETHLGQLVNVVSTMNKGKARTEQE-----------------------------------------KTQMEYWEAEEDTSSD

Query:  EAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVA
        +       PP P P         ++ +K+  + QF KF+N F  ++INIPFAEALE MP Y +F+K+ L+KKRK  + +TV+L   CS  +Q K+P K+ 
Subjt:  EAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVA

Query:  DPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFL
        DPGSF++PC+ G   F +AL DLGASIN++P S+ +KL +GE K T V LQLAD+S V   GI+E+VL++V                        GRPFL
Subjt:  DPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFL

Query:  ATGRVIIDIERRELTIRVRNEKEIFKAVEDSK
        AT   IID+   +++ +V  E   F     SK
Subjt:  ATGRVIIDIERRELTIRVRNEKEIFKAVEDSK

A0A6J1DU19 uncharacterized protein LOC1110243614.1e-9040.04Show/hide
Query:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDIVYSPGASQCITTWDA--------------LVQAFLKKFFPP
        IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FLD+  +   +  I   DA              +VQAFL  FFPP
Subjt:  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDIVYSPGASQCITTWDA--------------LVQAFLKKFFPP

Query:  AKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKK
        AKT +LRTEI +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA ILL+DMA NS+QWPSERS  KK
Subjt:  AKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKK

Query:  IAAGVFEVDKVSALQAQMTSLANAFMKFS-------------------------------------------------ESSNRTTKLEEAVIAINTTVNG
        + AG++E+D++S+L+AQ+ +L NA  K S                                                 E  +R +++E  V  +   + G
Subjt:  IAAGVFEVDKVSALQAQMTSLANAFMKFS-------------------------------------------------ESSNRTTKLEEAVIAINTTVNG

Query:  HSAAIKNIETHLGQLVNVVSTMNKGKARTEQEKTQMEYWEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKN--KKKNNQVQFDKFMNAFMNMNIN-I
        ++ +IKN+E  +GQ+   ++TM KGK  ++ E    E+ +A    S  E ++P  +  +  P +   + + K+   K+    +Q DK  ++ ++   N +
Subjt:  HSAAIKNIETHLGQLVNVVSTMNKGKARTEQEKTQMEYWEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKN--KKKNNQVQFDKFMNAFMNMNIN-I

Query:  PFAE-ALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCK----KLDIGEIK
        P+ + ALE MP Y RFMK+ +  KRK +  +TV L   CS  +Q+K+P+K+ DPGSF++PC+  + SF +ALCD+ ASIN++PL + +    K+D     
Subjt:  PFAE-ALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCK----KLDIGEIK

Query:  STPVKLQLADQSLVRLVGIVENVLIRVGRPFLATGRVIIDIERRELTIRVRNEKEIF
        +  V L   + S + ++         +GR FLATG  +ID++   LT+RV  E  +F
Subjt:  STPVKLQLADQSLVRLVGIVENVLIRVGRPFLATGRVIIDIERRELTIRVRNEKEIF

A0A6P6XAQ1 Reverse transcriptase3.2e-7933.59Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ--------------------------CI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+I   +   G S+                            
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDI---VYSPGASQ--------------------------CI

Query:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLED
        TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+
Subjt:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLED

Query:  MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLAN------------------------------------------------------------
        MA N+YQW +ER   ++  AG+ EVD ++ L A+M ++                                                              
Subjt:  MATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLAN------------------------------------------------------------

Query:  --------------------------------------AFMKFSESSN-RTTKLEEAVIAINTTVNGHSAAI----KNIETHLGQLVNVVSTMNKGKART
                                              A  K + +SN +  KL  A       + G    +    +N+E  LGQ+ N V+  N+G   +
Subjt:  --------------------------------------AFMKFSESSN-RTTKLEEAVIAINTTVNGHSAAI----KNIETHLGQLVNVVSTMNKGKART

Query:  EQEKTQMEYWEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQ---FDKFMNAFMNMNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVD
        + E    E+ +A    S  E  +P   P + S      +E KK ++ K    +    +K     + M    P    +  P Y +F+KE + KKRK    +
Subjt:  EQEKTQMEYWEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQ---FDKFMNAFMNMNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVD

Query:  TVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV--------
        T+ L   CS  +Q K+P K+ DPGSF+VPC+ G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+   +GI+ENVLI+V        
Subjt:  TVYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV--------

Query:  ----------------GRPFLATGRVIIDIERRELTIRVRNEKEIF
                        GRPFLAT   IID++R +   ++  E+  F
Subjt:  ----------------GRPFLATGRVIIDIERRELTIRVRNEKEIF

A0A6P8DD93 uncharacterized protein LOC1162064532.0e-8132.87Show/hide
Query:  MRSSR--ELIPLDPEIERTIHRIRRENR-----EIIHMADQNQPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR SR  EL+PLDPEIERT+HR+RRENR     +++ MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSR--ELIPLDPEIERTIHRIRRENR-----EIIHMADQNQPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFL-------------DIV----------------YSPGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL
         P+ H+  FL             D++                ++    + ITTW  L   FL++FFPPA+T +LR EI  F +   E L+EAWERFKE +
Subjt:  DPNSHLKSFL-------------DIV----------------YSPGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELL

Query:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVD-------KVSALQAQMTSLANA-
        RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W +ERS  K   A V ++D       ++SAL  Q++ L +A 
Subjt:  RKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVD-------KVSALQAQMTSLANA-

Query:  -------------------------------------FMKFSESSNR----------------------------------------------TTKLEEA
                                             F+   + SN+                                               +++EE 
Subjt:  -------------------------------------FMKFSESSNR----------------------------------------------TTKLEEA

Query:  VIA----INTTVNGHSAAIKNIETHLGQLVNVVSTMNKGK--ARTEQ--------------------EKTQMEYWEAEEDTSSDEAEKPNPE--------
        +++     +T +    A I+N+E  + Q+   +S    G   + TE+                     K Q +    E+D    + E+P  +        
Subjt:  VIA----INTTVNGHSAAIKNIETHLGQLVNVVSTMNKGK--ARTEQ--------------------EKTQMEYWEAEEDTSSDEAEKPNPE--------

Query:  PPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQ---KVPKKVADPGSF
        PP+P P         ++ K++    QF KF++ F  + INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS  +Q+    +P+K  D GSF
Subjt:  PPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQ---KVPKKVADPGSF

Query:  SVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRV
        +VPC+ G + F   L D GASIN++PLS+ +KL +GE K T V LQLAD+S+    GIVENVL++V                        GRPFLATG+ 
Subjt:  SVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATGRV

Query:  IIDIERRELTIRVRNEKEIFKAVEDSK
        +ID+E+ +LT+RV NE+  F   +  K
Subjt:  IIDIERRELTIRVRNEKEIFKAVEDSK

A0A6P8DKJ2 uncharacterized protein LOC1162042314.1e-8232.78Show/hide
Query:  YCMRSSR--ELIPLDPEIERTIHRIRRENR-----EIIHMADQNQPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSP
        Y MR SR  EL+PLDPEIERT+HR+RRENR     +++ MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P
Subjt:  YCMRSSR--ELIPLDPEIERTIHRIRRENR-----EIIHMADQNQPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSP

Query:  TEDPNSHLKSFL-------------DIV----------------YSPGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKE
         E P+ H+  FL             D++                ++    + ITTW  L   FL++FFPPA+T +LR EI  F +   E L+EAWERFKE
Subjt:  TEDPNSHLKSFL-------------DIV----------------YSPGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKE

Query:  LLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVD-------KVSALQAQMTSLAN
         +RKCP HG PD L +++FY  L  + +++VDAAAGG L+ K  + A  L+E+MA++++ W +ERS  K   A V ++D       ++SAL  Q++ L +
Subjt:  LLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVD-------KVSALQAQMTSLAN

Query:  A--------------------------------------FMKFSESSNR----------------------------------------------TTKLE
        A                                      F+   + SN+                                               +++E
Subjt:  A--------------------------------------FMKFSESSNR----------------------------------------------TTKLE

Query:  EAVIA----INTTVNGHSAAIKNIETHLGQLVNVVSTMNKGK--ARTEQ--------------------EKTQMEYWEAEEDTSSDEAEKPNPE------
        E +++     +T +    A I+N+E  + Q+   +S    G   + TE+                     K Q +    E+D    + E+P  +      
Subjt:  EAVIA----INTTVNGHSAAIKNIETHLGQLVNVVSTMNKGK--ARTEQ--------------------EKTQMEYWEAEEDTSSDEAEKPNPE------

Query:  --PPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQ---KVPKKVADPG
          PP+P P  +         K++    QF KF++ F  + INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS  +Q+    +P+K  D G
Subjt:  --PPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASICSTRVQQ---KVPKKVADPG

Query:  SFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATG
        SF+VPC+ G + F   L D GASIN++PLS+ +KL +GE K T + LQLAD+S+    GIVENVL++V                        GRPFLATG
Subjt:  SFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRV------------------------GRPFLATG

Query:  RVIIDIERRELTIRVRNEKEIFKAVEDSK
        + +ID+E+ +LT+RV NE+  F   +  K
Subjt:  RVIIDIERRELTIRVRNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGATATAGCGAGTGGTGGTGATGCACAAGACAAAACACTCAATGAGGCTAGAGAGATAATTTCGAGCTTGGCAAGGTTGGAGGTGGGGTATGTATTTTATTTTCT
GTTGAACCCTGAGCATTTGTGCGTCGGACCAGCGTTTGCGCTTAATACTCTCAACCCGTTGTCCGCTGAGATAGAAAGGGTCGAGCGTGCAGTGCAAGTCTTCTTTTCAG
CTCCACAATATCCTTCGCCTACCTGCATTGTTAATCCCGAAGAAGATAATTTTGTGCCCCTAGCTGAATTTATTTTCTCAGTTCCTTCCTTTTATGGTGATTCAAGAGCT
AAAGGCTACTATCATGTGTCCGATTTCTACTCGAGTTGCTCCGAGAAGAAATTCCCTAAGATGTTTGAAGATCAATTCAAATTGAGGGTATTTCCATATTCACTACAAGG
AGGAGCTAGAATGTGGATTTTCTCACTACCCTCAGGATCGGTAGCTTCTTGGTGCGAGATGAGAAATATTTTCTTGAAGAGGTACCATCCTACCGCTCTGCTGATTGAAG
CGACTGAGAGAATTGTCAACGACACGCAAGCAGCTGAGGAAATATTCCATGAGTTCGTGGACCGCTCTAGTGGACACTACATGGTTCCTCACACTACATTATTCAAGGAG
TCCTTAATAGAAAGCCTCTATGAATGGTTTGAAGGATTGGAATTTCGTAATTCCGTAAAGCAGAAGCGTGATTGGGACCGATTCGGTAGTTTTGTTTATTGCATGCGCAG
TAGTAGAGAATTGATACCGTTGGATCCCGAGATCGAAAGAACAATTCATAGGATTCGAAGAGAAAATAGAGAAATTATTCACATGGCTGACCAAAATCAACCTGAGGAGC
CTAGGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGGTCTCATTCAG
ATGGCTCGAGACTGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTTCTGGACATAGTCTATTCCCCTGGAGCATCCCAATGCATCACCAC
TTGGGATGCTTTGGTCCAGGCCTTTTTGAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAAATTGGGACATTCCAACAACAATATGATGAGCAGTTGT
TCGAGGCGTGGGAGCGATTCAAAGAGTTGTTGAGGAAGTGTCCTCAGCATGGTTACCCCGACTGGCTCCAGGTACAATTGTTTTATAATGGTTTAACTCCTAGTACGAAA
ACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCG
GTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGAGTCTA
GTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACACCACAGTGAATGGCCACAGTGCTGCCATCAAGAACATTGAGACTCATCTGGGACAGTTGGTGAAT
GTTGTGAGCACCATGAATAAAGGTAAGGCCCGAACTGAGCAAGAGAAAACCCAGATGGAGTACTGGGAAGCTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTAA
CCCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCTAAGGAAAAGAAAAAGAAAAATAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTA
TGAATATGAACATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACT
GTATATCTTGCTTCCATATGCAGCACCAGAGTACAACAGAAGGTACCTAAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAG
AGCATTATGTGATTTAGGTGCTAGCATTAATATTATTCCTTTGTCCTTGTGTAAAAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATC
AATCTCTAGTTAGATTAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCGTAGGGAGCTCACT
ATTAGAGTCAGGAACGAAAAAGAAATTTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTTATGGGCTACAAGAAAGGTGCGAGAAAGAGCACCTCTGTTGGATT
CACAAAAAGAAGCCTCCTTGAAGCACGGTCAACACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGCTTGTTTGTGCCTTATTTTCTTTATACAT
TGAGGACAATGAGTGTTTTAAGTTTGGGGGTGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGATATAGCGAGTGGTGGTGATGCACAAGACAAAACACTCAATGAGGCTAGAGAGATAATTTCGAGCTTGGCAAGGTTGGAGGTGGGGTATGTATTTTATTTTCT
GTTGAACCCTGAGCATTTGTGCGTCGGACCAGCGTTTGCGCTTAATACTCTCAACCCGTTGTCCGCTGAGATAGAAAGGGTCGAGCGTGCAGTGCAAGTCTTCTTTTCAG
CTCCACAATATCCTTCGCCTACCTGCATTGTTAATCCCGAAGAAGATAATTTTGTGCCCCTAGCTGAATTTATTTTCTCAGTTCCTTCCTTTTATGGTGATTCAAGAGCT
AAAGGCTACTATCATGTGTCCGATTTCTACTCGAGTTGCTCCGAGAAGAAATTCCCTAAGATGTTTGAAGATCAATTCAAATTGAGGGTATTTCCATATTCACTACAAGG
AGGAGCTAGAATGTGGATTTTCTCACTACCCTCAGGATCGGTAGCTTCTTGGTGCGAGATGAGAAATATTTTCTTGAAGAGGTACCATCCTACCGCTCTGCTGATTGAAG
CGACTGAGAGAATTGTCAACGACACGCAAGCAGCTGAGGAAATATTCCATGAGTTCGTGGACCGCTCTAGTGGACACTACATGGTTCCTCACACTACATTATTCAAGGAG
TCCTTAATAGAAAGCCTCTATGAATGGTTTGAAGGATTGGAATTTCGTAATTCCGTAAAGCAGAAGCGTGATTGGGACCGATTCGGTAGTTTTGTTTATTGCATGCGCAG
TAGTAGAGAATTGATACCGTTGGATCCCGAGATCGAAAGAACAATTCATAGGATTCGAAGAGAAAATAGAGAAATTATTCACATGGCTGACCAAAATCAACCTGAGGAGC
CTAGGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATCAATGCCAACAACTTTGAGCTGAAGACCGGTCTCATTCAG
ATGGCTCGAGACTGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTTCTGGACATAGTCTATTCCCCTGGAGCATCCCAATGCATCACCAC
TTGGGATGCTTTGGTCCAGGCCTTTTTGAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAAATTGGGACATTCCAACAACAATATGATGAGCAGTTGT
TCGAGGCGTGGGAGCGATTCAAAGAGTTGTTGAGGAAGTGTCCTCAGCATGGTTACCCCGACTGGCTCCAGGTACAATTGTTTTATAATGGTTTAACTCCTAGTACGAAA
ACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCG
GTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGAGTCTA
GTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACACCACAGTGAATGGCCACAGTGCTGCCATCAAGAACATTGAGACTCATCTGGGACAGTTGGTGAAT
GTTGTGAGCACCATGAATAAAGGTAAGGCCCGAACTGAGCAAGAGAAAACCCAGATGGAGTACTGGGAAGCTGAGGAGGACACATCATCAGATGAGGCTGAAAAGCCTAA
CCCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCTAAGGAAAAGAAAAAGAAAAATAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTA
TGAATATGAACATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACT
GTATATCTTGCTTCCATATGCAGCACCAGAGTACAACAGAAGGTACCTAAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAG
AGCATTATGTGATTTAGGTGCTAGCATTAATATTATTCCTTTGTCCTTGTGTAAAAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATC
AATCTCTAGTTAGATTAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCGTAGGGAGCTCACT
ATTAGAGTCAGGAACGAAAAAGAAATTTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTTATGGGCTACAAGAAAGGTGCGAGAAAGAGCACCTCTGTTGGATT
CACAAAAAGAAGCCTCCTTGAAGCACGGTCAACACGTCGAGCTAATGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGCTTGTTTGTGCCTTATTTTCTTTATACAT
TGAGGACAATGAGTGTTTTAAGTTTGGGGGTGGAATAG
Protein sequenceShow/hide protein sequence
MLDIASGGDAQDKTLNEAREIISSLARLEVGYVFYFLLNPEHLCVGPAFALNTLNPLSAEIERVERAVQVFFSAPQYPSPTCIVNPEEDNFVPLAEFIFSVPSFYGDSRA
KGYYHVSDFYSSCSEKKFPKMFEDQFKLRVFPYSLQGGARMWIFSLPSGSVASWCEMRNIFLKRYHPTALLIEATERIVNDTQAAEEIFHEFVDRSSGHYMVPHTTLFKE
SLIESLYEWFEGLEFRNSVKQKRDWDRFGSFVYCMRSSRELIPLDPEIERTIHRIRRENREIIHMADQNQPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQ
MARDCAYRGSPTEDPNSHLKSFLDIVYSPGASQCITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTK
TIVDAAAGGTLLSKTVENARILLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSESSNRTTKLEEAVIAINTTVNGHSAAIKNIETHLGQLVN
VVSTMNKGKARTEQEKTQMEYWEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKNKKKNNQVQFDKFMNAFMNMNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDT
VYLASICSTRVQQKVPKKVADPGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSLVRLVGIVENVLIRVGRPFLATGRVIIDIERRELT
IRVRNEKEIFKAVEDSKDEVLFMGYKKGARKSTSVGFTKRSLLEARSTRRANDVKQALMGGNPSLFVPYFLYTLRTMSVLSLGVE