; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008055 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008055
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:10969005..10973843
RNA-Seq ExpressionLag0008055
SyntenyLag0008055
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]4.3e-4740.99Show/hide
Query:  KNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWEDEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINI
        K I  + G+ +          TP      Q +   +EE+  +D  E+            PPI +P L  P+  +K+K  K    QF KF++ F  ++INI
Subjt:  KNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWEDEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINI

Query:  PFAEALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVK
        PFA+ALE +P Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF++PC+ G   F R LCDLGASIN++P  +C+KL +GE+K T + 
Subjt:  PFAEALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVK

Query:  LQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFKAVEDSK
        LQL D+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID++  ELT+RV  E+ +F   +  K
Subjt:  LQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFKAVEDSK

KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]5.0e-12046.5Show/hide
Query:  MADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+     +PR ++DY +P+     SGI    INANNFELK  L  M +   + GSP +DPN HL  FL+IC T+K+NGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSIT+W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAVALASRPQEETIEQVQYVSNSNCRGYNNNFT
         E A +LLE+MA+N+YQWP+ER+  K + AG+ E++  +AL AQ+ SL++     +     Q  E  +A ++     E + EQVQY++N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAVALASRPQEETIEQVQYVSNSNCRGYNNNFT

Query:  PTHYHPNNRNHENFSYANTKNILN-PPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPA
        P +YHP  RNHENFSY NTKN+L  PPGF  Q  E K  LED + +F+ E+     K +  +  I +  +   A +KN+E Q+GQL   ++   +G  P+
Subjt:  PTHYHPNNRNHENFSYANTKNILN-PPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPA

Query:  EQEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG----------EDEEDTSSDEAEKPKP-------EPPIPSPTLLVPKERKKKKKKKNNQVQFDKFM
          E    E CKAIT+    E E  P  E   TPT           E+EE       E   P        PPI S  L  P+  +K+K  K    QF KF+
Subjt:  EQEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG----------EDEEDTSSDEAEKPKP-------EPPIPSPTLLVPKERKKKKKKKNNQVQFDKFM

Query:  NAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP
        + F  ++INIPFA+ALE MP Y KF+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P
Subjt:  NAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.4e-12246.51Show/hide
Query:  MADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY +P+  G  S I+  PINANNFELK  L  M +   + GSP +DPN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAVALASRPQEETIEQVQYVSNSNCRGYNNNFT
         E A  LLE+MA+N+YQWP+ER+  K + AG+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAVALASRPQEETIEQVQYVSNSNCRGYNNNFT

Query:  PTHYHPNNRNHENFSYANTKNIL---NPPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKT
        P +YHP  RNHEN SY NTKN+L   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHENFSYANTKNIL---NPPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKT

Query:  PAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPTGE---------DEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDK
        P+  E    E CKAIT+   +E E  P  E   TPT           +EE+  +D  E+            PPI +P L  P+  +K+K  K    QF K
Subjt:  PAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPTGE---------DEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP
        F++ F  ++INIPFA+ALE MP Y KF+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P
Subjt:  FMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.0e-4841.7Show/hide
Query:  KNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWEDEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINI
        K I  + G+ +          TP      Q +   +EE+  +D  E+            PPI +P L  P+  +K+K  K    QF KF++ F  ++INI
Subjt:  KNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWEDEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINI

Query:  PFAEALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVK
        PFA+ALE +P Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F R LCDLGASIN++P S+C+KL +GE+K T + 
Subjt:  PFAEALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVK

Query:  LQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFKAVEDSK
        LQL D+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR +ID++  ELT+RV  E+ +F   +  K
Subjt:  LQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.4e-12246.51Show/hide
Query:  MADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY +P+  G  S I+  PINANNFELK  L  M +   + GSP +DPN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT
        WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT
Subjt:  WLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKT

Query:  VENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAVALASRPQEETIEQVQYVSNSNCRGYNNNFT
         E A  LLE+MA+N+YQWP+ER+  K + AG+ E++ ++AL AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  N  
Subjt:  VENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--SAVALASRPQEETIEQVQYVSNSNCRGYNNNFT

Query:  PTHYHPNNRNHENFSYANTKNIL---NPPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKT
        P +YHP  RNHEN SY NTKN+L   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL   ++   +G  
Subjt:  PTHYHPNNRNHENFSYANTKNIL---NPPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKT

Query:  PAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPTGE---------DEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDK
        P+  E    E CKAIT+   +E E  P  E   TPT           +EE+  +D  E+            PPI +P L  P+  +K+K  K    QF K
Subjt:  PAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPTGE---------DEEDTSSDEAEKP--------KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDK

Query:  FMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP
        F++ F  ++INIPFA+ALE MP Y KF+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P
Subjt:  FMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]2.6e-12946.7Show/hide
Query:  MRRNKDLILAPLDPEIERTIHRLRRENRENVQMADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSF
        MRR +   + P+DPEIERT+  LRR   + + MA+++    PR ++DY +P+  G  S I+  PINANNFELK  L  M +   + GSP +DPN HL  F
Subjt:  MRRNKDLILAPLDPEIERTIHRLRRENRENVQMADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSF

Query:  LDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYP
        L+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG P
Subjt:  LDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYP

Query:  DWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--S
        DWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  K + AG+ +++ ++AL AQ+ +L++     +     QS E  +
Subjt:  DWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIE--S

Query:  AVALASRPQEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKNIL---NPPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAIN
        + ++     E + EQVQYV+N N   Y  N  P +YHP  RNHEN SY NTKN+L   +PPGF  Q  E K  LED + +F+ E++ R  K +  +  I 
Subjt:  AVALASRPQEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKNIL---NPPGFAPQTKENK-KLEDLVGAFIAESSNRTTKLEEAVIAIN

Query:  STVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEDEEDTSSDEAEKPKPE----------
        +  +   AAIKNIE Q+GQL   ++   +G  P+  E    E CKAIT+   +E E  P  E   TPT    G+ +     DE      E          
Subjt:  STVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEDEEDTSSDEAEKPKPE----------

Query:  ---PPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP
           PPI +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y KF+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P
Subjt:  ---PPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]3.9e-4845.97Show/hide
Query:  QEKSQMEYWEDEEDTSSDEAEKP----KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-IPQYNRFMKEWLAKKRKEKKV
        Q K+++E  E   DT  +    P       PPI +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE +P Y +F+K+ ++KKR+ ++ 
Subjt:  QEKSQMEYWEDEEDTSSDEAEKP----KPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-IPQYNRFMKEWLAKKRKEKKV

Query:  DTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPI
        +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+C+KL + E+K T + LQL D+S+  P GI+E+VL++V +F  P 
Subjt:  DTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPI

Query:  DLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFK
        D  V+DM E+  +P+ILGRPFLATGR +ID++  ELT+RV  E+ +FK
Subjt:  DLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.4e-12554.66Show/hide
Query:  LAPLDPEIERTIHR-LRRENRENVQMADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTV
        L PLDPEI+RT  R LR    +  +MA+    E P+ IRDYFQP     Q GI+  PIN NNFELK GL QMAR+ A+RG   EDP+ HL+SFL+ICGTV
Subjt:  LAPLDPEIERTIHR-LRRENRENVQMADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTV

Query:  KINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQL
        K+NGVS DAI+LRLFPFSLQD+A+DWL++I P SITTW+ L QAFL K+FPPAK+ +LRTEIGTF+Q  DEQL+EAWER+K+LLR+CPQHGYPDWLQ+QL
Subjt:  KINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQL

Query:  FYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKM-IAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQ----SIESAVAL
        FYNGL  STK+I+DA AGG++ SK  + A T+LED+AT SY WP ER++P +  AAG++EVD+V++L+AQM SL NA  K +  G AQ    SI S  AL
Subjt:  FYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKM-IAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQ----SIESAVAL

Query:  ASR-PQEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKNILN-PPGF-APQTKENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNG
        AS        E   YV   + R Y +   PTHYHPN RNHENFSYAN KN+L  P GF      +   LED++  F+ ES +RTT LE +V AI STV  
Subjt:  ASR-PQEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKNILN-PPGF-APQTKENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNG

Query:  HSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEDEEDTSSD
           A++N+E QL Q+   + TM KGK P+  E    E CKA+T+   +    P   D      EDEE    D
Subjt:  HSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEDEEDTSSD

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.7e-6736.25Show/hide
Query:  EWLAKKRKEKKVDTVYLASTCSTRVQQKVPALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQIQYVSNSNFRGYNNNVTPTHYHPNNR
        +W  ++   KKV  ++          +   AL AQ+ SL++     +     Q  E  +A+++     E + EQ+QY++N N+  Y  N  P +YHP  R
Subjt:  EWLAKKRKEKKVDTVYLASTCSTRVQQKVPALQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQIQYVSNSNFRGYNNNVTPTHYHPNNR

Query:  NHENFSYANTKNILN-PPGFAPQTQENK-KLEDLVGVFIAESSNRTTKLEEAVITINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKTPAE--------
        NHENFSY NTKN+L  PPGF  Q  E K  LED +  F+ E+     K +  +  I +  +   A +KN+E Q+GQL   ++   +G  P+         
Subjt:  NHENFSYANTKNILN-PPGFAPQTQENK-KLEDLVGVFIAESSNRTTKLEEAVITINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKTPAE--------

Query:  ------------------------------QEKSQMEYWEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFA
                                      Q K+++E  E  EDT  +    P    P   P L  P    ++ +K+    QF KF++ F  ++INIPFA
Subjt:  ------------------------------QEKSQMEYWEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFA

Query:  EALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQL
        +ALE +P Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+ +KL +GE+K T + LQL
Subjt:  EALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQL

Query:  VDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFKAVEDSK
         D+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGRPFLATGR ++D++  ELT+RV  E+  F   E  K
Subjt:  VDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129455.2e-10743.21Show/hide
Query:  MRRNKDLILAPLDPEIERTIHRLRRENRE----NVQMADQN----------PPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCA-Y
        M+R  +L L P DP+IERT  R RREN +    N  MA+ N           PE  R +RDY  PL QG    I    INANNFE+K    QM +    +
Subjt:  MRRNKDLILAPLDPEIERTIHRLRRENRE----NVQMADQN----------PPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCA-Y

Query:  RGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWE
         G P++DPNSHL +FL+IC T K NGV++DAIRLRLFPFSL+DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT K+R +I +F Q   E L+EAWE
Subjt:  RGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWE

Query:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFM
        RFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +  A G +E+D +  L  Q+ +L+    
Subjt:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFM

Query:  KFSGTGSAQSIESAVALASRPQEE--------TIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKNILN-----PPGF----APQTKENK-K
        K   T    ++++++ +     +           E VQ+V N N R  NN ++ T Y+P  RNH NFS++N     N     PPGF     PQ  E K +
Subjt:  KFSGTGSAQSIESAVALASRPQEE--------TIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKNILN-----PPGF----APQTKENK-K

Query:  LEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAE-QEKPQ-MEYCKAIT---------VHQEESEEEPESED
        LE+L+  +I+++              ++ +    A+++N+ETQ+GQL + ++   +G  P++ Q  P+  E C+AIT         V+Q+  E E E  D
Subjt:  LEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAE-QEKPQ-MEYCKAIT---------VHQEESEEEPESED

Query:  YETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLAS
         E   G  E +    + +  K E    S  +  P    ++ +K+  + QF KF+N F  L+INIPFAEALE MP Y KF+K+ L+KKRK  + +TV+L  
Subjt:  YETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKKRKEKKVDTVYLAS

Query:  TCSTRVQQKVP
         CS  +Q K+P
Subjt:  TCSTRVQQKVP

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.7e-5235.85Show/hide
Query:  EQIQYVSNSNFRGYNNNVTPTHYHPNNRNHENFSYANTKNILN-----PPGF----APQTQENK-KLEDLVGVFIAESSNRTTKLEEAVITINSTVNGHS
        E +Q+V   NF    NN     Y+P  RNH NFS++N     N     PPGF     PQ  E K +LE+L+  +I+++              ++ +    
Subjt:  EQIQYVSNSNFRGYNNNVTPTHYHPNNRNHENFSYANTKNILN-----PPGF----APQTQENK-KLEDLVGVFIAESSNRTTKLEEAVITINSTVNGHS

Query:  AAIKNIETQLGQLVNVVSTMNKGKTPAEQE----------------------------KSQMEYWED----EEDTSSDEAEKPKPEPPIPSPTLLVPKER
        A+++N+ETQ+GQL N ++   +G  P++ +                            +S++E+ +     E +    + +  K E    S  +  P   
Subjt:  AAIKNIETQLGQLVNVVSTMNKGKTPAEQE----------------------------KSQMEYWED----EEDTSSDEAEKPKPEPPIPSPTLLVPKER

Query:  KKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCD
         ++ +K+  + QF KF+N F  L+INIPFAEALE +P Y +F+K+ L+KKRK  + +TV+L   CS  +Q K+P K+ DPGSF++PC+ G   F +AL D
Subjt:  KKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-IPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCD

Query:  LGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEK
        LGASIN++P S+ +KL +GE K T V LQL D+S V P GI+E+VL++V +F  P+D  ++DM E+  +P+ILGRPFLAT   IID+   +++ +V  E 
Subjt:  LGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEK

Query:  EIFKAVEDSK
          F     SK
Subjt:  EIFKAVEDSK

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129453.5e-9543.32Show/hide
Query:  IRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITT
        IRDY QP F     GI+  PINANN ELK GL QM R+  +RG+ TEDPN+HL  FLD+CGTVK+NGV +DAIRLRLFP SLQDK               
Subjt:  IRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITT

Query:  WDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMA
           +VQAFL  FFPPAKT +LRTEI +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA
Subjt:  WDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMA

Query:  TNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAVALASRP-QEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHEN
         NS+QWPSERS  K + AG++E+D++S+L+AQ+ +L NA  K SG G++ S E   A  +    E TIEQ Q+ S                HP       
Subjt:  TNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAVALASRP-QEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHEN

Query:  FSYANTKNILNPPGFAPQTKENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITV
                           ++   LEDL+GAFI E  +R +++E  V  +   + G++ +IKN+E Q+GQ+   ++TM KGK P++ E    E+CKA+T+
Subjt:  FSYANTKNILNPPGFAPQTKENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITV

Query:  HQEESEEEPESEDYETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKK
           +  +EPE +  E P    EE  + +E  K        +P L   K          N + + +                ALE MP Y +FMK+ +  K
Subjt:  HQEESEEEPESEDYETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNKFMKEWLAKK

Query:  RKEKKVDTVYLASTCSTRVQQKVP
        RK +  +TV L   CS  +Q+K+P
Subjt:  RKEKKVDTVYLASTCSTRVQQKVP

A0A6J1DU19 uncharacterized protein LOC1110243618.8e-3831.59Show/hide
Query:  EWLAKKRKEKKVDTVYLASTCSTRVQQKVPALQAQMTSLANAFMKFSGTGSAQSIESAAALASRP-QEETIEQIQYVSNSNFRGYNNNVTPTHYHPNNRN
        +W +++   KKV  +Y     S        +L+AQ+ +L NA  K SG G++ S E  AA  +    E TIEQ Q+ S                HP    
Subjt:  EWLAKKRKEKKVDTVYLASTCSTRVQQKVPALQAQMTSLANAFMKFSGTGSAQSIESAAALASRP-QEETIEQIQYVSNSNFRGYNNNVTPTHYHPNNRN

Query:  HENFSYANTKNILNPPGFAPQTQENKKLEDLVGVFIAESSNRTTKLEEAVITINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWED
                              ++   LEDL+G FI E  +R +++E  V  +   + G++ +IKN+E Q+GQ+   ++TM KGK P++ E    E+ + 
Subjt:  HENFSYANTKNILNPPGFAPQTQENKKLEDLVGVFIAESSNRTTKLEEAVITINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWED

Query:  ---EEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKK---KKKNNQVQFDKFMNAFMNLNIN-IPFAE-ALE-IPQYNRFMKEWLAKKRKEKKVDTVYLAS
                  E EK K E P+     +  +ER+ K+   K+    +Q DK  ++ ++   N +P+ + ALE +P Y RFMK+ +  KRK +  +TV L  
Subjt:  ---EEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKK---KKKNNQVQFDKFMNAFMNLNIN-IPFAE-ALE-IPQYNRFMKEWLAKKRKEKKVDTVYLAS

Query:  TCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDM
         CS  +Q+K+P+K+ DPGSF++PC+  + SF +ALCD+ ASIN++PL                             G++E+VL++V R   P D  V+  
Subjt:  TCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDM

Query:  IENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIF
         E+  +P+ILGR FLATG  +ID++   LT+RV  E  +F
Subjt:  IENPSMPVILGRPFLATGRVIIDIEHRELTIRVKNEKEIF

A0A6J1DU19 uncharacterized protein LOC1110243613.9e-9441.19Show/hide
Query:  MRRNKDLILAPLDPEIERTIHRLRRENRENVQMADQNPPEEP-RPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQM-ARDCAYRGSPTEDPNSHLK
        MRRN++L L PLDPEIERT+ RL++E ++  +       E+  R + DY  PL  G  S I    I ANNFE+K  + QM A    + G P +DPN+H+ 
Subjt:  MRRNKDLILAPLDPEIERTIHRLRRENRENVQMADQNPPEEP-RPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQM-ARDCAYRGSPTEDPNSHLK

Query:  SFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHG
        +FL++C T K NGV++DA+RLRL PFSL+DKA+ WL S+   +ITTWD L + FL KFFPP KTVK+R +I TF Q   E L+EAWER+KELLRKCP HG
Subjt:  SFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHG

Query:  YPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAF--MKFSGTGSAQSI
         P W+QVQ FYNGL  +T+T +DAA GGTL+ K+ E A  L+E+MATN+YQWPS+    K I  GV E+D +SAL AQ+ +L+     MK     S   +
Subjt:  YPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAF--MKFSGTGSAQSI

Query:  -------ESAVALASRPQEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKN-ILNPPGF-APQTKENKKLEDLVGAFIAESSNRTTKLE
                  V         + EQV YVSN + +  NN ++ T Y+P  RNH NFS+ N +N    PP F  PQ +E   LE ++  FI++  ++    +
Subjt:  -------ESAVALASRPQEETIEQVQYVSNSNCRGYNNNFTPTHYHPNNRNHENFSYANTKN-ILNPPGF-APQTKENKKLEDLVGAFIAESSNRTTKLE

Query:  EAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITVH-----------QEESEEEPESEDYETPTGEDEEDTSSDEAEKP
         A+    + +     AI+NIE  +GQL ++++   +G  P+  E    E  +AIT+            + E E+ P     +      EE    ++   P
Subjt:  EAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITVH-----------QEESEEEPESEDYETPTGEDEEDTSSDEAEKP

Query:  ----KPEPPIP-SPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE
             P+ P P  P +  P+  +K K  K    QF KF++ F  L++NIPFA+ALE
Subjt:  ----KPEPPIP-SPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE

A0A6P6XAQ1 Reverse transcriptase4.0e-9141.74Show/hide
Query:  RPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  L QM +   Y G+ TEDPNSHL +FL+IC T+K NGVSEDAI+LRLFPFSL+DKA+ WLQS  P + 
Subjt:  RPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSI

Query:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED
        TTWD L +AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+
Subjt:  TTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLED

Query:  MATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAVALASRPQEE-----TIEQVQYVSNSNCRGYNNNFTPTHYHPN
        MA N+YQW +ER   +   AG+ EVD ++ L A+M ++     +  G+ S Q +  A         +     + EQVQY++N N    NN ++ T Y+P 
Subjt:  MATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAVALASRPQEE-----TIEQVQYVSNSNCRGYNNNFTPTHYHPN

Query:  NRNHENFSY---ANTKNILNPPGFAPQ--TKENKKLEDLVGAFIAESSN-RTTKLEEAVIAINSTVNGHSAAI----KNIETQLGQLVDVVSTMNKGKTP
         RNH NF +    N +  +NPPGF  +    E+K   +L    +A +SN +  KL  A       + G    +    +N+E QLGQ+ + V+  N+G  P
Subjt:  NRNHENFSY---ANTKNILNPPGFAPQ--TKENKKLEDLVGAFIAESSN-RTTKLEEAVIAINSTVNGHSAAI----KNIETQLGQLVDVVSTMNKGKTP

Query:  AEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE
        ++ E    E+ KAIT+   +   EP        +G + E   + +  + K             KE K K+K + N++Q        M     IP      
Subjt:  AEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE

Query:  MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP
        +P Y KF+KE + KKRK    +T+ L   CS  +Q K+P
Subjt:  MPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP

A0A6P6XAQ1 Reverse transcriptase1.1e-4534.13Show/hide
Query:  LQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQIQYVSNSNFRGYNNNVTPTHYHPNNRNHENFSY---ANTKNILNPPGFAPQ--T
        L A+M ++     +  G+ S Q +  A+        +     + EQ+QY++N N R   NN     Y+P  RNH NF +    N +  +NPPGF  +   
Subjt:  LQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEE-----TIEQIQYVSNSNFRGYNNNVTPTHYHPNNRNHENFSY---ANTKNILNPPGFAPQ--T

Query:  QENKKLEDLVGVFIAESSN-RTTKLEEAVITINSTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWEDEEDTSSDEAEKPKPEPP
         E+K   +L    +A +SN +  KL  A       + G    +    +N+E QLGQ+ N V+  N+G  P++ E +  E+ +     S  E  +P   P 
Subjt:  QENKKLEDLVGVFIAESSN-RTTKLEEAVITINSTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKTPAEQEKSQMEYWEDEEDTSSDEAEKPKPEPP

Query:  IPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL--EIPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPC
        + S      +E KK  + K    + +K         + +  A  +   IP Y +F+KE + KKRK    +T+ L   CS  +Q K+P K+ DPGSF+VPC
Subjt:  IPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL--EIPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPC

Query:  SFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDI
        + G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQL D+S+  P+GI+ENVLI+V +F +P+D  V+DM E+ ++P+ILGRPFLAT   IID+
Subjt:  SFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDI

Query:  EHRELTIRVKNEKEIF
        +  +   ++  E+  F
Subjt:  EHRELTIRVKNEKEIF

A0A6P6XAQ1 Reverse transcriptase9.9e-9039.57Show/hide
Query:  IRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITT
        +RD+  P  QG Q+ I    +NAN FE++  L QM +   Y G+ TED +SHL +F +IC T+K NGVS+DAI+ RLFPFSL+DKA+ WLQ  +P + T 
Subjt:  IRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITT

Query:  WDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMA
        W  L + FL KFF P KT K R +I +F QQ +E L+E WER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KTV+ A+ L+E+MA
Subjt:  WDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMA

Query:  TNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAVALASRPQEE-----TIEQVQYVSNSNCRGYNNNFTPTHYHPNNR
         N+YQW +ER   +   AG+ EVD ++ L A+M ++     ++ G+ S + +  A         +     +  QVQY++N N    NN ++ T Y+P  R
Subjt:  TNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAVALASRPQEE-----TIEQVQYVSNSNCRGYNNNFTPTHYHPNNR

Query:  NHENFSY---ANTKNILNPPGFAPQ--TKENKKLEDLVGAFIAESSNRTTKLEEAVIA-------INSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTP
        NH NF +    N +  +NPP F P+    E+K    L    +A  SN   K+E+ V A       I   ++  +   +N+E QLGQ+ +VV+  N+   P
Subjt:  NHENFSY---ANTKNILNPPGFAPQ--TKENKKLEDLVGAFIAESSNRTTKLEEAVIA-------INSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTP

Query:  AEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEDEEDTSSDEAEKPK----------------PEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMN
        ++ E    E+ KAIT+  ++   E    ++E      E    S+  E  K                   PIP P   VP  ++ K  K  N  +F+KF+N
Subjt:  AEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEDEEDTSSDEAEKPK----------------PEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMN

Query:  AFMNLNINIPFAEA-LEMPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP
         F  L+INIPF +A L++P Y KF+KE + KKRK    +T+ L   C+  +Q K+P
Subjt:  AFMNLNINIPFAEA-LEMPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGGAATAAAGATTTAATTTTAGCACCTTTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTCGAAGGGAGAATAGAGAAAACGTTCAAATGGCTGACCAAAA
TCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTTTGTTTCAGGGACAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAACTTTGAGCTGAAGA
CCGGTCTCACTCAGATGGCTCGAGATTGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGTAAAGATTAAT
GGGGTCTCAGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCCTGGGAGTATCACCACTTGGGATGC
TTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGAACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTT
GGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTT
GATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACC
TAAAATGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTGAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCAC
AGTCAATTGAATCAGCTGTTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATTCTAATTGTAGGGGATATAATAATAATTTTACA
CCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAAGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAACTAAAGAAAATAAAAA
GTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCACAGCGCTGCCA
TCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGGATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAACCCCAGATGGAGTATTGTAAGGCA
ATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGATGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAA
ACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTA
TGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAAGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACT
GTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGCACTACAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAG
TGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATTCTAATTTTAGGGGATATAATAATAATT
TTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAGGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAACTCAAGAAAAT
AAAAAGTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCACAGCGC
TGCCATCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGGATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAACCCCAGATGGAGTATTGTA
AGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGATGAGGAGGACACATCATCTGATGAGGCTGAAAAG
CCTAAACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGC
TTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAAGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTG
ACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGCACTACAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACA
GGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATTCTAATTTTAGGGGATATAATAA
TAATTTTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAGGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAACTCAAG
AAAATAAAAAGTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCAC
AGCGCTGCCATCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAACCCCAGATGGAGTA
TTGTAAGGCAATCACTGTGTACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGATGAGGAGGACACATCATCTGATGAGGCTG
AAAAGCCTAAACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATG
AATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAA
GGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAG
GTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGATTCAGTATGTATCAAATTCTAATTTTAGGGGATAT
AATAATAATGTTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAAGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAAC
TCAAGAAAATAAAAAGTTAGAGGATCTTGTTGGAGTTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTACCATCAATTCAACAGTGAATG
GCCACAGCGCTGCCATCAAGAATATTGAGACTCAGCTAGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAATCCCAGATG
GAGTATTGGGAAGATGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAAACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAA
AAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATTCCCCAATACAACA
GGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACTAGAGTACAACAGAAAGTACCTGAAAAAGTA
GCAGACCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTACTCTTTTAGAGCATTATGTGACTTAGGTGCTAGCATTAATATTATCCCATTGTCCTTGTGTAAGAA
GTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGTTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTAATCAGAGTAGGTAGAT
TTTTCCTCCCTATTGATTTATATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGACGACCATTCCTCGCTACTGGGCGAGTGATTATAGATATTGAG
CACAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAGATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGTTTTTCATGGGCTATAGGAAAGGTGCAAGAAAGAG
AACCTCCGTTGGATTCACAGAACAAAAGCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGGAATAAAGATTTAATTTTAGCACCTTTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTCGAAGGGAGAATAGAGAAAACGTTCAAATGGCTGACCAAAA
TCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTTTGTTTCAGGGACAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAACTTTGAGCTGAAGA
CCGGTCTCACTCAGATGGCTCGAGATTGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGTAAAGATTAAT
GGGGTCTCAGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCCTGGGAGTATCACCACTTGGGATGC
TTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGAACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTT
GGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTT
GATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACC
TAAAATGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTGAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCAC
AGTCAATTGAATCAGCTGTTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATTCTAATTGTAGGGGATATAATAATAATTTTACA
CCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAAGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAACTAAAGAAAATAAAAA
GTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCACAGCGCTGCCA
TCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGGATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAACCCCAGATGGAGTATTGTAAGGCA
ATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGATGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAA
ACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTA
TGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAAGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACT
GTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGCACTACAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAG
TGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATTCTAATTTTAGGGGATATAATAATAATT
TTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAGGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAACTCAAGAAAAT
AAAAAGTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCACAGCGC
TGCCATCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGGATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAACCCCAGATGGAGTATTGTA
AGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGATGAGGAGGACACATCATCTGATGAGGCTGAAAAG
CCTAAACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGC
TTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAAGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTG
ACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGCACTACAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACA
GGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGGTTCAGTATGTATCAAATTCTAATTTTAGGGGATATAATAA
TAATTTTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAGGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAACTCAAG
AAAATAAAAAGTTAGAGGATCTTGTTGGAGCTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCAC
AGCGCTGCCATCAAGAATATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAACCCCAGATGGAGTA
TTGTAAGGCAATCACTGTGTACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGATGAGGAGGACACATCATCTGATGAGGCTG
AAAAGCCTAAACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATG
AATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAA
GGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAG
GTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATTGAACAGATTCAGTATGTATCAAATTCTAATTTTAGGGGATAT
AATAATAATGTTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCAAATACGAAGAATATTCTTAACCCACCTGGTTTTGCCCCTCAAAC
TCAAGAAAATAAAAAGTTAGAGGATCTTGTTGGAGTTTTCATTGCAGAGTCAAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTACCATCAATTCAACAGTGAATG
GCCACAGCGCTGCCATCAAGAATATTGAGACTCAGCTAGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAAACCCCAGCTGAACAAGAGAAATCCCAGATG
GAGTATTGGGAAGATGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAAACCTGAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAGGAAAAAGAA
AAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATTCCCCAATACAACA
GGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACTAGAGTACAACAGAAAGTACCTGAAAAAGTA
GCAGACCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTACTCTTTTAGAGCATTATGTGACTTAGGTGCTAGCATTAATATTATCCCATTGTCCTTGTGTAAGAA
GTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGTTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTAATCAGAGTAGGTAGAT
TTTTCCTCCCTATTGATTTATATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGACGACCATTCCTCGCTACTGGGCGAGTGATTATAGATATTGAG
CACAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAGATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGTTTTTCATGGGCTATAGGAAAGGTGCAAGAAAGAG
AACCTCCGTTGGATTCACAGAACAAAAGCCTCCTTGA
Protein sequenceShow/hide protein sequence
MRRNKDLILAPLDPEIERTIHRLRRENRENVQMADQNPPEEPRPIRDYFQPLFQGQQSGIVYAPINANNFELKTGLTQMARDCAYRGSPTEDPNSHLKSFLDICGTVKIN
GVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIV
DAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKMIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAVALASRPQEETIEQVQYVSNSNCRGYNNNFT
PTHYHPNNRNHENFSYANTKNILNPPGFAPQTKENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKA
ITVHQEESEEEPESEDYETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNKFMKEWLAKKRKEKKVDT
VYLASTCSTRVQQKVPALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQVQYVSNSNFRGYNNNFTPTHYHPNNRNHENFSYANTRNILNPPGFAPQTQEN
KKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVDVVSTMNKGKTPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEDEEDTSSDEAEK
PKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNKFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPALQAQMTSLANAFMKFSGT
GSAQSIESAAALASRPQEETIEQVQYVSNSNFRGYNNNFTPTHYHPNNRNHENFSYANTRNILNPPGFAPQTQENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGH
SAAIKNIETQLGQLVNVVSTMNKGKTPAEQEKPQMEYCKAITVYQEESEEEPESEDYETPTGEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFM
NAFMNLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQIQYVSNSNFRGY
NNNVTPTHYHPNNRNHENFSYANTKNILNPPGFAPQTQENKKLEDLVGVFIAESSNRTTKLEEAVITINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKTPAEQEKSQM
EYWEDEEDTSSDEAEKPKPEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEIPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKV
ADPGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLVDQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIE
HRELTIRVKNEKEIFKAVEDSKDEVFFMGYRKGARKRTSVGFTEQKPP