; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016014 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016014
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr12:31555621..31570597
RNA-Seq ExpressionLag0016014
SyntenyLag0016014
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]5.9e-10637.75Show/hide
Query:  MADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKI+GV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITHGSITTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG-------
        WLQS+  GSI +W  +                                                        V +FYNGL   T+TIVDAA+G       
Subjt:  WLQSITHGSITTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG-------

Query:  ---------------------------IGSAQSIESAAALASR----------------PQ---------------EETIEQVQYVSNFNSRRYNNSSTP
                                   +     +E  AAL+++                PQ               E + EQVQYV+N N   Y  +  P
Subjt:  ---------------------------IGSAQSIESAAALASR----------------PQ---------------EETIEQVQYVSNFNSRRYNNSSTP

Query:  THYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAL
         +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL   ++   +G   
Subjt:  THYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAL

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF
        +  E    E CKAIT+   +E E  P  E   TPT    G++++    +E     LE             PPI +P L  P+  +K+K  K    QF KF
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DP SF++PC+ G   F R LCDLGASIN++P  +C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+IL RPFLATGRA+ID+++ ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.4e-10738.04Show/hide
Query:  MADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARD
        MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKI+GV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITHGSITTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG-------
        WLQS+  GSI +W  +                                                        V +FYNGL   T+TIVDAA+G       
Subjt:  WLQSITHGSITTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG-------

Query:  ---------------------------IGSAQSIESAAALASR----------------PQ---------------EETIEQVQYVSNFNSRRYNNSSTP
                                   +     +E  AAL+++                PQ               E + EQVQYV+N N   Y  +  P
Subjt:  ---------------------------IGSAQSIESAAALASR----------------PQ---------------EETIEQVQYVSNFNSRRYNNSSTP

Query:  THYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAL
         +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I +  +   A +KN+E Q+GQL   ++   +G   
Subjt:  THYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAL

Query:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF
        +  E    E CKAIT+   +E E  P  E   TPT    G++++    +E     LE             PPI +P L  P+  +K+K  K    QF KF
Subjt:  AEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLE-------------PPIPSPTLLVPKERKKKKKKKNNQVQFDKF

Query:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL
        ++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F R LCDLGASIN++P S+C+KL
Subjt:  MNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL

Query:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
         +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+IL RPFLATGRA+ID+++ ELT+RV  E+ +F   +  K
Subjt:  DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.4e-11539.3Show/hide
Query:  SREVAPFDPEIERTIHRLRRENREIIQMADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICG
        SR++ P DPEIERT+  LRR   +I+ MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC 
Subjt:  SREVAPFDPEIERTIHRLRRENREIIQMADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICG

Query:  TVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL--------------------------------------------------------V
        TVKI+GV+ED IRLRLFPFSL+DKAR WLQS+  GSI +W  +                                                        V
Subjt:  TVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL--------------------------------------------------------V

Query:  LLFYNGLTPSTKTIVDAAAG----------------------------------IGSAQSIESAAALASR----------------PQ------------
         +FYNGL   T+TIVDAA+G                                  +     +E  AAL+++                PQ            
Subjt:  LLFYNGLTPSTKTIVDAAAG----------------------------------IGSAQSIESAAALASR----------------PQ------------

Query:  ---EETIEQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGH
           E + EQVQYV+N N   Y  +  P +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +  +  I +  +  
Subjt:  ---EETIEQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGH

Query:  SAAIKNIETQLGQLVNVVSTMNKGKALAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLE-------------PPI
         AAIKNIE Q+GQL   ++   +G   +  E    E CKAIT+   +E E  P  E   TPT    G+++     DE     LE             PPI
Subjt:  SAAIKNIETQLGQLVNVVSTMNKGKALAEQEKPQMEYCKAITVHQ-EESEEEPESEDYETPT----GEAEEDTSADEAEKPNLE-------------PPI

Query:  PSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF
         +P L  P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ 
Subjt:  PSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSF

Query:  GTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIER
        G   F + LCDLGASIN++PLS+C+KL + E+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+IL RPFLATGRA+ID+++
Subjt:  GTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIER

Query:  RELTIRVKNEKEIFK
         ELT+RV  E+ +FK
Subjt:  RELTIRVKNEKEIFK

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]1.5e-10138.05Show/hide
Query:  MRSSR--EVAPFDPEIERTIHRLR---RENREIIQMADQNAPEE---PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN
        MR +R  ++   DPE ERT   LR   R  RE +   D  A  E    R IRDY +PV     SGI    I A NFELK GLI M +   + G+  EDPN
Subjt:  MRSSR--EVAPFDPEIERTIHRLR---RENREIIQMADQNAPEE---PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPN

Query:  SHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL----------------------------------------------
        +HL SFL+IC TVK++GV+EDAIRLRLF FSL+DKA+ W QS+ +GSITTWD L                                              
Subjt:  SHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL----------------------------------------------

Query:  ----------VLLFYNGLTPSTKTIVDAAAG---------------------------------------------------------------IGSAQS
                  + +FYNGL   T+T+VDAAAG                                                                G+ Q+
Subjt:  ----------VLLFYNGLTPSTKTIVDAAAG---------------------------------------------------------------IGSAQS

Query:  IESAAALASRPQEETI--EQVQYVSNFNSRRYNNSS--TPTHYHPNNRNHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEA
        ++S  + +S  QE  +  EQVQY+   +SR YN        HYHP  RNHEN SY N +N L  PPGF  Q  D K  LED++G FI+E+ +R  K E  
Subjt:  IESAAALASRPQEETI--EQVQYVSNFNSRRYNNSS--TPTHYHPNNRNHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEA

Query:  VIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAEQEKPQMEYCKAITVHQEESEEEPESEDYETP------TGEAEEDTSADEAEKPNL-----
        +  I + V+   A +KN+E Q+GQL  ++ +  KGK  ++ E    E+C AIT+   +  EE + +    P      T E + +    EAE   +     
Subjt:  VIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAEQEKPQMEYCKAITVHQEESEEEPESEDYETP------TGEAEEDTSADEAEKPNL-----

Query:  -----EPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADP
              PPI  P L  P+   KKK       QF KF+  F  ++INIPFAE L +MP Y +F+KE ++ K+K ++ +T+ L   CS  + QK+P K+ DP
Subjt:  -----EPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADP

Query:  GSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLAT
        GSF++PC+ G  +F RALCD GASIN++PLS+ KKL +GE+K T + LQLAD+S+  P G++E+VL++V +F LP+D  V+DM EN  +P+IL RPFLAT
Subjt:  GSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLAT

Query:  GRAIIDI
        GRA+ID+
Subjt:  GRAIIDI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.9e-10437.21Show/hide
Query:  MADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARD
        MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +DPN HL  FL+IC T+K++GV+ED IRLRLFPFSL+DKAR 
Subjt:  MADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARD

Query:  WLQSITHGSITTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG-------
        WLQS+  GSIT+W  +                                                        V +FYNGL   T+TIVDAA+G       
Subjt:  WLQSITHGSITTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG-------

Query:  ---------------------------IGSAQSIESAAALASR----------------PQ---------------EETIEQVQYVSNFNSRRYNNSSTP
                                   +     +E  AAL+++                PQ               E + EQVQY++N N   Y  +  P
Subjt:  ---------------------------IGSAQSIESAAALASR----------------PQ---------------EETIEQVQYVSNFNSRRYNNSSTP

Query:  THYHPNNRNHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE
         +YHP  RNHENFSY NTKNVL  PPGF  Q  + K  LED + +F+ E+     K +  +  I +  +   A +KN+E Q+GQL   ++   +G   + 
Subjt:  THYHPNNRNHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE

Query:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG-------------EAEEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMN
         E    E CKAIT+    E E  P  E   TPT              E  EDT  +    P++  P   P L  P    ++ +K+    QF KF++ F  
Subjt:  QEKPQMEYCKAITVHQ-EESEEEPESEDYETPTG-------------EAEEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMN

Query:  LNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIK
        ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+ +KL +GE+K
Subjt:  LNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIK

Query:  STPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
         T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+IL RPFLATGRA++D+++ ELT+RV  E+  F   E  K
Subjt:  STPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129457.5e-9135.59Show/hide
Query:  RSSREVAPFDPEIERTIHRLRRENREII----QMADQN----------APEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS
        R++  + PFDP+IERT  R RREN ++      MA+ N           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    + G 
Subjt:  RSSREVAPFDPEIERTIHRLRRENREII----QMADQN----------APEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGS

Query:  PTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL----------------------------------------
        P++DPNSHL +FL+IC T K +GV++DAIRLRLFPFSL+DKA+ WL S+ +GSITTW+ L                                        
Subjt:  PTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL----------------------------------------

Query:  ----------------VLLFYNGLTPSTKTIVDAAAG-----------------------------IGSAQSI------------ESAAALASR------
                        V  FYNGL  S KTI+DAAAG                              GS +++               AAL+ +      
Subjt:  ----------------VLLFYNGLTPSTKTIVDAAAG-----------------------------IGSAQSI------------ESAAALASR------

Query:  --PQEETI------------------EQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAF
           Q   +                  E VQ+V NFN R+ NN  + T Y+P  RNH NFS++N     N     PPGF     PQ  + K +LE+L+  +
Subjt:  --PQEETI------------------EQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAF

Query:  IAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE-QEKPQ-MEYCKAIT---------VHQEESEEEPESEDYETPTGEA
        I+++              ++ +    A+++N+ETQ+GQL N ++   +G   ++ Q  P+  E C+AIT         V+Q+  E E E  D E   G  
Subjt:  IAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE-QEKPQ-MEYCKAIT---------VHQEESEEEPESEDYETPTGEA

Query:  EEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ
        E +    + +    E    S  +  P    ++ +K+  + QF KF+N F  L+INIPFAEALE MP Y +F+K+ L+KKRK  + +TV+L   CS  +Q 
Subjt:  EEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ

Query:  KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPV
        K+P K+ DPGSF++PC+ G   F +AL DLGASIN++P S+ +KL +GE K T V LQLAD+S V P GI+E+VL++V +F  P+D  ++DM E+  +P+
Subjt:  KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPV

Query:  ILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        IL RPFLAT  AIID+   +++ +V  E   F     SK
Subjt:  ILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

A0A6P6X9H2 Reverse transcriptase3.8e-7434.22Show/hide
Query:  EDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL------------------------------------------
        EDPNSHL +FL+IC T+K++GVS++AIRLRLFPFSL+DKA+ WL S    + TTWD L                                          
Subjt:  EDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDAL------------------------------------------

Query:  --------------VLLFYNGLTPSTKTIVDAAA-------------------------------GIG-SAQSIESAAALASRPQEET-----IEQVQYV
                      +  FYNGL+ STKT++DAAA                               G+G SA +   A       + +T      EQVQ+V
Subjt:  --------------VLLFYNGLTPSTKTIVDAAA-------------------------------GIG-SAQSIESAAALASRPQEET-----IEQVQYV

Query:  SNFNSRRYNNSSTPTHYHPNNRNHENFSY---ANTKNVLNPPGF---APQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQL
        +N+N    NN  + T Y+P  RNH NF +    N     NPPGF    PQ +     E  V      +S+R  ++E         ++  +   +N+E Q+
Subjt:  SNFNSRRYNNSSTPTHYHPNNRNHENFSY---ANTKNVLNPPGF---APQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQL

Query:  GQLVNVVSTMNKGKALAEQEKPQMEYCKAITVHQEESEEEP--------ESEDYETPTGEAEEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQ
        GQ+ + ++  N+G+  ++ E    E+ KAIT+   +  E+P        ESE+        E     +  + P    P  S  + +P      ++ K N+
Subjt:  GQLVNVVSTMNKGKALAEQEKPQMEYCKAITVHQEESEEEP--------ESEDYETPTGEAEEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQ

Query:  V--QFDKFMNAFMNLNINIPFAEA-LEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINII
            F+KF+  F  L+INIPFA+A L++P Y +F+KE + +KRK +  +T+ L   CS  +Q K+P K+ DPGSFS+PC+ G+ +F +ALCDLGAS+++I
Subjt:  V--QFDKFMNAFMNLNINIPFAEA-LEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINII

Query:  PLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVED
        PL++ ++L + E+K T + LQLAD+S+  P+G++ENVLI+V +F +P+D  V+DM E+ SMP+IL RPFLAT   IID++  +L  ++  E+  F   E 
Subjt:  PLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVED

Query:  SK
         K
Subjt:  SK

A0A6P6XAQ1 Reverse transcriptase1.2e-7534.18Show/hide
Query:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSI
        R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+IC T+K +GVSEDAI+LRLFPFSL+DKA+ WLQS    + 
Subjt:  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSI

Query:  TTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG---IG-----SAQSIES
        TTWD L                                                        V  FYNGLT  TKT VDAAAG   +G     + Q IE 
Subjt:  TTWDAL--------------------------------------------------------VLLFYNGLTPSTKTIVDAAAG---IG-----SAQSIES

Query:  AAA-------------------------LASRPQEETI-----------------------------------EQVQYVSNFNSRRYNNSSTPTHYHPNN
         AA                         + S   +  +                                   EQVQY++N+N    NN  + T Y+P  
Subjt:  AAA-------------------------LASRPQEETI-----------------------------------EQVQYVSNFNSRRYNNSSTPTHYHPNN

Query:  RNHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVIAINSTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKALA
        RNH NF +    N +  +NPPGF  +    ++K   +L    +A +SN +  KL  A       + G    +    +N+E QLGQ+ N V+  N+G   +
Subjt:  RNHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVIAINSTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKALA

Query:  EQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEM
        + E    E+ KAIT+   +   EP         G   E    +  +   L+           KE K K+K + N++Q        M     IP      +
Subjt:  EQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEM

Query:  PQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVV
        P Y +F+KE + KKRK    +T+ L   CS  +Q K+P K+ DPGSF+VPC+ G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+ 
Subjt:  PQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVV

Query:  RPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIF
         P+GI+ENVLI+V +F +P+D  V+DM E+ ++P+IL RPFLAT   IID++R +   ++  E+  F
Subjt:  RPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATGRAIIDIERRELTIRVKNEKEIF

A0A6P8DD93 uncharacterized protein LOC1162064532.4e-8132.88Show/hide
Query:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNAPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR SR  E+ P DPEIERT+HRLRRENR     ++++MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNAPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTW----------------------------------------------
         P+ H+  FL  C TVK++ V++D IRL+LFPFSL+DKAR W  S+   SITTW                                              
Subjt:  DPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTW----------------------------------------------

Query:  ----------DALVLLFYNGLTPSTKTIVDAAAG---------------------------------IGSAQSIESAAALASRPQEETI-----------
                  + L+ +FY  L  + +++VDAAAG                                 + S   +++ A L ++    T            
Subjt:  ----------DALVLLFYNGLTPSTKTIVDAAAG---------------------------------IGSAQSIESAAALASRPQEETI-----------

Query:  -------------------------------EQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDL
                                       EQV +V+NF  +R N       Y+P  RNH NFS+ N  N L  PPGF        AP  Q   ++E+L
Subjt:  -------------------------------EQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDL

Query:  VGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEA
        + +++ ++              ++ +    A I+N+E Q+ Q+   +S    G   +  +E P+      +   K + +   +++ + ES + +    + 
Subjt:  VGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEA

Query:  EEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ
        EE        KP + PP+P P         ++ K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS  +Q+
Subjt:  EEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ

Query:  ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPS
            +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +GE K T V LQLAD+S+  P GIVENVL++V +F  P+D  V++M E+  
Subjt:  ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPS

Query:  MPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        +P+IL RPFLATG+A+ID+E+ +LT+RV NE+  F   +  K
Subjt:  MPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

A0A6P8DKJ2 uncharacterized protein LOC1162042311.9e-8132.88Show/hide
Query:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNAPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE
        MR SR  E+ P DPEIERT+HRLRRENR     ++++MAD +   +     R +RDY  P   G  S I    I ANNFELK  LIQM +   + G P E
Subjt:  MRSSR--EVAPFDPEIERTIHRLRRENR-----EIIQMADQNAPEE----PRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTE

Query:  DPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTW----------------------------------------------
         P+ H+  FL  C TVK++ V++D IRL+LFPFSL+DKAR W  S+   SITTW                                              
Subjt:  DPNSHLKSFLDICGTVKISGVSEDAIRLRLFPFSLQDKARDWLQSITHGSITTW----------------------------------------------

Query:  ----------DALVLLFYNGLTPSTKTIVDAAAG---------------------------------IGSAQSIESAAALASRPQEETI-----------
                  + L+ +FY  L  + +++VDAAAG                                 + S   +++ A L ++    T            
Subjt:  ----------DALVLLFYNGLTPSTKTIVDAAAG---------------------------------IGSAQSIESAAALASRPQEETI-----------

Query:  -------------------------------EQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDL
                                       EQV +V+NF  +R N       Y+P  RNH NFS+ N  N L  PPGF        AP  Q   ++E+L
Subjt:  -------------------------------EQVQYVSNFNSRRYNNSSTPTHYHPNNRNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDL

Query:  VGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEA
        + +++ ++              ++ +    A I+N+E Q+ Q+   +S    G   +  +E P+      +   K + +   +++ + ES + +    + 
Subjt:  VGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAE-QEKPQ------MEYCKAITVHQEESEEEPESEDYETPTGEA

Query:  EEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ
        EE        KP + PP+P P  L         K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L   CS  +Q+
Subjt:  EEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQ

Query:  ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPS
            +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +GE K T + LQLAD+S+  P GIVENVL++V +F  P+D  V++M E+  
Subjt:  ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPS

Query:  MPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK
        +P+IL RPFLATG+A+ID+E+ +LT+RV NE+  F   +  K
Subjt:  MPVILERPFLATGRAIIDIERRELTIRVKNEKEIFKAVEDSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAGTAGTAGAGAAGTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCATAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCTGACCAAAATGCACC
TGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGTC
TTATTCAGATGGCTCGAGACTGTGCATACAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGTAAAGATTAGTGGAGTG
TCTGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCATGGGAGCATCACCACCTGGGATGCTTTGGT
ACTGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTATAGGGAGTGCTCAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGAC
CTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGAGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAA
AATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTC
TAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACAGTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAATTGGTGA
ATGTTGTAAGCACCATGAATAAAGGTAAGGCCCTAGCTGAACAAGAGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCATCAGGAGGAATCTGAAGAGGAACCT
GAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCAGCAGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGTTGGT
TCCTAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCAT
TAGAAATGCCCCAATACAACAGATTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAA
CAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATTAT
CCCATTGTCCTTGTGTAAGAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATG
TGTTAATCAGAGTATGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGAACGACCATTCCTCGCTACTGGG
CGAGCGATCATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCCAAAGATGAAGTGCTTTACATGGGCTA
CAGGAAAGGTCTTGTTTTCTTCCTCTCCTTTGCTTTCAAGCTTTCAAGATTCAAAGCTCTCAAGCAAGAAGTCATCATCATTCCAGGCAAAAAGAGCTACCCCTGTTGTG
ATCAGTTAGGCCAACCAGAGAAGTCTCAGCAGGCTAGCGTCTCGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAGCG
TCGAGACGTTATGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTTTATGAGCTGGTTTTTCTTGGAAAATGGAAGGGATTCTTCTAATGTGAT
GGGAATGTATGAGAAGGTAGACATTGATAAAGAAGGCTTCAACTCGAACGGTGGTGCTTCGCAATGTAGATCAGATTCTGTACTTGTGGTCCTTGTGGTGCAGGAAGTAC
TGGAGGTATTCTCCTACCATTTTATTCCACTCTGTGATTTTGTTGGTCAACAGGAGGATCACCTCTGGGTGGATCCATTGGAAGTTCTTGAATGTGCTGATGCAAGCTGC
TCTCAATCATTAGAAGGGAATCCATGTGGGCAAAGTTGTCCAATCAGCCCTGAAACACACAATTATGGCATGCTATTTAAAGATTGGAGGCCAAAATTAGAAGGAGACGA
CTGGAAAAGGAAGAAAATAAAGATTGGAGCTGCTGCTGGGCGAAAAAGAGGAAAAAGACAGAGAGAAGCTGTCGTAGGAGACATTTTTCAGCTTGAGAAAGCTGGATTTT
ACCTTGGAAGCTTCTTCTTTAGCTCTTTCCTCTTCTCCTTGGTGTTAGTTGTGTTCGAGGCTGAAATCATGTTATGTGTGAGTTTGGTGCATGAGCGATCCACCTGGGAA
GATCCACTTAATCTCCAGCAGGACCGTGTGCTGCAACCTAATCCACCGCTGGAGCAAAATGGACATCAAAATAATCCGGCTAAGAACCCTATCTTGGTAGCAAATGATAG
GGCCAGAGCCATTCGAGCGTATGCTTTTTCGATGTTTGATGAGTTAAATCCAGGAATTGCAGGTCCTCAAATTGAGGCAGCAAATTTTGAAATGAAACAGGGTATGGTCG
ACGCTTCGACGGTAGGGGCCCTTTTGGTAAAAACTTTTAATGAATCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCAGATGTTAGAGGCTCAAAT
ACAAAAGTTAAGACTGTGTTAGAAGTTGATGGTGTGTCTACCATTAGGGTTGATATTACAATGTTAGCTAACGCTCTTAAAAATGTGACAATGGTTAGTCATCAACAGCC
GCTAGCTGTGGAGCCTGTTGCAGTGGTGAACCAAGTTGTAAAGGAATCATGTGTCTATTGTGGTGAAGAACACAACTACGAGTTTTGTCCCAGCAATCCAGCTTCTGTGT
TTTTTGTAGCTAATCAAAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTTGCATGGGGAGGGCAAGGAAGCAATTCACAAGTCCCT
CAGACACAACAAAAGGTGAACCAATCAGGATTTGTTAAATCACAGGTATTGCCCCAACAAAATAAGCAGGCTTTGGCCCAACAAAATTCGGAGAGTTCTCTTGAGGCAAT
GATGAAAGAATATATGGCTCGTACAGATGCTGCTATTCAAAATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTACAGACAGTGACTCTAAGGAGTGAGTCTTGTA
AAAGTGACGGAGGCAACAATAATGATGTTGGAGCATCTGGTTCTATTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAA
AAGCAGAAGCCTAAGAATCAAGATGGCGCAAGCATTAACCTTATGCCTCTTTCAGTCTATCGAAAGCTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTTCGACT
AGCGATAGATTATGAGGCAGATAAAGATGTCCCAATTATTCTTGGTCGTTCATTTTTGGCTACTGGTAGGGCATTGATAAATGTTCAGAAAGGGGAACTAACAATGAGGG
TTTATGATGAGGAAGTAAACTTCAATGTTTTCAAGGCCATGAAGTATCCAGACGAAGTGGAAGATTGTTCTTTCATTAAGATTCTAGAGAACACAATTGCTGAGACAACA
ATGAAGGATTTGACAAATAAACATTTGGAAGACTATGGGAGATTAGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTAGTAGTAGAGAAGTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCATAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCTGACCAAAATGCACC
TGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGTC
TTATTCAGATGGCTCGAGACTGTGCATACAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGGTAAAGATTAGTGGAGTG
TCTGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCATGGGAGCATCACCACCTGGGATGCTTTGGT
ACTGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTATAGGGAGTGCTCAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGAC
CTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGAGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAA
AATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTC
TAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACAGTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAATTGGTGA
ATGTTGTAAGCACCATGAATAAAGGTAAGGCCCTAGCTGAACAAGAGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCATCAGGAGGAATCTGAAGAGGAACCT
GAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCAGCAGATGAGGCTGAAAAGCCTAACCTTGAGCCTCCTATTCCTTCTCCCACACTGTTGGT
TCCTAAGGAAAGGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCAT
TAGAAATGCCCCAATACAACAGATTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTTTATCTTGCTTCCACATGCAGCACCAGAGTACAA
CAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGTGCTAGCATTAATATTAT
CCCATTGTCCTTGTGTAAGAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATG
TGTTAATCAGAGTATGTAGATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGAACGACCATTCCTCGCTACTGGG
CGAGCGATCATAGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCCAAAGATGAAGTGCTTTACATGGGCTA
CAGGAAAGGTCTTGTTTTCTTCCTCTCCTTTGCTTTCAAGCTTTCAAGATTCAAAGCTCTCAAGCAAGAAGTCATCATCATTCCAGGCAAAAAGAGCTACCCCTGTTGTG
ATCAGTTAGGCCAACCAGAGAAGTCTCAGCAGGCTAGCGTCTCGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTCACAGCG
TCGAGACGTTATGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTTTATGAGCTGGTTTTTCTTGGAAAATGGAAGGGATTCTTCTAATGTGAT
GGGAATGTATGAGAAGGTAGACATTGATAAAGAAGGCTTCAACTCGAACGGTGGTGCTTCGCAATGTAGATCAGATTCTGTACTTGTGGTCCTTGTGGTGCAGGAAGTAC
TGGAGGTATTCTCCTACCATTTTATTCCACTCTGTGATTTTGTTGGTCAACAGGAGGATCACCTCTGGGTGGATCCATTGGAAGTTCTTGAATGTGCTGATGCAAGCTGC
TCTCAATCATTAGAAGGGAATCCATGTGGGCAAAGTTGTCCAATCAGCCCTGAAACACACAATTATGGCATGCTATTTAAAGATTGGAGGCCAAAATTAGAAGGAGACGA
CTGGAAAAGGAAGAAAATAAAGATTGGAGCTGCTGCTGGGCGAAAAAGAGGAAAAAGACAGAGAGAAGCTGTCGTAGGAGACATTTTTCAGCTTGAGAAAGCTGGATTTT
ACCTTGGAAGCTTCTTCTTTAGCTCTTTCCTCTTCTCCTTGGTGTTAGTTGTGTTCGAGGCTGAAATCATGTTATGTGTGAGTTTGGTGCATGAGCGATCCACCTGGGAA
GATCCACTTAATCTCCAGCAGGACCGTGTGCTGCAACCTAATCCACCGCTGGAGCAAAATGGACATCAAAATAATCCGGCTAAGAACCCTATCTTGGTAGCAAATGATAG
GGCCAGAGCCATTCGAGCGTATGCTTTTTCGATGTTTGATGAGTTAAATCCAGGAATTGCAGGTCCTCAAATTGAGGCAGCAAATTTTGAAATGAAACAGGGTATGGTCG
ACGCTTCGACGGTAGGGGCCCTTTTGGTAAAAACTTTTAATGAATCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCAGATGTTAGAGGCTCAAAT
ACAAAAGTTAAGACTGTGTTAGAAGTTGATGGTGTGTCTACCATTAGGGTTGATATTACAATGTTAGCTAACGCTCTTAAAAATGTGACAATGGTTAGTCATCAACAGCC
GCTAGCTGTGGAGCCTGTTGCAGTGGTGAACCAAGTTGTAAAGGAATCATGTGTCTATTGTGGTGAAGAACACAACTACGAGTTTTGTCCCAGCAATCCAGCTTCTGTGT
TTTTTGTAGCTAATCAAAGGAATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTTGCATGGGGAGGGCAAGGAAGCAATTCACAAGTCCCT
CAGACACAACAAAAGGTGAACCAATCAGGATTTGTTAAATCACAGGTATTGCCCCAACAAAATAAGCAGGCTTTGGCCCAACAAAATTCGGAGAGTTCTCTTGAGGCAAT
GATGAAAGAATATATGGCTCGTACAGATGCTGCTATTCAAAATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTACAGACAGTGACTCTAAGGAGTGAGTCTTGTA
AAAGTGACGGAGGCAACAATAATGATGTTGGAGCATCTGGTTCTATTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAA
AAGCAGAAGCCTAAGAATCAAGATGGCGCAAGCATTAACCTTATGCCTCTTTCAGTCTATCGAAAGCTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACACTTCGACT
AGCGATAGATTATGAGGCAGATAAAGATGTCCCAATTATTCTTGGTCGTTCATTTTTGGCTACTGGTAGGGCATTGATAAATGTTCAGAAAGGGGAACTAACAATGAGGG
TTTATGATGAGGAAGTAAACTTCAATGTTTTCAAGGCCATGAAGTATCCAGACGAAGTGGAAGATTGTTCTTTCATTAAGATTCTAGAGAACACAATTGCTGAGACAACA
ATGAAGGATTTGACAAATAAACATTTGGAAGACTATGGGAGATTAGTGTAG
Protein sequenceShow/hide protein sequence
MRSSREVAPFDPEIERTIHRLRRENREIIQMADQNAPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKISGV
SEDAIRLRLFPFSLQDKARDWLQSITHGSITTWDALVLLFYNGLTPSTKTIVDAAAGIGSAQSIESAAALASRPQEETIEQVQYVSNFNSRRYNNSSTPTHYHPNNRNHE
NFSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKALAEQEKPQMEYCKAITVHQEESEEEP
ESEDYETPTGEAEEDTSADEAEKPNLEPPIPSPTLLVPKERKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQ
QKVPEKVADPGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVCRFFLPIDLYVMDMIENPSMPVILERPFLATG
RAIIDIERRELTIRVKNEKEIFKAVEDSKDEVLYMGYRKGLVFFLSFAFKLSRFKALKQEVIIIPGKKSYPCCDQLGQPEKSQQASVSTLALERLDAHIPYQIRRVKLTA
SRRYDRKRPDATVFPYSERAFMSWFFLENGRDSSNVMGMYEKVDIDKEGFNSNGGASQCRSDSVLVVLVVQEVLEVFSYHFIPLCDFVGQQEDHLWVDPLEVLECADASC
SQSLEGNPCGQSCPISPETHNYGMLFKDWRPKLEGDDWKRKKIKIGAAAGRKRGKRQREAVVGDIFQLEKAGFYLGSFFFSSFLFSLVLVVFEAEIMLCVSLVHERSTWE
DPLNLQQDRVLQPNPPLEQNGHQNNPAKNPILVANDRARAIRAYAFSMFDELNPGIAGPQIEAANFEMKQGMVDASTVGALLVKTFNESHEILERISTNSCQWSDVRGSN
TKVKTVLEVDGVSTIRVDITMLANALKNVTMVSHQQPLAVEPVAVVNQVVKESCVYCGEEHNYEFCPSNPASVFFVANQRNNPYSNFYNPGWRNHPNFAWGGQGSNSQVP
QTQQKVNQSGFVKSQVLPQQNKQALAQQNSESSLEAMMKEYMARTDAAIQNTEHPRREGKEQVQTVTLRSESCKSDGGNNNDVGASGSIPDVEPPYVPPPPYVPPLPFPQ
KQKPKNQDGASINLMPLSVYRKLGIGEARPTTVTLRLAIDYEADKDVPIILGRSFLATGRALINVQKGELTMRVYDEEVNFNVFKAMKYPDEVEDCSFIKILENTIAETT
MKDLTNKHLEDYGRLV