; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022696 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022696
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:35847213..35855791
RNA-Seq ExpressionLag0022696
SyntenyLag0022696
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.1e-3232.85Show/hide
Query:  EDKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDCF-----------------------
        +D+A+DWL++IPP SITTW+ L QAFL K FPPAK+ +LRTEIGTF+Q  DEQL+EAWER+K+LLR+CPQHGYPD                         
Subjt:  EDKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDCF-----------------------

Query:  ---------------------------------------------SALQAQMSSLANAFLKFSGTGNAQ----SIESTAALASQ----------------
                                                     ++L+AQM+SL NA  K +  G AQ    SI S AALAS+                
Subjt:  ---------------------------------------------SALQAQMSSLANAFLKFSGTGNAQ----SIESTAALASQ----------------

Query:  ------------------TQEENL----EQNVLN-PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVV
                             EN      +NVL  P GF         SLED++  F+ +S + +  LE +V AI + V     A++N+E QL Q+ + +
Subjt:  ------------------TQEENL----EQNVLN-PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVV

Query:  NTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQ---VAEEEETTKTEE
         TM KGK P+  E +  E CKAV++   ++     + +E+E  + ++
Subjt:  NTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQ---VAEEEETTKTEE

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]5.8e-2930.82Show/hide
Query:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYP---------------------------
        DKA+ W QS+P GSITTWD L Q FL K FPP+K+ +LR EI  F+Q   E  +EAWERFK+LLR+CPQHG+                            
Subjt:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYP---------------------------

Query:  ---------------------------------------DCFSALQAQMSSLANAFLKFSGTGNAQSIESTAALASQTQE--------------------
                                               D  +AL AQ++SL N  +  +  GN Q+++S  + +S  QE                    
Subjt:  ---------------------------------------DCFSALQAQMSSLANAFLKFSGTGNAQSIESTAALASQTQE--------------------

Query:  --------------ENL----EQNVLN-PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGK
                      ENL     +N L  PPGF  Q+ + K  LED++G FI+++ +  NK E  +  I   V+   A +KN+E Q+GQL +++ +  KGK
Subjt:  --------------ENL----EQNVLN-PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGK

Query:  APAEQEKSSLEYCKAVSV
         P++ E +  E+C A+++
Subjt:  APAEQEKSSLEYCKAVSV

XP_022868533.1 uncharacterized protein LOC111388104 [Olea europaea var. sylvestris]3.2e-2730.03Show/hide
Query:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYP---------------------------
        DKA+ W QS+P  SITTWD L Q FL K FPP+K+ +LR EI  F+Q   E  +EAWERFK+LLR+CPQ+G+                            
Subjt:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYP---------------------------

Query:  ---------------------------------------DCFSALQAQMSSLANAFLKFSGTGNAQSIESTAALASQTQE--------------------
                                               D  +AL AQ+SSL N  +  +  GN Q ++S  + +S  QE                    
Subjt:  ---------------------------------------DCFSALQAQMSSLANAFLKFSGTGNAQSIESTAALASQTQE--------------------

Query:  --------------ENL----EQNVLNPP-GFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGK
                      ENL     +N L PP GF  Q+ + K  LED++G FI+++ +  NK E  +  I   V+   A +KN+E Q+ QL + + +  KGK
Subjt:  --------------ENL----EQNVLNPP-GFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGK

Query:  APAEQEKSSLEYCKAVSVHHKEETQVAEEEETTKTEELAGEVK
         P++ E +  E+CKAV +   ++T  +E  E    E++    K
Subjt:  APAEQEKSSLEYCKAVSVHHKEETQVAEEEETTKTEELAGEVK

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]5.5e-2731.3Show/hide
Query:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDC-------------------------
        DKAR WLQS+ PGSIT+W  + + FL K FPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PD                          
Subjt:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDC-------------------------

Query:  -----------------------------------------FSALQAQMSSLANAFLKFSGTGNAQSIESTAAL--------ASQTQ-------------
                                                 F+AL AQ++SL++     S     QS E  AA         ASQ Q             
Subjt:  -----------------------------------------FSALQAQMSSLANAFLKFSGTGNAQSIESTAAL--------ASQTQ-------------

Query:  -------------EENLE----QNVLN-PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGK
                      EN      +NVL  PPGF  Q  E K SLED + +F+ ++     K +  +  I    +   A +KN+E Q+GQL + +N   +G 
Subjt:  -------------EENLE----QNVLN-PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGK

Query:  APAEQEKSSLEYCKAVSVH--HKEETQVAEEEETTKTEELAGEVK
         P+  E +  E CKA+++    + ET  ++E ETT T    G+ K
Subjt:  APAEQEKSSLEYCKAVSVH--HKEETQVAEEEETTKTEELAGEVK

XP_030964936.1 uncharacterized protein LOC115986224 [Quercus lobata]2.9e-2831.46Show/hide
Query:  NCMWENHV------CSGEDKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPD------CF
        NC+ E+ +       S  DKAR WLQS+ PGSIT+W  + + FL K+FPPAKT +LR++IG F+Q   E L+EAWER+K+L+R+CPQHG PD       +
Subjt:  NCMWENHV------CSGEDKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPD------CF

Query:  SALQAQMSSL----------------ANAFLKFSGTGNAQ-SIEST---------------------AALASQTQEENLE--------------------
        + L  Q  ++                A++ L+   + N Q S E T                     A+L+ Q Q  N                      
Subjt:  SALQAQMSSL----------------ANAFLKFSGTGNAQ-SIEST---------------------AALASQTQEENLE--------------------

Query:  --------QNVLNP-PGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGKAPAEQEKSSLEYCK
                +NVL P PGF  Q  E K SLED + +F+ ++     K +  +  I    +   A +KN+E Q+GQL + +N   +G  P+  E +  E CK
Subjt:  --------QNVLNP-PGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGKAPAEQEKSSLEYCK

Query:  AVSVHHKEETQVAEEEETTKT
        A+++    E +    +ET  T
Subjt:  AVSVHHKEETQVAEEEETTKT

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953732.2e-2631.71Show/hide
Query:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPD------CFSALQAQMSSLANA-----
        DKAR WLQS+  GSIT+W  + + FL K FPPAKT +LR+EI  F+Q   E L+EAWER+K L+R CPQHG P+       ++ L  +  ++ +A     
Subjt:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPD------CFSALQAQMSSLANA-----

Query:  -----------FLKFSGTGN-----------------------AQSIESTAAL--------ASQTQEENLEQNVLNPPGFTPQSQESKKSLEDLVGAFIA
                   +L    T N                        Q I+  AA         ASQ Q + +     N  GF  Q  +   SLED + +F+ 
Subjt:  -----------FLKFSGTGN-----------------------AQSIESTAAL--------ASQTQEENLEQNVLNPPGFTPQSQESKKSLEDLVGAFIA

Query:  KSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAEEEETTKTEELA
        +++    K +  +  I+   +   AAIKNIE Q+G+L +++N   +G  P+  E +  E CKA+++    E + +  +ET  T  +A
Subjt:  KSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAEEEETTKTEELA

A0A2I4G4Q3 uncharacterized protein LOC1090047124.4e-2228.32Show/hide
Query:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDC-------------------------
        D+AR WLQS+ P SIT+W  + + F  K FPPAKT +LR+EIG F+Q   E L+EAWE +K+L+R+CPQHG PD                          
Subjt:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDC-------------------------

Query:  -----------------------------------------FSALQAQMSSLANAFLKFSGTGNAQSIE-----STAALASQTQEENLE-----------
                                                  +AL AQ+++L++     +     QS E     S   L+++  +E ++           
Subjt:  -----------------------------------------FSALQAQMSSLANAFLKFSGTGNAQSIE-----STAALASQTQEENLE-----------

Query:  ----------------------QNVLN---PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAI-KNIETQLGQLVSVVNTMN
                              +NVL    PPGF  QS E K SLED + +FI +++    K +  +  I    +   AAI KNIE Q+GQL + +N   
Subjt:  ----------------------QNVLN---PPGFTPQSQESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAI-KNIETQLGQLVSVVNTMN

Query:  KGKAPAEQEKSSLEYCKAVSVHHKEE----TQVAEEEETTKTEELA
        +G  P+  E +  E CKA+ +    E     ++ E ++TT + +LA
Subjt:  KGKAPAEQEKSSLEYCKAVSVHHKEE----TQVAEEEETTKTEELA

A0A6J1DU19 uncharacterized protein LOC1110243612.9e-2634.12Show/hide
Query:  QAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHG---------------------------------------------------
        QAFL   FPPAKT +LRTEI +F++   EQLFE WER+KELLRKCPQHG                                                   
Subjt:  QAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHG---------------------------------------------------

Query:  YP---------------DCFSALQAQMSSLANAFLKFSGTGNAQSIESTAALASQTQEENLEQNVLNPPGFTPQSQESKKSLEDLVGAFIAKSSNISNKL
        +P               D  S+L+AQ+ +L NA  K SG G + S E  AA    T   +  +  +    FT    E K SLEDL+GAFI +  + ++++
Subjt:  YP---------------DCFSALQAQMSSLANAFLKFSGTGNAQSIESTAALASQTQEENLEQNVLNPPGFTPQSQESKKSLEDLVGAFIAKSSNISNKL

Query:  EEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAE----EEETTKTEELAGE---VKGAPPQMK
        E  V  +   + G+  +IKN+E Q+GQ+   +NTM KGK P++ E    E+CKAV++   +E Q  E    EE    TEE   +   VK A P ++
Subjt:  EEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAE----EEETTKTEELAGE---VKGAPPQMK

A0A6P3YRS3 uncharacterized protein LOC1074037776.7e-2334.4Show/hide
Query:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDCFSALQAQMSSLANAFLKFSGTGNAQ
        DKA+ WL S+P G+ITTWDA+   FL K FPP+KT KL+++I  F Q   E L++AWERFKELLR+CP HG+P   + +Q Q+      F      GN Q
Subjt:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDCFSALQAQMSSLANAFLKFSGTGNAQ

Query:  SIESTA---------ALASQTQEENLEQNVLNPPGFTPQSQESKKSLEDL----VGAFIA----KSSNIS----NKLEEAVIAINNIVNGHFAAIKNIET
         +++ A         + A +  EE    N   P       +   K++ED+    +GA +A    K+ N       + E+ +   + + N   AAIK++E 
Subjt:  SIESTA---------ALASQTQEENLEQNVLNPPGFTPQSQESKKSLEDL----VGAFIA----KSSNIS----NKLEEAVIAINNIVNGHFAAIKNIET

Query:  QLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAEEEETTK
        Q+GQL +     ++G  P++ EK+  E  +A+++  +   QVA  +E+ K
Subjt:  QLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAEEEETTK

A0A6P6GBP4 uncharacterized protein LOC1074219982.0e-2234.4Show/hide
Query:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDCFSALQAQMSSLANAFLKFSGTGNAQ
        DKA+ WL S+P G+ITTWDA+   FL K FPP+KT KL+++I  F Q   E L++AWERFKELLR+CP HG+P   + +Q Q+      F      GN Q
Subjt:  DKARDWLQSIPPGSITTWDALGQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDCFSALQAQMSSLANAFLKFSGTGNAQ

Query:  SIESTA--ALASQTQEENLE-------QNVLNPPGFTPQSQESKKSLEDL----VGAFIA----KSSNIS----NKLEEAVIAINNIVNGHFAAIKNIET
         +++ A  +L  + Q E  E        N   P       +   K++ED+    +GA +A    K+ N       + E+ +   + + N   AAIK++E 
Subjt:  SIESTA--ALASQTQEENLE-------QNVLNPPGFTPQSQESKKSLEDL----VGAFIA----KSSNIS----NKLEEAVIAINNIVNGHFAAIKNIET

Query:  QLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAEEEETTK
        Q+GQL +     ++G  P++ EK+  E  +A+++  +   QV   +E+ K
Subjt:  QLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAEEEETTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTAGGGAGAGAACTCTTAAAACCAAGCCTTACCCTACTTGTAGGATTTGGGGGAGAAAGAGTGACACCACAGAATACAGTGATGAAGATAGTAAACTTCGTAGT
CGTAGATTGCATTTCAACCTACAATGAAATCTTAGGAAGGTCGACCTTGCATGATATGACGGCCATAACTTCAACTTTATCATCAGTTGCTGAAGTTTCCAACTCAGAGT
GGTGTAGGAATTATCAAAGGAGAGCAGAAGGCGTCGAGGGAGTGTTACTGGACAACACTGAAAGAAGACAAGCCGCAGAAGGTCGAGGCCGACCAGGGGTGAACTGTGTC
GAGGCTGACCAGGGCGCACTTCCACCAACACTGAATCTGAACTTATGGAGTTTCAGGGCTTTGTATTTATTGCATTTCATCATGTTGTTCATTAGAGGAGCACTGATACT
TAAGGACCAAGAGGTAGCCCAGGGCGAGATCAAGGACCTCTTGCTGCTGGTTTTGGGTGAAAACCAAGGAGATTTGGCGAAACGGATCAAAAATTTCTACAAAGATAGGG
ATATAGGGTTCAATTGCATGTGGGAGAATCATGTATGCTCAGGAGAGGACAAAGCGAGGGACTGGTTACAATCGATTCCACCTGGGAGCATCACCACCTGGGATGCTCTA
GGCCAGGCATTTCTGAAGAAAATTTTCCCCCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACGTTCCAGCAGCAGTTTGATGAACAACTGTTCGAGGCCTGGGA
GCGATTTAAAGAGCTGCTAAGGAAATGCCCTCAGCATGGCTATCCCGACTGCTTCAGTGCACTTCAGGCCCAGATGTCTTCCCTTGCTAATGCATTTTTAAAGTTTTCAG
GTACAGGGAATGCTCAATCGATTGAGTCTACAGCTGCCCTTGCTTCCCAAACTCAGGAGGAAAATCTCGAACAGAATGTTCTGAATCCTCCTGGCTTCACCCCTCAAAGT
CAAGAAAGTAAGAAATCTCTAGAGGATCTCGTTGGAGCTTTTATTGCAAAATCTAGTAACATATCAAATAAGCTTGAGGAGGCCGTGATTGCCATAAACAACATTGTCAA
TGGCCATTTTGCAGCCATCAAGAACATAGAGACTCAACTGGGACAACTGGTAAGTGTTGTCAACACAATGAATAAAGGAAAAGCCCCAGCTGAACAGGAGAAATCTTCAT
TGGAGTACTGCAAGGCCGTCTCTGTGCATCACAAGGAGGAGACTCAAGTAGCTGAGGAGGAGGAGACTACTAAAACTGAGGAACTAGCTGGAGAAGTTAAGGGGGCACCA
CCTCAAATGAAGCTGAAAAGCTTATACCTGAGCCCTCTATCCCTTCTCCTACTGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTAGGGAGAGAACTCTTAAAACCAAGCCTTACCCTACTTGTAGGATTTGGGGGAGAAAGAGTGACACCACAGAATACAGTGATGAAGATAGTAAACTTCGTAGT
CGTAGATTGCATTTCAACCTACAATGAAATCTTAGGAAGGTCGACCTTGCATGATATGACGGCCATAACTTCAACTTTATCATCAGTTGCTGAAGTTTCCAACTCAGAGT
GGTGTAGGAATTATCAAAGGAGAGCAGAAGGCGTCGAGGGAGTGTTACTGGACAACACTGAAAGAAGACAAGCCGCAGAAGGTCGAGGCCGACCAGGGGTGAACTGTGTC
GAGGCTGACCAGGGCGCACTTCCACCAACACTGAATCTGAACTTATGGAGTTTCAGGGCTTTGTATTTATTGCATTTCATCATGTTGTTCATTAGAGGAGCACTGATACT
TAAGGACCAAGAGGTAGCCCAGGGCGAGATCAAGGACCTCTTGCTGCTGGTTTTGGGTGAAAACCAAGGAGATTTGGCGAAACGGATCAAAAATTTCTACAAAGATAGGG
ATATAGGGTTCAATTGCATGTGGGAGAATCATGTATGCTCAGGAGAGGACAAAGCGAGGGACTGGTTACAATCGATTCCACCTGGGAGCATCACCACCTGGGATGCTCTA
GGCCAGGCATTTCTGAAGAAAATTTTCCCCCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACGTTCCAGCAGCAGTTTGATGAACAACTGTTCGAGGCCTGGGA
GCGATTTAAAGAGCTGCTAAGGAAATGCCCTCAGCATGGCTATCCCGACTGCTTCAGTGCACTTCAGGCCCAGATGTCTTCCCTTGCTAATGCATTTTTAAAGTTTTCAG
GTACAGGGAATGCTCAATCGATTGAGTCTACAGCTGCCCTTGCTTCCCAAACTCAGGAGGAAAATCTCGAACAGAATGTTCTGAATCCTCCTGGCTTCACCCCTCAAAGT
CAAGAAAGTAAGAAATCTCTAGAGGATCTCGTTGGAGCTTTTATTGCAAAATCTAGTAACATATCAAATAAGCTTGAGGAGGCCGTGATTGCCATAAACAACATTGTCAA
TGGCCATTTTGCAGCCATCAAGAACATAGAGACTCAACTGGGACAACTGGTAAGTGTTGTCAACACAATGAATAAAGGAAAAGCCCCAGCTGAACAGGAGAAATCTTCAT
TGGAGTACTGCAAGGCCGTCTCTGTGCATCACAAGGAGGAGACTCAAGTAGCTGAGGAGGAGGAGACTACTAAAACTGAGGAACTAGCTGGAGAAGTTAAGGGGGCACCA
CCTCAAATGAAGCTGAAAAGCTTATACCTGAGCCCTCTATCCCTTCTCCTACTGTTTTAG
Protein sequenceShow/hide protein sequence
MKLGRELLKPSLTLLVGFGGERVTPQNTVMKIVNFVVVDCISTYNEILGRSTLHDMTAITSTLSSVAEVSNSEWCRNYQRRAEGVEGVLLDNTERRQAAEGRGRPGVNCV
EADQGALPPTLNLNLWSFRALYLLHFIMLFIRGALILKDQEVAQGEIKDLLLLVLGENQGDLAKRIKNFYKDRDIGFNCMWENHVCSGEDKARDWLQSIPPGSITTWDAL
GQAFLKKIFPPAKTVKLRTEIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDCFSALQAQMSSLANAFLKFSGTGNAQSIESTAALASQTQEENLEQNVLNPPGFTPQS
QESKKSLEDLVGAFIAKSSNISNKLEEAVIAINNIVNGHFAAIKNIETQLGQLVSVVNTMNKGKAPAEQEKSSLEYCKAVSVHHKEETQVAEEEETTKTEELAGEVKGAP
PQMKLKSLYLSPLSLLLLF