; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022191 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022191
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:20674500..20677150
RNA-Seq ExpressionLag0022191
SyntenyLag0022191
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059834.1 uncharacterized protein E6C27_scaffold108G001170 [Cucumis melo var. makuwa]4.0e-5646.1Show/hide
Query:  MKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQYQNCC
        MKID+P YNGK + E+FL+WVK  +NFF YM+T D KKV LVAL+L+GGA AWWDQLE+NRQR GK PI  WE+MKK++K RFLP N+EQ +YNQYQNC 
Subjt:  MKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQYQNCC

Query:  QGTRT--------------KKLGRPKQ-SISRTI-----------LPERN------------------TSQNPSATVKGKQIDLGT-----TKEPTPK-K
        QG+R+                LG  +Q  I+R I           L   N                  T++ PS +V GK  D+ T      K+ T K K
Subjt:  QGTRT--------------KKLGRPKQ-SISRTI-----------LPERN------------------TSQNPSATVKGKQIDLGT-----TKEPTPK-K

Query:  NANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLLAPKTDPTYQ
        + N Y RP+L KCFRC Q+GHLSN CPQR+T+++AD        D  + EE+ +F+E  +G+ +S VIQR+L+APK +   Q
Subjt:  NANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLLAPKTDPTYQ

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]1.4e-5642.68Show/hide
Query:  RRDQRIPYHQQE-NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERF
        R +QRI   + E +D+KMKID+P Y+GK  IEAFL+W+K  ENFF YM+TP+ KKV LVALKL+ GA AWWDQLE+NRQR GK+P+R WE+MKK++K RF
Subjt:  RRDQRIPYHQQE-NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERF

Query:  LPTNFEQILYNQYQNCCQGT------------------------------------------------------------------RTKKLGRPKQSISR
        LP N+EQ LYNQYQNC QG                                                                   R+K L R     + 
Subjt:  LPTNFEQILYNQYQNCCQGT------------------------------------------------------------------RTKKLGRPKQSISR

Query:  TILPERNTSQNPSATVKGKQID-----LGTTKEPTPKKNA-NAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEG
        +   + N   + S   KGK+ID     +   KE T K +  N+Y RP+LGKCFRC QTGHLS+ CPQRKT+AIA+    Q   D  + EE+ + +E  +G
Subjt:  TILPERNTSQNPSATVKGKQID-----LGTTKEPTPKKNA-NAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEG

Query:  EPLSLVIQRLLLAPKTDPTYQ
        E +S VIQRLL+ PK +   Q
Subjt:  EPLSLVIQRLLLAPKTDPTYQ

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]2.8e-5743.3Show/hide
Query:  RRDQRIPYHQQE-NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERF
        R ++RI   + E +D+KMKID+P Y GK  IEAFL+W+K  ENFFTYM+TP+ KKV LVALKL+ GA AWWDQLE+NRQR GK+P+R WE+MKK++K RF
Subjt:  RRDQRIPYHQQE-NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERF

Query:  LPTNFEQILYNQYQNCCQGTRT------------------------------------------------------------------KKLGRPKQSISR
        LP N+EQ LYNQYQNC QG RT                                                                  K L R     + 
Subjt:  LPTNFEQILYNQYQNCCQGTRT------------------------------------------------------------------KKLGRPKQSISR

Query:  TILPERNTSQNPSATVKGKQID-----LGTTKEPTPKKNA-NAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEG
        +   + N   + S   KGK+ID     +   KE T K +  N Y RP+LGKCFRC QTGHLSN CPQRKT+AIA+    Q   D  + EE+ + +E  +G
Subjt:  TILPERNTSQNPSATVKGKQID-----LGTTKEPTPKKNA-NAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEG

Query:  EPLSLVIQRLLLAPKTDPTYQ
        E +S VIQRLL+ PK +   Q
Subjt:  EPLSLVIQRLLLAPKTDPTYQ

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]3.9e-5945.97Show/hide
Query:  RRDQRIPYHQQE-NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERF
        R ++RI   + E +D+KMKID+P Y+GK  IEAFL+W+K  ENFF YM+TP+ KKV LVALKL+ GA AWWDQLE+NRQR GK+PIR WE+MKK++K RF
Subjt:  RRDQRIPYHQQE-NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERF

Query:  LPTNFEQILYNQYQNCCQGT-----------------------------------------RTKKLGRPKQSISRTILPERNTSQNPSATVKGK------
        LP N+EQ LYNQYQNC QG                                          R+K L R  +S   T   +  T+  PS + KGK      
Subjt:  LPTNFEQILYNQYQNCCQGT-----------------------------------------RTKKLGRPKQSISRTILPERNTSQNPSATVKGK------

Query:  -QIDLGTTKEPTPKKNA-NAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLLAPKTDPTYQ
         ++ +   KE T K +  N+Y RP+LGKCFRC QTGHLSN CPQRKT+AIA+    Q   D  + EE+ + +E  +GE +S  IQR+L+ PK +   Q
Subjt:  -QIDLGTTKEPTPKKNA-NAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLLAPKTDPTYQ

XP_031745523.1 uncharacterized protein LOC116405899 [Cucumis sativus]4.6e-5239.09Show/hide
Query:  MKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQYQNCC
        MK+D+P+Y+GK +IE+FL+W+K  ENFF+YM+TP+ KKV+LVALKLKGGA AWW+QLE+NRQR  KRP+R WE+MKK++K RFLP N+EQ LYNQYQNC 
Subjt:  MKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQYQNCC

Query:  QGTR--TKKLGRPKQSISRTILPERNTSQ----------------------------------NPSATVKGKQIDLGTTKEPTP----------------
        QGTR  T+ +    +  +RT L E    Q                                       +K K ++  TT EPTP                
Subjt:  QGTR--TKKLGRPKQSISRTILPERNTSQ----------------------------------NPSATVKGKQIDLGTTKEPTP----------------

Query:  -------------------------KKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLL
                                  KN N Y RP+LGKCFRC Q GHLSN CPQRKT+A+A+        D S+  E+ + +E  EG+   ++  +  L
Subjt:  -------------------------KKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLL

Query:  APKTDPTYQDMPCLRHVARLMGKSALLLLI
         P+    +      +  A LMG+ A  LL+
Subjt:  APKTDPTYQDMPCLRHVARLMGKSALLLLI

TrEMBL top hitse value%identityAlignment
A0A5A7UXS4 CCHC-type domain-containing protein1.9e-5646.1Show/hide
Query:  MKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQYQNCC
        MKID+P YNGK + E+FL+WVK  +NFF YM+T D KKV LVAL+L+GGA AWWDQLE+NRQR GK PI  WE+MKK++K RFLP N+EQ +YNQYQNC 
Subjt:  MKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQYQNCC

Query:  QGTRT--------------KKLGRPKQ-SISRTI-----------LPERN------------------TSQNPSATVKGKQIDLGT-----TKEPTPK-K
        QG+R+                LG  +Q  I+R I           L   N                  T++ PS +V GK  D+ T      K+ T K K
Subjt:  QGTRT--------------KKLGRPKQ-SISRTI-----------LPERN------------------TSQNPSATVKGKQIDLGT-----TKEPTPK-K

Query:  NANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLLAPKTDPTYQ
        + N Y RP+L KCFRC Q+GHLSN CPQR+T+++AD        D  + EE+ +F+E  +G+ +S VIQR+L+APK +   Q
Subjt:  NANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPYEGEPLSLVIQRLLLAPKTDPTYQ

A0A5D3CJ99 Reverse transcriptase7.1e-5146.5Show/hide
Query:  NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQY
        +D+KMKID+PTY GK ++E+FL+W+++ ENFF YM+T D KKV LVALKL+GGA AWWDQ+E+NRQR+GK PI  WE+MKK+MK RFLP N+EQ LYNQY
Subjt:  NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTNFEQILYNQY

Query:  QNCCQGTR-------------------------------TKKLGRPKQSISRTILPERNTSQNPSATVKGKQIDLGTTKEPTPKKNA------NAYLRPT
        QNC QG++                               T +    +   S+       T++ PS +V  K  D+   +    K+NA      N Y RP+
Subjt:  QNCCQGTR-------------------------------TKKLGRPKQSISRTILPERNTSQNPSATVKGKQIDLGTTKEPTPKKNA------NAYLRPT

Query:  LGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQ
        LGKCFRC+QT +LSN CPQRKT+A+A+  EY    D S   E+
Subjt:  LGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQ

A0A5D3DGR0 Reverse transcriptase1.0e-4937.09Show/hide
Query:  ERRRDQRIPYHQQE---NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMM
        E RR + +   +Q+   +++KMKID+P+Y+GK  IE FL+W+K+ ENFF YM T   KKV LVALKLKGGA AWWDQ+ +NRQ+ GK PIR WE+MKK+M
Subjt:  ERRRDQRIPYHQQE---NDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMM

Query:  KERFLPTNFEQILYNQYQNCCQGTRT--------KKLG------------------------------RPKQSISRTIL--------------PERNTSQ
        K+RF+P N+EQ LY QYQNC QG R          +LG                              +P Q +S  I                 R    
Subjt:  KERFLPTNFEQILYNQYQNCCQGTRT--------KKLG------------------------------RPKQSISRTIL--------------PERNTSQ

Query:  NPSATVK----GKQIDLGTTKEPT--------------PKKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPY
         PSA+ K      ++   T+++P                KK  N Y RP  G C+RC Q GH SN+CPQRKT+A+A + +   +    + +E+ + +E  
Subjt:  NPSATVK----GKQIDLGTTKEPT--------------PKKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVDFLEPY

Query:  EGEPLSLVIQRLLLAPKTDPTYQDMPCLRHVARLMGK
        EG+ LS ++QR+L++PK +   Q     +    + GK
Subjt:  EGEPLSLVIQRLLLAPKTDPTYQDMPCLRHVARLMGK

A0A6J1CAS9 uncharacterized protein LOC111009540 isoform X19.3e-5138.3Show/hide
Query:  PTERRRDQRIPYHQQENDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMK
        P   R +  +       DFKMKID+PT+NGKM++E FL+ VK+VENFF Y NTP+ KKVKLVA K++ GA AWWDQLE+N +R GK+PIR W RM ++M+
Subjt:  PTERRRDQRIPYHQQENDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMK

Query:  ERFLPTNFEQILYNQYQNCCQGTRT--------KKLG------------------------------RPKQSISRTIL-------------PERNTS---
        ERFLP NFEQ+LY  YQ C QG +T         +LG                              +P   ++  I+             P R T    
Subjt:  ERFLPTNFEQILYNQYQNCCQGTRT--------KKLG------------------------------RPKQSISRTIL-------------PERNTS---

Query:  ---QNPSATVKGKQIDLGTTKEPT-------------------PKKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVD
              + T  GK + +GTT   T                    K+  N Y+RPTLGKCFRC Q  HLSNECPQR+ LA+ D  +  E       E+   
Subjt:  ---QNPSATVKGKQIDLGTTKEPT-------------------PKKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVD

Query:  FLEPYEGEPLSLVIQRLLLAPKTDPTYQDMPCLRHVARLMGK
        ++EP EG+ LS V+Q+ +L PK +   Q     R    + GK
Subjt:  FLEPYEGEPLSLVIQRLLLAPKTDPTYQDMPCLRHVARLMGK

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X29.3e-5138.3Show/hide
Query:  PTERRRDQRIPYHQQENDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMK
        P   R +  +       DFKMKID+PT+NGKM++E FL+ VK+VENFF Y NTP+ KKVKLVA K++ GA AWWDQLE+N +R GK+PIR W RM ++M+
Subjt:  PTERRRDQRIPYHQQENDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMK

Query:  ERFLPTNFEQILYNQYQNCCQGTRT--------KKLG------------------------------RPKQSISRTIL-------------PERNTS---
        ERFLP NFEQ+LY  YQ C QG +T         +LG                              +P   ++  I+             P R T    
Subjt:  ERFLPTNFEQILYNQYQNCCQGTRT--------KKLG------------------------------RPKQSISRTIL-------------PERNTS---

Query:  ---QNPSATVKGKQIDLGTTKEPT-------------------PKKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVD
              + T  GK + +GTT   T                    K+  N Y+RPTLGKCFRC Q  HLSNECPQR+ LA+ D  +  E       E+   
Subjt:  ---QNPSATVKGKQIDLGTTKEPT-------------------PKKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYSDNEEQVD

Query:  FLEPYEGEPLSLVIQRLLLAPKTDPTYQDMPCLRHVARLMGK
        ++EP EG+ LS V+Q+ +L PK +   Q     R    + GK
Subjt:  FLEPYEGEPLSLVIQRLLLAPKTDPTYQDMPCLRHVARLMGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTCCAACGGAAAGGAGAAGAGATCAAAGAATTCCCTACCACCAACAAGAAAATGATTTCAAGATGAAAATCGATATCCCAACTTATAATGGGAAAATGGAAAT
TGAAGCATTCTTGGAATGGGTGAAACATGTGGAGAATTTTTTTACCTATATGAATACACCAGACACCAAGAAAGTTAAGTTGGTAGCTCTTAAATTGAAAGGTGGGGCGC
AAGCTTGGTGGGATCAACTTGAGTTAAATCGGCAAAGATTTGGAAAGAGACCAATTCGACGGTGGGAAAGAATGAAAAAGATGATGAAAGAACGGTTCCTACCGACCAAT
TTCGAGCAAATTCTATACAACCAATATCAAAATTGTTGCCAGGGCACACGTACGAAGAAACTTGGGCGACCAAAGCAATCCATATCAAGAACCATTCTTCCTGAGAGAAA
TACTTCACAAAATCCCTCTGCAACAGTCAAAGGCAAACAAATTGATCTGGGAACAACAAAAGAACCGACACCTAAGAAAAATGCAAACGCATACCTGAGACCTACCTTGG
GTAAATGCTTTCGTTGTAATCAGACCGGCCATTTGTCAAACGAATGCCCACAAAGAAAGACACTAGCTATTGCTGATAATGTAGAATATCAAGAAGACAGCGACTACTCG
GACAATGAAGAACAGGTTGATTTTCTTGAACCATATGAGGGAGAACCATTGTCATTGGTGATTCAAAGGCTCCTTTTGGCCCCAAAGACAGACCCAACATACCAAGACAT
GCCTTGTTTAAGACACGTTGCACGATTAATGGGAAAATCTGCACTGTTATTATTGATAGTGTCTTTGCCTCCGCCGTTTGAGTTTCAGTCTTCCCCACTGTCGTCTCCGT
TCATCTTCCCGATTTCTGGCTTCGTTTCAGATTCAGGTATGTTTCGTTCATTTGTTCTTCTTGTTCTTCGTGATTTGAATGCCTTTTCTGCATTCTCCAAACCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCTTCCAACGGAAAGGAGAAGAGATCAAAGAATTCCCTACCACCAACAAGAAAATGATTTCAAGATGAAAATCGATATCCCAACTTATAATGGGAAAATGGAAAT
TGAAGCATTCTTGGAATGGGTGAAACATGTGGAGAATTTTTTTACCTATATGAATACACCAGACACCAAGAAAGTTAAGTTGGTAGCTCTTAAATTGAAAGGTGGGGCGC
AAGCTTGGTGGGATCAACTTGAGTTAAATCGGCAAAGATTTGGAAAGAGACCAATTCGACGGTGGGAAAGAATGAAAAAGATGATGAAAGAACGGTTCCTACCGACCAAT
TTCGAGCAAATTCTATACAACCAATATCAAAATTGTTGCCAGGGCACACGTACGAAGAAACTTGGGCGACCAAAGCAATCCATATCAAGAACCATTCTTCCTGAGAGAAA
TACTTCACAAAATCCCTCTGCAACAGTCAAAGGCAAACAAATTGATCTGGGAACAACAAAAGAACCGACACCTAAGAAAAATGCAAACGCATACCTGAGACCTACCTTGG
GTAAATGCTTTCGTTGTAATCAGACCGGCCATTTGTCAAACGAATGCCCACAAAGAAAGACACTAGCTATTGCTGATAATGTAGAATATCAAGAAGACAGCGACTACTCG
GACAATGAAGAACAGGTTGATTTTCTTGAACCATATGAGGGAGAACCATTGTCATTGGTGATTCAAAGGCTCCTTTTGGCCCCAAAGACAGACCCAACATACCAAGACAT
GCCTTGTTTAAGACACGTTGCACGATTAATGGGAAAATCTGCACTGTTATTATTGATAGTGTCTTTGCCTCCGCCGTTTGAGTTTCAGTCTTCCCCACTGTCGTCTCCGT
TCATCTTCCCGATTTCTGGCTTCGTTTCAGATTCAGGTATGTTTCGTTCATTTGTTCTTCTTGTTCTTCGTGATTTGAATGCCTTTTCTGCATTCTCCAAACCCTAG
Protein sequenceShow/hide protein sequence
MNLPTERRRDQRIPYHQQENDFKMKIDIPTYNGKMEIEAFLEWVKHVENFFTYMNTPDTKKVKLVALKLKGGAQAWWDQLELNRQRFGKRPIRRWERMKKMMKERFLPTN
FEQILYNQYQNCCQGTRTKKLGRPKQSISRTILPERNTSQNPSATVKGKQIDLGTTKEPTPKKNANAYLRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVEYQEDSDYS
DNEEQVDFLEPYEGEPLSLVIQRLLLAPKTDPTYQDMPCLRHVARLMGKSALLLLIVSLPPPFEFQSSPLSSPFIFPISGFVSDSGMFRSFVLLVLRDLNAFSAFSKP