; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg017536 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg017536
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationscaffold4:35387744..35402565
RNA-Seq ExpressionSpg017536
SyntenySpg017536
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]1.4e-3248.22Show/hide
Query:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQ-PPVVEPAAVV---------MMKE
        +VDASA  A+L++T+NEAYEILE I++N+CQW+DVR    +K + VLEVD +S+I A LA + N L+N+ +         V  AAV+         ++KE
Subjt:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQ-PPVVEPAAVV---------MMKE

Query:  FMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKE------PSKTQDIDDNCDR--NVVVEKE
        +MA+ D AIQS QAS+R LE+QVGQLANEL+ RP  KLP+DTE P+REG EQ +A+ LRSGK +  R E       S++Q+  D   R    VV++E
Subjt:  FMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKE------PSKTQDIDDNCDR--NVVVEKE

XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]1.6e-2334.9Show/hide
Query:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPVV--------------------
        ++DASA  A+L++++NEA+EILE I++N+ QWS  R  T++KV  VLEVD ++ + A +A + N LKN+ +    QP                       
Subjt:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPVV--------------------

Query:  --EPAAVV--------------------------------------------MMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEH
           PA+V                                             +M+++MA+ D  IQS  AS++ LE+Q+GQLAN+LK RPQG LPSDTE+
Subjt:  --EPAAVV--------------------------------------------MMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEH

Query:  PRREGKEQVKAVTLRSGKPLEER------KEPSKTQDIDDNCDRNVVVEKELETG
        PRR+GKE  KAVTLRSGK +E        KE S  Q   +   +  +   E+  G
Subjt:  PRREGKEQVKAVTLRSGKPLEER------KEPSKTQDIDDNCDRNVVVEKELETG

XP_030494874.1 uncharacterized protein LOC115710657 [Cannabis sativa]6.0e-2336.21Show/hide
Query:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKN--------------VTVISHQQ----------
        ++DAS   A+L++++NE +EILE I++N+ QWS+ R  T++KV  VLEVD ++ + A +A + N LKN              V  + +Q           
Subjt:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKN--------------VTVISHQQ----------

Query:  --------------------------------PP-------------VVEPAAV-VMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPS
                                        PP               +P+++  +M+++MA+ D  IQS  AS+R LELQ+G LANELKARPQG LPS
Subjt:  --------------------------------PP-------------VVEPAAV-VMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPS

Query:  DTEHPRREGKEQVKAVTLRSGKPLEERKEPSK
        DTE+PRR+GKEQ  A+ LRSGK L+  +E  K
Subjt:  DTEHPRREGKEQVKAVTLRSGKPLEERKEPSK

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]1.0e-2236.29Show/hide
Query:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTV-------------IS--------------
        ++DASA  A+L++++NEA+EILE I++N+ QWS  R  T++KV  VLEVD ++ + A +A + N LKN+ +             IS              
Subjt:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTV-------------IS--------------

Query:  ------------------------------------------HQQPPVVEP---------AAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKA
                                                   QQP   +P         +   +M+++MA+ D  IQS  AS+R LE+Q+GQLAN+LK 
Subjt:  ------------------------------------------HQQPPVVEP---------AAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKA

Query:  RPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEER------KEPSKTQ
        RPQG LPSDTE+PRR+GKE  KAVTLRSGK +E        KEPS  Q
Subjt:  RPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEER------KEPSKTQ

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]3.1e-2743.92Show/hide
Query:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISH--------------------QQP---
        ++DASA  A+L++++NEA+EILE I++N+ QWS+ R  T++KV  VLEVD ++ + A +A + N   N++                        QQP   
Subjt:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISH--------------------QQP---

Query:  ---PVVEPAAV-VMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKEPSK
              +P+++  +M+++MA+ D  IQS  AS+R LELQ+G LANELKARPQG LPSDTE+PRR+GKEQ K++ LRSGK L+  +E  K
Subjt:  ---PVVEPAAV-VMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKEPSK

TrEMBL top hitse value%identityAlignment
A0A061EW79 Retrotrans_gag domain-containing protein2.6e-1635.29Show/hide
Query:  VDASAGVALLARTFNEAYEILEIISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQ---------------------QPPVVE
        +DA+   AL++++ ++AY++LE I +N+ QW   R   +K+  + E+D ++T+   L   A  +  ++V + Q                     + P++E
Subjt:  VDASAGVALLARTFNEAYEILEIISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQ---------------------QPPVVE

Query:  --PAAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERKEPSKTQD
          P+   +  +FM +T+  IQ+   S+R LE+QVGQLA+ L  RPQG LPSDTE +PRREGKE   A+TL +GK   E K P   +D
Subjt:  --PAAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSGKPLEERKEPSKTQD

A0A5B6UYR6 Aspartic proteinase CDR1-like3.8e-1535.79Show/hide
Query:  HERSAWGMVDASAGVALLARTFNEAYEILEIISTNSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLAMIANALK--NVTVISHQQPPVVE--PAAVV---
        H R A   V AS    LL + +NEAY+ILE I+ N  Q+  +R GT ++V  V+E+D ++++ A   ++   +K   +T +  ++  V +  P+ +    
Subjt:  HERSAWGMVDASAGVALLARTFNEAYEILEIISTNSCQWSDVR-GTNKKVKSVLEVDGVSTIRADLAMIANALK--NVTVISHQQPPVVE--PAAVV---

Query:  ----------------------MMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL
                              M +E+MA+ +  IQS  A++RALE QV Q+AN L +R QG LPS+TE+ R +GKE  KA+TLRSG  L
Subjt:  ----------------------MMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPL

A0A5B6VNY6 Gag-asp_proteas domain-containing protein2.4e-1742.24Show/hide
Query:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVT--VISH--QQPPVVEPAAVVMMKEFMARTD
        +VDASA   LL++++NEAY I++ I++ +CQW   R  + ++V  V EVD ++++ A +  I++ LK  T   ++H   QPP       V+   +MA+ D
Subjt:  MVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADLAMIANALKNVT--VISH--QQPPVVEPAAVVMMKEFMARTD

Query:  TAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEER
          IQ   A+++ LE +VGQLA EL  RPQG  PSD ++PR  GKE  K V LRSGK LE +
Subjt:  TAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEER

A0A6J1G7Q6 uncharacterized protein LOC1114515981.2e-1630Show/hide
Query:  KGANSVLEQSWE------RKLPRVSLVH-----------ERSAWGMVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRGT-NKKVKSVLEVDGVSTI
        K  N  L ++WE      RK P   L H             +   +VDASA   +L++T+NEAYEILE I++N+CQW DVR    KK + VLEVD +S+I
Subjt:  KGANSVLEQSWE------RKLPRVSLVH-----------ERSAWGMVDASAGVALLARTFNEAYEILEIISTNSCQWSDVRGT-NKKVKSVLEVDGVSTI

Query:  RADLAMIANALKNV---------------TVI------------------------------------------------------------------SH
         A LA + N L+N+               TV+                                                                  + 
Subjt:  RADLAMIANALKNV---------------TVI------------------------------------------------------------------SH

Query:  QQPP---------------------------------VVEPAAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKE
        Q PP                                 +       ++KE+MAR D  IQS Q S+R LE+QVGQLANEL+ RP GKLP+DTE P+REG E
Subjt:  QQPP---------------------------------VVEPAAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKE

A0A6J1H7K8 uncharacterized protein LOC1114611671.1e-1449.56Show/hide
Query:  QQPPVVEPAAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKE------PSKTQDIDD
        Q   + E +   ++KE+MA+ D  IQS QAS+R LE+QVGQLANEL+ RP GKLPSDTE P+REG EQ +A+ LRSGK +  R+E       S++Q+  D
Subjt:  QQPPVVEPAAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKE------PSKTQDIDD

Query:  NCDR--NVVVEKE
           R    VV++E
Subjt:  NCDR--NVVVEKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGTCACTCCACTTCTCAGTCGAAAAACCCCACTCCATCCCAGAAACCGCCTAAGGGTTCTTCTTCTGAGAGGTTGAAGGTGGTTTCTAAAACTTCTCCTTTTCC
ACGGCTAATGAGTACCCACGATTTCGAACGTTCGGCCTCCCTAAGGTCGAAAACTCAAGAGGAGGAAGAAGAGGAAGTAACACATGCAAGGATGCCAAAACCAGTGAGGG
ATGAGGAGAGAAAGAGAGTAGGAATCAAGTACGTGAGAAGGAAAATCAAGGAGCAAGCTAAGCCAAACAGCCCAGTGGCGGCCGCAAATGCGGTGCAAACTGGTGTTTCT
TTTGCAGGAAATCAGGAATTTGCAGCCGCAAATGTGGCCCAACCAGAGAAAGAAGTGCCAGCTTCTTGTTGTGCTCCTGCAAGCGTACGGTTGCCCAACAAATCCAAGCC
AAAAACCCGAGAGGACAGCGTCTTGACGCCACAAACTAGCGTCTCGACGGTGTCGTCGGGAGCCAACCACGCAATTTGGTGTTATTTCGGCCCAATTATTGAGTTTTGGA
GCCATCTCGGTGTTCTTGGACGGCAAGGAGACAAAACTGAGAAAAAAGGTAGCAAGGCTCGGATTGGGGACGGCGTCGAGACGCTGCCTGATAAGGAAAAACAGAGGAAA
ACTGGAAAACCCCAGAAATGCGACCACATTTATGGGAAGGCAAAATCCAAATGCGACCGCATTTCTGGGCAGACAGAGGCAGTTTCGAGTCCGTCGCGGGTCGTTAAGCA
GCCGCAGAAGAACTTTCAACCAGTCAAAGTACAGAATCAAGAGTCAAATCTAGAGGCTCTGATGAAGGAGTACATGGCAAGAAATGATGTCGCTGTCAGGAATTTGGAAG
TACATATTGGTAAGGAGCAGTGCGAGGCCGTCACCTTGAGAAGTATATTAGAATACGATGGACCAGAATACCCCGTGAATCAAGAAGTAAGGAAAATCCCAAAAAAAGCT
CCAAAAGAAGTTCCAGAAAAGAAGTCAAAAGTTACCACCGAAGTTTTTCAGAATAAGGAAAATAAAGAAAAGACCAAGGAATTAGAAACCTTGGAAGAAAAAAGTGAAAG
TTCAAGTTCAGAGAAAAAGCAGGAAAATGCAACCGCAAAGCCTAAAAAGTTTATTATAGATCCAGATTACAGACCACCACCTCCATATCCTCAGAGATTCAAATATGCCT
CACAAGACGCACAGTTTAAAAAGTTTCTAGATATCCTCAAGCTGAAGTGTCTTCCCTACTATTCTCCGAGCCAAGAACGACTAGTGGGAGTCTATAGAGATAGGGAATGG
GAAGGGAATGCTAGAAGAGCAAATCCTTGCACCTTGGAGTCTCAAGTTGAAGGACACATGGTGGTAATGCAAGCCAAAGTGGATTCTAGAATTGAGTTAGAAGTGGTGAT
TATTTGTCCATGCCGAAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAATGCTCAGAATCTGTTGCTGGGCGACTTAAGGGAGCAAACTCTGTGCTGGAGCAAA
GCTGGGAGCGAAAACTGCCACGTGTGAGTTTGGTGCATGAGCGATCCGCCTGGGGTATGGTTGATGCTTCGGCTGGAGTGGCCCTTTTGGCAAGAACTTTTAACGAAGCC
TATGAAATTTTAGAAATAATATCTACTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAAGTTGATGGTGTGTCCACCATTAG
GGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCGCCAGTTGTGGAGCCTGCTGCAGTGGTAATGATGAAAGAATTTATGG
CTCGTACAGACACCGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCCAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCC
TCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGTGGTAAGCCACTAGAAGAAAGAAAAGAGCCTAGTAAAACCCAGGATAT
AGATGATAATTGTGATAGAAATGTTGTTGTTGAGAAAGAGTTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAAAGATGCTGGAGCATCTGAAGGTTGTTACGGCAAAG
TTATGGCTGAAGCAAATCTTCCGTCCGGATGGAAAGGGTTGTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACGTCACTCCACTTCTCAGTCGAAAAACCCCACTCCATCCCAGAAACCGCCTAAGGGTTCTTCTTCTGAGAGGTTGAAGGTGGTTTCTAAAACTTCTCCTTTTCC
ACGGCTAATGAGTACCCACGATTTCGAACGTTCGGCCTCCCTAAGGTCGAAAACTCAAGAGGAGGAAGAAGAGGAAGTAACACATGCAAGGATGCCAAAACCAGTGAGGG
ATGAGGAGAGAAAGAGAGTAGGAATCAAGTACGTGAGAAGGAAAATCAAGGAGCAAGCTAAGCCAAACAGCCCAGTGGCGGCCGCAAATGCGGTGCAAACTGGTGTTTCT
TTTGCAGGAAATCAGGAATTTGCAGCCGCAAATGTGGCCCAACCAGAGAAAGAAGTGCCAGCTTCTTGTTGTGCTCCTGCAAGCGTACGGTTGCCCAACAAATCCAAGCC
AAAAACCCGAGAGGACAGCGTCTTGACGCCACAAACTAGCGTCTCGACGGTGTCGTCGGGAGCCAACCACGCAATTTGGTGTTATTTCGGCCCAATTATTGAGTTTTGGA
GCCATCTCGGTGTTCTTGGACGGCAAGGAGACAAAACTGAGAAAAAAGGTAGCAAGGCTCGGATTGGGGACGGCGTCGAGACGCTGCCTGATAAGGAAAAACAGAGGAAA
ACTGGAAAACCCCAGAAATGCGACCACATTTATGGGAAGGCAAAATCCAAATGCGACCGCATTTCTGGGCAGACAGAGGCAGTTTCGAGTCCGTCGCGGGTCGTTAAGCA
GCCGCAGAAGAACTTTCAACCAGTCAAAGTACAGAATCAAGAGTCAAATCTAGAGGCTCTGATGAAGGAGTACATGGCAAGAAATGATGTCGCTGTCAGGAATTTGGAAG
TACATATTGGTAAGGAGCAGTGCGAGGCCGTCACCTTGAGAAGTATATTAGAATACGATGGACCAGAATACCCCGTGAATCAAGAAGTAAGGAAAATCCCAAAAAAAGCT
CCAAAAGAAGTTCCAGAAAAGAAGTCAAAAGTTACCACCGAAGTTTTTCAGAATAAGGAAAATAAAGAAAAGACCAAGGAATTAGAAACCTTGGAAGAAAAAAGTGAAAG
TTCAAGTTCAGAGAAAAAGCAGGAAAATGCAACCGCAAAGCCTAAAAAGTTTATTATAGATCCAGATTACAGACCACCACCTCCATATCCTCAGAGATTCAAATATGCCT
CACAAGACGCACAGTTTAAAAAGTTTCTAGATATCCTCAAGCTGAAGTGTCTTCCCTACTATTCTCCGAGCCAAGAACGACTAGTGGGAGTCTATAGAGATAGGGAATGG
GAAGGGAATGCTAGAAGAGCAAATCCTTGCACCTTGGAGTCTCAAGTTGAAGGACACATGGTGGTAATGCAAGCCAAAGTGGATTCTAGAATTGAGTTAGAAGTGGTGAT
TATTTGTCCATGCCGAAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAATGCTCAGAATCTGTTGCTGGGCGACTTAAGGGAGCAAACTCTGTGCTGGAGCAAA
GCTGGGAGCGAAAACTGCCACGTGTGAGTTTGGTGCATGAGCGATCCGCCTGGGGTATGGTTGATGCTTCGGCTGGAGTGGCCCTTTTGGCAAGAACTTTTAACGAAGCC
TATGAAATTTTAGAAATAATATCTACTAATAGTTGTCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAAGTTGATGGTGTGTCCACCATTAG
GGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCGCCAGTTGTGGAGCCTGCTGCAGTGGTAATGATGAAAGAATTTATGG
CTCGTACAGACACCGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCCAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCC
TCAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGTGGTAAGCCACTAGAAGAAAGAAAAGAGCCTAGTAAAACCCAGGATAT
AGATGATAATTGTGATAGAAATGTTGTTGTTGAGAAAGAGTTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAAAGATGCTGGAGCATCTGAAGGTTGTTACGGCAAAG
TTATGGCTGAAGCAAATCTTCCGTCCGGATGGAAAGGGTTGTTGTGA
Protein sequenceShow/hide protein sequence
MKRHSTSQSKNPTPSQKPPKGSSSERLKVVSKTSPFPRLMSTHDFERSASLRSKTQEEEEEEVTHARMPKPVRDEERKRVGIKYVRRKIKEQAKPNSPVAAANAVQTGVS
FAGNQEFAAANVAQPEKEVPASCCAPASVRLPNKSKPKTREDSVLTPQTSVSTVSSGANHAIWCYFGPIIEFWSHLGVLGRQGDKTEKKGSKARIGDGVETLPDKEKQRK
TGKPQKCDHIYGKAKSKCDRISGQTEAVSSPSRVVKQPQKNFQPVKVQNQESNLEALMKEYMARNDVAVRNLEVHIGKEQCEAVTLRSILEYDGPEYPVNQEVRKIPKKA
PKEVPEKKSKVTTEVFQNKENKEKTKELETLEEKSESSSSEKKQENATAKPKKFIIDPDYRPPPPYPQRFKYASQDAQFKKFLDILKLKCLPYYSPSQERLVGVYRDREW
EGNARRANPCTLESQVEGHMVVMQAKVDSRIELEVVIICPCRKNYFAAAELGFAECSESVAGRLKGANSVLEQSWERKLPRVSLVHERSAWGMVDASAGVALLARTFNEA
YEILEIISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPVVEPAAVVMMKEFMARTDTAIQSNQASMRALELQVGQLANELKARPQGKLP
SDTEHPRREGKEQVKAVTLRSGKPLEERKEPSKTQDIDDNCDRNVVVEKELETGQGAGGSNKDAGASEGCYGKVMAEANLPSGWKGLL