; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr2:13457689..13460252
RNA-Seq ExpressionMoc02g18010
SyntenyMoc02g18010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]4.1e-6437Show/hide
Query:  MNPQVQPN--APIRPNVRIEKIVDGAPIVADPEVAVPPLNAVLLADDIDREIKAYAASTFYNVNPIITEPEIETSKFELK--------------------
        MNP   PN   PI PNVRIE+IVDG P+  + EV VP LN VLLA  IDREI+AYAA TFYN NP+ITE EI   KFELK                    
Subjt:  MNPQVQPN--APIRPNVRIEKIVDGAPIVADPEVAVPPLNAVLLADDIDREIKAYAASTFYNVNPIITEPEIETSKFELK--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------PVMFQMLQ--------------------------------------TVDLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNI
                 F +L+                                       +DLV RSMT+QS+VGA  GKAN S  QG S SF+ G HHYNNC GN 
Subjt:  ------PVMFQMLQ--------------------------------------TVDLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNI

Query:  ESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRR
        ESVY LGN  NS+NN Y NTYNPG RNHPN                                      E R    +V    +  + + +           
Subjt:  ESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRR

Query:  EPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEED
             +      K+VE+  E QN+SN+ VN V       GSTQ RV KKRKQ EH+++ AEY+  P YP+R QKKE NVQ  KFLDVLKQLHVNIPL E 
Subjt:  EPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEED

Query:  LQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAKMK
        L++M NYVRFLK+I TKK  LGEY+TV MTKACSTILTSKIP KMK
Subjt:  LQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAKMK

XP_022155097.1 uncharacterized protein LOC111022229 [Momordica charantia]1.4e-4375.57Show/hide
Query:  GEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKK
        GE Q  S Q ++ +I++ PE+GSTQ RVP+KRKQAEH+N+ AEYR TP YP+RLQKK+ NVQ KKFLD LKQLHV IPL E L+QMPNYVRFLKEI  KK
Subjt:  GEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKK

Query:  RTLGEYKTVAMTKACSTILTSKIPAKMKDPG
        RTLGEY+TVAMTKACSTILTSKIPAKMKDPG
Subjt:  RTLGEYKTVAMTKACSTILTSKIPAKMKDPG

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]1.9e-4080Show/hide
Query:  DLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKV
        +LVMRSMT+Q++VGAS GKANVS IQGISCSF EG+HHYNN   N ESVYYLGN QN+  N Y NTYNPG RNHPNFSWSGNQGGNNAGTSNAPAYQQK 
Subjt:  DLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKV

Query:  SYPPGFANQG
        SYPP F+NQG
Subjt:  SYPPGFANQG

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]1.6e-7660.73Show/hide
Query:  LVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVS
        LVMRSM +QSSVGA  G ANV+QIQGISCSF EGDHHYNNC GN ESVYYLGNPQN++NN Y NTYNPG RNHPNFSWSG+QGG+NAGTS+APA+Q KVS
Subjt:  LVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVS

Query:  YPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQV-PEVGSTQTRVPKKRKQAE
        YPPGF NQG ++                Q++ SIA      K+     +A V            + Q TS + + L + Q+  ++ S   RVP+KRKQAE
Subjt:  YPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQV-PEVGSTQTRVPKKRKQAE

Query:  HDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAM
        H+N+ AEY P P YP+RLQKKE NVQ  KFLDVLKQLHVNIPL E L+QMPNYVRFLKEI  KKRTLGEY T+ +
Subjt:  HDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAM

XP_022158740.1 uncharacterized protein LOC111025203 [Momordica charantia]5.4e-6449.18Show/hide
Query:  MTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGF
        MT++++ G  A KANVS IQGIS SF EG+HHYN+C  N +SVYYLGN  N+ NNPY NTYN G  +HPNFSWS NQG N+ GTSNAPAYQQK +YPP  
Subjt:  MTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGF

Query:  ANQG----------------CIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQT
        ANQG                 ++K++   + +    +     R++  + G      +      L    K+ E+  + Q+ +++ VN V A+    G++  
Subjt:  ANQG----------------CIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQT

Query:  RVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAK
        +V +KRK+ EH+++  E+RPTP YP+RL+KKE +VQ +KFLDVL QLHVNIPL E  +QM  YVRFLK+I  KKR L EYKTVAMTK  S IL SKIP K
Subjt:  RVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAK

Query:  MKDPG
        +KD G
Subjt:  MKDPG

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189102.0e-6437Show/hide
Query:  MNPQVQPN--APIRPNVRIEKIVDGAPIVADPEVAVPPLNAVLLADDIDREIKAYAASTFYNVNPIITEPEIETSKFELK--------------------
        MNP   PN   PI PNVRIE+IVDG P+  + EV VP LN VLLA  IDREI+AYAA TFYN NP+ITE EI   KFELK                    
Subjt:  MNPQVQPN--APIRPNVRIEKIVDGAPIVADPEVAVPPLNAVLLADDIDREIKAYAASTFYNVNPIITEPEIETSKFELK--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------PVMFQMLQ--------------------------------------TVDLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNI
                 F +L+                                       +DLV RSMT+QS+VGA  GKAN S  QG S SF+ G HHYNNC GN 
Subjt:  ------PVMFQMLQ--------------------------------------TVDLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNI

Query:  ESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRR
        ESVY LGN  NS+NN Y NTYNPG RNHPN                                      E R    +V    +  + + +           
Subjt:  ESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRR

Query:  EPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEED
             +      K+VE+  E QN+SN+ VN V       GSTQ RV KKRKQ EH+++ AEY+  P YP+R QKKE NVQ  KFLDVLKQLHVNIPL E 
Subjt:  EPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEED

Query:  LQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAKMK
        L++M NYVRFLK+I TKK  LGEY+TV MTKACSTILTSKIP KMK
Subjt:  LQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAKMK

A0A6J1DNF6 uncharacterized protein LOC1110222296.7e-4475.57Show/hide
Query:  GEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKK
        GE Q  S Q ++ +I++ PE+GSTQ RVP+KRKQAEH+N+ AEYR TP YP+RLQKK+ NVQ KKFLD LKQLHV IPL E L+QMPNYVRFLKEI  KK
Subjt:  GEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKK

Query:  RTLGEYKTVAMTKACSTILTSKIPAKMKDPG
        RTLGEY+TVAMTKACSTILTSKIPAKMKDPG
Subjt:  RTLGEYKTVAMTKACSTILTSKIPAKMKDPG

A0A6J1DRG1 uncharacterized protein LOC1110236699.0e-4180Show/hide
Query:  DLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKV
        +LVMRSMT+Q++VGAS GKANVS IQGISCSF EG+HHYNN   N ESVYYLGN QN+  N Y NTYNPG RNHPNFSWSGNQGGNNAGTSNAPAYQQK 
Subjt:  DLVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKV

Query:  SYPPGFANQG
        SYPP F+NQG
Subjt:  SYPPGFANQG

A0A6J1DWN2 uncharacterized protein LOC1110252032.6e-6449.18Show/hide
Query:  MTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGF
        MT++++ G  A KANVS IQGIS SF EG+HHYN+C  N +SVYYLGN  N+ NNPY NTYN G  +HPNFSWS NQG N+ GTSNAPAYQQK +YPP  
Subjt:  MTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGF

Query:  ANQG----------------CIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQT
        ANQG                 ++K++   + +    +     R++  + G      +      L    K+ E+  + Q+ +++ VN V A+    G++  
Subjt:  ANQG----------------CIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQT

Query:  RVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAK
        +V +KRK+ EH+++  E+RPTP YP+RL+KKE +VQ +KFLDVL QLHVNIPL E  +QM  YVRFLK+I  KKR L EYKTVAMTK  S IL SKIP K
Subjt:  RVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAK

Query:  MKDPG
        +KD G
Subjt:  MKDPG

A0A6J1E1F3 uncharacterized protein LOC1110250657.8e-7760.73Show/hide
Query:  LVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVS
        LVMRSM +QSSVGA  G ANV+QIQGISCSF EGDHHYNNC GN ESVYYLGNPQN++NN Y NTYNPG RNHPNFSWSG+QGG+NAGTS+APA+Q KVS
Subjt:  LVMRSMTEQSSVGASAGKANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVS

Query:  YPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQV-PEVGSTQTRVPKKRKQAE
        YPPGF NQG ++                Q++ SIA      K+     +A V            + Q TS + + L + Q+  ++ S   RVP+KRKQAE
Subjt:  YPPGFANQGCIIKEHRVASRIVSDRFEEQTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQV-PEVGSTQTRVPKKRKQAE

Query:  HDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAM
        H+N+ AEY P P YP+RLQKKE NVQ  KFLDVLKQLHVNIPL E L+QMPNYVRFLKEI  KKRTLGEY T+ +
Subjt:  HDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHVNIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGAATGAATCCACAAGTTCAACCTAATGCACCTATAAGACCGAATGTTAGGATTGAGAAAATAGTAGATGGGGCTCCTATCGTTGCTGACCCTGAGGTAGCAGT
GCCCCCTCTTAATGCTGTACTGTTAGCAGACGACATCGACAGGGAAATCAAAGCATATGCAGCTTCGACATTCTACAATGTCAACCCAATTATCACGGAGCCAGAAATTG
AAACTTCCAAGTTTGAGCTAAAACCAGTGATGTTTCAGATGCTCCAGACAGTGGACTTGGTAATGAGAAGTATGACGGAACAAAGTTCAGTGGGAGCATCAGCTGGTAAA
GCTAATGTTAGTCAAATCCAAGGGATTTCTTGTTCTTTCTACGAGGGAGATCATCATTATAACAATTGCCTGGGAAATATTGAGTCGGTGTACTATTTGGGAAACCCGCA
GAATAGTAAAAATAACCCATATTTAAATACGTACAATCCTGGCTTGAGGAATCACCCCAACTTTAGTTGGAGTGGCAATCAGGGAGGAAACAATGCTGGGACGTCCAATG
CTCCAGCATATCAGCAGAAAGTAAGTTATCCTCCAGGTTTTGCGAATCAAGGTTGCATCATTAAGGAACATAGAGTTGCAAGTAGGATAGTTAGCGATAGATTTGAAGAG
CAGACCGATCGGAGCATTGCTGAGCGATATGGAAGTGCCAAAAAGAGACGTGAGCCAGTAGAAGCAGTTGTACTCACTCCACCGGAGAAGATGGTTGAGAAACTAGGGGA
AGTTCAGAATACGTCCAATCAAATGGTTAACCTAGTAATTGCTCAGGTACCTGAAGTAGGGTCAACACAGACCAGAGTGCCCAAGAAAAGAAAGCAGGCAGAGCATGATA
ATTCTTCAGCAGAATATAGGCCAACACCATCATACCCTGAACGATTGCAAAAGAAAGAACATAATGTTCAGTCTAAGAAGTTCTTAGATGTCTTGAAGCAGCTGCATGTG
AATATACCGTTGGAGGAAGACCTACAACAAATGCCAAATTATGTGAGATTTCTGAAAGAAATACACACAAAGAAGAGAACGCTGGGAGAGTATAAAACTGTAGCAATGAC
CAAGGCCTGCAGCACCATACTCACAAGCAAAATTCCCGCAAAGATGAAAGATCCTGGGAAAGAATGTTGTGTGTTAAAGATTTTAGATGAAGCATTAATGGAGGAATTGG
AAATAGAAGCTATGCTGGAGCATTTAGAAGCGAGTGACGCTGGAGGTGTTGTTGACGAACTTGAAGAAGAACTAGACGATAACCAGTTGGTATGTATGAATGCCAGTGAA
GGATTGGTGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGAATGAATCCACAAGTTCAACCTAATGCACCTATAAGACCGAATGTTAGGATTGAGAAAATAGTAGATGGGGCTCCTATCGTTGCTGACCCTGAGGTAGCAGT
GCCCCCTCTTAATGCTGTACTGTTAGCAGACGACATCGACAGGGAAATCAAAGCATATGCAGCTTCGACATTCTACAATGTCAACCCAATTATCACGGAGCCAGAAATTG
AAACTTCCAAGTTTGAGCTAAAACCAGTGATGTTTCAGATGCTCCAGACAGTGGACTTGGTAATGAGAAGTATGACGGAACAAAGTTCAGTGGGAGCATCAGCTGGTAAA
GCTAATGTTAGTCAAATCCAAGGGATTTCTTGTTCTTTCTACGAGGGAGATCATCATTATAACAATTGCCTGGGAAATATTGAGTCGGTGTACTATTTGGGAAACCCGCA
GAATAGTAAAAATAACCCATATTTAAATACGTACAATCCTGGCTTGAGGAATCACCCCAACTTTAGTTGGAGTGGCAATCAGGGAGGAAACAATGCTGGGACGTCCAATG
CTCCAGCATATCAGCAGAAAGTAAGTTATCCTCCAGGTTTTGCGAATCAAGGTTGCATCATTAAGGAACATAGAGTTGCAAGTAGGATAGTTAGCGATAGATTTGAAGAG
CAGACCGATCGGAGCATTGCTGAGCGATATGGAAGTGCCAAAAAGAGACGTGAGCCAGTAGAAGCAGTTGTACTCACTCCACCGGAGAAGATGGTTGAGAAACTAGGGGA
AGTTCAGAATACGTCCAATCAAATGGTTAACCTAGTAATTGCTCAGGTACCTGAAGTAGGGTCAACACAGACCAGAGTGCCCAAGAAAAGAAAGCAGGCAGAGCATGATA
ATTCTTCAGCAGAATATAGGCCAACACCATCATACCCTGAACGATTGCAAAAGAAAGAACATAATGTTCAGTCTAAGAAGTTCTTAGATGTCTTGAAGCAGCTGCATGTG
AATATACCGTTGGAGGAAGACCTACAACAAATGCCAAATTATGTGAGATTTCTGAAAGAAATACACACAAAGAAGAGAACGCTGGGAGAGTATAAAACTGTAGCAATGAC
CAAGGCCTGCAGCACCATACTCACAAGCAAAATTCCCGCAAAGATGAAAGATCCTGGGAAAGAATGTTGTGTGTTAAAGATTTTAGATGAAGCATTAATGGAGGAATTGG
AAATAGAAGCTATGCTGGAGCATTTAGAAGCGAGTGACGCTGGAGGTGTTGTTGACGAACTTGAAGAAGAACTAGACGATAACCAGTTGGTATGTATGAATGCCAGTGAA
GGATTGGTGAAGTGA
Protein sequenceShow/hide protein sequence
MDRMNPQVQPNAPIRPNVRIEKIVDGAPIVADPEVAVPPLNAVLLADDIDREIKAYAASTFYNVNPIITEPEIETSKFELKPVMFQMLQTVDLVMRSMTEQSSVGASAGK
ANVSQIQGISCSFYEGDHHYNNCLGNIESVYYLGNPQNSKNNPYLNTYNPGLRNHPNFSWSGNQGGNNAGTSNAPAYQQKVSYPPGFANQGCIIKEHRVASRIVSDRFEE
QTDRSIAERYGSAKKRREPVEAVVLTPPEKMVEKLGEVQNTSNQMVNLVIAQVPEVGSTQTRVPKKRKQAEHDNSSAEYRPTPSYPERLQKKEHNVQSKKFLDVLKQLHV
NIPLEEDLQQMPNYVRFLKEIHTKKRTLGEYKTVAMTKACSTILTSKIPAKMKDPGKECCVLKILDEALMEELEIEAMLEHLEASDAGGVVDELEEELDDNQLVCMNASE
GLVK