; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr7:3940385..3948024
RNA-Seq ExpressionMoc07g04600
SyntenyMoc07g04600
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8501049.1 hypothetical protein CXB51_003148 [Gossypium anomalum]1.6e-7750.14Show/hide
Query:  YSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTI----------------------------------
        Y P PPYP+RLQK+++ VQF KFLDVLKQLH+NIPLVEALEQMPNYV+F+K+IL KKR LGE++T+                                  
Subjt:  YSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTI----------------------------------

Query:  -GLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK----
         G ALCDLGA+INLMP+SI+ KLGIGE RPT VTLQLADRS+ H EGKI+DVLV VDKF FPADF+ILD++ADKEVPIILGRPFLATGR L+DV K    
Subjt:  -GLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK----

Query:  ----------------------EECSVIKILDEALME-ELET-EVMFEHLEAIDAESVADAFEEELEDVQSEYMNTN-KGF-VKRMYKFLDVTNSELRLP
                              ++CSV+  L++ ++E EL + E + E +  +D  S      +E ED     +  N KGF  +  ++ LD+   +   P
Subjt:  ----------------------EECSVIKILDEALME-ELET-EVMFEHLEAIDAESVADAFEEELEDVQSEYMNTN-KGF-VKRMYKFLDVTNSELRLP

Query:  KPSIEDPPVLELKALPQHLKYAYLGLSETLPIIIAADLPLENEQMLL
        K SIE+PP LELK LP HLKY YLG + TLP+I++A+L  E E+ L+
Subjt:  KPSIEDPPVLELKALPQHLKYAYLGLSETLPIIIAADLPLENEQMLL

XP_017227899.1 PREDICTED: uncharacterized protein LOC108203467 [Daucus carota subsp. sativus]3.3e-8046.56Show/hide
Query:  KLTKEKLNQNRTIRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTI----
        K++ +K  +N  +  P    +  H        P PP+P+R QK++++VQF KFLDVLKQLH+NIPLVEALEQMPNYV+F+K+IL KKR LGE++T+    
Subjt:  KLTKEKLNQNRTIRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTI----

Query:  -------------------------------GLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDY
                                       G+ALCDLGA+INLMP+S++ KLGIGE RPT VTLQLADRS+ HPEGKIEDVLV VDKF FPADFI+LDY
Subjt:  -------------------------------GLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDY

Query:  DADKEVPIILGRPFLATGRALVDVHK--------------------------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQ-S
        +AD+EVPIILGRPFLATGR L+DV                            E+CS I I DE + ++L++    + LE +    + +  +EE ++V+  
Subjt:  DADKEVPIILGRPFLATGRALVDVHK--------------------------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQ-S

Query:  EYMNTNKGFVKRMYKF--LDVTNSELRLPKPSIEDPPVLELKALPQHLKYAYLGLSETLPIIIAADLPLENEQMLLNL
         ++  N    +   +F  LD+++ E + PK SI++PP LELK LP HLKYAYLG S TLP+II+A+L +  E+ L+ L
Subjt:  EYMNTNKGFVKRMYKF--LDVTNSELRLPKPSIEDPPVLELKALPQHLKYAYLGLSETLPIIIAADLPLENEQMLLNL

XP_022144016.1 uncharacterized protein LOC111013805 [Momordica charantia]3.6e-8264.49Show/hide
Query:  KKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYD-----------------------------------TIGLALCDLGANI
        KKE+N +F KFLDVLKQLHVN+PLVEALEQMPNYVRFLKEIL KKR LGEY+                                    IGL LCD+GA+I
Subjt:  KKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYD-----------------------------------TIGLALCDLGANI

Query:  NLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK----------------
        N+MPLSIYNKLGI EARPT VTLQLADRSITHPEGKIEDV V V+KF FPADFIILDYDA KEVPIILGRPFLATGRALVDV+K                
Subjt:  NLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK----------------

Query:  ----------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDV
                  EECSV+KILDEALMEELE EVM E LEA+ A+SV +AFEEELEDV+SE +NTNKGFVK++Y+ LDV
Subjt:  ----------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDV

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]1.8e-9495.74Show/hide
Query:  QNRTIRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTIGLALCDLGANIN
        +++ IRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTIGLALCDLGANIN
Subjt:  QNRTIRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTIGLALCDLGANIN

Query:  LMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHKEECSV
        LMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK E ++
Subjt:  LMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHKEECSV

XP_024028757.1 uncharacterized protein LOC112093792 [Morus notabilis]2.9e-7648.55Show/hide
Query:  PAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDT-----------------------------------IG
        P PP+P+R Q ++++ QF +FLDVLKQLH+NIPLVEALEQMP+YV+F+K+IL KKR LGE++T                                   IG
Subjt:  PAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDT-----------------------------------IG

Query:  LALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK------
         ALCDLGA+INLMP+SI+ KLGIGE  PT VTLQLADRS  HPEGKIEDVLV VDKF FPADFI+LDY+ADKEVPIILGRPFLATG+ L+DV K      
Subjt:  LALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK------

Query:  --------------------EECSVIKILDEALMEELE---TEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDVTNSELRLPKPS
                            EECS + +LD  +  E E    E +    + ID+E   D  ++++  ++  +  T     +R ++ LD++   LR  KPS
Subjt:  --------------------EECSVIKILDEALMEELE---TEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDVTNSELRLPKPS

Query:  IEDPPVLELKALPQHLKYAYLGLSETLPIIIAADLPLENEQMLLNL
        +E+PP+LEL+ LP HL+YAYLG S+TLP+IIA+ L    E  LL +
Subjt:  IEDPPVLELKALPQHLKYAYLGLSETLPIIIAADLPLENEQMLLNL

TrEMBL top hitse value%identityAlignment
A0A2G9HYA0 Reverse transcriptase3.2e-6843.37Show/hide
Query:  RNILKLTKEKL-NQNRTIRVPEKRKQAEHENAPAEYSP----APPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLG
        R + ++ KE   ++ + +   EK K+ E   AP E S      PP+P+RLQK++   QF KFL+V K+LH+NIP  EALEQMP+YV+F+K+IL KKR LG
Subjt:  RNILKLTKEKL-NQNRTIRVPEKRKQAEHENAPAEYSP----APPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLG

Query:  EYDTI-----------------------------------GLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFF
        +Y+T+                                   G ALCDLGA+INLMP SIY  LG+GEA+PT +TLQLADRS+T+P+G IED+LV VDKF F
Subjt:  EYDTI-----------------------------------GLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFF

Query:  PADFIILDYDADKEVPIILGRPFLATGRALVDVHK--------------------------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFE
        PADF++LD + D EVPIILGRPFLATGR L+DV K                          +EC  + + D+    E   E   + LE    + + +  E
Subjt:  PADFIILDYDADKEVPIILGRPFLATGRALVDVHK--------------------------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFE

Query:  EELEDVQSEYMNTNKGFVKRMYKFLDVTNSELRLPKPSIEDPPVLELKALPQHLKYAYLGLSETLPIIIAADL-PLENEQMLLNLGQHLEAV
        E+LE V++  ++ +K    R  + L+ T    ++ KPSIEDPP LELK LP HL YAYLG S+TLP+II++ L  L+ E++L  L  H  A+
Subjt:  EELEDVQSEYMNTNKGFVKRMYKFLDVTNSELRLPKPSIEDPPVLELKALPQHLKYAYLGLSETLPIIIAADL-PLENEQMLLNLGQHLEAV

A0A6J1CS22 uncharacterized protein LOC1110138051.7e-8264.49Show/hide
Query:  KKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYD-----------------------------------TIGLALCDLGANI
        KKE+N +F KFLDVLKQLHVN+PLVEALEQMPNYVRFLKEIL KKR LGEY+                                    IGL LCD+GA+I
Subjt:  KKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYD-----------------------------------TIGLALCDLGANI

Query:  NLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK----------------
        N+MPLSIYNKLGI EARPT VTLQLADRSITHPEGKIEDV V V+KF FPADFIILDYDA KEVPIILGRPFLATGRALVDV+K                
Subjt:  NLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK----------------

Query:  ----------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDV
                  EECSV+KILDEALMEELE EVM E LEA+ A+SV +AFEEELEDV+SE +NTNKGFVK++Y+ LDV
Subjt:  ----------EECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDV

A0A6J1D3P6 uncharacterized protein LOC1110170143.0e-7461.9Show/hide
Query:  KLTKEKLNQNRT----IRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDT-
        K+T  ++N ++      RV +KRKQAEH  APA+Y P PPYPKRLQKKE+NVQF K LDVLKQLHVNIP VEALEQ+PNYVRFLKEILIKKR L E +T 
Subjt:  KLTKEKLNQNRT----IRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDT-

Query:  ----------------------------------IGLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFI
                                          IGLALCDLGA+INL+PLSIYNKLGIGEARPT VTLQLAD+S+THPEGKIEDVLV VDKF FP DFI
Subjt:  ----------------------------------IGLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFI

Query:  ILDYDADKEVPIILGRPFLATGRALVDVHK------------------------EECSVIKILDEALMEELET
        ILDYDADKEV II+ RPFLAT RALV+VHK                        EEC VIKILDEALM+ELET
Subjt:  ILDYDADKEVPIILGRPFLATGRALVDVHK------------------------EECSVIKILDEALMEELET

A0A6J1DV77 uncharacterized protein LOC1110238183.5e-7551.06Show/hide
Query:  PAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYD---------------------------------
        P E+   PPYP+RLQKK ++VQF++FL+VLKQLH+NIPL+EALEQMPNYV+FLK+IL KKR LGE++                                 
Subjt:  PAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYD---------------------------------

Query:  --TIGLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHKE
           +G ALCDLGA+INLMPLS+Y KLGIGEARP  VTLQLADRSIT+ EGKIEDVLV VDKF FPADFIILDY+ADKE+PIILGRPFL+TGRAL+DVH  
Subjt:  --TIGLALCDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHKE

Query:  ECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDVTNSELRLP-KPSIEDPPVLELKALPQHLKYAYLGL
        E + I++ D+ +   +   + +     ID E    ++    +D+ S+ + T +   +   +   +    ++ P +PS+   P LELK LP HLKYAYLG 
Subjt:  ECSVIKILDEALMEELETEVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDVTNSELRLP-KPSIEDPPVLELKALPQHLKYAYLGL

Query:  SETLPIIIAADLPLENEQMLLN-LGQHLEAV
         ETLP+ IAADL  E E  L+  L  H +A+
Subjt:  SETLPIIIAADLPLENEQMLLN-LGQHLEAV

A0A6J1E1F3 uncharacterized protein LOC1110250658.8e-9595.74Show/hide
Query:  QNRTIRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTIGLALCDLGANIN
        +++ IRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTIGLALCDLGANIN
Subjt:  QNRTIRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTIGLALCDLGANIN

Query:  LMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHKEECSV
        LMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHK E ++
Subjt:  LMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHKEECSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCAGCATCGATCAAGAAATATACTCAAGCTGACCAAGGAGAAACTCAATCAGAACAGGACAATTAGAGTGCCCGAGAAAAGAAAGCAGGCAGAGCATGAAAATGC
TCCAGCAGAGTATAGTCCAGCGCCACCATATCCTAAACGGTTGCAGAAGAAGGAGCGAAACGTTCAATTCAATAAGTTTTTAGATGTCTTGAAGCAACTGCACGTGAACA
TACCGTTGGTGGAAGCTTTAGAGCAAATGCCGAATTATGTGAGGTTCCTGAAAGAGATACTCATAAAGAAGAGAACGCTGGGAGAGTATGATACGATAGGGCTTGCCTTA
TGTGATCTGGGCGCCAACATCAATCTCATGCCGTTATCAATTTACAACAAGTTGGGTATTGGAGAAGCAAGGCCCACAATAGTGACATTGCAGCTAGCAGATCGATCCAT
TACGCATCCAGAGGGAAAAATTGAAGACGTTTTGGTAATAGTAGATAAATTTTTTTTCCCTGCGGACTTCATTATTTTGGATTATGATGCAGATAAGGAGGTTCCTATTA
TTCTTGGAAGACCATTCCTTGCAACTGGGAGAGCTTTGGTAGATGTCCACAAAGAAGAGTGCTCAGTAATAAAGATATTGGATGAAGCATTAATGGAGGAGCTGGAAACA
GAAGTCATGTTTGAGCACCTAGAAGCAATTGACGCCGAAAGTGTTGCTGACGCGTTTGAAGAGGAGCTAGAAGATGTCCAATCAGAATACATGAACACTAACAAAGGATT
TGTGAAAAGAATGTATAAGTTCCTAGACGTCACAAACTCGGAGCTTAGATTACCAAAGCCATCTATTGAAGATCCGCCAGTGTTGGAGCTTAAAGCATTGCCACAACATC
TGAAATATGCTTACCTGGGTTTATCAGAGACATTGCCAATCATCATAGCAGCGGACTTACCCTTAGAAAATGAACAAATGCTTTTAAATTTAGGGCAACATCTGGAAGCT
GTACGAAGACAGTTAAAATTGGGGCAACAAGTTACTGCGCAGTTACATATGCGAGAGCATCCAGACTGGATAGACTCTAGCATGGAAACTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCCAGCATCGATCAAGAAATATACTCAAGCTGACCAAGGAGAAACTCAATCAGAACAGGACAATTAGAGTGCCCGAGAAAAGAAAGCAGGCAGAGCATGAAAATGC
TCCAGCAGAGTATAGTCCAGCGCCACCATATCCTAAACGGTTGCAGAAGAAGGAGCGAAACGTTCAATTCAATAAGTTTTTAGATGTCTTGAAGCAACTGCACGTGAACA
TACCGTTGGTGGAAGCTTTAGAGCAAATGCCGAATTATGTGAGGTTCCTGAAAGAGATACTCATAAAGAAGAGAACGCTGGGAGAGTATGATACGATAGGGCTTGCCTTA
TGTGATCTGGGCGCCAACATCAATCTCATGCCGTTATCAATTTACAACAAGTTGGGTATTGGAGAAGCAAGGCCCACAATAGTGACATTGCAGCTAGCAGATCGATCCAT
TACGCATCCAGAGGGAAAAATTGAAGACGTTTTGGTAATAGTAGATAAATTTTTTTTCCCTGCGGACTTCATTATTTTGGATTATGATGCAGATAAGGAGGTTCCTATTA
TTCTTGGAAGACCATTCCTTGCAACTGGGAGAGCTTTGGTAGATGTCCACAAAGAAGAGTGCTCAGTAATAAAGATATTGGATGAAGCATTAATGGAGGAGCTGGAAACA
GAAGTCATGTTTGAGCACCTAGAAGCAATTGACGCCGAAAGTGTTGCTGACGCGTTTGAAGAGGAGCTAGAAGATGTCCAATCAGAATACATGAACACTAACAAAGGATT
TGTGAAAAGAATGTATAAGTTCCTAGACGTCACAAACTCGGAGCTTAGATTACCAAAGCCATCTATTGAAGATCCGCCAGTGTTGGAGCTTAAAGCATTGCCACAACATC
TGAAATATGCTTACCTGGGTTTATCAGAGACATTGCCAATCATCATAGCAGCGGACTTACCCTTAGAAAATGAACAAATGCTTTTAAATTTAGGGCAACATCTGGAAGCT
GTACGAAGACAGTTAAAATTGGGGCAACAAGTTACTGCGCAGTTACATATGCGAGAGCATCCAGACTGGATAGACTCTAGCATGGAAACTCTGTAG
Protein sequenceShow/hide protein sequence
MLQHRSRNILKLTKEKLNQNRTIRVPEKRKQAEHENAPAEYSPAPPYPKRLQKKERNVQFNKFLDVLKQLHVNIPLVEALEQMPNYVRFLKEILIKKRTLGEYDTIGLAL
CDLGANINLMPLSIYNKLGIGEARPTIVTLQLADRSITHPEGKIEDVLVIVDKFFFPADFIILDYDADKEVPIILGRPFLATGRALVDVHKEECSVIKILDEALMEELET
EVMFEHLEAIDAESVADAFEEELEDVQSEYMNTNKGFVKRMYKFLDVTNSELRLPKPSIEDPPVLELKALPQHLKYAYLGLSETLPIIIAADLPLENEQMLLNLGQHLEA
VRRQLKLGQQVTAQLHMREHPDWIDSSMETL