; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038262 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038262
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:14491865..14493058
RNA-Seq ExpressionLag0038262
SyntenyLag0038262
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7597660.1 Reverse transcriptase domain [Arabidopsis suecica]3.3e-3938.55Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKK---NGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        IIS +Q+AFVPGR I DN++V HE LH+LKSK+   +G   +K D+SKAYDRVEW FLE+++  +GF  +WV+ IM+CV+T  + IL+NG P G + P R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKK---NGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKK--PVLVDILAIPVVTDLGKYLDIPSSFTRRRGEDFQSIKQR------------VYGKILVGNH
        GI QGDPLSPYLFL C E LS ++  A+ + HI G K       I  +    D   +    SS  ++    FQ  ++             ++G  +    
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKK--PVLVDILAIPVVTDLGKYLDIPSSFTRRRGEDFQSIKQR------------VYGKILVGNH

Query:  RDKMENPL--EKMGGFMPSKGFGWLKLHRLRVLIKHYLLNRIEEKRWKW
        R +++  L  EK+GG     G    +  R +V +  Y++ R++E+   W
Subjt:  RDKMENPL--EKMGGFMPSKGFGWLKLHRLRVLIKHYLLNRIEEKRWKW

XP_013654532.1 uncharacterized protein LOC106359365 [Brassica napus]1.3e-3844.56Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLK---SKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        I+S+NQSAFVPGR I DN+++ HE LH LK   ++K     +K DMSKAYDR+EW F+  +L  +GFH KW+  IM+CV T  +S L+NG+P G + P R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLK---SKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKK-----PVLVDILAIPVV--------TDLGKYLDIPSSFTRRRGEDFQSIKQRVYGK
        GI QGDPLSPY+F+LC E LS +   AQ    + G +     P    +  I           + +GKYL +P  F R +G  F SI  R+  K
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKK-----PVLVDILAIPVV--------TDLGKYLDIPSSFTRRRGEDFQSIKQRVYGK

XP_030483480.1 uncharacterized protein LOC115700064 [Cannabis sativa]8.7e-4049.17Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        +IS  QSAFVPGR I DN++VG E LH+L  K NG +G   L++DMSKAYDRVEW FL  ++   GF  +WV L++DCV T ++  L+NG P GC+VP R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGEDFQSIKQRVYGKI
        G+ QG PL PYLFL C EALSSMI  A+    +S +  VL D+L +  V    KYL +P+   R +   F SI  +V  ++
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGEDFQSIKQRVYGKI

XP_035551721.1 uncharacterized protein LOC118349890 [Juglans regia]7.4e-3956.3Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        IIS  QS+F+PGR I DN++V +E LH LK KK G++G   +K+DMSKAYDRVEW F+E +++ +GFH+K ++L+M CV+T  FSIL+NG+PTG IVP R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG
        G+ QGDP+SPYLFLLC E L S++  A+    ++G
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG

XP_042958109.1 uncharacterized protein LOC122293655 [Carya illinoinensis]7.4e-3940.65Show/hide
Query:  DIISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQ
        DIIS NQSAF+PGR I DNI+V +E LHT+++++ G++G   +K+DMSKAYDRVEW FL+ +L+ +GF DKW +LIM CV+T  +S+++NGTP     P 
Subjt:  DIISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQ

Query:  RGICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG--------------------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRG
        RG+ QGDPLSPYLFLLC E LSS++ NA+    I G                                 +  ++      V  +  KYL +P++  R + 
Subjt:  RGICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG--------------------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRG

Query:  EDFQSIKQRVYGKI
          F+ +K++++ +I
Subjt:  EDFQSIKQRVYGKI

TrEMBL top hitse value%identityAlignment
A0A2N9H1U1 Reverse transcriptase domain-containing protein2.9e-4142.72Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        IIS+ QSAFVPGR I DNI+V  E LH +K++  G+ G   LK+DMSKAYDRVEWVFL+ ++  +GF+ KWV L+M+C+ +  +SIL+NG+P G + P R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKH-------------ISG-------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGE
        G+ QGDPLSPYLFLLC E L +     Q  K+             +SG                   K+  ++ +L +PVV +  KYL +PS   R R E
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKH-------------ISG-------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGE

Query:  DFQSIKQRVYGKI
         F  IK++++ ++
Subjt:  DFQSIKQRVYGKI

A0A2N9HRK6 Uncharacterized protein2.9e-4148.63Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        I+S +QSAFVPGR I DN+++  E LH +   K GR+G   LK+DMSKAYDRVEW +LE+++  +GFH KW+ L++ C+ +  +S+L+NG P G I P R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKKPVLVDILAIPVVTDL--GKYLDIPSSFTRRRGEDFQSIKQRVYGKI
        G+ QGDPLSPYLFLLC E L S+I  A+     SG    +      P +T L   KYL +PS   R R E F  IK+RV+ K+
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISGKKPVLVDILAIPVVTDL--GKYLDIPSSFTRRRGEDFQSIKQRVYGKI

A0A2N9HW04 Reverse transcriptase domain-containing protein1.7e-4143.19Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        IIS+ QSAFVPGR I DNI+V  E LH +K++  G+ G   LK+DMSKAYDRVEWVFL+ ++  +GF+ KWV L+M+C+ +  +SIL+NG+P G + P R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKH-------------ISG-------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGE
        G+ QGDPLSPYLFLLC E L +     Q  K+             +SG                   K+  ++ IL +PVV +  KYL +PS   R R E
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKH-------------ISG-------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGE

Query:  DFQSIKQRVYGKI
         F  IK++++ ++
Subjt:  DFQSIKQRVYGKI

A0A2N9HYS7 Uncharacterized protein1.1e-4041.47Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        +IS++QSAFVPGR I DNI++  E LH + +++ G+ G   LK+DMSKAYDRVEW FL++++  +GFHD+W+ LIM+C+ T  +SIL+NG PTG I P R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNA-----------------QHHKHISG-------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTR
        G+ QGDP+SPYLFLLC E L+ +I  A                 Q ++  SG                   K+  + +IL +P +    KYL +PS   +
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNA-----------------QHHKHISG-------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTR

Query:  RRGEDFQSIKQRVYGKI
         +   F  IK+RV+ K+
Subjt:  RRGEDFQSIKQRVYGKI

A0A2N9J7E4 Uncharacterized protein1.7e-4142.18Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR
        +IS++QSAFVPGR I DNI++  E LH + +++ G+ G   LK+DMSKAYDRVEW FL++++  +GFHD+W+ LIM+C+ T  +SIL+NG PTG I P R
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQG---LKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQR

Query:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG------------------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGEDF
        G+ QGDP+SPYLFLLC E L+ +I  A     I G                               K  + +IL +P +    KYL +PS   + +   F
Subjt:  GICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG------------------------------KKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGEDF

Query:  QSIKQRVYGKI
          IK+RV+ K+
Subjt:  QSIKQRVYGKI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.0e-1124.9Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECL-HTLKSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI
        +I  +Q  F+PG     NI      + H  ++K      + +D  KA+D+++  F+ K LN +G    ++++I         +I+LNG        + G 
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECL-HTLKSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI

Query:  CQGDPLSPYLFLLCVEALSSMITNAQHHKHIS-GKKPVLVDILAIPVVTDL-------GKYLDIPSSFTRRRGEDFQSIKQRVYGKILVGNHRDKMENPL
         QG PLSP LF + +E L+  I   +  K I  GK+ V + + A  ++  L          L + S+F++  G      K + +   L  N+R      +
Subjt:  CQGDPLSPYLFLLCVEALSSMITNAQHHKHIS-GKKPVLVDILAIPVVTDL-------GKYLDIPSSFTRRRGEDFQSIKQRVYGKILVGNHRDKMENPL

Query:  EKMGGFMPSKGFGWLKLHRLR----VLIKHY--LLNRIEEKRWKWL-IDCVFQGSMD
         ++   + SK   +L +   R    +  ++Y  LL  I+E   KW  I C + G ++
Subjt:  EKMGGFMPSKGFGWLKLHRLR----VLIKHY--LLNRIEEKRWKWL-IDCVFQGSMD

P08548 LINE-1 reverse transcriptase homolog1.8e-0828.48Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECL-HTLKSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI
        II  +Q  F+PG     NI      + H  K K      L +D  KA+D ++  F+ + L  IG    +++LI         +I+LNG        + G 
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECL-HTLKSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI

Query:  CQGDPLSPYLFLLCVEALSSMITNAQHHKHIS-GKKPVLVDILAIPVVTDL
         QG PLSP LF + +E L+  I   +  K I  G + + + + A  ++  L
Subjt:  CQGDPLSPYLFLLCVEALSSMITNAQHHKHIS-GKKPVLVDILAIPVVTDL

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-1130.06Show/hide
Query:  IISKNQSAFVPGRFIQDNIIVGHECLHTL-KSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI
        II  +Q  F+PG     NI      +H + K K      + +D  KA+D+++  F+ K+L   G    ++ +I         +I +NG     I  + G 
Subjt:  IISKNQSAFVPGRFIQDNIIVGHECLHTL-KSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI

Query:  CQGDPLSPYLFLLCVEALSSMITNAQHHKHIS-GKKPVLVDILAIPVVTDLGKYLDIPSSFTR
         QG PLSPYLF + +E L+  I   +  K I  GK+ V + +LA     D+  Y+  P + TR
Subjt:  CQGDPLSPYLFLLCVEALSSMITNAQHHKHIS-GKKPVLVDILAIPVVTDLGKYLDIPSSFTR

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-0830.17Show/hide
Query:  DIISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI
        ++I  +QS  VPGR I DN+ +  + LH  +        L +D  KA+DRV+  +L   L    F  ++V  +     + +  + +N + T  +   RG+
Subjt:  DIISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGI

Query:  CQGDPLSPYLFLLCVE
         QG PLS  L+ L +E
Subjt:  CQGDPLSPYLFLLCVE

P92555 Uncharacterized mitochondrial protein AtMg012506.3e-0951.02Show/hide
Query:  LLNGTPTGCIVPQRGICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG
        ++NG P G + P RG+ QGDPLSPYLF+LC E LS +   AQ    + G
Subjt:  LLNGTPTGCIVPQRGICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-1040.28Show/hide
Query:  DIISKNQSAFVPGRFIQDNIIVGHECLHTLKSKK--NGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWV
        ++I   Q++F+PGR   DNI+   E +H+++ KK   G   LK+D+ KAYDR+ W +LE  L   GF + W+
Subjt:  DIISKNQSAFVPGRFIQDNIIVGHECLHTLKSKK--NGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWV

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.5e-1051.02Show/hide
Query:  LLNGTPTGCIVPQRGICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG
        ++NG P G + P RG+ QGDPLSPYLF+LC E LS +   AQ    + G
Subjt:  LLNGTPTGCIVPQRGICQGDPLSPYLFLLCVEALSSMITNAQHHKHISG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTATCTCTAAAAATCAATCAGCTTTTGTTCCAGGTCGGTTTATTCAGGATAACATTATTGTAGGGCATGAATGTCTTCATACATTAAAATCAAAGAAAAATGG
TAGACAAGGTCTAAAGGTGGACATGAGCAAAGCATATGATCGAGTGGAGTGGGTATTTTTAGAGAAACTGCTGAATATGATTGGATTTCATGATAAATGGGTGAGGTTGA
TCATGGATTGTGTTAAAACTACGAAATTCTCTATTCTGCTCAATGGTACTCCAACAGGTTGCATTGTTCCTCAGAGAGGAATTTGTCAGGGAGATCCACTATCTCCTTAT
CTATTCTTACTGTGTGTCGAAGCATTATCGTCTATGATTACAAATGCTCAACATCATAAACACATATCAGGGAAGAAACCAGTCCTTGTTGATATTTTAGCCATCCCAGT
GGTTACCGACTTGGGTAAATATCTAGACATTCCTTCTTCCTTTACCCGACGTAGAGGGGAAGATTTTCAAAGTATTAAGCAAAGGGTGTATGGAAAAATTTTGGTGGGGA
ACCACAGAGACAAAATGGAAAATCCATTGGAAAAAATGGGAGGATTTATGCCTTCCAAAGGATTTGGGTGGCTTAAGCTTCATAGACTTAGAGTTTTAATAAAGCATTAC
TTGCTAAACAGGATTGAGGAAAAGCGTTGGAAATGGTTGATCGATTGCGTTTTTCAAGGATCCATGGATTCCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTATCTCTAAAAATCAATCAGCTTTTGTTCCAGGTCGGTTTATTCAGGATAACATTATTGTAGGGCATGAATGTCTTCATACATTAAAATCAAAGAAAAATGG
TAGACAAGGTCTAAAGGTGGACATGAGCAAAGCATATGATCGAGTGGAGTGGGTATTTTTAGAGAAACTGCTGAATATGATTGGATTTCATGATAAATGGGTGAGGTTGA
TCATGGATTGTGTTAAAACTACGAAATTCTCTATTCTGCTCAATGGTACTCCAACAGGTTGCATTGTTCCTCAGAGAGGAATTTGTCAGGGAGATCCACTATCTCCTTAT
CTATTCTTACTGTGTGTCGAAGCATTATCGTCTATGATTACAAATGCTCAACATCATAAACACATATCAGGGAAGAAACCAGTCCTTGTTGATATTTTAGCCATCCCAGT
GGTTACCGACTTGGGTAAATATCTAGACATTCCTTCTTCCTTTACCCGACGTAGAGGGGAAGATTTTCAAAGTATTAAGCAAAGGGTGTATGGAAAAATTTTGGTGGGGA
ACCACAGAGACAAAATGGAAAATCCATTGGAAAAAATGGGAGGATTTATGCCTTCCAAAGGATTTGGGTGGCTTAAGCTTCATAGACTTAGAGTTTTAATAAAGCATTAC
TTGCTAAACAGGATTGAGGAAAAGCGTTGGAAATGGTTGATCGATTGCGTTTTTCAAGGATCCATGGATTCCGATTGA
Protein sequenceShow/hide protein sequence
MDIISKNQSAFVPGRFIQDNIIVGHECLHTLKSKKNGRQGLKVDMSKAYDRVEWVFLEKLLNMIGFHDKWVRLIMDCVKTTKFSILLNGTPTGCIVPQRGICQGDPLSPY
LFLLCVEALSSMITNAQHHKHISGKKPVLVDILAIPVVTDLGKYLDIPSSFTRRRGEDFQSIKQRVYGKILVGNHRDKMENPLEKMGGFMPSKGFGWLKLHRLRVLIKHY
LLNRIEEKRWKWLIDCVFQGSMDSD