; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015826 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015826
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:26210808..26212673
RNA-Seq ExpressionLag0015826
SyntenyLag0015826
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4351405.1 hypothetical protein F8388_001025, partial [Cannabis sativa]5.1e-3433.66Show/hide
Query:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEW-WWRFTGFYENSDQS
        NP A+ +LR +V K++P ++FLSETK     A  IRR++ FSN   V+C G+SGGL+LLW +  E+SV S+S GHID  +K      WRFTGFY N   S
Subjt:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEW-WWRFTGFYENSDQS

Query:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM
         R DSWQLL  L    +LPWI GGDFNE++                                     +TW   +      +ERLDRY    +  +L   +
Subjt:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM

Query:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPT-RLEESWLNFEGSKTAFKESWNSSVVA--NNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITE
        +V +  ++HSDHR I   +  EN V    + K  + R E  WL     +    ++W S  V   N  +         + +  WNK +  GSI   +   E
Subjt:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPT-RLEESWLNFEGSKTAFKESWNSSVVA--NNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITE

Query:  SEIRRFSNY
          +    +Y
Subjt:  SEIRRFSNY

KAG2693398.1 hypothetical protein I3760_08G095200 [Carya illinoinensis]2.3e-3430.6Show/hide
Query:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIK--TLEWWWRFTGFYE
        RG  NPR IR+LR LV K  P ++FL ETK +S++  R++ ++G  NC++V+CEG+SGGL LLW+ + E+++ S+SK HID  +K    E  W+ TG Y 
Subjt:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIK--TLEWWWRFTGFYE

Query:  NSDQSKRKDSWQLLESLNEATNLPWIIGGDFNEV-------------------------------------MYTWTISKNNKEAAKERLDRYLATSKMAS
        + + ++R+++W L+ SL+   +  W++ GDFNE+                                      +TW  ++    A  ERLDR LA +    
Subjt:  NSDQSKRKDSWQLLESLNEATNLPWIIGGDFNEV-------------------------------------MYTWTISKNNKEAAKERLDRYLATSKMAS

Query:  LAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDI
             +V H S  +SDH  I+L+   E  VQ+    K P R E  W+  E      + +W   VV       +KI    + + RWNK    G+++  + +
Subjt:  LAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDI

Query:  TESEIRRF--SNYKNSN
           ++ +    N+++ N
Subjt:  TESEIRRF--SNYKNSN

TXG57064.1 hypothetical protein EZV62_018377 [Acer yangbiense]6.0e-3530.9Show/hide
Query:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISI-KTLEWWWRFTGFYENSDQS
        N R I +L+ ++ K +P ++FLSETK    +A   +++LGF N  SV+C G+SGGL+LLW +  ++S+ S+SKGHID+ + +  E  WRF+GFY   +Q 
Subjt:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISI-KTLEWWWRFTGFYENSDQS

Query:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM
         ++DSW+LL  L    +  W+ GGDFNE++                                      TW   ++     +ER+DR LA +    L    
Subjt:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM

Query:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNI-NFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESE
        RV+HL Y  SDHR +LL  + +        ++ P + E  WL  E      +E+WN   V +++ +  RK+      +  W+  +  GS++  I+  + E
Subjt:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNI-NFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESE

Query:  I
        +
Subjt:  I

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]3.2e-4440.81Show/hide
Query:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEWWWRFTGFYENSDQSK
        NP   R+LR+LV +  P ++FLSETK N  L  R +R+L F  C+SV   G+SGGLMLLW +   + + S S GHID  I      WRFTGFY N    K
Subjt:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEWWWRFTGFYENSDQSK

Query:  RKDSWQLLESLNEATNLPWIIGGDFNEVMYTWT----ISKNNKEAAK----ERLDRYLATSKMASLAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGF
        R  SW+LLE L    +LPWIIGGDFNE++        + +N  +       ERLDR+L    M +    ++V HL  L SDHR IL     E        
Subjt:  RKDSWQLLESLNEATNLPWIIGGDFNEVMYTWT----ISKNNKEAAK----ERLDRYLATSKMASLAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGF

Query:  SKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESEIRR
         +   R EESWL  +G +     +W S        F  KI   L  +  WNK RL  S+KGAI   E E+ R
Subjt:  SKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESEIRR

XP_024021734.1 uncharacterized protein LOC112091706 [Morus notabilis]2.7e-3531.51Show/hide
Query:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEW-WWRFTGFYEN
        RG  NPRA   LR L+   +P + FL ET+  S+ A  ++R+ GF     V+C G+SGGLML+WK + E+ + SYS+ HID  ++     WWRFTGFY N
Subjt:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEW-WWRFTGFYEN

Query:  SDQSKRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASL
          +S R  SW LL  L   +NLPW++ GDFNE++                                     +TW   +      +ERLDR L   +  +L
Subjt:  SDQSKRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASL

Query:  AKEMRVEHLSYLHSDHRLILLK-ISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSS-VVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAID
             V ++ +  SDHR + L+ +   N   +G   +   R E  W+  E  K   + SW ++  V +   F RK+Q     +  W++ +  G I   + 
Subjt:  AKEMRVEHLSYLHSDHRLILLK-ISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSS-VVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAID

Query:  ITESEIRRFSN
            EI R  N
Subjt:  ITESEIRRFSN

TrEMBL top hitse value%identityAlignment
A0A2N9I611 Uncharacterized protein5.5e-3431.58Show/hide
Query:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEWWWRFTGFYENS
        RG  NPRA+R+LR L     P V+FLSETK N +    IR  L + +   V  +G+SGGL LLW    ++S+ SY++ HID  IK+ E  WRFTGFY + 
Subjt:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEWWWRFTGFYENS

Query:  DQSKRKDSWQLLESLNEATNLPWIIGGDFNEVM-------YTWTISKNNKEAAKERLDRYLATSKMASLAKEMRVEHLSYLHSDHRLILLKISCENAVQQ
        + +KRK SW LL+ L E + LPW+  GDFNE++        TW  S++ +  +  RLDR         + K   + ++   +SDH  I L + C + +++
Subjt:  DQSKRKDSWQLLESLNEATNLPWIIGGDFNEVM-------YTWTISKNNKEAAKERLDRYLATSKMASLAKEMRVEHLSYLHSDHRLILLKISCENAVQQ

Query:  GGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFN--RKIQEGLEAMKRWNKERLQGSIKGAIDITESEIRRFSNYKNSNELDKLLQAEIKLNKL
           +    R E++W   E  +    + W  +       F    K++   + +  W+K    G+IK  ++  E  +++  +        ++   ++++N+L
Subjt:  GGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFN--RKIQEGLEAMKRWNKERLQGSIKGAIDITESEIRRFSNYKNSNELDKLLQAEIKLNKL

Query:  LEEE
        LE+E
Subjt:  LEEE

A0A5B6U6G4 Reverse transcriptase2.1e-3331.25Show/hide
Query:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKT--LEWWWRFTGFYE
        RG  +PRA+R LR+++ +H+P ++FL ETK +S+   R+RR  GF+N I+V  EG  GGL L W++   +++ SYSK HID+ IK   ++  WRFTGFY 
Subjt:  RGGENPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKT--LEWWWRFTGFYE

Query:  NSDQSKRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMAS
        +     +K  W LLE L++  + PW++ GDFNE+M                                     +TW     ++   +ERLDR +A  K  +
Subjt:  NSDQSKRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMAS

Query:  LAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNK--ERLQGSIKGAI
        L    R++HL ++ SDH  +LL    +N +            E  W   E  +   +  W SS     I   + +Q GLE   +W K  +R +G +K   
Subjt:  LAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNK--ERLQGSIKGAI

Query:  DITESEIRRFSNYKNSNELDKLLQAEIKLNKLLEEE
         +TE         ++   L KL+  +I+LN  +++E
Subjt:  DITESEIRRFSNYKNSNELDKLLQAEIKLNKLLEEE

A0A5C7HJN1 Uncharacterized protein2.9e-3530.9Show/hide
Query:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISI-KTLEWWWRFTGFYENSDQS
        N R I +L+ ++ K +P ++FLSETK    +A   +++LGF N  SV+C G+SGGL+LLW +  ++S+ S+SKGHID+ + +  E  WRF+GFY   +Q 
Subjt:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISI-KTLEWWWRFTGFYENSDQS

Query:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM
         ++DSW+LL  L    +  W+ GGDFNE++                                      TW   ++     +ER+DR LA +    L    
Subjt:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM

Query:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNI-NFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESE
        RV+HL Y  SDHR +LL  + +        ++ P + E  WL  E      +E+WN   V +++ +  RK+      +  W+  +  GS++  I+  + E
Subjt:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNI-NFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESE

Query:  I
        +
Subjt:  I

A0A6J1DUG8 uncharacterized protein LOC1110241351.5e-4440.81Show/hide
Query:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEWWWRFTGFYENSDQSK
        NP   R+LR+LV +  P ++FLSETK N  L  R +R+L F  C+SV   G+SGGLMLLW +   + + S S GHID  I      WRFTGFY N    K
Subjt:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEWWWRFTGFYENSDQSK

Query:  RKDSWQLLESLNEATNLPWIIGGDFNEVMYTWT----ISKNNKEAAK----ERLDRYLATSKMASLAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGF
        R  SW+LLE L    +LPWIIGGDFNE++        + +N  +       ERLDR+L    M +    ++V HL  L SDHR IL     E        
Subjt:  RKDSWQLLESLNEATNLPWIIGGDFNEVMYTWT----ISKNNKEAAK----ERLDRYLATSKMASLAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGF

Query:  SKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESEIRR
         +   R EESWL  +G +     +W S        F  KI   L  +  WNK RL  S+KGAI   E E+ R
Subjt:  SKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITESEIRR

A0A7J6DZ24 CCHC-type domain-containing protein2.4e-3433.66Show/hide
Query:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEW-WWRFTGFYENSDQS
        NP A+ +LR +V K++P ++FLSETK     A  IRR++ FSN   V+C G+SGGL+LLW +  E+SV S+S GHID  +K      WRFTGFY N   S
Subjt:  NPRAIRSLRHLVPKHNPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEW-WWRFTGFYENSDQS

Query:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM
         R DSWQLL  L    +LPWI GGDFNE++                                     +TW   +      +ERLDRY    +  +L   +
Subjt:  KRKDSWQLLESLNEATNLPWIIGGDFNEVM-------------------------------------YTWTISKNNKEAAKERLDRYLATSKMASLAKEM

Query:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPT-RLEESWLNFEGSKTAFKESWNSSVVA--NNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITE
        +V +  ++HSDHR I   +  EN V    + K  + R E  WL     +    ++W S  V   N  +         + +  WNK +  GSI   +   E
Subjt:  RVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPT-RLEESWLNFEGSKTAFKESWNSSVVA--NNINFNRKIQEGLEAMKRWNKERLQGSIKGAIDITE

Query:  SEIRRFSNY
          +    +Y
Subjt:  SEIRRFSNY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATACTCAAGAAGTAGGAAGCACTGTAGTGGAGGAAGGACAAAGTTCTTCGAAAATGGAAGATGGCAACATATGTAAACAACTCCATGAGTTCAATCTAACAGT
GGAAGAAAGAGGGAAGGTTTTAGGAATGGAAGATGAGGAGTTAAGAGAAAAGGTGCAAGACCTGGAATGTGTGCTGGAATACTTCGAGTCAACCGTTGGGATATCGACGG
GAGCTGTGGAGCAACCCCGCCGGACGCATGATTATTCTAAGTTGGAACGTCCAAGGGGTGGGGAAAACCCTCGAGCGATCCGTTCCTTGCGGCATTTAGTTCCCAAGCAT
AACCCCATGGTGATTTTTCTTTCGGAGACTAAATGCAATTCTCAACTGGCATATAGAATCAGGAGGAAGTTGGGATTTAGTAATTGTATTAGTGTCAATTGTGAAGGCCA
AAGTGGAGGGCTTATGTTATTATGGAAAAACCAACACGAAATCTCTGTCAACTCTTATTCGAAGGGACATATTGATATCTCTATCAAGACCCTGGAGTGGTGGTGGAGGT
TTACCGGTTTTTATGAGAACTCGGATCAAAGCAAGAGGAAAGATTCGTGGCAGCTCTTGGAGAGCCTCAACGAAGCTACAAATCTGCCTTGGATCATCGGTGGTGACTTC
AACGAAGTGATGTACACTTGGACAATAAGTAAAAATAACAAAGAAGCAGCAAAGGAAAGGCTGGACAGATACTTGGCAACCTCTAAAATGGCCTCTTTAGCAAAGGAAAT
GAGGGTGGAGCATTTAAGTTACTTACACTCTGACCATAGGTTGATCCTGCTGAAGATAAGCTGTGAAAATGCTGTCCAGCAAGGAGGATTCTCGAAAATGCCTACTAGAC
TTGAGGAGAGTTGGCTCAACTTTGAAGGAAGCAAAACAGCTTTTAAGGAGTCCTGGAATTCTAGTGTAGTCGCCAATAATATTAATTTTAATCGAAAAATTCAAGAAGGT
CTCGAAGCTATGAAGAGATGGAACAAAGAGAGACTGCAAGGCTCAATCAAAGGGGCCATTGACATAACTGAATCAGAAATCAGGAGATTTTCAAACTACAAGAACTCCAA
TGAATTAGACAAATTGTTACAAGCTGAGATAAAGCTAAACAAGCTCTTAGAGGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATACTCAAGAAGTAGGAAGCACTGTAGTGGAGGAAGGACAAAGTTCTTCGAAAATGGAAGATGGCAACATATGTAAACAACTCCATGAGTTCAATCTAACAGT
GGAAGAAAGAGGGAAGGTTTTAGGAATGGAAGATGAGGAGTTAAGAGAAAAGGTGCAAGACCTGGAATGTGTGCTGGAATACTTCGAGTCAACCGTTGGGATATCGACGG
GAGCTGTGGAGCAACCCCGCCGGACGCATGATTATTCTAAGTTGGAACGTCCAAGGGGTGGGGAAAACCCTCGAGCGATCCGTTCCTTGCGGCATTTAGTTCCCAAGCAT
AACCCCATGGTGATTTTTCTTTCGGAGACTAAATGCAATTCTCAACTGGCATATAGAATCAGGAGGAAGTTGGGATTTAGTAATTGTATTAGTGTCAATTGTGAAGGCCA
AAGTGGAGGGCTTATGTTATTATGGAAAAACCAACACGAAATCTCTGTCAACTCTTATTCGAAGGGACATATTGATATCTCTATCAAGACCCTGGAGTGGTGGTGGAGGT
TTACCGGTTTTTATGAGAACTCGGATCAAAGCAAGAGGAAAGATTCGTGGCAGCTCTTGGAGAGCCTCAACGAAGCTACAAATCTGCCTTGGATCATCGGTGGTGACTTC
AACGAAGTGATGTACACTTGGACAATAAGTAAAAATAACAAAGAAGCAGCAAAGGAAAGGCTGGACAGATACTTGGCAACCTCTAAAATGGCCTCTTTAGCAAAGGAAAT
GAGGGTGGAGCATTTAAGTTACTTACACTCTGACCATAGGTTGATCCTGCTGAAGATAAGCTGTGAAAATGCTGTCCAGCAAGGAGGATTCTCGAAAATGCCTACTAGAC
TTGAGGAGAGTTGGCTCAACTTTGAAGGAAGCAAAACAGCTTTTAAGGAGTCCTGGAATTCTAGTGTAGTCGCCAATAATATTAATTTTAATCGAAAAATTCAAGAAGGT
CTCGAAGCTATGAAGAGATGGAACAAAGAGAGACTGCAAGGCTCAATCAAAGGGGCCATTGACATAACTGAATCAGAAATCAGGAGATTTTCAAACTACAAGAACTCCAA
TGAATTAGACAAATTGTTACAAGCTGAGATAAAGCTAAACAAGCTCTTAGAGGAGGAATAA
Protein sequenceShow/hide protein sequence
MENTQEVGSTVVEEGQSSSKMEDGNICKQLHEFNLTVEERGKVLGMEDEELREKVQDLECVLEYFESTVGISTGAVEQPRRTHDYSKLERPRGGENPRAIRSLRHLVPKH
NPMVIFLSETKCNSQLAYRIRRKLGFSNCISVNCEGQSGGLMLLWKNQHEISVNSYSKGHIDISIKTLEWWWRFTGFYENSDQSKRKDSWQLLESLNEATNLPWIIGGDF
NEVMYTWTISKNNKEAAKERLDRYLATSKMASLAKEMRVEHLSYLHSDHRLILLKISCENAVQQGGFSKMPTRLEESWLNFEGSKTAFKESWNSSVVANNINFNRKIQEG
LEAMKRWNKERLQGSIKGAIDITESEIRRFSNYKNSNELDKLLQAEIKLNKLLEEE